ECS Insight assesses Elastic Compute Service (ECS) usage from the ECS Basics, Cost Insight, Automation, Reliability, Elasticity, and Security dimensions and provides optimization suggestions based on the assessment results.
Assessment dimensions
Assessment dimension | Description |
Assesses whether all ECS instances and the associated resources in the current account are properly distributed, identifies potential risks to business performance and high availability, and provides corresponding optimization suggestions. | |
Assesses the billing methods of ECS instances in the current account, the matching degree between instance types and workloads, and the reasonableness of fine-grained cost management, and provides corresponding optimization suggestions. | |
Assesses your use of automation capabilities to improve the O&M efficiency of ECS instances and associated resources, and provides corresponding optimization suggestions. | |
Assesses whether ECS instances in the current account have potential risks related to single points of stability and business high availability, and provides corresponding optimization suggestions. | |
Assesses the usage of elasticity capabilities for ECS resources in the current account, identifies elasticity risks, and provides corresponding optimization suggestions. | |
Assesses whether ECS instances and associated resources in the current account have data security, network security, and access security risks, and provides corresponding optimization suggestions. |
Levels of assessment items
For each assessment item, ECS Insight provides different results and corresponding severity levels. The following section describes the assessment results corresponding to each severity level and the recommended operations:
Critical (high-risk item): indicates that the usage of resources in the current account does not meet the criteria of an assessment item and high risks exist. We recommend that you resolve the issue and optimize the usage based on the best practices. A high-risk item scores 0 points.
Warning (warned item): indicates that the usage of resources in the current account does not meet the criteria of an assessment item and risks exist. We recommend that you resolve the issue and optimize the usage based on the best practices. The score varies based on the scoring rules.
Information (information item): indicates that the usage of resources in the current account does not meet the criteria of an assessment item, but the issue is not serious. You can choose whether to optimize the usage based on your business requirements. The score varies based on the scoring rules.
Unassessed Item (unassessed item): indicates that the resources in the current account do not meet the assessment threshold for an assessment item and ECS Insight does not assess the resources based on the item. You do not need to take note of the assessment item.
OK (Scored item): indicates that the usage of resources in the current account meets the criteria of an assessment item and no risk exists. You do not need to take note of the assessment item. The item gets the full score.
ECS Basics
In the ECS Basics dimension, ECS Insight assesses whether all ECS instances and the associated resources in the current account are properly distributed, identifies potential risks to business performance and high availability, and provides corresponding optimization suggestions.
ECS Insight provides the following categories of capabilities in the ECS Basics dimension: Basic Computing Capabilities, Basic Storage Capabilities, Basic Network Capabilities, and Basic API & Resource Management Capabilities. Each category contains multiple assessment items. The following table describes the scores, scoring rules, and best practices of each assessment item.
Category | Category description | Assessment item | Assessment item description | Full score (points) | Scoring rules | Whether to return affected resources | Best practices |
Basic Computing Capabilities | Assesses whether all ECS instances in the account are properly distributed across zones, regions, and instance families. | Proper Cross-zone Distribution of ECS Instances | Checks whether all ECS instances in the account are properly distributed across different zones to achieve high availability and zone redundancy. | 10 |
| Yes |
|
No ECS Instances Deployed in Classic Network | Checks whether ECS instances that are deployed in the classic network, which are vulnerable to security risks, exist in the account. | 10 |
| Yes | Migrate specific or all ECS instances from the classic network to a virtual private cloud (VPC) based on your business requirements. For more information, see Migrate an ECS instance from the classic network to a VPC. | ||
No ECS Instances of Retired or Outdated Instance Families | Checks whether ECS instances of retired or outdated instance families exist in the account. If ECS instances have a pending following event, the instances belong to retired or outdated instance families: Instance Migration Events Due to Upgrades at Underlying Layer. | 10 |
| Yes | Migrate the ECS instances that belong to retired or outdated instance families at the earliest opportunity to improve the stability of ECS instances. For more information, see Events related to instance migration due to upgrades at the underlying layer. | ||
Basic Storage Capabilities | Assesses whether disks in the account are properly used. | No Basic Disks | Checks whether cloud disks of retired disk categories exist in the account. | 10 |
| Yes |
|
No Instances Equipped with Local Disks of Retired Disk Categories | Checks whether ECS instances that are equipped with local disks of retired disk categories exist in the account. | 10 |
| Yes | Back up the data of ECS instances that are equipped with local disks of retired disk categories based on your business requirements, release the ECS instances, and then create ECS instances that use local disks of new disk categories. For information about the instance types that are equipped with local disks, see Big data instance families and Instance families with local SSDs. | ||
Basic Network Capabilities | Assesses whether security groups were properly configured for all ECS instances in the account. | Non-default Security Groups | Checks whether non-default security groups exist and are used in the account. | 10 |
| No |
|
Basic API & Resource Management Capabilities | Assesses whether API operations are properly called and all Resource Access Management (RAM) users and resources in the account are properly managed. | Proper Tag-based Grouping of Resources | Checks whether resources in the account are properly grouped based on tags. | 20 |
| Yes | We recommend that you use tag policies to identify resources to which specific tags are not added at the earliest opportunity and add at least two custom tags to each ECS instance to which tags are not added. For more information, see Perform automatic tag detection, Create tags and add tags to resources, and Best practices for tag design. |
Proper API Call Success Rate | Checks whether a large number of API calls in the account failed. | 20 |
| Yes | Check whether the API responses meet your expectation based on your business requirements. For information about error codes, see Elastic Compute Service - Error Codes |
Cost Insight
In the Cost Insight dimension, ECS Insight assesses whether the billing methods for the ECS instances and associated resources in the current account are properly used and whether fine-grained cost management and analysis have been performed.
ECS Insight provides the following categories of capabilities in the Cost Insight dimension: Basic ECS Cost Analysis Capabilities, Refined ECS Cost Insight Capabilities, and Cost Analysis Capabilities. Each category contains multiple assessment items. The following table describes the scores, scoring rules, and best practices of each assessment item.
Category | Category description | Assessment item | Assessment item description | Full score (points) | Scoring rules | Whether to return affected resources | Best practices |
Basic ECS Cost Analysis Capabilities | Assesses whether resource costs in the account are properly managed to prevent waste and business interruptions. | ECS Instance Refund Limit Reached | Checks whether the account has reached the monthly maximum number of refunds for subscription ECS instances in the current month. | 5 |
| Yes | To meet temporary business requirements, we recommend that you create Spot Instance ECS instances. For more information, see Pay-as-you-go. |
Use of Economical Mode | Checks whether ECS instances in the account are stopped in economical mode to prevent resource waste. | 10 |
| Yes |
| ||
ECS Instances Unavailable Due to Overdue Payments | Checks whether ECS instances in the account are unavailable due to overdue payments. | 10 |
| Yes |
| ||
ECS Instances Unavailable Due to Expiration | Checks whether subscription ECS instances in the account are unavailable due to expiration. | 10 |
| Yes |
| ||
Refined ECS Cost Insight Capabilities | Assesses whether ECS advanced capabilities, such as upgrade capabilities, were used in the account to optimize costs. | Auto-renewal of Subscription ECS Instances | Checks whether auto-renewal is enabled for subscription ECS instances in the account to prevent service interruptions. | 10 |
| Yes | Enable auto-renewal for subscription ECS instances that you want to retain for long-term use based on your business requirements. Auto-renewal helps reduce management costs and prevents the services on ECS instances from being interrupted if manual renewal for the instances fails. For more information, see Auto-renewal. |
Proper Spot Instance Utilization | Checks whether reserved instances in the account are underutilized. | 10 |
| Yes |
| ||
Proper Savings Plan Utilization | Checks whether savings plans in the account are underutilized. | 10 |
| Yes | To maximize cost savings, we recommend that you monitor the usage of savings plans and adjust savings plan configurations at the earliest opportunity based on your business requirements. Alibaba Cloud provides a utilization report and a coverage report for your savings plans. You can optimize the usage of savings plans based on the reports by following the suggestions in the View and optimize the usage of savings plans topic or the recommendations on the Savings Plan Purchase Scheme Calculation page in the Expenses and Costs console. | ||
Expiring Savings Plans | Checks whether savings plans that are about to expire within 30 days exist in the account. | 5 |
| Yes | Before a savings plan expires, you can manually renew or enable auto-renewal for the savings plan to extend the duration of the savings plan. You can renew a savings plan on the Overview tab of Savings Plan page in the Expenses and Costs console. You can also manually renew or enable auto-renewal for a savings plan on the Renewal page in the Expenses and Costs console. For more information, see Renewal guide for the international site (alibabacloud.com). | ||
Cost Analysis Capabilities | Assesses whether the cost analysis tools of Alibaba Cloud were used in the account to analyze and optimize costs. | Use of Cost Center and Cost Allocation Tags for Cost Management | Checks whether the Cost Center and cost allocation tags are used in the account to manage costs. | 10 |
| No | Use the Cost Center and cost allocation tags to allocate costs based on your business requirements. This provides references for enterprise cost allocation based on the resource usage and costs of split items. For more information, see Split Bill and Use tags to manage costs. |
Use of Budget Management | Checks whether the budget management feature is used in the account to manage costs. | 5 |
| No | Use the budget management feature to manage costs in a fine-grained manner based on your business requirements. For more information, see Budget management. | ||
Unused Resources | Checks whether unused resources exist in the account. | 10 |
| Yes | If you no longer require the EBS resources that remain unattached for three consecutive days or longer, we recommend that you release the resources to reduce costs. |
Automation
In the Automation dimension, ECS Insight assesses your use of automation capabilities to improve the O&M efficiency of ECS instances and associated resources, and provides optimization suggestions.
ECS Insight provides the following categories of capabilities in the Automation dimension: Basic Automation Capabilities, Advanced Automation Capabilities, and Premium Automation Capabilities. Each category contains multiple assessment items. The following table describes the scores, scoring rules, and best practices of each assessment item.
Category | Category description | Assessment item | Assessment item description | Full score (points) | Scoring rules | Whether to return affected resources | Best practices |
Basic Automation Capabilities | Assesses whether automation tools were used in the account. | Use of ECS API Operations for O&M and Management | Checks whether ECS API operations are called to manage ECS instances in the account. | 10 |
| No | Call API operations to manage or maintain ECS instances based on your business requirements to improve O&M and management efficiency. For more information, see List of operations by function. |
Use of Latest SDKs to Create and Manage ECS Resources | Checks whether the latest SDKs are used to create or manage ECS instances in the account to improve O&M efficiency. | 10 |
| No | Use the latest SDKs to create or manage ECS resources based on your business requirements to improve the management efficiency of cloud resources. For more information, see Create and manage an ECS instance by using an SDK. | ||
Use of Latest CLIs to Create and Manage ECS Resources | Checks whether the latest CLIs are used to create or manage ECS instances in the account to improve O&M efficiency. | 10 |
| No | Use the latest CLIs to create or manage ECS resources based on your business requirements to improve the management efficiency of cloud resources. For more information, see CLI Reference. | ||
Advanced Automation Capabilities | Assesses whether advanced automation capabilities were used in the account. | Use of ROS to Provision ECS Resources | Checks whether ROS is used to provision resources in the account to improve O&M efficiency. | 10 |
| No |
|
Use of Cloud Assistant for O&M and Management | Checks whether Cloud Assistant is used to deploy, operate, and maintain ECS resources in the account to improve instance management efficiency. | 10 |
| No | Use Cloud Assistant to perform O&M on ECS instances based on your business requirements, including running commands and uploading files, to improve O&M efficiency. For more information, see Overview of Cloud Assistant and Use Cloud Assistant. | ||
Use of OOS for Automated O&M | Checks whether CloudOps Orchestration Service (OOS) is used to operate and maintain ECS instances in the account to improve instance management efficiency. | 10 |
| No | Use OOS to batch manage ECS instances based on your business requirements, including starting or stopping ECS instances based on a schedule and temporarily upgrading ECS instance bandwidth, to improve management efficiency. For more information, see Manage multiple instances at a time, Start or stop instances at a scheduled time, and Temporarily upgrade the bandwidth. | ||
Premium Automation Capabilities | Assesses whether premium automation capabilities were used in the account. | Use of OOS Scheduled O&M, Alert O&M, and Event-triggered O&M | Checks whether the Scheduled O&M, Alert O&M, and Event-triggered O&M features of OOS are used in the account to improve instance management efficiency. | 10 |
| No | Use the scheduled, event-triggered, or alert O&M tasks of OOS based on your business requirements to improve O&M automation and efficiency. For more information, see Perform scheduled O&M, Create an alert O&M task, and Create an event-trigger O&M task. |
Use of Patch Management for O&M and Management | Checks whether the patch management feature is used to scan for or install system patches on ECS instances in the account to improve instance security. | 10 |
| No | Use the patch management feature to automatically update system patches based on your business requirements to improve operating system security. For more information, see Patch management overview. | ||
Use of Software Package Management for O&M and Management | Checks whether the software package management feature is used to manage or upgrade applications in the account to improve instance management efficiency. | 10 |
| No | Use the software package management feature to manage the software used by different teams in a unified manner based on your business requirements. This improves cloud management efficiency. For more information, see Manage custom software on multiple ECS instances at a time. | ||
Use of OOS Templates to Create Images | Checks whether OOS templates are used to create images in the account to improve instance management efficiency. | 10 |
| No | Use OOS custom templates to create custom images based on your business requirements to improve image management efficiency. For more information, see Use OOS to create a custom image. |
Reliability
In the Reliability dimension, ECS Insight assesses whether ECS instances have potential risks related to single points of stability and business high availability, and provides optimization suggestions.
ECS Insight provides the following categories of capabilities in the Reliability dimension: Instance Stability, Performance Reliability, and Application Reliability. Each category contains multiple assessment items. The following table describes the scores, scoring rules, and best practices of each assessment item.
Category | Category description | Assessment item | Assessment item description | Full score (points) | Scoring rules | Whether to return affected resources | Best practices |
Instance Stability | Assesses whether proactive O&M was performed on important ECS instances in the account to improve instance stability. | Fast Response to Scheduled O&M Events | Checks whether pending O&M events exist in the account. | 10 |
| Yes | View and handle ECS system events and configure event-driven automatic response rules based on your business requirements to reduce business risks. For more information, see Query and handle ECS system events and Subscribe to ECS system event notifications. |
No Instances Automatically Stopped | Checks whether ECS instances in the account are automatically stopped due to overdue payments or security issues. | 10 |
| Yes |
| ||
Snapshots Created Within Previous 7 Days | Checks whether snapshots were created in the account within the previous 7 days to improve instance data reliability. | 10 |
| No | We recommend that you configure automatic snapshot policies for disks attached to important ECS instances to back up disk data on a periodic basis. For more information, see Create an automatic snapshot policy. | ||
Performance Reliability | Assesses whether ECS instances that poorly performed in the account were detected and handled to improve performance reliability. | High CPU Utilization of ECS Instances | Checks whether ECS instances in the account have high CPU utilization. | 10 |
| Yes | Monitor the CPU utilization of ECS instances and configure alert rules based on your business requirements. If the CPU utilization is abnormal, you can handle the anomaly at the earliest opportunity to ensure that your business runs as expected. For more information, see Configure alert rules for an ECS instance. |
High Memory Usage of ECS Instances | Checks whether ECS instances in the account have high memory usage. | 10 |
| Yes | Monitor the memory usage of ECS instances and configure alert rules based on your business requirements. If the memory usage is abnormal, you can handle the anomaly at the earliest opportunity to ensure that your business runs as expected. For more information, see Configure alert rules for an ECS instance. | ||
High Storage Space Usage of Cloud Disks | Checks whether the storage space usage of cloud disks in the account is high. | 10 |
| Yes | Monitor the storage space usage of cloud disks on ECS instances, and resize the cloud disks whose storage space usage is high at the earliest opportunity based on your business requirements to reduce business risks. For more information, see Configure alert rules for an ECS instance. | ||
High Cloud Disk Utilization | Checks whether cloud disks in the account have high utilization. | 10 |
| Yes | Change the category or size of the cloud disks whose IOPS or BPS utilization exceeds 99% based on your business requirements. For more information, see Block storage performance and Change the category of a disk. | ||
Excessive Security Group Rules Associated with ECS Instances | Checks whether the number of security group rules associated with ECS instances in the account is higher than expected. | 10 |
| Yes | Delete unnecessary security group rules based on your business requirements. This prevents excessive security group rules from being associated with a single ECS instance. For more information, see Guidelines for using security groups and use cases. | ||
Normal ECS Instance Network Performance | Checks whether the network performance of ECS instances in the account degrades. | 10 |
| Yes | View and handle the Performance Degradation events based on your business requirements to prevent the events from affecting your business. For more information, see Query and handle ECS system events and Subscribe to ECS system event notifications. | ||
Application Reliability | Assesses whether deployment sets were used in the account to improve application reliability and availability. | Proper Use of Deployment Sets | Checks whether deployment sets are used in the account to improve cluster availability. | 10 |
| No | Create deployment sets and change the deployment sets of ECS instances based on your business requirements. Deploy ECS instances on different physical servers to improve the high availability and disaster recovery capabilities of your business. For more information, see Deployment set. |
Elasticity
In the Elasticity dimension, ECS Insight assesses the usage of elasticity capabilities for ECS resources, identifies elasticity risks, and provides optimization suggestions.
ECS Insight provides the following categories of capabilities in the Elasticity dimension: Basic Elasticity Capabilities, Advanced Elasticity Capabilities, and Premium Elasticity Capabilities. Each category contains multiple assessment items. The following table describes the scores, scoring rules, and best practices of each assessment item.
Category | Category description | Assessment item | Assessment item description | Full score (points) | Scoring rules | Whether to return affected resources | Best practices |
Basic Elasticity Capabilities | Assesses whether basic elasticity capabilities of ECS were used in the account. | Use of Auto Scaling to Frequently Create or Release ECS Instances | Checks whether Auto Scaling is used to frequently create or release ECS instances in the account. | 10 |
| No | We recommend that you use Auto Scaling based on your business requirements. This helps you cope with business traffic fluctuations while improving resource utilization and effectively reducing costs. For more information, see Usage process of Auto Scaling. |
Use of Auto Provisioning for Large-scale Scaling Activities | Checks whether large-scale scaling activities are performed in the account but Auto Provisioning is not used. | 10 |
| No | |||
Advanced Elasticity Capabilities | Assesses whether infrequent scaling activities, high-load scaling groups, and scaling activities that have low success rates were detected in the account. | Active Scaling Groups Within the Previous Month | Checks whether scaling groups that do not have scaling activities for an extended period of time exist in the account. | 10 |
| No | A scaling activity may fail to complete for specific reasons. You must troubleshoot the issue at the earliest opportunity. For more information, see Troubleshoot scaling activity exceptions. |
Proper Success Rate of Scaling Activities | Checks whether the scaling activities of scaling groups in the account have an appropriate success rate. | 10 |
| No |
| ||
High-load Scaling Groups | Checks whether high-load scaling groups exist in the account. | 20 |
| No |
| ||
Premium Elasticity Capabilities | Assesses whether premium elasticity capabilities, such as cost optimization policies and instance type combination, were used in the account to improve elasticity. | Use of Spot Instances or Cost Optimization Policies in Recurring Tasks | Checks whether spot instances or cost optimization policies are used in recurring tasks in the account. | 20 |
| No | Use scaling groups to reduce costs based on your business requirements. For more information, see Save your money with Auto Scaling and Combine a cost optimization policy with the selection of multiple instance types. |
Use of Multi-instance Type and Multi-zone Scaling Groups | Checks whether risks that affect the scaling success rate of scaling groups in the account exist. | 20 |
| No |
|
Security
In the Security dimension, ECS Insight assesses whether ECS instances and associated resources in the current account have data security, network security, and access security risks, and provides corresponding optimization suggestions.
ECS Insight provides the following categories of capabilities in the Security dimension: Instance Data Security, Instance Access Security, and Instance Network Security. Each category contains multiple assessment items. The following table describes the scores, scoring rules, and best practices of each assessment item.
Category | Category description | Assessment item | Assessment item description | Full score (points) | Scoring rules | Whether to return affected resources | Best practices |
Instance Data Security | Assesses whether data protection capabilities, such as data encryption, were used in the account to improve instance data security. | Use of RAM Users and RAM Roles for Functional Operations | Checks whether RAM users and RAM roles, instead of the Alibaba Cloud account, are used to perform functional operations to improve account security. | 15 |
| Yes | We recommend that you use RAM users, RAM roles, and account groups to improve account security and ensure business security. For more information, see Create a RAM user and Grant permissions to a RAM user. |
Use of Disk Encryption | Checks whether disk encryption is used in the account to improve data security. | 10 |
| Yes | Encrypt system disks or data disks based on your business requirements to improve data security. For more information, see Encrypt a disk. | ||
Instance Access Security | Assesses whether security hardening capabilities were used in the account to improve instance access security. | Use of Key Pairs (Linux Only) | Checks whether key pairs are used to log on to Linux ECS instances in the account to improve instance access security. | 10 |
| No | Use key pairs to log on to Linux ECS instances based on your business requirements to improve instance access security. For more information, see Connect to a Linux instance by using an SSH key pair. |
Use of Non-root Users for Logons to Linux ECS Instances | Checks whether non-root users are used to log on to Linux ECS instances in the account to improve instance access security. | 10 |
| Yes |
| ||
Unrestricted Access to Specific Security Group Ports | Checks whether security groups in the account allow restricted access to specific ports. | 15 |
| Yes | Modify security group rules to control access to critical ports based on your business requirements, such as ports 20, 21, 1433, 1434, 3306, 3389, 4333, 5432, and 5500. For more information, see Security group rules and Guidelines for using security groups and use cases. | ||
EOL of Operating Systems on ECS Instances | Checks whether the operating systems used by ECS instances in the account approached or already reached end of life (EOL). | 10 |
| Yes | Technical support for operating systems of ECS instances may be discontinued due to reasons such as EOL, end of third-party support, and evolution of open source projects. In this case, we recommend that you migrate or upgrade the operating systems. For more information, see Migrate and upgrade the operating system of an ECS instance. | ||
Instance Network Security | Assesses whether the account was exposed to cybersecurity risks. | ECS Instances under DDoS Attacks | Checks whether ECS instances in the account are under DDoS attacks. | 15 |
| Yes | Configure proper scrubbing thresholds based on your business requirements to prevent unexpected scrubbing. For more information, see Configure traffic scrubbing thresholds. |