CloudMonitor provides an overview of alarms, key events, and resource usage.

This allows you to check the resource usage, and alarms of each cloud service in real time.

Alarm overview

CloudMonitor provides alert statistics, including the total number of alerts in the last seven days, the number of currently triggered alert rules, the number of alert rules with insufficient data, and the usage of alert SMS in the current month.

You can view more information about alert rules by clicking the number of alert rules that are currently triggered or with insufficient data.

Event overview

CloudMonitor summarizes all the exceptions and O&M events in the last 24 hours. The following table lists the key events that are supported.

Service Event
Host CloudMonitor agent stopped
ApsaraDB for RDS Primary/Secondary switchover
ApsaraDB for RDS Instance failure
ApsaraDB for MongoDB Instance failure
ApsaraDB for Redis Primary/Secondary switchover
ApsaraDB for Redis Instance failure

Resource usage overview

CloudMonitor displays the overall resource usage of each service under your account. For OSS, CDN, and Log Service, CloudMonitor displays the cumulative resource usage in the current month. For other services, CloudMonitor displays the resource usage in real time by using the 95th percentile.

Statistical method: the 95th percentile

A percentile is a measure used in statistics. It indicates the value lower than which a given percentage of observations in a group of observations in ascending order falls.

The 95th percentile is the value lower than which 95% of observations in a group of observations in ascending order falls. For example, if the 95th percentile for the CPU usage of ECS instances is 34%, 95% of the ECS instances have a CPU usage of lower than 34%.

CloudMonitor uses the 95th percentile to measure the resource usage of most cloud services.

Resource metric description

Service Metric Statistical method Statistical period Statistical range
Host CPU usage 95th percentile Real-time All instances
Host Memory usage 95th percentile Real-time All instances
Host Disk usage 95th percentile Real-time All instances
Host Outbound bandwidth to the public network 95th percentile Real-time All instances
ApsaraDB for RDS CPU usage 95th percentile Real-time All instances
ApsaraDB for RDS Input/Output operations per second (IOPS) usage 95th percentile Real-time All instances
ApsaraDB for RDS Connection usage 95th percentile Real-time All instances
ApsaraDB for RDS Disk usage 95th percentile Real-time All instances
OSS Total outbound traffic to the public network in the current month Sum The cumulative value from 00:00 on the first day of the month to the current time All buckets
OSS Total number of PUT requests in the current month Sum The cumulative value from 00:00 on the first day of the month to the current time All buckets
OSS Total number of GET requests in the current month Sum The cumulative value from 00:00 on the first day of the month to the current time All buckets
OSS Storage size Sum The sum of the storage currently occupied by all OSS buckets All buckets
CDN Total traffic in the current month Sum The cumulative value from 00:00 on the first day of the month to the current time All domains
CDN Peak network bandwidth 95th percentile Real-time All domains
CDN Access QPS 95th percentile Real-time All domains
ApsaraDB for MongoDB CPU usage 95th percentile Real-time All instances
ApsaraDB for MongoDB Memory usage 95th percentile Real-time All instances
ApsaraDB for MongoDB IOPS usage 95th percentile Real-time All instances
ApsaraDB for MongoDB Connection usage 95th percentile Real-time All instances
ApsaraDB for MongoDB Disk usage 95th percentile Real-time All instances
ApsaraDB for Memcache Cache hit ratio 95th percentile Real-time All instances
ApsaraDB for Memcache Cache usage 95th percentile Real-time All instances
ApsaraDB for Redis Memory usage 95th percentile Real-time All instances
ApsaraDB for Redis IOPS usage 95th percentile Real-time All instances
ApsaraDB for Redis Connection usage 95th percentile Real-time All instances
EIP Inbound bandwidth 95th percentile Real-time All instances
EIP Outbound bandwidth 95th percentile Real-time All instances
Container Service CPU usage 95th percentile Real-time All instances
Container Service Memory usage 95th percentile Real-time All instances
Container Service Outbound traffic to the public network 95th percentile Real-time All instances
Log Service Total inbound traffic in the current month Sum The cumulative value from 00:00 on the first day of the month to the current time All projects
Log Service Total outbound traffic in the current month Sum The cumulative value from 00:00 on the first day of the month to the current time All projects
Log Service Total number of requests in the current month Sum The cumulative value from 00:00 on the first day of the month to the current time All projects
HybridDB CPU usage 95th percentile Real-time All instances
HybridDB Memory usage 95th percentile Real-time All instances
HybridDB IOPS usage 95th percentile Real-time All instances
HybridDB Connection usage 95th percentile Real-time All instances
HybridDB Disk usage 95th percentile Real-time All instances