All Products
Search
Document Center

AnalyticDB:Metric overview

Last Updated:Mar 28, 2026

This page lists all monitoring metrics for AnalyticDB for MySQL, organized by category and edition.

Cluster health status

Health status metrics report the number of nodes in each state (Healthy, At-risk, or Unavailable) for each node type in your cluster.

Enterprise Edition and Basic Edition

MetricDescriptionReferences
Cluster access node statusReports the number of available and unavailable access nodes. The access layer handles protocol-layer access, SQL parsing and optimization, real-time write sharding, data scheduling, and query scheduling.View monitoring information (console) · DescribeDBClusterHealthStatus (API)
Elastic compute node health statusReports the number of available and unavailable elastic compute nodes. Elastic compute nodes are temporary resources scaled out for scheduled or on-demand scaling, within seconds or minutes.
Reserved resource node health statusReports the number of available, at-risk, and unavailable reserved resource nodes. Reserved resource nodes are pre-purchased resources using a storage-compute coupled architecture.

Data Lakehouse Edition and Data Warehouse Edition

MetricDescriptionReferences
Cluster access node statusReports the number of available and unavailable instance access nodes. The access layer handles protocol-layer access, SQL parsing and optimization, real-time write sharding, data scheduling, and query scheduling.View monitoring information (console) · API: Data Lakehouse Edition · Data Warehouse Edition
Compute node health statusReports the number of available and unavailable compute nodes. The compute engine uses distributed Massively Parallel Processing (MPP) and directed acyclic graph (DAG) architectures with elastic scheduling.
Data node health statusReports the number of available, at-risk, and unavailable data nodes. The storage engine is a distributed, high-availability (HA) engine based on the Raft protocol, with data sharding, Multi-Raft, tiered storage, and hybrid row-column storage.

Cluster performance monitoring

Node monitoring

Enterprise Edition and Basic Edition

MetricMetric keyMetric value nameUnitReferences
CPU utilizationAnalyticDB_CPUworker_max_cpu_used — Max CPU utilization of reserved resource nodes%View monitoring information (console) · DescribeDBClusterPerformance (API)
worker_p95_cpu_used — P95 CPU utilization of reserved resource nodes
worker_avg_cpu_used — Average CPU utilization of reserved resource nodes
executor_max_cpu_used — Max CPU utilization of elastic compute nodes
executor_p95_cpu_used — P95 CPU utilization of elastic compute nodes
executor_avg_cpu_used — Average CPU utilization of elastic compute nodes
BUILD jobsAnalyticDB_BuildTaskCountavg_build_task_count — Average number of BUILD jobs across all reserved resource nodescount
max_build_task_count — Maximum number of BUILD jobs across all reserved resource nodes

Compute memory usage

AnalyticDB_ComputeMemoryUsedRatio

The maximum compute memory usage of reserved resource nodes.

max_worker_compute_memory_used_ratio

%

The P95 compute memory usage of reserved resource nodes.

p95_worker_compute_memory_used_ratio

The average compute memory usage of reserved resource nodes.

avg_worker_compute_memory_used_ratio

The maximum compute memory usage of elastic compute nodes.

max_executor_compute_memory_used_ratio

The P95 compute memory usage of elastic compute nodes.

p95_executor_compute_memory_used_ratio

The average compute memory usage of elastic compute nodes.

avg_executor_compute_memory_used_ratio

Unavailable nodesAnalyticDB_UnavailableNodeCountworker_unavailable_node_count — Unavailable reserved resource nodesItem
executor_unavailable_node_count — Unavailable elastic compute nodes
Amount of read table dataAnalyticDB_Table_Read_Result_Sizetable_max_read_result_size — Max amount of read table dataMB
table_avg_read_result_size — Average amount of read table data
CPU utilization of access nodesAnalyticDB_RC_CPUrc_max_cpu_used — Max CPU utilization of access nodes%
rc_p95_cpu_used — P95 CPU utilization of access nodes
rc_controller_avg_cpu_used — Average CPU utilization of access nodes
Disk I/O throughputAnalyticDB_IOworker_max_read_bytes_ratio — Max disk read throughput of reserved resource nodesMB/s
worker_p95_read_bytes_ratio — P95 disk read throughput of reserved resource nodes
worker_avg_read_bytes_ratio — Average disk read throughput of reserved resource nodes
worker_max_write_bytes_ratio — Max disk write throughput of reserved resource nodes
worker_p95_write_bytes_ratio — P95 disk write throughput of reserved resource nodes
worker_avg_write_bytes_ratio — Average disk write throughput of reserved resource nodes
Disk IOPSAnalyticDB_IOPSworker_max_read_ratio — Max disk read operations on reserved resource nodesio/s
worker_p95_read_ratio — P95 disk read operations on reserved resource nodes
worker_avg_read_ratio — Average disk read operations on reserved resource nodes
worker_max_write_ratio — Max disk write operations on reserved resource nodes
worker_p95_write_ratio — P95 disk write operations on reserved resource nodes
worker_avg_write_ratio — Average disk write operations on reserved resource nodes
Disk I/O usageAnalyticDB_IO_UTILworker_max_io_util — Max disk I/O usage of reserved resource nodes%
worker_p95_io_util — P95 disk I/O usage of reserved resource nodes
worker_avg_io_util — Average disk I/O usage of reserved resource nodes
Disk I/O wait timeAnalyticDB_IO_WAITworker_max_io_await — Max disk I/O wait time of reserved resource nodesms
worker_p95_io_await — P95 disk I/O wait time of reserved resource nodes
worker_avg_io_await — Average disk I/O wait time of reserved resource nodes
Memory usage of access nodesAnalyticDB_RC_MemoryUsedRatiorc_max_memory_used_ratio — Max memory usage of access nodes%
rc_p95_memory_used_ratio — P95 memory usage of access nodes
rc_avg_memory_used_ratio — Average memory usage of access nodes
Disk I/O throughput of access nodesAnalyticDB_RC_IOrc_max_read_mebibytes — Max read throughput of access nodesMB/s
rc_p95_read_mebibytes — P95 read throughput of access nodes
rc_avg_read_mebibytes — Average read throughput of access nodes
rc_max_write_mebibytes — Max write throughput of access nodes
rc_p95_write_mebibytes — P95 write throughput of access nodes
rc_avg_write_mebibytes — Average write throughput of access nodes
Disk IOPS of access nodesAnalyticDB_RC_IOPSrc_max_read_iops — Max read operations on access nodesio/s
rc_p95_read_iops — P95 read operations on access nodes
rc_avg_read_iops — Average read operations on access nodes
rc_max_write_iops — Max write operations on access nodes
rc_p95_write_iops — P95 write operations on access nodes
rc_avg_write_iops — Average write operations on access nodes

Data Lakehouse Edition and Data Warehouse Edition

After switching a Data Warehouse Edition cluster from reserved mode (C32) to elastic mode, average CPU utilization increases. For details, see FAQ.
MetricMetric keyMetric value nameUnitReferences
CPU utilizationAnalyticDB_CPUexecutor_max_cpu_used — Max CPU utilization of compute nodes%View monitoring information (console) · API: Data Warehouse Edition · Data Lakehouse Edition
executor_p95_cpu_used — P95 CPU utilization of compute nodes
executor_avg_cpu_used — Average CPU utilization of compute nodes
worker_max_cpu_used — Max CPU utilization of data nodes
worker_p95_cpu_used — P95 CPU utilization of data nodes
worker_avg_cpu_used — Average CPU utilization of data nodes
BUILD jobsAnalyticDB_BuildTaskCountavg_build_task_count — Average number of BUILD jobs across all reserved resource nodesItem
max_build_task_count — Maximum number of BUILD jobs across all reserved resource nodes

Compute memory usage

AnalyticDB_ComputeMemoryUsedRatio

The maximum compute memory usage.

max_executor_compute_memory_used_ratio

%

The P95 compute memory usage.

p95_executor_compute_memory_used_ratio

The average compute memory usage.

avg_executor_compute_memory_used_ratio

Unavailable nodesAnalyticDB_UnavailableNodeCountworker_unavailable_node_count — Unavailable data nodescount
executor_unavailable_node_count — Unavailable compute nodes
Amount of read table dataAnalyticDB_Table_Read_Result_Sizetable_max_read_result_size — Max amount of read table dataMB
table_avg_read_result_size — Average amount of read table data
CPU utilization of access nodesAnalyticDB_RC_CPUrc_max_cpu_used — Max CPU utilization of access nodes%
rc_p95_cpu_used — P95 CPU utilization of access nodes
rc_controller_avg_cpu_used — Average CPU utilization of access nodes
Disk I/O throughputAnalyticDB_IOworker_max_read_bytes_ratio — Max disk read throughput of data nodesMB/s
worker_p95_read_bytes_ratio — P95 disk read throughput of data nodes
worker_avg_read_bytes_ratio — Average disk read throughput of data nodes
worker_max_write_bytes_ratio — Max disk write throughput of data nodes
worker_p95_write_bytes_ratio — P95 disk write throughput of data nodes
worker_avg_write_bytes_ratio — Average disk write throughput of data nodes
Disk IOPSAnalyticDB_IOPSworker_max_read_ratio — Max disk read operations on data nodesio/s
worker_p95_read_ratio — P95 disk read operations on data nodes
worker_avg_read_ratio — Average disk read operations on data nodes
worker_max_write_ratio — Max disk write operations on data nodes
worker_p95_write_ratio — P95 disk write operations on data nodes
worker_avg_write_ratio — Average disk write operations on data nodes
Disk I/O usageAnalyticDB_IO_UTILworker_max_io_util — Max disk I/O usage of data nodes%
worker_p95_io_util — P95 disk I/O usage of data nodes
worker_avg_io_util — Average disk I/O usage of data nodes
Disk I/O wait timeAnalyticDB_IO_WAITworker_max_io_await — Max disk I/O wait time of data nodesms
worker_p95_io_await — P95 disk I/O wait time of data nodes
worker_avg_io_await — Average disk I/O wait time of data nodes
Memory usage of access nodesAnalyticDB_RC_MemoryUsedRatiorc_max_memory_used_ratio — Max memory usage of access nodes%
rc_p95_memory_used_ratio — P95 memory usage of access nodes
rc_avg_memory_used_ratio — Average memory usage of access nodes
Disk I/O throughput of access nodesAnalyticDB_RC_IOrc_max_read_mebibytes — Max read throughput of access nodesMB/s
rc_p95_read_mebibytes — P95 read throughput of access nodes
rc_avg_read_mebibytes — Average read throughput of access nodes
rc_max_write_mebibytes — Max write throughput of access nodes
rc_p95_write_mebibytes — P95 write throughput of access nodes
rc_avg_write_mebibytes — Average write throughput of access nodes
Disk IOPS of access nodesAnalyticDB_RC_IOPSrc_max_read_iops — Max read operations on access nodesio/s
rc_p95_read_iops — P95 read operations on access nodes
rc_avg_read_iops — Average read operations on access nodes
rc_max_write_iops — Max write operations on access nodes
rc_p95_write_iops — P95 write operations on access nodes
rc_avg_write_iops — Average write operations on access nodes

Data size monitoring

Enterprise Edition and Basic Edition

MetricMetric keyMetric value nameUnitReferences
Disk usageAnalyticDB_DiskUsedRatiodisk_used_ratio — Average disk usage%View monitoring information (console) · DescribeDBClusterPerformance (API)
worker_max_node_disk_used_ratio — Max disk usage
Disk space usedAnalyticDB_DiskUsedSizecold_disk_used — Size of cold dataByte
hot_disk_used — Size of hot data
user_used_disk_max — Max hot data size per node
user_used_disk_avg — Average hot data size per node

Data Lakehouse Edition and Data Warehouse Edition

MetricMetric keyMetric value nameUnitReferences
Disk usageAnalyticDB_DiskUsedRatiodisk_used_ratio — Average disk usage%View monitoring information (console) · API: Data Lakehouse Edition · Data Warehouse Edition
worker_max_node_disk_used_ratio — Max disk usage
Disk space usedAnalyticDB_DiskUsedSizecold_disk_used — Size of cold dataByte
hot_disk_used — Size of hot data
user_used_disk_max — Max hot data size per node
user_used_disk_avg — Average hot data size per node

Workload monitoring

Enterprise Edition and Basic Edition

MetricMetric keyMetric value nameUnitReferences
Cluster connectionsAnalyticDB_Connectionsconnections — Successful connectionscountView monitoring information (console) · DescribeDBClusterPerformance (API)
Query failure rate¹AnalyticDB_QueryFailedRatioquery_failed_ratio — Query failure rate%
Query QPSAnalyticDB_QPSqps — Queries per secondop/s
etl_qps — Extract, transform, and load (ETL) QPS
Query response timeAnalyticDB_QueryRTquery_avg_rt — Average query response timems
query_max_rt — Max query response time
Query wait timeAnalyticDB_QueryWaitTimequery_avg_wait_time — Average query wait timems
query_max_wait_time — Max query wait time
Write TPSAnalyticDB_InsertTPSinsert_tps — Write transactions per secondop/s
Write response timeAnalyticDB_InsertRTinsert_avg_rt — Average write response timems
insert_max_rt — Max write response time
Write throughputAnalyticDB_InsertBytesinsert_in_bytes — Average write throughputMB
Update TPSAnalyticDB_UpdateTPSupdate_tps — Update TPSop/s
Update response timeAnalyticDB_UpdateRTupdateinto_avg_rt — Average update response timems
updateinto_max_rt — Max update response time
Delete TPSAnalyticDB_DeleteTPSdelete_tps — Delete TPSop/s
Delete response timeAnalyticDB_DeleteRTdelete_avg_rt — Average delete response timems
delete_max_rt — Max delete response time
Import TPSAnalyticDB_LoadTPSload_tps — Load TPSop/s

Data Lakehouse Edition and Data Warehouse Edition

MetricMetric keyMetric value nameUnitReferences
Cluster connectionsAnalyticDB_Connectionsconnections — Successful connectionscountView monitoring information (console) · API: Data Lakehouse Edition · Data Warehouse Edition
Query failure rate¹AnalyticDB_QueryFailedRatioquery_failed_ratio — Query failure rate%
Query QPSAnalyticDB_QPSqps — Queries per secondop/s
etl_qps — ETL QPS
Query response timeAnalyticDB_QueryRTquery_avg_rt — Average query response timems
query_max_rt — Max query response time
Query wait timeAnalyticDB_QueryWaitTimequery_avg_wait_time — Average query wait timems
query_max_wait_time — Max query wait time
Write TPSAnalyticDB_InsertTPSinsert_tps — Write TPSop/s
Write response timeAnalyticDB_InsertRTinsert_avg_rt — Average write response timems
insert_max_rt — Max write response time
Write throughputAnalyticDB_InsertBytesinsert_in_bytes — Average write throughputMB
Update TPSAnalyticDB_UpdateTPSupdate_tps — Update TPSop/s
Update response timeAnalyticDB_UpdateRTupdateinto_avg_rt — Average update response timems
updateinto_max_rt — Max update response time
Delete TPSAnalyticDB_DeleteTPSdelete_tps — Delete TPSop/s
Delete response timeAnalyticDB_DeleteRTdelete_avg_rt — Average delete response timems
delete_max_rt — Max delete response time
Import TPSAnalyticDB_LoadTPSload_tps — Load TPSop/s
¹ Query failure rate is calculated as follows:
Time range within 24 hours: Query failure rate = (Failed SQL queries in 1 minute / Total SQL queries in 1 minute) × 100%
Time range exceeding 24 hours: Query failure rate = (Failed SQL queries in 5 minutes / Total SQL queries in 5 minutes) × 100%

Resource group monitoring

Enterprise Edition, Basic Edition, and Data Lakehouse Edition

MetricMetric keyMetric value nameUnitReferences
CPU utilizationAnalyticDB_RP_CPUAnalyticDB_RP_CPU — Average CPU utilization of the resource group%View monitoring information (console) · DescribeDBClusterPerformance (API)
Query QPSAnalyticDB_RP_QPSAnalyticDB_RP_QPS — Query QPS of the resource groupop/s
Query response timeAnalyticDB_RP_RTAnalyticDB_RP_RT — Average response time of queries in the resource groupms
Query wait timeAnalyticDB_RP_WaitTimeAnalyticDB_RP_WaitTime — Total average wait time of queries in the resource groupms
(Xihe) Running SQL queriesAnalyticDB_RP_RunningQueries_CountAnalyticDB_RP_RunningQueries_Count — Running SQL queries in the resource groupUnit
Queued SQL queriesAnalyticDB_RP_QueuedQueries_CountAnalyticDB_RP_QueuedQueries_Count — Queued SQL queries in the resource groupcount
Computing resource usage²NoneTotalAcuNumber — Total computing resourcesACUView the computing and storage resource usage of a cluster (console) · DescribeClusterResourceUsage (API)
ReservedAcuNumber — Reserved computing resources
Storage resource usage²NoneTotalAcuNumber — Total storage resourcesACUView the computing and storage resource usage of a cluster (console) · DescribeStorageResourceUsage (API)
ReservedAcuNumber — Reserved storage resources
Resource usageNoneTotalAcuNumber — Total computing resourcesACUView the computing and storage resource usage of a cluster (console) · DescribeStorageResourceUsage (API)
ReservedAcuNumber — Reserved resources
Interactive resource groupNoneReservedAcuNumber — Min computing resourcesACUView the computing resource usage of a resource group (console)
MaxAcuNumber — Max computing resources
CurrentAcuNumber — Current computing resource usage
Job resource groupNoneReservedAcuNumber — Min computing resourcesACUView the computing resource usage of a resource group (console) · DescribeJobResourceUsage (API)
MaxAcuNumber — Max computing resources
CurrentAcuNumber — Current computing resource usage
SpotAcuNumber — Spot instance resource usage
Total ACU-hours used by a jobNoneTotalAcuNumber — Average ACU-hours used by a jobACUView the computing resource usage of a job (console)
Reserved ACU-hoursNoneReservedAcuNumber — Reserved ACU-hours out of total job ACU-hoursACU
Elastic ACU-hoursNoneElasticAcuNumber — Elastic ACU-hours out of total job ACU-hoursACU
² Computing resource usage and storage resource usage metrics are supported only by Data Lakehouse Edition.

Data Warehouse Edition

MetricMetric keyMetric value nameUnitReferences
CPU utilizationAnalyticDB_RP_CPUAnalyticDB_RP_CPU — Average CPU utilization of the resource group%View monitoring information (console) · DescribeDBClusterPerformance (API)
Query QPSAnalyticDB_RP_QPSAnalyticDB_RP_QPS — Query QPS of the resource groupop/s
Query response timeAnalyticDB_RP_RTAnalyticDB_RP_RT — Average response time of queries in the resource groupms
Query wait timeAnalyticDB_RP_WaitTimeAnalyticDB_RP_WaitTime — Total average wait time of queries in the resource groupms
Actual Pop-upsAnalyticDB_RP_ActualNodeAnalyticDB_RP_ActualNode — Nodes actually added when a scale-out plan executesItem
Number of Planned PoPsAnalyticDB_RP_PlanNodeAnalyticDB_RP_PlanNode — Nodes planned to be added based on a scheduled scaling plancount
Total nodesAnalyticDB_RP_TotalNodeAnalyticDB_RP_TotalNode — Total nodes in the resource group (basic nodes + actual scaled-out nodes from scheduled scaling)Unit
Basic nodesAnalyticDB_RP_OriginalNodeAnalyticDB_RP_OriginalNode — Basic nodes in the resource groupItem

Spark monitoring

Spark monitoring metrics are not available in the AnalyticDB for MySQL console. To view them, go to the CloudMonitor console.

MetricDescriptionMetricNameUnitReferences
Spark CPU utilization (%)CPU utilization of SparkSparkCpuUtilizationEci, SparkCpuUtilizationShenlong%View Spark monitoring information (console) · DescribeMetricList (API)
Spark memory utilization (%)Memory usage of SparkSparkMemoryUtilizationEci, SparkMemoryUtilizationShenlong%
Peak on-heap execution memory usage (B)Max JVM heap memory used while a Spark job runsSparkExecutorOnHeapExecutionMemoryBytesByte
Peak off-heap execution memory usage (B)Max memory used outside the JVM heap while a Spark job runsSparkExecutorOffHeapExecutionMemoryBytesByte
Peak on-heap storage memory usage (B)Max JVM heap memory used to store Spark data such as cached Resilient Distributed Datasets (RDDs)SparkExecutorOnHeapStorageMemoryBytesByte
Peak off-heap storage memory usage (B)Max JVM off-heap memory used to store Spark data such as cached RDDsSparkExecutorOffHeapStorageMemoryBytesByte
RDD storage disk usage (B)Disk space used by RDDs in SparkSparkExecutorDiskUsedBytesByte
Major GC count (count)Number of major garbage collections (GCs) performed by the JVM while a Spark job runsSparkExecutorMajorGCCountcount
Minor GC count (count)Number of minor GCs performed by the JVM while a Spark job runsSparkExecutorMinorGCCountUnit
Spark GC time (s)Total time consumed by Spark garbage collectionSparkExecutorTotalGCTimeSecondss
Spark shuffle read data size (B)Size of data read during a Spark shuffleSparkExecutorTotalShuffleReadBytesByte
Spark shuffle write data size (B)Size of data written during a Spark shuffleSparkExecutorTotalShuffleWriteBytesByte

References

Optimize cluster performance based on monitoring information — describes performance-related metrics, explains how to diagnose abnormal metric values, and provides troubleshooting and optimization guidance.