All Products
Search
Document Center

AnalyticDB:Metric overview

Last Updated:May 12, 2025

This topic provides an overview of all monitoring metrics in AnalyticDB for MySQL.

Cluster health status

Enterprise Edition and Basic Edition

Metric

Description

References

Cluster Access Node Status

The access layer of AnalyticDB for MySQL consists of multiple cluster access nodes and provides features such as protocol layer access, SQL parsing and optimization, real-time sharding of written data, data scheduling, and query scheduling.

Valid values:

  • Healthy: the number of available cluster access nodes.

  • Unavailable: the number of unavailable cluster access nodes.

Health Status of Elastic Compute Nodes

Elastic compute nodes refer to the computing resources temporarily scaled during scheduled scaling or on-demand scaling. The resources can be scaled within minutes or seconds at high efficiency.

Valid values:

  • Healthy: the number of available elastic compute nodes.

  • Unavailable: the number of unavailable elastic compute nodes.

Health Status of Reserved Resource Node Groups

Reserved resource nodes refer to the pre-purchased resources. You can change the single-node specifications and the number of nodes for the reserved resources by using configuration changes or scheduled scaling. Enterprise Edition and Basic Edition use the storage-compute coupled architecture to run both the compute and storage engines for the reserved resources nodes.

Valid values:

  • Healthy: the number of available reserved resource nodes.

  • Risky: the number of at-risk reserved resource nodes.

  • Unavailable: the number of unavailable reserved resource nodes.

Data Lakehouse Edition and Data Warehouse Edition

Metric

Description

References

Cluster Access Node Status

The access layer of AnalyticDB for MySQL consists of multiple cluster access nodes and provides features such as protocol layer access, SQL parsing and optimization, real-time sharding of written data, data scheduling, and query scheduling.

Valid values:

  • Healthy: the number of available cluster access nodes.

  • Unavailable: the number of unavailable cluster access nodes.

Health Status of Compute Nodes

The compute engine of AnalyticDB for MySQL consists of compute nodes and supports the integrated execution of distributed massively parallel processing (MPP) and directed acyclic graph (DAG) architectures. The compute engine can work with intelligent optimizers to support high concurrency and hybrid loads of complex SQL statements. In addition, the cloud native infrastructure allows compute nodes to implement elastic scaling within minutes or seconds based on business requirements. This ensures efficient resource utilization.

Valid values:

  • Healthy: the number of available compute nodes.

  • Unavailable: the number of unavailable compute nodes.

Health Status of Storage Node Groups

The storage engine of AnalyticDB for MySQL consists of storage node groups and supports highly consistent and highly available real-time data writes based on the Raft consensus protocol. The storage engine uses data sharding and Multi-Raft to support parallel processing, tiered storage to separate hot and cold data at lower costs, and hybrid row-column storage and intelligent indexing to provide ultra-high performance.

Valid values:

  • Healthy: the number of available storage nodes.

  • Risky: the number of at-risk storage nodes.

  • Unavailable: the number of unavailable storage nodes.

Cluster performance monitoring

Node monitoring

Enterprise Edition and Basic Edition

Metric

Metric key

Description

Metric value

Unit

References

CPU Utilization

AnalyticDB_CPU

The maximum CPU utilization of reserved resource nodes.

worker_max_cpu_used

%

The P95 CPU utilization of reserved resource nodes.

worker_p95_cpu_used

The P95 CPU utilization of reserved resource nodes.

worker_avg_cpu_used

The maximum CPU utilization of elastic compute nodes.

executor_max_cpu_used

The P95 CPU utilization of elastic compute nodes.

executor_p95_cpu_used

The average CPU utilization of elastic compute nodes.

executor_avg_cpu_used

BUILD Jobs

AnalyticDB_BuildTaskCount

The average number of BUILD jobs.

Note

This metric indicates the average number of BUILD jobs that run across all reserved resource nodes.

avg_build_task_count

N/A

The maximum number of BUILD jobs.

Note

This metric indicates the maximum number of BUILD jobs that run across all reserved resource nodes.

max_build_task_count

Compute Memory Usage

AnalyticDB_ComputeMemoryUsedRatio

The maximum compute memory usage of reserved resource nodes.

max_worker_compute_memory_used_ratio

%

The P95 compute memory usage of reserved resource nodes.

p95_worker_compute_memory_used_ratio

The average compute memory usage of reserved resource nodes.

avg_worker_compute_memory_used_ratio

The maximum compute memory usage of elastic compute nodes.

max_executor_compute_memory_used_ratio

The P95 compute memory usage of elastic compute nodes.

p95_executor_compute_memory_used_ratio

The average compute memory usage of elastic compute nodes.

avg_executor_compute_memory_used_ratio

Unavailable Nodes

AnalyticDB_UnavailableNodeCount

The number of unavailable reserved resource nodes.

worker_unavailable_node_count

N/A

The number of unavailable elastic compute nodes.

executor_unavailable_node_count

Amount of Read Table Data

AnalyticDB_Table_Read_Result_Size

The maximum amount of read table data.

table_max_read_result_size

MB

The average amount of read table data.

table_avg_read_result_size

CPU Utilization of Access Nodes

AnalyticDB_RC_CPU

The maximum CPU utilization of access nodes.

rc_max_cpu_used

%

The P95 CPU utilization of access nodes.

rc_p95_cpu_used

The average CPU utilization of access nodes.

rc_controller_avg_cpu_used

Disk I/O Throughput

AnalyticDB_IO

The maximum disk read throughput of reserved resource nodes.

worker_max_read_bytes_ratio

MB/s

The P95 disk read throughput of reserved resource nodes.

worker_p95_read_bytes_ratio

The average disk read throughput of reserved resource nodes.

worker_avg_read_bytes_ratio

The maximum disk write throughput of reserved resource nodes.

worker_max_write_bytes_ratio

The P95 disk write throughput of reserved resource nodes.

worker_p95_write_bytes_ratio

The average disk write throughput of reserved resource nodes.

worker_avg_write_bytes_ratio

Disk IOPS

AnalyticDB_IOPS

The maximum number of disk read operations on reserved resource nodes.

worker_max_read_ratio

N/A

The P95 number of disk read operations on reserved resource nodes.

worker_p95_read_ratio

The average number of disk read operations on reserved resource nodes.

worker_avg_read_ratio

The maximum number of disk write operations on reserved resource nodes.

worker_max_write_ratio

The P95 number of disk write operations on reserved resource nodes.

worker_p95_write_ratio

The average number of disk write operations on reserved resource nodes.

worker_avg_write_ratio

Disk I/O Usage

AnalyticDB_IO_UTIL

The maximum disk I/O usage of reserved resource nodes.

worker_max_io_util

%

The P95 disk I/O usage of reserved resource nodes.

worker_p95_io_util

The average disk I/O usage of reserved resource nodes.

worker_avg_io_util

Disk I/O Wait Time

AnalyticDB_IO_WAIT

The maximum disk I/O wait time of reserved resource nodes.

worker_max_io_await

ms

The P95 disk I/O wait time of reserved resource nodes.

worker_p95_io_await

The average disk I/O wait time of reserved resource nodes.

worker_avg_io_await

Memory Usage of Access Nodes

AnalyticDB_RC_MemoryUsedRatio

The maximum memory usage of access nodes.

rc_max_memory_used_ratio

%

The P95 memory usage of access nodes.

rc_p95_memory_used_ratio

The average memory usage of access nodes.

rc_avg_memory_used_ratio

Disk I/O Throughput of Access Nodes

AnalyticDB_RC_IO

The maximum read throughput of access nodes.

rc_max_read_mebibytes

MB/s

The P95 read throughput of access nodes.

rc_p95_read_mebibytes

The average read throughput of access nodes.

rc_avg_read_mebibytes

The maximum write throughput of access nodes.

rc_max_write_mebibytes

The P95 write throughput of access nodes.

rc_p95_write_mebibytes

The average write throughput of access nodes.

rc_avg_write_mebibytes

Disk IOPS of Access Nodes

AnalyticDB_RC_IOPS

The maximum number of read operations on access nodes.

rc_max_read_iops

N/A

The P95 number of read operations on access nodes.

rc_p95_read_iops

The average number of read operations on access nodes.

rc_avg_read_iops

The maximum number of write operations on access nodes.

rc_max_write_iops

The P95 number of write operations on access nodes.

rc_p95_write_iops

The average number of write operations on access nodes.

rc_avg_write_iops

Data Lakehouse Edition and Data Warehouse Edition

Metric

Metric key

Description

Metric value

Unit

References

CPU Utilization

Note

After you change a C32 Data Warehouse Edition cluster in reserved mode to elastic mode, the average CPU utilization increases. For more information, see the "FAQ" section of the View monitoring information of AnalyticDB for MySQL topic.

AnalyticDB_CPU

The maximum CPU utilization of compute nodes.

executor_max_cpu_used

%

The P95 CPU utilization of compute nodes.

executor_p95_cpu_used

The average CPU utilization of compute nodes.

executor_avg_cpu_used

The maximum CPU utilization of storage nodes.

worker_max_cpu_used

The P95 CPU utilization of storage nodes.

worker_p95_cpu_used

The average CPU utilization of storage nodes.

worker_avg_cpu_used

BUILD Jobs

AnalyticDB_BuildTaskCount

The average number of BUILD jobs.

Note

This metric indicates the average number of BUILD jobs that run across all reserved resource nodes.

avg_build_task_count

N/A

The maximum number of BUILD jobs.

Note

This metric indicates the maximum number of BUILD jobs that run across all reserved resource nodes.

max_build_task_count

Compute Memory Usage

AnalyticDB_ComputeMemoryUsedRatio

The maximum compute memory usage.

max_executor_compute_memory_used_ratio

%

The P95 compute memory usage.

p95_executor_compute_memory_used_ratio

The average compute memory usage.

avg_executor_compute_memory_used_ratio

Unavailable Nodes

AnalyticDB_UnavailableNodeCount

The number of unavailable storage nodes.

worker_unavailable_node_count

N/A

The number of unavailable compute nodes.

executor_unavailable_node_count

Amount of Read Table Data

AnalyticDB_Table_Read_Result_Size

The maximum amount of read table data.

table_max_read_result_size

MB

The average amount of read table data.

table_avg_read_result_size

CPU Utilization of Access Nodes

AnalyticDB_RC_CPU

The maximum CPU utilization of access nodes.

rc_max_cpu_used

%

The P95 CPU utilization of access nodes.

rc_p95_cpu_used

The average CPU utilization of access nodes.

rc_controller_avg_cpu_used

Disk I/O Throughput

AnalyticDB_IO

The maximum disk read throughput of storage nodes.

worker_max_read_bytes_ratio

MB/s

The P95 disk read throughput of storage nodes.

worker_p95_read_bytes_ratio

The average disk read throughput of storage nodes.

worker_avg_read_bytes_ratio

The maximum disk write throughput of storage nodes.

worker_max_write_bytes_ratio

The P95 disk write throughput of storage nodes.

worker_p95_write_bytes_ratio

The average disk write throughput of storage nodes.

worker_avg_write_bytes_ratio

Disk IOPS

AnalyticDB_IOPS

The maximum number of disk read operations on storage nodes.

worker_max_read_ratio

N/A

The P95 number of disk read operations on storage nodes.

worker_p95_read_ratio

The average number of disk read operations on storage nodes.

worker_avg_read_ratio

The maximum number of disk write operations on storage nodes.

worker_max_write_ratio

The P95 number of disk write operations on storage nodes.

worker_p95_write_ratio

The average number of disk write operations on storage nodes.

worker_avg_write_ratio

Disk I/O Usage

AnalyticDB_IO_UTIL

The maximum disk I/O usage of storage nodes.

worker_max_io_util

%

The P95 disk I/O usage of storage nodes.

worker_p95_io_util

The average disk I/O usage of storage nodes.

worker_avg_io_util

Disk I/O Wait Time

AnalyticDB_IO_WAIT

The maximum disk I/O wait time of storage nodes.

worker_max_io_await

ms

The P95 disk I/O wait time of storage nodes.

worker_p95_io_await

The average disk I/O wait time of storage nodes.

worker_avg_io_await

Memory Usage of Access Nodes

AnalyticDB_RC_MemoryUsedRatio

The maximum memory usage of access nodes.

rc_max_memory_used_ratio

%

The P95 memory usage of access nodes.

rc_p95_memory_used_ratio

The average memory usage of access nodes.

rc_avg_memory_used_ratio

Disk I/O Throughput of Access Nodes

AnalyticDB_RC_IO

The maximum read throughput of access nodes.

rc_max_read_mebibytes

MB/s

The P95 read throughput of access nodes.

rc_p95_read_mebibytes

The average read throughput of access nodes.

rc_avg_read_mebibytes

The maximum write throughput of access nodes.

rc_max_write_mebibytes

The P95 write throughput of access nodes.

rc_p95_write_mebibytes

The average write throughput of access nodes.

rc_avg_write_mebibytes

Disk IOPS of Access Nodes

AnalyticDB_RC_IOPS

The maximum number of read operations on access nodes.

rc_max_read_iops

N/A

The P95 number of read operations on access nodes.

rc_p95_read_iops

The average number of read operations on access nodes.

rc_avg_read_iops

The maximum number of write operations on access nodes.

rc_max_write_iops

The P95 number of write operations on access nodes.

rc_p95_write_iops

The average number of write operations on access nodes.

rc_avg_write_iops

Data size monitoring

Enterprise Edition and Basic Edition

Metric

Metric key

Description

Metric value

Unit

References

Disk Usage

AnalyticDB_DiskUsedRatio

The average disk usage.

disk_used_ratio

%

The maximum disk usage.

worker_max_node_disk_used_ratio

Disk Space Used

AnalyticDB_DiskUsedSize

The size of cold data.

cold_disk_used

Byte

The size of hot data.

hot_disk_used

The maximum size of hot data for storage nodes.

user_used_disk_max

The average size of hot data for storage nodes.

user_used_disk_avg

Data Lakehouse Edition and Data Warehouse Edition

Metric

Metric key

Description

Metric value

Unit

References

Disk Usage

AnalyticDB_DiskUsedRatio

The average disk usage.

disk_used_ratio

%

The maximum disk usage.

worker_max_node_disk_used_ratio

Disk Space Used

AnalyticDB_DiskUsedSize

The size of cold data.

cold_disk_used

Byte

The size of hot data.

hot_disk_used

The maximum size of hot data for storage nodes.

user_used_disk_max

The average size of hot data for storage nodes.

user_used_disk_avg

Workload monitoring

Enterprise Edition and Basic Edition

Metric

Metric key

Description

Metric value

Unit

References

Cluster Connections

AnalyticDB_Connections

The number of successful connections.

connections

N/A

Query Failure Rate1

AnalyticDB_QueryFailedRatio

The failure rate of queries.

query_failed_ratio

%

QPS

AnalyticDB_QPS

The queries per second (QPS).

qps

N/A

The extract, transform, load (ETL) QPS.

etl_qps

Query Response Time

AnalyticDB_QueryRT

The average query response time.

query_avg_rt

ms

The maximum query response time.

query_max_rt

Query Wait Time

AnalyticDB_QueryWaitTime

The average query wait time.

query_avg_wait_time

ms

The maximum query wait time.

query_max_wait_time

Write TPS

AnalyticDB_InsertTPS

The write transactions per second (TPS) of a cluster.

insert_tps

N/A

Write Response Time

AnalyticDB_InsertRT

The average write response time.

insert_avg_rt

ms

The maximum write response time.

insert_max_rt

Write Throughput

AnalyticDB_InsertBytes

The average write throughput of a cluster.

insert_in_bytes

MB

Update TPS

AnalyticDB_UpdateTPS

The update TPS of a cluster.

update_tps

N/A

Update Response Time

AnalyticDB_UpdateRT

The average update response time.

updateinto_avg_rt

ms

The maximum update response time.

updateinto_max_rt

Delete TPS

AnalyticDB_DeleteTPS

The delete TPS of a cluster.

delete_tps

N/A

Delete Response Time

AnalyticDB_DeleteRT

The average delete response time.

delete_avg_rt

ms

The maximum delete response time.

delete_max_rt

Import TPS

AnalyticDB_LoadTPS

The load TPS of a cluster.

load_tps

N/A

Data Lakehouse Edition and Data Warehouse Edition

Metric

Metric key

Description

Metric value

Unit

References

Cluster Connections

AnalyticDB_Connections

The number of successful connections.

connections

N/A

Query Failure Rate1

AnalyticDB_QueryFailedRatio

The failure rate of queries.

query_failed_ratio

%

QPS

AnalyticDB_QPS

The QPS.

qps

N/A

The ETL QPS.

etl_qps

Query Response Time

AnalyticDB_QueryRT

The average query response time.

query_avg_rt

ms

The maximum query response time.

query_max_rt

Query Wait Time

AnalyticDB_QueryWaitTime

The average query wait time.

query_avg_wait_time

ms

The maximum query wait time.

query_max_wait_time

Write TPS

AnalyticDB_InsertTPS

The write TPS of a cluster.

insert_tps

N/A

Write Response Time

AnalyticDB_InsertRT

The average write response time.

insert_avg_rt

ms

The maximum write response time.

insert_max_rt

Write Throughput

AnalyticDB_InsertBytes

The average write throughput of a cluster.

insert_in_bytes

MB

Update TPS

AnalyticDB_UpdateTPS

The update TPS of a cluster.

update_tps

N/A

Update Response Time

AnalyticDB_UpdateRT

The average update response time.

updateinto_avg_rt

ms

The maximum update response time.

updateinto_max_rt

Delete TPS

AnalyticDB_DeleteTPS

The delete TPS of a cluster.

delete_tps

N/A

Delete Response Time

AnalyticDB_DeleteRT

The average delete response time.

delete_avg_rt

ms

The maximum delete response time.

delete_max_rt

Import TPS

AnalyticDB_LoadTPS

The load TPS of a cluster.

load_tps

N/A

Note

Query Failure Rate1:

  • If you select a time range within 24 hours, the query failure rate per minute is displayed, The rate is calculated by using the following formula: Query failure rate = (Number of failed SQL queries in 1 minute/Total number of SQL queries in 1 minute) × 100%.

  • If you select a time range that exceeds 24 hours, the query failure rate for every 5 minutes is displayed. The rate is calculated by using the following formula: Query failure rate = (Number of failed SQL queries within 5 minutes/Total number of SQL queries within 5 minutes) × 100%.

Resource group monitoring

Enterprise Edition, Basic Edition, and Data Lakehouse Edition

Metric

Metric key

Description

Metric value

Unit

References

CPU Utilization

AnalyticDB_RP_CPU

The CPU utilization of a resource group.

AnalyticDB_RP_CPU

%

QPS

AnalyticDB_RP_QPS

The QPS processed by a resource group.

AnalyticDB_RP_QPS

N/A

Query Response Time

AnalyticDB_RP_RT

The average response time of queries processed by a resource group.

AnalyticDB_RP_RT

ms

Query Wait Time

AnalyticDB_RP_WaitTime

The average wait time of queries processed by a resource group.

AnalyticDB_RP_WaitTime

ms

(XIHE) Running Queries

AnalyticDB_RP_RunningQueries_Count

The number of running queries in a resource group.

AnalyticDB_RP_RunningQueries_Count

N/A

Queued Queries

AnalyticDB_RP_QueuedQueries_Count

The number of queued queries in a resource group.

AnalyticDB_RP_QueuedQueries_Count

N/A

Computing Resource Usage

Note

Only Data Lakehouse Edition supports this metric.

None

The total amount of computing resources.

TotalAcuNumber

ACU

The amount of reserved computing resources.

ReservedAcuNumber

Storage Resource Usage

Note

Only Data Lakehouse Edition supports this metric.

None

The total amount of storage resources.

TotalAcuNumber

ACU

The amount of reserved storage resources.

ReservedAcuNumber

Resource Usage

None

The total amount of computing resources.

TotalAcuNumber

ACU

The amount of reserved resources.

ReservedAcuNumber

Interactive resource group

None

The minimum amount of computing resources that can be used by a resource group.

ReservedAcuNumber

ACU

Console documentation: View information about the computing resources of a resource group

The maximum amount of computing resources that can be used by a resource group.

MaxAcuNumber

The amount of computing resources that are used by a resource group.

CurrentAcuNumber

Job resource group

None

The minimum amount of computing resources that can be used by a resource group.

ReservedAcuNumber

ACU

The maximum amount of computing resources that can be used by a resource group.

MaxAcuNumber

The amount of computing resources that are used by a resource group.

CurrentAcuNumber

The amount of spot instance resources that are used by the resource group.

SpotAcuNumber

Total ACU-hours

None

The average number of ACU-hours that are used by a job.

TotalAcuNumber

ACU

Console documentation: View the amount of computing resources used by a job

Reserved ACU-hours

None

The number of reserved ACU-hours that are used by a job.

ReservedAcuNumber

ACU

Elastic ACU-hours

None

The number of elastic ACU-hours that are used by a job.

ElasticAcuNumber

ACU

Data Warehouse Edition

Metric

Metric key

Description

Metric value

Unit

References

CPU Utilization

AnalyticDB_RP_CPU

The CPU utilization of a resource group.

AnalyticDB_RP_CPU

%

QPS

AnalyticDB_RP_QPS

The QPS processed by a resource group.

AnalyticDB_RP_QPS

N/A

Query Response Time

AnalyticDB_RP_RT

The average response time of queries processed by a resource group.

AnalyticDB_RP_RT

ms

Query Wait Time

AnalyticDB_RP_WaitTime

The average wait time of queries processed by a resource group.

AnalyticDB_RP_WaitTime

ms

Scheduled Nodes Actually Scaled Out

AnalyticDB_RP_ActualNode

The number of nodes that are added to a resource group in a scheduled scaling plan.

AnalyticDB_RP_ActualNode

N/A

Scheduled Nodes to Be Scaled Out

AnalyticDB_RP_PlanNode

The number of nodes that need to be added to a resource group in a scheduled scaling plan.

For information about how to create a scaling plan for a resource group, see Create a resource scaling plan.

AnalyticDB_RP_PlanNode

N/A

Total Nodes

AnalyticDB_RP_TotalNode

The total number of nodes in a resource group, which is calculated by using the following formula: Total number of nodes = Number of basic nodes + Number of effective nodes in scheduled scaling plans.

AnalyticDB_RP_TotalNode

N/A

Basic Nodes

AnalyticDB_RP_OriginalNode

The number of basic nodes in a resource group.

AnalyticDB_RP_OriginalNode

N/A

Spark monitoring

The AnalyticDB for MySQL console does not provide Spark monitoring information. To view Spark monitoring information, go to the CloudMonitor console.

Metric

Description

Metric value

Unit

References

Spark Cpu Utilization(%)

The CPU utilization of a Spark job.

  • SparkCpuUtilizationEci

  • SparkCpuUtilizationShenlong

%

Spark Memory Utilization(%)

The memory usage of a Spark job.

  • SparkMemoryUtilizationEci

  • SparkMemoryUtilizationShenlong

%

SparkExecutorOnHeapExecutionMemoryBytes(B)

The maximum amount of Java virtual machine (JVM) heap memory that is used to run a Spark job.

SparkExecutorOnHeapExecutionMemoryBytes

Byte

SparkExecutorOffHeapExecutionMemoryBytes(B)

The maximum amount of JVM non-heap memory that is used to run a Spark job.

SparkExecutorOffHeapExecutionMemoryBytes

Byte

SparkExecutorOnHeapStorageMemoryBytes(B)

The maximum amount of JVM heap memory that is used to store Spark data, such as the cached resilient distributed datasets (RDDs).

SparkExecutorOnHeapStorageMemoryBytes

Byte

SparkExecutorOffHeapStorageMemoryBytes(B)

The maximum amount of JVM non-heap memory that is used to store Spark data, such as the cached RDDs.

SparkExecutorOffHeapStorageMemoryBytes

Byte

SparkExecutorDiskUsedBytes(B)

The disk storage that is used by RDDs.

SparkExecutorDiskUsedBytes

Byte

SparkExecutorMajorGCCount(count)

The number of major garbage collections (GCs) that are executed by the JVM GC mechanism when a Spark job runs.

SparkExecutorMajorGCCount

N/A

SparkExecutorMinorGCCount(count)

The number of minor GCs that are executed by the JVM GC mechanism when a Spark job runs.

SparkExecutorMinorGCCount

N/A

SparkExecutorTotalGCTimeSeconds(s)

The GC duration of a Spark job.

SparkExecutorTotalGCTimeSeconds

s

SparkExecutorTotalShuffleReadBytes(B)

The shuffle read size of a Spark job.

SparkExecutorTotalShuffleReadBytes

Byte

SparkExecutorTotalShuffleWriteBytes(B)

The shuffle write size of a Spark job.

SparkExecutorTotalShuffleWriteBytes

Byte

References

Cluster performance optimization: describes the metrics about cluster performance and running status and helps you identify the causes of abnormal metrics and resolve issues.