ApsaraDB for ClickHouse provides metrics that let you monitor cluster health in real time, detect issues early, and locate performance bottlenecks quickly.
The metrics in the Community Edition and Enterprise Edition sections apply to clusters purchased after December 1, 2021. For clusters purchased before December 1, 2021, see Metrics for earlier cluster versions.
Community Edition
| Metric | Unit | Description |
|---|---|---|
| Disk throughput | Byte | Read and write throughput (bandwidth) of the disk. |
| Disk IOPS | counts/s | Number of read and write operations performed on the disk per second. |
| Disk usage | Byte | Amount of disk space used. |
| Disk usage - by single node | Byte | Total disk space used on a single node. |
| Disk usage | % | Ratio of used disk space to total available disk space. |
| Disk usage - by single node | % | Disk usage calculated per node. |
| CPU utilization | % | Average CPU utilization of each node in the cluster. |
| inode usage | % | Ratio of used inodes to total available inodes. An inode is a Linux data structure that identifies a file or directory. |
| Memory usage | Byte | Memory used by each node in the cluster. |
| Memory utilization | % | Average memory utilization of each node in the cluster. |
| Number of data parts | count | Total number of data parts across the cluster. |
| Number of inactive data parts | count | Number of data parts that are no longer active. |
| ZooKeeper CPU utilization | % | Average CPU utilization of the ZooKeeper nodes in the cluster. |
| ZooKeeper memory utilization | % | Average memory utilization of the ZooKeeper nodes in the cluster. |
| Write bytes/sec | Byte | Volume of data written to each node per second. |
| Write rows/sec | rows/s | Number of rows written to each node per second. |
| QPS | count | Number of queries processed per second. |
| TPS | count | Number of transactions processed per second. |
| Network throughput | Byte | Network read and write throughput (bandwidth). |
| TCP connections | count | Number of TCP connections to the cluster. |
| HTTP connections | count | Number of HTTP connections to the cluster. |
| Number of running Init Queries | count | Number of queries running locally on the node, not forwarded from another node. |
| Number of running queries | count | Total number of queries currently running on the node. |
| Number of failed queries | count | Number of queries that failed. |
| Number of failed Insert queries | count | Number of INSERT statements that failed. |
| Number of delayed inserts | count | Number of INSERT statements that were delayed. |
| Number of failed Select queries | count | Number of SELECT queries that failed. |
| Number of running merges | count | Number of background merge tasks currently running. |
| Number of running mutations | count | Number of data correction tasks (mutations) currently running. |
| Number of temporary files for distributed tables | count | Number of temporary files created for distributed tables. |
| Number of MaterializeMySQL | count | Number of databases created using the MaterializeMySQL engine. |
| Cold storage usage | Byte | Amount of cold data currently stored. |
| Number of failed MaterializeMySQL synchronizations | count | Number of failed sync operations for databases created with the MaterializeMySQL engine. |
| Number of errors in Kafka external table consumption | count | Number of consumption failures for Kafka external tables. |
| ZooKeeper Leader | - | Indicates the ZooKeeper leader node. A value of 1 identifies the leader. |
| Number of Kafka external tables | count | Number of Kafka external tables created in the cluster. |
| Node queue accumulation | count | Number of data packets in the send queue of an ApsaraDB for ClickHouse node on ZooKeeper. |
| ZooKeeper average latency | ms | Average request latency of the ZooKeeper nodes. |
| Number of ZooKeeper nodes | count | Number of ZNodes in ZooKeeper. |
| ZooKeeper TPS | counts/s | Transactions per second processed by ZooKeeper. |
| Zxid usage | % | Zxid usage of ZooKeeper. When Zxid usage reaches 100%, a new leader election is triggered. |
Enterprise Edition
| Metric | Unit | Description |
|---|---|---|
| CPU utilization | % | Average CPU utilization of each node in the cluster. |
| CCU | count | Amount of compute resources (CCU) used by the cluster. |
| Memory usage | Byte | Memory used by each node in the cluster. |
| Memory utilization | % | Average memory utilization of each node in the cluster. |
| OSS usage | Byte | Amount of OSS storage used by the cluster. |
| Number of data parts | count | Total number of data parts across the cluster. |
| Number of inactive data parts | count | Number of data parts that are no longer active. |
| Keeper CPU utilization | % | CPU utilization of ClickHouse-Keeper. |
| Keeper memory usage | Byte | Memory used by ClickHouse-Keeper. |
| Keeper memory utilization | % | Memory utilization of ClickHouse-Keeper. |
| Write bytes/sec | Byte | Volume of data written to each node per second. |
| Write rows/sec | rows/s | Number of rows written to each node per second. |
| Network throughput | Byte | Network read and write throughput (bandwidth). |
| OSS network throughput | Byte | Read and write throughput (bandwidth) of OSS. |
| OSS IOPS | counts/s | Number of read and write operations performed on OSS per second. |
| TPS | counts/s | Number of transactions processed per second. |
| QPS | counts/s | Number of queries processed per second. |
| Number of running Init Queries | count | Number of queries running locally on the node, not forwarded from another node. |
| Number of running queries | count | Total number of queries currently running on the node. |
| Number of failed queries | count | Number of queries that failed on the node. |
| Number of failed Insert queries | count | Number of INSERT statements that failed. |
| Number of delayed inserts | count | Number of INSERT statements that were delayed. |
| Number of failed Select queries | count | Number of SELECT queries that failed on the node. |
| Number of running mutations | count | Number of data correction tasks (mutations) currently running. |
| Number of running merges | count | Number of background merge tasks currently running. |
| TCP connections | count | Number of TCP connections to the cluster. |
| HTTP connections | count | Number of HTTP connections to the cluster. |
| Number of failed MV refreshes | count | Number of failed refresh operations for refreshable materialized views in the cluster. |
| Average MV refresh duration | seconds/minutes | Average time to complete a refresh of refreshable materialized views in the cluster. |
| Maximum MV refresh duration | seconds/minutes | Maximum time to refresh refreshable materialized views in the cluster. |
Metrics for earlier cluster versions
Earlier cluster versions refer to Community Edition clusters purchased before December 1, 2021.
| Metric | Unit | Description |
|---|---|---|
| CPU utilization | % | CPU utilization of each node in the cluster. |
| Memory utilization | % | Memory utilization of each node in the cluster. |
| Memory usage | Byte | Memory used by each node in the cluster. |
| Disk usage ratio | % | Ratio of used disk space to total available disk space. |
| Disk usage | Byte | Amount of disk space in use. |
| Disk IOPS | counts/s | Number of read and write operations performed on the disk per second. |
| Disk IOPS size | Byte | Amount of data read from or written to the disk per second. |
| Database connection usage ratio | % | Percentage of used database connections out of the maximum allowed. |
| Used database connections | count | Number of database connections currently in use. |
| TPS | counts/s | Number of transactions processed per second. |
| Write rows/sec | rows/s | Number of rows written to each node per second. |
| Write bytes/sec | Byte | Amount of data written to each node per second. |
| QPS | counts/s | Number of queries processed per second. |
| Average ZooKeeper wait time | ms | Average response latency of ZooKeeper. |
| Average I/O wait time | ms | Average I/O response latency. |
| Average CPU wait time | ms | Average CPU response latency. |