Application Real-Time Monitoring Service (ARMS) Prometheus monitoring collects only a fixed set of basic metrics at no extra cost. If your custom Grafana dashboard queries metrics outside this set, those panels return no data.
To determine whether a missing metric is collected, check the following tables. The 211 collected basic metrics are grouped by Kubernetes component.
Kubelet metrics
Volume stats, pod worker latency, PLEG relist duration, node identity, and certificate management metrics from the Kubelet node agent.
| No. | Metric |
|---|---|
| 8 | kubelet_pleg_relist_duration_seconds_bucket |
| 9 | kubelet_node_name |
| 10 | kubelet_pod_worker_duration_seconds_bucket |
| 11 | kubelet_certificate_manager_client_ttl_seconds |
| 12 | kubelet_certificate_manager_server_ttl_seconds |
| 13 | kubelet_certificate_manager_client_expiration_renew_errors |
| 14 | kubelet_server_expiration_renew_errors |
| 75 | kubelet_volume_stats_used_bytes |
| 76 | kubelet_volume_stats_inodes_used |
| 77 | kubelet_volume_stats_inodes_free |
| 78 | kubelet_volume_stats_inodes |
| 79 | kubelet_volume_stats_capacity_bytes |
| 80 | kubelet_volume_stats_available_bytes |
API server metrics
Request totals, latency, in-flight request counts, dropped requests, long-running request gauges, and client certificate expiration metrics for the Kubernetes API server.
| No. | Metric |
|---|---|
| 15 | apiserver_client_certificate_expiration_seconds_count |
| 16 | apiserver_client_certificate_expiration_seconds_bucket |
| 36 | apiserver_dropped_requests_total |
| 53 | apiserver_request_latencies_summary |
| 109 | apiserver_current_inflight_requests |
| 110 | apiserver_longrunning_gauge |
| 114 | apiserver_request_total |
| 115 | apiserver_request_count |
| 116 | apiserver_request_duration_seconds_bucket |
API aggregator metrics
Unavailable API service counts from the aggregation layer.
| No. | Metric |
|---|---|
| 17 | aggregator_unavailable_apiservice_count |
| 18 | aggregator_unavailable_apiservice |
Scheduler metrics
Pod scheduling attempts, pending pods, and scheduler cache size.
| No. | Metric |
|---|---|
| 3 | scheduler_pod_scheduling_attempts_bucket |
| 4 | scheduler_pending_pods |
| 5 | scheduler_scheduler_cache_size |
etcd metrics
Request duration, object counts, disk commit latency, leader changes, and database size metrics for etcd.
| No. | Metric |
|---|---|
| 40 | etcd_request_duration_seconds_count |
| 41 | etcd_request_duration_seconds_sum |
| 42 | etcd_object_counts |
| 43 | etcd_debugging_mvcc_keys_total |
| 44 | etcd_disk_backend_commit_duration_seconds_bucket |
| 45 | etcd_server_leader_changes_seen_total |
| 46 | etcd_server_has_leader |
| 47 | etcd_debugging_mvcc_db_total_size_in_bytes |
Kubernetes state metrics
Pod, Deployment, Node, DaemonSet, StatefulSet, Job, HPA, Service, PersistentVolume, and ResourceQuota state metrics from kube-state-metrics.
| No. | Metric |
|---|---|
| 1 | kube_pod_container_status_last_terminated_reason |
| 6 | kube_node_spec_taint |
| 7 | kube_node_status_capacity_pods |
| 26 | kube_resourcequota |
| 27 | kube_daemonset_status_current_number_scheduled |
| 28 | kube_daemonset_status_desired_number_scheduled |
| 29 | kube_daemonset_status_number_misscheduled |
| 30 | kube_daemonset_updated_number_scheduled |
| 31 | kube_daemonset_status_number_available |
| 32 | kube_job_spec_completions |
| 33 | kube_job_status_succeeded |
| 34 | kube_job_failed |
| 35 | kube_persistentvolume_status_phase |
| 54 | kube_hpa_status_condition |
| 55 | kube_hpa_labels |
| 56 | kube_hpa_metadata_generation |
| 57 | kube_resourcequota |
| 58 | kube_pod_container_status_waiting_reason |
| 68 | kube_pod_labels |
| 69 | kube_deployment_labels |
| 70 | kube_node_labels |
| 71 | kube_pod_status_ready |
| 72 | kube_node_status_capacity |
| 73 | kube_node_status_condition |
| 74 | kube_pod_container_resource_limits |
| 81 | kube_pod_container_resource_limits |
| 86 | kube_node_labels |
| 87 | kube_deployment_status_replicas_unavailable |
| 88 | kube_job_status_failed |
| 89 | kube_job_status_active |
| 90 | kube_job_status_succeeded |
| 91 | kube_pod_container_status_restarts |
| 92 | kube_pod_container_status_terminated |
| 93 | kube_pod_container_status_waiting |
| 94 | kube_pod_container_status_running |
| 95 | kube_node_spec_unschedulable |
| 96 | kube_node_status_condition |
| 97 | kube_node_info |
| 98 | kube_node_status_allocatable_pods |
| 100 | kube_deployment_status_replicas_unavailable |
| 117 | kube_pod_container_resource_limits_memory_bytes |
| 118 | kube_pod_container_resource_limits_cpu_cores |
| 122 | kube_service_info |
| 123 | kube_pod_status_phase |
| 125 | kube_pod_labels |
| 126 | kube_deployment_spec_strategy_rollingupdate_max_unavailable |
| 127 | kube_deployment_metadata_generation |
| 128 | kube_deployment_status_observed_generation |
| 129 | kube_deployment_spec_replicas |
| 130 | kube_deployment_status_replicas_available |
| 131 | kube_deployment_spec_replicas |
| 132 | kube_deployment_status_replicas_updated |
| 133 | kube_deployment_created |
| 134 | kube_node_status_allocatable_memory_bytes |
| 135 | kube_node_status_allocatable_cpu_cores |
| 136 | kube_node_status_capacity_memory_bytes |
| 137 | kube_node_status_capacity_cpu_cores |
| 138 | kube_node_status_condition |
| 142 | kube_pod_container_resource_requests_memory_bytes |
| 143 | kube_pod_container_resource_requests_cpu_cores |
| 144 | kube_hpa_spec_max_replicas |
| 145 | kube_hpa_spec_min_replicas |
| 146 | kube_hpa_status_desired_replicas |
| 147 | kube_hpa_status_current_replicas |
| 148 | kube_pod_container_status_restarts_total |
| 154 | kube_pod_info |
| 155 | kube_pod_container_info |
| 158 | kube_daemonset_created |
| 159 | kube_statefulset_created |
| 160 | kube_deployment_created |
| 161 | kube_deployment_status_replicas |
| 162 | kube_statefulset_replicas |
| 163 | kube_daemonset_status_desired_number_scheduled |
| 164 | kube_deployment_status_replicas_available |
| 165 | kube_statefulset_status_replicas |
| 166 | kube_daemonset_status_number_ready |
| 167 | kube_pod_container_resource_requests_cpu_cores |
| 168 | kube_pod_container_resource_requests_memory_bytes |
| 209 | kube_pod_owner |
| 210 | kube_deployment_metadata_generation |
| 211 | kube_pod_deletion_timestamp |
Container metrics
CPU usage, memory usage, network I/O, filesystem I/O, and GPU accelerator metrics for containers.
| No. | Metric |
|---|---|
| 37 | container_fs_inodes_total |
| 38 | container_fs_inodes_free |
| 59 | container_cpu_load_average_10s |
| 60 | container_network_receive_errors_total |
| 61 | container_network_receive_packets_dropped_total |
| 62 | container_network_transmit_errors_total |
| 63 | container_network_transmit_packets_dropped_total |
| 64 | container_memory_max_usage_bytes |
| 65 | container_memory_cache |
| 66 | container_memory_swap |
| 67 | container_memory_failcnt |
| 83 | container_fs_writes_bytes_total |
| 84 | container_fs_reads_bytes_total |
| 85 | container_memory_usage_bytes |
| 105 | container_network_transmit_bytes_total |
| 106 | container_network_receive_bytes_total |
| 111 | container_memory_rss |
| 112 | container_spec_memory_limit_bytes |
| 113 | container_network_transmit_bytes_total |
| 119 | container_accelerator_memory_total_bytes |
| 120 | container_accelerator_memory_used_bytes |
| 121 | container_accelerator_duty_cycle |
| 139 | container_cpu_cfs_throttled_periods_total |
| 140 | container_cpu_cfs_periods_total |
| 141 | container_cpu_cfs_throttled_seconds_total |
| 149 | container_network_receive_bytes_total |
| 150 | container_memory_working_set_bytes |
| 152 | container_cpu_usage_seconds_total |
| 156 | container_fs_usage_bytes |
| 157 | container_fs_limit_bytes |
| 205 | container_spec_cpu_quota |
| 206 | container_network_transmit_packets_total |
| 207 | container_fs_write_seconds_total |
| 208 | container_fs_read_seconds_total |
Node metrics
CPU, memory, disk I/O, network, filesystem, and system load metrics from the node exporter.
| No. | Metric |
|---|---|
| 20 | node_filesystem_readonly |
| 21 | node_network_receive_errs_total |
| 22 | node_network_transmit_errs_total |
| 23 | node_timex_offset_seconds |
| 24 | node_timex_sync_status |
| 25 | node_network_up |
| 82 | node_filesystem_usage |
| 99 | node_filesystem_size |
| 101 | node_filesystem_free |
| 102 | node_uname_info |
| 124 | node_cpu_seconds_total |
| 169 | node_boot_time_seconds |
| 170 | node_memory_MemAvailable_bytes |
| 171 | node_memory_MemTotal_bytes |
| 172 | node_memory_MemFree_bytes |
| 173 | node_memory_Buffers_bytes |
| 174 | node_memory_Cached_bytes |
| 175 | node_filefd_allocated |
| 176 | node_filesystem_avail_bytes |
| 177 | node_filesystem_size_bytes |
| 178 | node_filesystem_free_bytes |
| 179 | node_load15 |
| 180 | node_load1 |
| 181 | node_load5 |
| 182 | node_disk_io_time_seconds_total |
| 183 | node_disk_read_time_seconds_total |
| 184 | node_disk_write_time_seconds_total |
| 185 | node_disk_reads_completed_total |
| 186 | node_disk_writes_completed_total |
| 187 | node_disk_io_now |
| 188 | node_disk_read_bytes_total |
| 189 | node_disk_written_bytes_total |
| 190 | node_disk_io_time_weighted_seconds_total |
| 191 | node_network_receive_bytes_total |
| 192 | node_network_transmit_bytes_total |
| 193 | node_netstat_Tcp_CurrEstab |
| 194 | node_sockstat_TCP_tw |
| 195 | node_netstat_Tcp_ActiveOpens |
| 196 | node_netstat_Tcp_PassiveOpens |
| 197 | node_sockstat_TCP_alloc |
| 198 | node_sockstat_TCP_inuse |
| 199 | node_exporter_build_info |
NVIDIA GPU metrics
GPU temperature, memory usage, power draw, and duty cycle metrics.
| No. | Metric |
|---|---|
| 39 | nvidia_gpu_num_devices |
| 48 | nvidia_gpu_temperature_celsius |
| 49 | nvidia_gpu_memory_used_bytes |
| 50 | nvidia_gpu_memory_total_bytes |
| 51 | nvidia_gpu_power_usage_milliwatts |
| 52 | nvidia_gpu_duty_cycle |
Machine-level metrics
Physical CPU core count and total memory capacity.
| No. | Metric |
|---|---|
| 151 | machine_memory_bytes |
| 153 | machine_cpu_cores |
REST client and HTTP metrics
REST client request totals and HTTP request/response size and duration metrics.
| No. | Metric |
|---|---|
| 2 | rest_client_requests_total |
| 200 | http_request_duration_microseconds |
| 201 | http_response_size_bytes |
| 202 | http_requests_total |
| 203 | http_request_size_bytes |
| 204 | rest_client_requests_total |
Go runtime and process metrics
Garbage collection duration, goroutine count, resident memory, and CPU usage for Go-based components.
| No. | Metric |
|---|---|
| 103 | go_gc_duration_seconds |
| 104 | go_goroutines |
| 107 | process_resident_memory_bytes |
| 108 | process_cpu_seconds_total |
Kubernetes build information
Kubernetes version and build metadata.
| No. | Metric |
|---|---|
| 19 | kubernetes_build_info |