This topic describes how to view the overall monitoring information about a Kubernetes cluster in the Application Real-Time Monitoring Service (ARMS) console. The overall monitoring information includes the total number of resources in the cluster, the number of abnormal resources in the cluster, the distribution of resources across namespaces, the performance data of services and Deployments, and the information about node resource usage.

Procedure

  1. Log on to the ARMS console .
  2. In the left-side navigation pane, click Kubernetes Monitoring.
  3. In the top navigation bar of the MNS console, select the region where your cluster is deployed.
  4. On the Kubernetes Monitoring page, click the name of the Kubernetes cluster.
  5. On the cluster overview page, select a time range from the date and time picker in the upper-right corner.
    After you select a time range, you can view the overall monitoring information about the Kubernetes cluster in the time range.
    Note If an alert event occurs within the selected time range, the resource is considered abnormal. The statistics and curves of abnormal resources are displayed in red, whereas those of normal resources are displayed in blue.

Namespace

Each line in the Namespace section displays the total number of resources in the corresponding namespace, and the number of abnormal resources in the corresponding namespace.

Kubernetes Monitoring_Resource overview_Namespace
In the Namespace section, you can perform the following operations:
  • View the intersection of a namespace and a resource type.
    • If the intersection is displayed only in red, all the resources of this type in this namespace are abnormal. You can move the pointer over the intersection to view the number of abnormal resources.
    • If the intersection is displayed only in blue, all the resources of this type in this namespace are normal. You can move the pointer over the intersection to view the number of normal resources.
    • If the intersection is displayed in red and blue, both abnormal and normal resources of this type exist in this namespace. You can move the pointer over the intersection to view the number of abnormal resources.
    • If the intersection is empty, no resources of this type exist in this namespace.
    Note The larger the size of an intersection is, the more resources of the specified type exist in the specified namespace.
  • Click the red part in the intersection of a namespace and a resource type to view all the abnormal resources of this type in this namespace. For more information, see Overview of resources.
  • Click the blue part in the intersection of a namespace and a resource type to view all the normal resources of this type in this namespace. For more information, see Overview of resources.
  • Search for namespaces or change the order of the namespaces.
    Note By default, namespaces are sorted based on the number of abnormal resources.

Performance data of services

The Service Performance Overview section displays the performance data of the services in the current Kubernetes cluster in the selected time range. The performance data includes the top 10 services by the total number of requests, the top 10 services by the number of failed requests, the top 10 services by the average response time, the time series curves of the preceding three metrics, and the namespace to which each service belongs.

Kubernetes Monitoring_Overview of top 10 services
In the Service Performance Overview section, you can perform the following operations:
  • Move the pointer over the curve of a metric for a service to view the statistics at a specific point in time.
  • Click a namespace to view all the services in the namespace. For more information, see Service tab.
  • Click the name of a service to view the details of the service. For more information, see Service details.

Performance data of Deployments

The Deployment Performance Overview section displays the performance data of the Deployments in the current Kubernetes cluster in the selected time range. The performance data includes the top 10 Deployments by the total number of requests, the top 10 Deployments by the number of failed requests, the top 10 Deployments by the average response time, the time series curves of the preceding three metrics, and the namespace to which each Deployment belongs.

Kubernetes Monitoring_Overview of top 10 Deployments
In the Deployment Performance Overview section, you can perform the following operations:
  • Move the pointer over the curve of a metric for a service to view the statistics at a specific point in time.
  • Click a namespace to view all the Deployments in the namespace. For more information, see Deployment tab.
  • Click the name of a Deployment to view the details of the Deployment. For more information, see Deployment details.

Information about node resource usage

The Node Resource Overview section displays the information about the usage of node resources in the current Kubernetes cluster in the selected time range. The information includes the top 10 nodes by CPU utilization, the top 10 nodes by memory usage, the top 10 nodes by disk usage, and the time series curves of the preceding three metrics.

Note If the average CPU utilization, memory usage, or disk usage of a node is greater than 70%, the node is considered abnormal. In this case, the corresponding time series curve is displayed in red.
Kubernetes Monitoring_Node list
In the Node Resource Overview section, you can perform the following operations:
  • Move the pointer over the curve of a metric for a node to view the statistic at a specific point in time.
  • Click View All Nodes in the upper-right corner of the Node Resource Overview section to view all the nodes in the current Kubernetes cluster. For more information, see Node list.
  • Click the name of a node to view the details of the node. For more information, see Node details.

Contact us

If you have questions about how to use Kubernetes Monitoring, join the DingTalk group of Kubernetes Monitoring whose ID is 35568145 to seek technical support.