Application Real-Time Monitoring Service (ARMS) Kubernetes Monitoring allows you to view the network topologies of resources in a cluster, such as Services, workloads, nodes, and pods. You can also view the network topology relationships between each resource and the related cloud services. This topic describes how to view the network topologies of a cluster.

Procedure

  1. Log on to the ARMS console. In the left-side navigation pane, click Kubernetes Monitoring.
  2. On the Kubernetes Monitoring page, select a region from the top navigation bar and click the name of the cluster that you want to manage.
  3. In the left-side navigation pane, click Cluster Topology.
    By default, the Cluster Topology page displays the network topologies of workloads (Deployments, StatefulSets, and DaemonSets) in all namespaces and the network topology relationships between each workload and the related cloud services. Container monitoring - Deployment topology
  4. Optional:In the upper-left corner of the Cluster Topology page, select Service view, Node View, or Pod View to view the network topologies of Services, nodes, or pods, and the network topology between each resource and the related cloud services.
    Container monitoring - Service topology

Query topology nodes

You can perform the following operations to query topology nodes:

  • Select a namespace or cloud service from the Node Filtering drop-down list or search for a topology node by keyword to view the resource list of the specified namespace or cloud service.
  • Click the Save icon icon next to the Node Filtering drop-down list to save the current filter conditions.
    Note The Noe Filtering drop-down list displays the saved filter conditions. You can click a filter condition to quickly view the cluster topology that matches the filter condition.
    • Click the Edit icon icon next to the filter condition to modify the name of the filter condition and specify whether to set the cluster topology corresponding to the current filter condition as the default view.
    • Click the Delete icon icon next to the filter condition to delete the filter condition.
  • Click the Delete icon icon next to the Node Filtering drop-down list to delete all the selected filter conditions.
  • Click the Query icon icon next to the Node Filtering drop-down list to query the specified topology node.
  • Click the Refresh icon icon next to the Node Filtering drop-down list to refresh the current topology.
  • Select a time range to query in the upper-right corner of the Cluster Topology page. For example, you can query the network topologies of the last 30 minutes, last 1 hour, or last 12 hours.

View network topologies

You can perform the following operations on network topologies:
  • If a number icon is displayed in the lower-right corner of a namespace node, alert events exist in the current namespace. You can click the number icon to view the alert event list of the namespace. For more information, see Alert events. For information about how to create alert rules, see Create alert rules for Kubernetes Monitoring.
    Note
    • Before you view alert events, you must integrate a cloud service into Prometheus Service and create a Prometheus instance. For more information, see Configure alert rules for Alibaba Cloud services.
    • The number in each namespace indicates the number of resources in the namespace. The color of the circle around the number indicates the alert event status of the namespace. Red indicates that the alert event status is Critical, yellow indicates that the alert event status is Warning, gray indicates that the alert event status is Default, and blue indicates that no alert event is generated.
  • If a number icon is displayed in the upper-right corner of a resource node, alert events exist in the current resource. You can click the number icon to view the alert event list of the resource. For more information, see Alert events.
  • In a network topology, click the Show icon icon in the upper-right corner of a namespace node or cloud service node to show the current namespace or cloud service. Click the Hide icon icon in the upper-right corner to hide the current namespace or cloud service.
  • Move the pointer over a resource node to view the number of requests, the number of errors, and the response time of the resource. Click a resource node to view its details in the panel that appears. For more information, see Topology node details.
    Note You can view the node details only of the cloud services that are monitored by Alibaba Cloud Prometheus Service.
  • Move the pointer over the line between two resource nodes to view the number of requests, the number of errors, and the response time between the resources. Click the line between two resource nodes to view its details in the panel that appears. For more information, see Details of a line between topology nodes.
  • Move the pointer over a resource node and click Upstream and Downstream to query the upstream and downstream topologies of the resource.
In the lower part of a network topology, you can perform the following operations:
  • If the Tip icon icon is displayed at the bottom of the page, some cloud services associated with the current Kubernetes cluster are not monitored by Alibaba Cloud Prometheus Service. Click the icon to go to the Monitor Cloud Services page. For more information, see Create a Prometheus instance to monitor Alibaba Cloud services. If a cloud service is monitored by Alibaba Cloud Prometheus Service, click the cloud service node in the topology to view the details of the cloud service.
  • Click the Alert icon icon at the bottom of the page to view all alert events in the current Kubernetes cluster.
  • Click the Expand icon icon to show all namespaces.
  • Click the Fold icon icon to hide all namespaces.
  • Click the Adjust Topology icon icon to adjust the topology size to fit the page.
  • Click the Zoom In icon icon or scroll the mouse wheel up to zoom in the topology.
  • Click the Zoom Out icon icon or scroll the mouse wheel down to zoom out the topology.

Topology node details

Click a topology node in a network topology to view the details of the topology node in the panel that appears.

Topology node details panel
  • On the Performance tab, you can click the Expand icon in the Protocol drop-down list to view the number of requests, number of errors, average response time, number of slow calls, number of instances, and the time series curve of topology nodes that correspond to each protocol.

    In the Requests, Errors, Average Response Time, Slow Call, and Instances sections, click View List to view the details of each metric. For more information, see Protocol-related metrics.

  • On the Resources tab, you can view the expected number and actual number of instances for the current resource.
  • On the Network tab, you can view the number of new TCP connections and the average latency of successful TCP connections.
  • On the Associate Upstream and Downstream tab, you can view the upstream and downstream resource list of the current resource.

Details of a line between topology nodes

Click a line between two topology nodes in a network topology to view the details of the line in the panel that appears.

Details of a line between topology nodes
  • On the Performance tab, you can click the Expand icon in the Protocol drop-down list to view the number of requests, number of errors, average response time, number of slow calls, and the time series curve of the lines between topology nodes that correspond to each protocol.

    In the Requests, Errors, Average Response Time, Slow Call, and Instances sections, click View List to view the details of each metric. For more information, see Protocol-related metrics.

  • On the Network tab, you can view the network data of the current line.

Alert events

Click the number in a namespace node to view the alert event list of the namespace and all alert rules in the current Kubernetes cluster.

Alert event list

The Alert object tab displays the list of resources that trigger alerts. The list includes the resource name, namespace, and alert level.

The Alert events tab displays the content of the reported alert events. On the Alert events tab, you can perform the following operations:
The Alert rules tab displays all alert rules of the current Kubernetes cluster. On the Alert rules tab, you can perform the following operations: