Cluster inspection, provided by Container Intelligent Service (CIS), scans your Alibaba Cloud Container Compute Service (ACS) cluster for potential risks — such as insufficient cloud resource quotas or high usage of key Kubernetes resources — and suggests solutions for each identified issue. Use this feature to proactively monitor cluster health and address problems before they affect your workloads.
Prerequisites
Before you begin, make sure you have:
-
An ACS cluster. For more information, see Create an ACS cluster.
-
The cluster in the Running state.
To check the cluster state, log on to the ACS console and go to the Clusters page. If the Cluster Status column shows Running, the cluster is operating normally.
Configure cluster inspection
When cluster inspection runs, a data collection program is deployed in the cluster to gather inspection results. The program collects the system version, system load, status of container components, and key error messages in system logs. It does not collect business data or sensitive data.
-
Log on to the ACS console. In the left-side navigation pane, click Clusters.
-
On the Clusters page, find the cluster you want to inspect and click its ID. In the left-side navigation pane of the cluster details page, choose Inspections and Diagnostics > Cluster Inspections.
-
In the Schedule Rule section of the Cluster Inspection page, click Add.
-
In the Set Inspection Rule pane, set TimeZone and Regular Task Rule (Daily), read the warning, select I know and agree, and then click Save.
After you save the schedule rule, the system inspects the cluster automatically at the scheduled time.
To run an inspection immediately, click Start in the Reports section. The results appear in the Reports section after the inspection completes.
View inspection results
-
Log on to the ACS console. In the left-side navigation pane, click Clusters.
-
On the Clusters page, find the target cluster and click its ID. In the left-side navigation pane of the cluster details page, choose Inspections and Diagnostics > Cluster Inspections.
-
In the Reports section, click Details in the Operation column for the report you want to view.
-
On the Report page, review the identified risks, their impacts, and suggested solutions.
Risks are displayed in ascending order of severity levels, including Warning, Error, and Critical. Critical-level risks are highlighted.
For a full list of check items and how to resolve each risk type, see Check items and solutions.
What's next
Manage the inspection rule
In the Schedule Rule section, you can:
-
Click Edit to update the inspection schedule.
-
Click Delete to remove a rule. Once deleted, the cluster is no longer inspected on that schedule.
Run cluster diagnostics
After reviewing inspection results, use cluster diagnostics to investigate and troubleshoot the identified issues. For more information, see Work with cluster diagnostics.