All Products
Search
Document Center

Container Service for Kubernetes:ack-sysom-monitor

Last Updated:Nov 08, 2024

ack-sysom-monitor is an operating system kernel-level container monitoring component provided by Container Service for Kubernetes for ACK clusters. This topic introduces ack-sysom-monitor and describes the usage notes and release notes for ack-sysom-monitor.

Introduction

System Operations & Maintenance (SysOM) is a one-stop operating system O&M platform developed by the SysOM SIG in the OpenAnolis community. You can complete complex operating system management tasks on this platform. For example, you can manage hosts, monitor systems, diagnose anomalies, audit logs, and conduct security control. ack-sysom-monitor serves as a SysOM monitoring component to collect kernel metrics and enhance the kernel. ack-sysom-monitor occupies only a small amount of resources and adopts the extended Berkeley Packet Filter (eBPF) technology to collect node and container metrics. The component can help resolve issues such as system jitters, high latency, resource leaks, and pod memory anomalies. For more information, see SysOM kernel-level container monitoring.

The ack-sysom-monitor component exposes the metrics that it collects as standard Prometheus metrics. For more information about the metrics, see Metrics.

Billing of ack-sysom-monitor

After the ack-sysom-monitor component is enabled, related components automatically send monitoring metrics to Managed Service for Prometheus. These metrics are considered as custom metrics. Fees are charged for custom metrics.

Before you enable this feature, we recommend that you read Billing overview to understand the billing rules of custom metrics. The fees may vary based on the cluster size and number of applications. You can follow the steps in View resource usage to monitor and manage resource usage.

Usage notes

For more information about how to use the ack-sysom-monitor component, see Enable ack-sysom-monitor.

Release notes

October 2024

Version

Image address

Release date

Description

Impact

1.1.0

registry-vpc.{{.Region}}.aliyuncs.com/acs/ack-sysom-monitor:v1.1.0-6dcd0ef-aliyun

2024-10-28

  • The issue that cgroup V2 fails to recognize container changes is fixed.

  • The startup error is fixed.

No impact on workloads

August 2024

Version

Image address

Release date

Description

Impact

1.0.9

registry-vpc.{{.Region}}.aliyuncs.com/acs/ack-sysom-monitor:v1.0.9-ce26b3a-aliyun

2024-08-09

  • The ARM64 architecture is supported.

  • The cgroup V2 interface is supported.

  • GPU and memory usage can be monitored.

  • Cached file in the node and shared memory can be displayed in the Pod File Cache panel.

No impact on workloads

May 2024

Version

Image address

Release date

Description

Impact

1.0.8

registry-vpc.{{.Region}}.aliyuncs.com/acs/ack-sysom-monitor:v1.0.8-27e0d9c-aliyun

2024-05-17

  • ConfigMaps can be configured to expose metrics for SysOM.

  • By default, metrics for system namespaces are exposed.

  • Pod page cache monitoring can be used to monitor the page cache usage of the root cgroup.

  • The issue that delays caused by memory reclamation are not recorded in CentOS is fixed.

  • Pod CPU utilization anomalies are fixed.

No impact on workloads

December 2023

Version

Image address

Release date

Description

Impact

1.0.7

registry-vpc.{{.Region}}.aliyuncs.com/acs/ack-sysom-monitor:v1.0.7-5d8a59d-aliyun

2023-12-12

  • The double free issue is fixed for PodMem.

  • The BTF file error in kernel version 5.10 is fixed for PodMem.

  • Resolution failures that occur when the number of the metrics of the elastic network interface (ENI) used by the container network plug-ins is excessively large.

  • The data missing issue in kernel version 3.10 of CentOS 7.9 is fixed for PodMem.

No impact on workloads

September 2023

Version

Image address

Release date

Description

Impact

1.0.5

registry-vpc.{{.Region}}.aliyuncs.com/acs/ack-sysom-monitor:v1.0.6-0085be5-aliyun

2023-09-21

Operating system kernel-level node and pod memory monitoring is supported.

No impact on workloads