All Products
Search
Document Center

Application Real-Time Monitoring Service:Helm versions

Last Updated:Dec 05, 2023

This topic describes the Helm versions that are provided by Managed Service for Prometheus.

2023

Helm version

Agent image version

Description

Release date

v1.1.17

arms-prom-operator:v4.0.0

  • Metric collection jobs can be created for cluster events. Cluster events can be displayed in the Kubernetes Deployment dashboard.

  • Self-monitoring metrics can be instrumented based on the service-level agreement (SLA) to stabilize the dashboard data. SLA stability data can be displayed in a self-monitoring dashboard.

  • ServiceMonitor supports the BasicAuth authentication method. Secrets must be in the same namespace as ServiceMonitor.

  • Metrics Metadata capabilities are provided to display the description of specific metrics.

  • The Agent Chart version can be passed to the server. Then, the server initializes or updates the dashboard based on the version.

  • Remote write self-monitoring metrics are supported to calculate the time consumed to send data in each batch.

  • Metrics about the errors and latency of basic metric collection are supported.

  • Metrics about the errors and latency of business metric collection are supported.

  • The queue_config parameter in remote write settings supports the following default values: min_shards=10, max_samples_per_send=5000, and capacity=10000. This improves the adaptability of large-scale clusters.

  • The service discovery methods, especially the PV settings of Container Storage Interface (CSI) data collection, are optimized.

  • The senderLoop distribution frequency is optimized, and the syncWorkersSeries frequency is modified to reduce unnecessary disturbances.

  • Some logs are simplified. Detailed information, such as the time consumed for trace capturing, can be displayed in some logs.

  • The collection period and collection timeout settings of basic metric collection jobs are separately configured, and the global configurations are no longer used. This reduces unnecessary interference on basic metric data collection.

  • The interaction logic in master-slave multi-replica mode is optimized. The Masters and Workers no longer affect each other. This helps improve stability.

  • The policy that specifies how the Master distributes Targets is optimized. This saves about 30% CPU utilization and 40% memory resources, and improves data collection performance.

  • metrics_relabel is optimized. CPU utilization is reduced by 70%.

  • The multi-tenancy listening logic of Informer is optimized to save CPU utilization by 20% in multi-tenancy scenarios.

  • Cache IP addresses can be automatically used if CoreDNS fails to resolve domain names in real time. This improves the success rate of data transmission.

  • The distribution and collection configuration logic of SendConfig is optimized to improve configuration stability.

  • The Master prefetching policy is optimized to reduce the resource overhead of Master, and improve Master service discovery and target scheduling capabilities.

  • Adaptive control is implemented on data packets that exceed 1 MB in size in a single batch. This reduces data loss caused by backend restrictions.

  • The issue that some ScrapeLoop Targets are repeatedly collected is fixed.

  • In multi-tenancy scenarios, the Label caches of pods are not updated in a timely manner. As a result, duplicate timelines are generated. This issue is fixed.

  • Some targets related to out-of-memory (OOM) errors or replica restarting are not collected. This issue is fixed.

  • Secret parsing issues and remote write Header transmission issues are fixed.

  • Occasionally, the Kubernetes-pods cannot be shut down. This issue is fixed.

  • The issue that the global default parameters and the external_labels parameter do not take effect is fixed. Parameters can be modified.

August 2023

v1.1.15

arms-prom-operator:v4.0.0

Helm is compatible with the Kubernetes version 1.26 of ACK clusters.

May 2023

v1.1.14

arms-prom-operator:v4.0.0

  • The memory usage is reduced by about 30% and the CPU utilization is reduced by about 50%.

  • Dependencies on CoreDNS domain name resolution are reduced, and data transmission stability is enhanced.

  • ServiceMonitor supports the BasicAuth authentication method.

  • Secret parsing issues of RemoteWrite are fixed.

  • Three self-monitoring metrics are released.

  • The description of specific metrics can be displayed in metrics metadata.

  • Metric collection jobs can be created for cluster events.

  • The multi-master mechanism is launched. It can handle service discovery and target scheduling in ultra-large clusters. The feature is disabled by default.

  • More than three bugs are fixed.

v1.1.13

gpu-prometheus-exporter:v2.3.6-994eaf7-aliyun

  • GPU-Exporter v2.3.6-994eaf7-aliyun is upgraded.

  • Clusters registered in ACK One can be monitored.

April 2023

v1.1.12

  • arms-prom-operator:v3.2.1

  • gpu-prometheus-exporter:v2.3.6-fdb40f2-aliyun

  • GPU-Exporter v2.3.6-fdb40f2-aliyun is upgraded.

  • Component image pulling is accelerated.

February 2023

2022

Helm version

Agent image version

Description

Release date

v1.1.11

arms-prom-operator:v3.2.1

  • The service degradation feature is provided. If a remote write fails, data collection on the primary link stably continues.

  • The Global Config in collection jobs can be modified.

  • The Remote Write feature is enhanced. If CoreDNS fails to resolve a domain name, the cached IP address corresponding to the domain name is sent.

  • Remote Write allows you to configure multiple IP addresses.

December 2022

v1.1.9

arms-prom-operator:v3.2.0

  • The Prometheus agent supports the following CPU architectures: AMD64, ARM, ARM64, ppc64le, and s390x.

  • The self-monitoring capabilities of the Prometheus agent are enhanced.

  • The garbage collection (GC) policy of the Prometheus agent is optimized.

  • The multi-replica Target scheduling policy is optimized to prevent memory leaks of Worker nodes.

  • The memory degradation issue of the Prometheus agent is fixed.

  • The deadlock of boundary conditions in multi-replica scenarios is fixed.

  • The following service discovery capabilities are provided: IONOS, PuppetDB, Uyuni, and Vultr.

September 2022

v1.1.7

  • arms-prom-operator:v3.1.0

  • gpu-prometheus-exporter:v2.3.6-2.0.0-0c0440f

The metrics and dashboards for the latest GPU exporter are supported. For more information, see Enable GPU monitoring for a cluster.

July 2022

v1.1.6

  • arms-prom-operator:v3.1.0

  • gpu-prometheus-exporter:v1.0.1-26c5321

The data collection issue of GPU exporter V1x is fixed.

June 2022

v1.1.5

arms-prom-operator:v3.1.0

  • The integration center is supported.

  • Clusters that contain more than 10,000 nodes are supported.

  • ServiceMonitors and PodMonitors that are not created in the Managed Service for Prometheus console can be synchronized to Managed Service for Prometheus.

  • ServiceMonitors and PodMonitors that are not created in the Managed Service for Prometheus console can be discovered by using the service discovery feature.

    Annotations: arms.prometheus.io/discovery=true|false
  • The maximum number of agent HPA replicas can be configured by specifying parameters.

  • The fields related to the Job metric of Managed Service for Prometheus can be edited.

  • The configuration files related to ServiceMonitor, PodMonitor, and Prometheus.yaml can be verified.

  • CPU, memory usage, and system stability are optimized.

May 2022

2021

Helm version

Agent image version

Description

Release date

v1.1.0

arms-prom-operator:v3.0.0

  • PodMonitor is supported.

  • Custom namespaces are supported.

  • GPU metrics are supported by CloudMonitor.

  • Agent logs are available in the Managed Service for Prometheus console.

October 2021

v1.0.0

arms-prom-operator:v3.0.0

  • Monitoring jobs can be scheduled based on the volume of captured metric data.

  • Horizontal Pod Autoscaling (HPA) capabilities can be provided on demand.

  • The performance is optimized.

  • The following bug is fixed: The metric data related to disk usage is displayed as zero.

September 2021

v0.1.8

arms-prom-operator:v0.1

  • The update capabilities are optimized.

  • The access to self-managed Kubernetes clusters over the Internet is optimized.

  • The following bug is fixed: The region labels of cloud services are inaccurate.

July 2021

2020

Helm version

Agent image version

Description

Release date

v0.1.5

arms-prom-operator:v0.1

  • The clusters of Container Service for Kubernetes (ACK) V1.18 are supported.

  • Images can be pulled over internal endpoints.

October 2020

v0.1.4

arms-prom-operator:v0.1

  • Out-of-the-box Kubernetes monitoring, including pod monitoring, node monitoring, and resource monitoring, is supported. This feature is used to monitor the Kubernetes container runtime where applications are deployed.

  • Web-based component monitoring, including nine common components such as MySQL, Redis, Kafka, ZooKeeper, and NGINX, is supported. This feature is suitable for scenarios where applications depend on middleware.

  • Managed Service for Prometheus is fully managed. It provides the prometheus.yaml collection rule, Grafana dashboards, and alerting system. Data can be migrated from self-managed Prometheus systems to Alibaba Cloud.

  • The bug related to authentication is fixed.

July 2020

v0.1.3

arms-prom-operator:v0.1

The number of resources used by the Prometheus agent is limited.

April 2020

2019

Helm version

Agent image version

Description

Release date

v0.1.2

arms-prom-operator:v0.1

First release.

August 2019