All Products
Search
Document Center

Elasticsearch:Configure cluster alerting

Last Updated:Aug 13, 2025

Alibaba Cloud Elasticsearch lets you monitor instances, customize alert thresholds, and receive alert notifications. To prevent issues, such as an abnormal cluster status or high node disk usage, from affecting the Elasticsearch service, you can configure monitoring and alerting. This lets you monitor the cluster status and node disk usage in real time, receive prompt alert notifications, and take preventive measures. This topic describes how to configure alerting for an Elasticsearch instance, including one-click alerting and CloudMonitor alerting.

Enable one-click alerting

The one-click alerting feature for Alibaba Cloud Elasticsearch is provided by CloudMonitor and is disabled by default. When you enable this feature, the system automatically creates alert rules for issues such as an abnormal cluster status, high node disk usage (> 75%), and high node JVM heap memory usage (> 85%). These rules apply to all Elasticsearch instances in your Alibaba Cloud account.

  1. Log on to the Alibaba Cloud Elasticsearch console.

  2. In the left-side navigation pane, click Elasticsearch Clusters.

  3. On the Elasticsearch Instances page, click One-click Alerting.

    image

  4. In the One-click Alerting dialog box, click Enable Now.

    Note

    If the button shows Disable Now, one-click alerting is already enabled. You do not need to perform the following steps.

  5. In the CloudMonitor console, turn on the One-click Alerting switch for the Elasticsearch service.

  6. (Optional) Return to the Alibaba Cloud Elasticsearch console to check whether the One-click Alerting feature is enabled.

    1. On the Elasticsearch Instances page, click the ID of the target instance.

    2. In the navigation pane on the left, choose Monitoring and Logs > Cluster Monitoring.

    3. Click the Infrastructure Monitoring tab. In the upper-right corner of the page, view the status of One-click Alerting.

      If the status of One-click Alerting is Enabled, one-click alerting is enabled.一键告警为开启状态

Configure CloudMonitor alerting

  1. Go to the CloudMonitor console.

  2. In the navigation pane on the left, choose Alert Service > Alert Rules.

  3. Click Create Alert Rule.

  4. On the Create Alert Rule page, configure the alert rule.

    This example shows how to configure monitoring for Node Disk Usage, Cluster Status, and Node Heap Memory Usage_ES Service. The following table describes the parameters. You can use the default values for parameters that are not mentioned. For more information about the parameters, see Create an alert rule.

    Parameter

    Description

    Product

    Select Elasticsearch.

    Resource Range

    Select Instance.

    Associated Resources

    Add the instances to monitor.

    Rule Description

    Click Add Rule > Combined Metrics. In the Set Rule Description panel, enter a Rule Name and configure the following parameters:

    • Set Metric Type to Combined Metrics.

    • Set Alert Level to Warning (Warn).

    • Multi-metric Alert Description:

      Note

      This example configures three metrics. Click Add Metric to add a metric description.

      • Metric 1: Choose Cluster ID > Cluster Status and set the condition to Value >= 2.

      • Metric 2: Choose nodeName > Node Disk Usage and set the condition to Average >= 75%.

      • Metric 3: Choose nodeName > Node Heap Memory Usage_ES Service and set the condition to Average >= 85%.

    • Set Relationship Between Metrics to Alert If Any Condition Is Met (||).

    • Set Alert Threshold Triggers to 3 Consecutive Cycles (1 Cycle = 1 Minute).

    You can also configure an alert rule for a single metric to monitor disk usage. For more information, see Example of configuring a disk alert.

    Alert Contact Group

    Select an existing alert contact group. If you have not created one, see Create an alert contact or alert contact group.

    Note

    You can also expand Advanced Settings and enter a publicly accessible URL in the Alert Callback field. CloudMonitor then pushes alert notifications to the URL using POST requests. Only the HTTP protocol is supported. For more information, see Use the alert callback feature.

    When you configure alert rules for Alibaba Cloud Elasticsearch, you can refer to the following metric descriptions. For more information about metrics, see Metric descriptions and troubleshooting suggestions.

    Metric

    Description

    Cluster Status

    Required. Monitors whether the cluster status is normal (green) or abnormal (yellow or red).

    The cluster statuses Green, Yellow, and Red correspond to the numeric values 0.00, 1.00, and 2.00. Configure the threshold for the Cluster Status metric based on these values.

    Node Disk Usage (%)

    Required. Set the alert threshold to less than 75%. Do not exceed 80%.

    Node Heap Memory Usage (%)

    Required. Set the alert threshold to less than 85%. Do not exceed 90%.

    Node CPU Utilization (%)

    Optional. Set the alert threshold to less than or equal to 95%.

    Node Load_1m

    Optional. Use 80% of the number of CPU cores as a reference value.

    Cluster Query QPS (Count/Second)

    Optional. Use the actual test results as a reference.

    Cluster Write QPS (Count/Second)

    Optional. Use the actual test results as a reference.

    Full GC Count (count)

    Optional. A value other than 0 indicates that the service is abnormal.

    Exception Count (count)

    Optional. A value other than 0 indicates that the service is abnormal.

    Snapshot Status

    Optional. A value of -1 or 0 indicates that the service is normal. A value of 2 indicates that the service is abnormal.

  5. Click Confirm.

    After the alert rule is configured, when a specified event occurs, the designated alert contacts receive alert notifications using the configured notification methods.

Example of configuring a disk alert

You can configure a disk usage alert in CloudMonitor to promptly receive alert notifications about the disk usage of instance nodes and handle related issues.

To configure a disk alert, see Configure CloudMonitor alerting. The following table shows an example configuration for the alert rule.

Parameter

Example

Rule Name

Set to Disk Usage Alert.

Metric Type

Select Simple Metric.

Metric

Choose nodeName > Node Disk Usage.

Threshold And Alert Level

  • Critical: Average over 3 consecutive cycles >= 80%

  • Warning: Average over 3 consecutive cycles >= 75%

  • Info: Average over 3 consecutive cycles >= 70%

Monitoring Chart Preview

A preview of the monitoring chart for the metric.