Alibaba Cloud Elasticsearch lets you monitor instances, customize alert thresholds, and receive alert notifications. To prevent issues, such as an abnormal cluster status or high node disk usage, from affecting the Elasticsearch service, you can configure monitoring and alerting. This lets you monitor the cluster status and node disk usage in real time, receive prompt alert notifications, and take preventive measures. This topic describes how to configure alerting for an Elasticsearch instance, including one-click alerting and CloudMonitor alerting.
Enable one-click alerting
The one-click alerting feature for Alibaba Cloud Elasticsearch is provided by CloudMonitor and is disabled by default. When you enable this feature, the system automatically creates alert rules for issues such as an abnormal cluster status, high node disk usage (> 75%), and high node JVM heap memory usage (> 85%). These rules apply to all Elasticsearch instances in your Alibaba Cloud account.
Log on to the Alibaba Cloud Elasticsearch console.
In the left-side navigation pane, click Elasticsearch Clusters.
On the Elasticsearch Instances page, click One-click Alerting.

In the One-click Alerting dialog box, click Enable Now.
NoteIf the button shows Disable Now, one-click alerting is already enabled. You do not need to perform the following steps.
In the CloudMonitor console, turn on the One-click Alerting switch for the Elasticsearch service.
(Optional) Return to the Alibaba Cloud Elasticsearch console to check whether the One-click Alerting feature is enabled.
On the Elasticsearch Instances page, click the ID of the target instance.
In the navigation pane on the left, choose .
Click the Infrastructure Monitoring tab. In the upper-right corner of the page, view the status of One-click Alerting.
If the status of One-click Alerting is Enabled, one-click alerting is enabled.

Configure CloudMonitor alerting
Go to the CloudMonitor console.
In the navigation pane on the left, choose .
Click Create Alert Rule.
On the Create Alert Rule page, configure the alert rule.
This example shows how to configure monitoring for Node Disk Usage, Cluster Status, and Node Heap Memory Usage_ES Service. The following table describes the parameters. You can use the default values for parameters that are not mentioned. For more information about the parameters, see Create an alert rule.
Parameter
Description
Product
Select Elasticsearch.
Resource Range
Select Instance.
Associated Resources
Add the instances to monitor.
Rule Description
Click . In the Set Rule Description panel, enter a Rule Name and configure the following parameters:
Set Metric Type to Combined Metrics.
Set Alert Level to Warning (Warn).
Multi-metric Alert Description:
NoteThis example configures three metrics. Click Add Metric to add a metric description.
Metric 1: Choose and set the condition to Value >= 2.
Metric 2: Choose and set the condition to Average >= 75%.
Metric 3: Choose and set the condition to Average >= 85%.
Set Relationship Between Metrics to Alert If Any Condition Is Met (||).
Set Alert Threshold Triggers to 3 Consecutive Cycles (1 Cycle = 1 Minute).
You can also configure an alert rule for a single metric to monitor disk usage. For more information, see Example of configuring a disk alert.
Alert Contact Group
Select an existing alert contact group. If you have not created one, see Create an alert contact or alert contact group.
NoteYou can also expand Advanced Settings and enter a publicly accessible URL in the Alert Callback field. CloudMonitor then pushes alert notifications to the URL using POST requests. Only the HTTP protocol is supported. For more information, see Use the alert callback feature.
When you configure alert rules for Alibaba Cloud Elasticsearch, you can refer to the following metric descriptions. For more information about metrics, see Metric descriptions and troubleshooting suggestions.
Metric
Description
Cluster Status
Required. Monitors whether the cluster status is normal (green) or abnormal (yellow or red).
The cluster statuses Green, Yellow, and Red correspond to the numeric values 0.00, 1.00, and 2.00. Configure the threshold for the Cluster Status metric based on these values.
Node Disk Usage (%)
Required. Set the alert threshold to less than 75%. Do not exceed 80%.
Node Heap Memory Usage (%)
Required. Set the alert threshold to less than 85%. Do not exceed 90%.
Node CPU Utilization (%)
Optional. Set the alert threshold to less than or equal to 95%.
Node Load_1m
Optional. Use 80% of the number of CPU cores as a reference value.
Cluster Query QPS (Count/Second)
Optional. Use the actual test results as a reference.
Cluster Write QPS (Count/Second)
Optional. Use the actual test results as a reference.
Full GC Count (count)
Optional. A value other than 0 indicates that the service is abnormal.
Exception Count (count)
Optional. A value other than 0 indicates that the service is abnormal.
Snapshot Status
Optional. A value of -1 or 0 indicates that the service is normal. A value of 2 indicates that the service is abnormal.
Click Confirm.
After the alert rule is configured, when a specified event occurs, the designated alert contacts receive alert notifications using the configured notification methods.
Example of configuring a disk alert
You can configure a disk usage alert in CloudMonitor to promptly receive alert notifications about the disk usage of instance nodes and handle related issues.
To configure a disk alert, see Configure CloudMonitor alerting. The following table shows an example configuration for the alert rule.
Parameter | Example |
Rule Name | Set to Disk Usage Alert. |
Metric Type | Select Simple Metric. |
Metric | Choose . |
Threshold And Alert Level |
|
Monitoring Chart Preview | A preview of the monitoring chart for the metric. |