Notification policy - Cloud Monitor - Alibaba Cloud Documentation Center

A notification policy defines rules for matching, grouping, notifying, and escalating alerts or events to provide flexible alert routing and notification management.

Overview

A notification policy is a core component in the alert and event processing workflow. It supports the following features:

Define matching rules to filter alerts and events that require action.
Configure event grouping rules to reduce notification fatigue.
Specify notification objects and effective periods.
Configure repeated notifications, escalation policies, and recovery methods.
Link action integrations for automated processing.

Create a notification policy

Log on to the CloudMonitor 2.0 console, select the target workspace, and from the left-side navigation pane, choose alert center > notification management > notification policy.

Click Create Notification Policy and configure the following settings:

Basic Information:
- Name: The name of the notification policy. It must not exceed 120 characters.
- Description: A description of the notification policy. It must not exceed 120 characters.
Routing Rule: Routing rules define how events are routed for notification. The system evaluates these rules sequentially and stops after the first match.
- Deduplication and noise reduction: To reduce redundant notifications, configure grouping fields to aggregate similar events. You can select up to five grouping fields. In addition to common grouping fields, you can also specify custom fields based on the event data. By default, labels._cms_rule_id (alert rule ID) and resource.entity.entity_id (resource entity ID) are used.
- Routing rule configuration: Each notification policy can have multiple routing rules. Click Add Routing Rule to add a new rule.
  - Routing condition: Define custom matching conditions. You can add multiple conditions.
  - Notification object: Specify the recipients for the notification. You can add multiple recipients. For 'contact', 'contact group', or 'on-call schedule' objects, you must also select a notification method, such as a phone call, SMS, or email. Other object types do not require this.
  - Effective period: By default, the rule is effective 24/7. Click Modify Time to specify a custom effective period.

Notification Rule (Optional): Configure advanced notification rules, including notification templates, recovery notifications, auto-recovery, repeated notifications, and action integrations.

Notification template settings: Configure a notification template for each channel type.
Recovery notification:
Option
Description
Send recovery notification
Sends a notification when an alert is resolved.
Do not send recovery notification
Does not send a notification when an alert is resolved (default).

Auto-recovery:

Option	Description
Alerts do not recover automatically	If an alert expires and the underlying issue is not resolved, the alert remains active.
Alerts recover automatically after N minutes/seconds	The alert recovers automatically when it expires, even if the underlying issue is not resolved.

Repeated notification:

Option	Description
No repeat notification is required	Sends only one notification while the alert is active.
If an alert is not recovered, send a repeat notification every N minutes/seconds	Sends notifications periodically until the alert is recovered.

Action integration: Configure action integrations to run automatically when an alert is triggered or resolved.

Parameter	Description
Trigger action integration upon firing	Runs the specified action integration when an alert is triggered.
Trigger action integration upon recovery	Runs the specified action integration when an alert is resolved.

Escalation policy: Select a pre-configured escalation policy. If an alert is not handled within a specified time, it is automatically escalated.

After you complete the configuration, click OK.

The notification policy list page displays all created policies with the following information:

Field	Description
Name	The name of the notification policy.
Description	The description of the notification policy.
Grouping condition	The grouping fields used for event deduplication and noise reduction.
Notification mode	Displays the trigger method and mute time. For example, "Trigger immediately, no further notifications for 5 minutes."
Number of routing rules	The number of routing rules in the policy.
Number of custom notification templates	The number of custom notification templates configured in the policy.
Source	The source of the policy, such as Managed Service for Prometheus, ARMS, or CloudMonitor.
Last modified time	The time when the policy was last modified.
Status	The status of the policy (enabled or disabled).

The following operations are available on the list page:
- Create Notification Policy: Click the Create Notification Policy button to open the policy creation page.
- Search: Supports fuzzy search by policy name.
- Filter by Status: Filter policies by their enabled or disabled status.
- Sort: Sort by last modified time or status.
- Edit: Click the Edit button to modify the policy configuration.
- Delete: Click the Delete button to delete the policy. Policies synchronized from ARMS or CloudMonitor cannot be deleted.
- Enable/Disable: Use the toggle to enable or disable the policy.

Event matching process

When a policy receives an event, it first applies deduplication and noise reduction based on the configured grouping fields.
The system then matches the event against the routing rules in sequential order.
For each routing rule, the system checks if the event meets the routing conditions and is within the effective period.
The first matching routing rule is applied, and all subsequent rules are ignored.
A notification is sent to the notification objects specified in the matched rule.
Recovery notifications, auto-recovery, and repeated notifications are handled based on the configured notification rules.

Default values

The following default settings apply when you create a notification policy:

Configuration item	Default value
Grouping fields	`labels._cms_rule_id`, `resource.entity.entity_id`
Mute time	300 seconds (5 minutes)
Auto-recovery time	600 seconds (10 minutes)
Repeat notification interval	600 seconds (10 minutes)
Recovery notification	Do not send recovery notification
Effective period	All time

Policy source

Notification policies can be synchronized from multiple sources:

Source type	Description
Prometheus	Policies synchronized from Managed Service for Prometheus.
ARMS	Policies synchronized from ARMS.
CloudMonitor	Policies created locally in CloudMonitor 2.0.

Note: Policies synchronized from other sources may have functional limitations. For example, some settings may be read-only, and the policies themselves cannot be deleted.

Recommended configurations

Multi-level routing rules

Configure different notification methods for events with different severity levels:

Critical events (CRITICAL):
- Routing condition: Severity level is CRITICAL.
- Notification object: On-call personnel (phone + SMS).
- Effective period: 24/7.
Warning events (WARNING):
- Routing condition: Severity level is WARNING.
- Notification object: Development team (DingTalk group).
- Effective period: 09:00–18:00 on weekdays.
Info events (INFO):
- Routing condition: Any.
- Notification object: Operations team email.
- Effective period: All time.

Deduplication and noise reduction

Group by rule: Use labels._cms_rule_id to aggregate events generated by the same rule.
Group by resource: Use resource.entity.entity_id to aggregate events from the same resource.
Combined grouping: Use both the rule ID and resource ID for more granular event grouping.

Prevent notification fatigue

Configure mute time: Avoid a large number of repeated notifications in a short period.
Enable repeated notifications: Periodically remind relevant personnel about unhandled alerts.
Configure auto-recovery: Set a reasonable auto-recovery time for transient alerts.
Configure an escalation policy: Ensure that critical alerts are not missed.

FAQ

Routing rule matching order

Routing rules are evaluated sequentially from top to bottom. Because processing stops after the first match, you should place stricter, more specific rules before broader, catch-all rules.

Why am I not receiving notifications?

Check the following configurations:

Verify that the notification policy is enabled.
Ensure the routing conditions correctly match the event.
Confirm that the current time is within the effective period.
Verify that the contact information for the notification object is configured correctly.
Make sure the desired notification method is selected.

How can I avoid notification storms?

Configure grouping fields properly to aggregate related events.
Set an appropriate mute time.
Configure the repeated notification interval to avoid frequent notifications.
Use routing conditions to filter out low-priority events.

Option	Description
Send recovery notification	Sends a notification when an alert is resolved.
Do not send recovery notification	Does not send a notification when an alert is resolved (default).