A notification policy defines rules for matching, grouping, notifying, and escalating alerts or events to provide flexible alert routing and notification management.
Overview
A notification policy is a core component in the alert and event processing workflow. It supports the following features:
Define matching rules to filter alerts and events that require action.
Configure event grouping rules to reduce notification fatigue.
Specify notification objects and effective periods.
Configure repeated notifications, escalation policies, and recovery methods.
Link action integrations for automated processing.
Create a notification policy
Log on to the CloudMonitor 2.0 console, select the target workspace, and from the left-side navigation pane, choose .
Click Create Notification Policy and configure the following settings:
Basic Information:
Name: The name of the notification policy. It must not exceed 120 characters.
Description: A description of the notification policy. It must not exceed 120 characters.
Routing Rule: Routing rules define how events are routed for notification. The system evaluates these rules sequentially and stops after the first match.
Deduplication and noise reduction: To reduce redundant notifications, configure grouping fields to aggregate similar events. You can select up to five grouping fields. In addition to common grouping fields, you can also specify custom fields based on the event data. By default,
labels._cms_rule_id(alert rule ID) andresource.entity.entity_id(resource entity ID) are used.Routing rule configuration: Each notification policy can have multiple routing rules. Click Add Routing Rule to add a new rule.
Routing condition: Define custom matching conditions. You can add multiple conditions.
Notification object: Specify the recipients for the notification. You can add multiple recipients. For 'contact', 'contact group', or 'on-call schedule' objects, you must also select a notification method, such as a phone call, SMS, or email. Other object types do not require this.
Effective period: By default, the rule is effective 24/7. Click Modify Time to specify a custom effective period.
Notification Rule (Optional): Configure advanced notification rules, including notification templates, recovery notifications, auto-recovery, repeated notifications, and action integrations.
Notification template settings: Configure a notification template for each channel type.
Recovery notification:
Option
Description
Send recovery notification
Sends a notification when an alert is resolved.
Do not send recovery notification
Does not send a notification when an alert is resolved (default).
Auto-recovery:
Option
Description
Alerts do not recover automatically
If an alert expires and the underlying issue is not resolved, the alert remains active.
Alerts recover automatically after N minutes/seconds
The alert recovers automatically when it expires, even if the underlying issue is not resolved.
Repeated notification:
Option
Description
No repeat notification is required
Sends only one notification while the alert is active.
If an alert is not recovered, send a repeat notification every N minutes/seconds
Sends notifications periodically until the alert is recovered.
Action integration: Configure action integrations to run automatically when an alert is triggered or resolved.
Parameter
Description
Trigger action integration upon firing
Runs the specified action integration when an alert is triggered.
Trigger action integration upon recovery
Runs the specified action integration when an alert is resolved.
Escalation policy: Select a pre-configured escalation policy. If an alert is not handled within a specified time, it is automatically escalated.
After you complete the configuration, click OK.
The notification policy list page displays all created policies with the following information:
Field
Description
Name
The name of the notification policy.
Description
The description of the notification policy.
Grouping condition
The grouping fields used for event deduplication and noise reduction.
Notification mode
Displays the trigger method and mute time. For example, "Trigger immediately, no further notifications for 5 minutes."
Number of routing rules
The number of routing rules in the policy.
Number of custom notification templates
The number of custom notification templates configured in the policy.
Source
The source of the policy, such as Managed Service for Prometheus, ARMS, or CloudMonitor.
Last modified time
The time when the policy was last modified.
Status
The status of the policy (enabled or disabled).
The following operations are available on the list page:
Create Notification Policy: Click the Create Notification Policy button to open the policy creation page.
Search: Supports fuzzy search by policy name.
Filter by Status: Filter policies by their enabled or disabled status.
Sort: Sort by last modified time or status.
Edit: Click the Edit button to modify the policy configuration.
Delete: Click the Delete button to delete the policy. Policies synchronized from ARMS or CloudMonitor cannot be deleted.
Enable/Disable: Use the toggle to enable or disable the policy.
Event matching process
When a policy receives an event, it first applies deduplication and noise reduction based on the configured grouping fields.
The system then matches the event against the routing rules in sequential order.
For each routing rule, the system checks if the event meets the routing conditions and is within the effective period.
The first matching routing rule is applied, and all subsequent rules are ignored.
A notification is sent to the notification objects specified in the matched rule.
Recovery notifications, auto-recovery, and repeated notifications are handled based on the configured notification rules.
Default values
The following default settings apply when you create a notification policy:
Configuration item | Default value |
Grouping fields |
|
Mute time | 300 seconds (5 minutes) |
Auto-recovery time | 600 seconds (10 minutes) |
Repeat notification interval | 600 seconds (10 minutes) |
Recovery notification | Do not send recovery notification |
Effective period | All time |
Policy source
Notification policies can be synchronized from multiple sources:
Source type | Description |
Prometheus | Policies synchronized from Managed Service for Prometheus. |
ARMS | Policies synchronized from ARMS. |
CloudMonitor | Policies created locally in CloudMonitor 2.0. |
Note: Policies synchronized from other sources may have functional limitations. For example, some settings may be read-only, and the policies themselves cannot be deleted.
Recommended configurations
Multi-level routing rules
Configure different notification methods for events with different severity levels:
Critical events (CRITICAL):
Routing condition: Severity level is CRITICAL.
Notification object: On-call personnel (phone + SMS).
Effective period: 24/7.
Warning events (WARNING):
Routing condition: Severity level is WARNING.
Notification object: Development team (DingTalk group).
Effective period: 09:00–18:00 on weekdays.
Info events (INFO):
Routing condition: Any.
Notification object: Operations team email.
Effective period: All time.
Deduplication and noise reduction
Group by rule: Use
labels._cms_rule_idto aggregate events generated by the same rule.Group by resource: Use
resource.entity.entity_idto aggregate events from the same resource.Combined grouping: Use both the rule ID and resource ID for more granular event grouping.
Prevent notification fatigue
Configure mute time: Avoid a large number of repeated notifications in a short period.
Enable repeated notifications: Periodically remind relevant personnel about unhandled alerts.
Configure auto-recovery: Set a reasonable auto-recovery time for transient alerts.
Configure an escalation policy: Ensure that critical alerts are not missed.
FAQ
Routing rule matching order
Routing rules are evaluated sequentially from top to bottom. Because processing stops after the first match, you should place stricter, more specific rules before broader, catch-all rules.
Why am I not receiving notifications?
Check the following configurations:
Verify that the notification policy is enabled.
Ensure the routing conditions correctly match the event.
Confirm that the current time is within the effective period.
Verify that the contact information for the notification object is configured correctly.
Make sure the desired notification method is selected.
How can I avoid notification storms?
Configure grouping fields properly to aggregate related events.
Set an appropriate mute time.
Configure the repeated notification interval to avoid frequent notifications.
Use routing conditions to filter out low-priority events.