You can create threshold-triggered alert rules in CloudMonitor to monitor IoT resources, and send alert notifications to specified contacts by using the specified method. You can monitor the number of online devices in a specific product, the number of TSL model communication failures, and the number of times that data is forwarded by the rules engine. This topic describes how to create a threshold-triggered alert rule.

Procedure

  1. Log on to the CloudMonitor console.
  2. In the left-side navigation pane, choose Alerts > Alert Rules.
  3. On the Alert Rules page, click Create Alert Rule.
  4. In the Create Alert Rule panel, configure the parameters. The following table describes the parameters.
    1. Set the Product parameter to IoT Platform.
    2. Set the Resource Range parameter to All Resources.
    3. Click Add Rule. In the Add Rule Description panel, configure the parameters and click OK.

      In this example, three threshold-triggered rules whose alert level is critical are created, as shown in the following figure.

      The following table describes the rules shown in the preceding figure.

      Rule Name Rule Description Description
      DevicePropertyReportError The statistical period is 1 minute. If 100 or more property reporting failures are detected for three consecutive statistical periods, an alert is triggered. This rule is used to monitor the property reporting status of all devices in a temperature-based sensory product. If a large number of consecutive property reporting failures occur, you must check the devices and network status.
      MessageCountForwardedThroughRuleEngine_FC The statistical period is 1 minute. If at least 10,000 messages are forwarded to Function Compute for three consecutive statistical periods, an alert is triggered. If a device reports a temperature that is higher than 25°C, the rules engine forwards the temperature data to Function Compute based on the data forwarding rule. If 10,000 messages are forwarded, a large number of devices report more than two temperatures that are higher than 25°C within 1 minute. We recommend that you monitor the temperature changes and troubleshoot potential safety risks at the earliest opportunity.
      OnlineDevicesCount_MQTT The statistical period is 1 minute. If less than 4,800 devices are online for three consecutive statistical periods, an alert is triggered. A large number of devices are offline at the same time. You need to check the devices and network status.
    4. Set the Mute For parameter to 15 Minutes. If an alert fails to be cleared within 15 minutes after the alert is triggered, the system sends another alert notification for the alert.
    5. Configure the Alert Contact Group parameter and use the default values for other parameters.
  5. Click OK to create the rule.

Result

After a threshold-triggered alert rule is created, CloudMonitor continuously monitors IoT Platform resources based on the rule. If an alert rule is triggered, CloudMonitor sends alert notifications based on the specified notification method.