You can use the service monitoring and alerting feature to monitor the status of your services. If a threshold that is specified in an alert rule is exceeded, an alert notification is sent.

Background information

The following table describes the metrics that you can monitor for services deployed in Elastic Algorithm Service (EAS).
Metric Description
CPUConsumption The number of CPU cores that are consumed.
GPUUtilization The ratio of GPU utilization to the total GPU capacity.
MemoryComsumption The memory usage, in MB.
QueryPerSecondTotal The total number of calls per second.
ResponsePerSecondWithStatusCode2xx The number of responses with status code 2xx per second.
2xxResponseRatio The ratio of responses with status code 2xx to the total responses.
ResponsePerSecondWithStatusCode4xx The number of responses with status code 4xx per second.
4xxResponseRatio The ratio of responses with status code 4xx to the total responses.
ResponsePerSecondWithStatusCode5xx The number of responses with status code 5xx per second.
5xxResponseRatio The ratio of responses with status code 5xx to the total responses.
IngressTraffic The amount of inbound data, in Kbit/s.
EgressTraffic The amount of outbound data, in Kbit/s.

Step 1: Configure alert contacts

  1. Create an alert contact.
    1. Log on to the CloudMonitor console.
    2. In the left-side navigation pane, choose Alerts > Alert Contacts.
    3. On the Alert Contacts page, click Create Alert Contact.
    4. In the Set Alert Contact panel, set the Name, Email ID, and DingTalk Enterprise WeChat Flying Book slack Webhook(http https) parameters.
    5. Set the Alert Notification Information Language parameter. Then, click and hold the slider in the lower part of the panel and drag it to the right.
      This parameter has the following valid values:
      • Automatic: Notifications are sent in the language of the Alibaba Cloud site on which your account is registered. If your account is registered on China site (aliyun.com), notifications are sent in Chinese. If your account is registered on International site (alibabacloud.com) or Japan site (jp.alibabacloud.com), notifications are sent in English.
      • Chinese
      • English
    6. Click OK.
  2. Create an alert group.
    1. On the Alert Contacts page, click the Alert Contact Group tab.
    2. On the Alert Contact Group tab, click Create Alert Contact Group.
    3. In the Create Alert Contact Group panel, set the Group Name parameter and select existing contacts as needed to add them to the group. Use the default value for other parameters.
    4. Click Confirm.

Step 2: Configure alert rules

  1. In the CloudMonitor console, choose Alerts > Alert Rules in the left-side navigation pane.
  2. On the Alert Rules page, click Create Alert Rule.
  3. On the Create Alert Rule page, set the parameters in the Related Resource, Set Alert Rules, and Notification Method sections.
    Parameter Description
    Product The name of the service that is managed by CloudMonitor. Set this parameter to PAI-EAS inference service.
    Resource Range The range of resources to which the alert rule is applied. Valid values: All Resources and ServiceInstance.
    • All Resources: An alert notification is sent when the monitoring data of an EAS service triggers alert rules.
    • ServiceInstance: An alert notification is sent only when the monitoring data of the selected one or more services triggers alert rules.
    Alert Rule The name of the alert rule.
    Rule Description The details of the alert rule. This parameter defines the conditions to trigger alerts.
    Note The units of the metrics such as CPUConsumption, GPUUtilization, and MemoryComsumption that are specified in the Rule Description parameter are the same as those that are displayed along the y-axes on the EAS service monitoring page.
    Mute For The interval at which CloudMonitor resends an alert notification if the issue that triggers the alert persists.
    Effective Period The time period during which the alert rule is effective. CloudMonitor checks whether the monitoring data triggers the alert rule only during the effective period.
    Notification Contact The contact group to which an alert notification is sent. Select a contact group to which alert contacts are bound.
    Notification Methods

    The methods that are used to send alert notifications. In this example, Email + DingTalk (Info) is selected.

    Auto Scaling The auto scaling feature is unavailable if an alert rule configured for EAS services is triggered. You do not need to select this check box.
    Log Service If you select Log Service, the alert message is written to Log Service when an alert is generated. In this case, you must also set the Region, Project, and Logstore parameters.

    For more information about how to create a project and a Logstore, see Getting Started.

    Email Remark The additional information that you want to include in the alert notification email. After you set the Email Remark parameter, the specified remarks are included in the alert notification email.
    Alert Callback The callback URL that can be accessed over the Internet. CloudMonitor sends a POST request to push an alert to the specified callback URL. Only HTTP requests are supported.
  4. Click Confirm.