You can use the service monitoring and alerting feature to monitor the status of your services. If a threshold that is specified in an alert rule is exceeded, an alert notification is sent.
Background information
The following table describes the metrics that you can monitor for services deployed
in Elastic Algorithm Service (EAS).
Metric | Description |
---|---|
CPUConsumption | The number of CPU cores that are consumed. |
GPUUtilization | The ratio of GPU utilization to the total GPU capacity. |
MemoryComsumption | The memory usage, in MB. |
QueryPerSecondTotal | The total number of calls per second. |
ResponsePerSecondWithStatusCode2xx | The number of responses with status code 2xx per second. |
2xxResponseRatio | The ratio of responses with status code 2xx to the total responses. |
ResponsePerSecondWithStatusCode4xx | The number of responses with status code 4xx per second. |
4xxResponseRatio | The ratio of responses with status code 4xx to the total responses. |
ResponsePerSecondWithStatusCode5xx | The number of responses with status code 5xx per second. |
5xxResponseRatio | The ratio of responses with status code 5xx to the total responses. |
IngressTraffic | The amount of inbound data, in Kbit/s. |
EgressTraffic | The amount of outbound data, in Kbit/s. |
Step 1: Configure alert contacts
- Create an alert contact.
- Create an alert group.
- On the Alert Contacts page, click the Alert Contact Group tab.
- On the Alert Contact Group tab, click Create Alert Contact Group.
- In the Create Alert Contact Group panel, set the Group Name parameter and select existing contacts as needed to add them to the group. Use the default value for other parameters.
- Click Confirm.