The Operation Center service of DataWorks provides the automated O&M feature for node instances that are running on exclusive resource groups. You can customize O&M rules for the node instances based on your business requirements. This topic describes how to manage an automated O&M rule.

Background information

The automated O&M feature enables DataWorks to automatically perform O&M operations on node instances that are running on exclusive resource groups. DataWorks performs O&M operations based on the O&M rules that you created and the monitoring rules that are associated with the O&M rules. You can customize metrics and create an O&M rule for the node instances based on your business requirements. Then, you can associate the O&M rule with a monitoring rule. The monitoring rule can be an existing monitoring rule or a new monitoring rule. If the conditions specified in the O&M rule are met, the system automatically performs the O&M operation defined in the rule.

Limits

  • Only workspace administrators can create, modify, or delete automated O&M rules.
  • You can create automated O&M rules only for node instances that are running on exclusive resource groups.
  • You can view the execution records that are generated only within 30 days for automated O&M rules.
  • DataWorks supports only the automated O&M operation that stops running node instances when the resource usage of the exclusive resource groups used by the node instances is excessively high.
  • You can associate one automated O&M rule with only one monitoring rule. However, you can associate multiple automated O&M rules with the same monitoring rule.

Go the Automatic page

  1. Go to the DataStudio page.
    1. Log on to the DataWorks console.
    2. In the left-side navigation pane, click Workspaces.
    3. In the top navigation bar, select the region where your workspace resides, find the workspace, and then click Data Analytics in the Actions column.
  2. On the DataStudio page, click the Cycle Task icon in the upper-left corner and choose All Products > Operation Center.
  3. On the Operation Center page, choose Alarm > Automatic. Then, the Rule Management tab of the Automatic page appears.

Manage automated O&M rules

The Automatic page displays all the automated O&M rules that are created and the execution records of the rules. You can perform the following operations on this page.
Note Only workspace administrators can create or modify automated O&M rules. For more information about how to obtain the permissions of the workspace administrator role, see Add workspace members.
Automated O&M
  • Add an automated O&M rule
    On the Rule Management tab, click Add Rule in the upper-right corner. In the Created Rule dialog box, configure the following parameters.Add Rule
    Section Parameter Description
    Basic information Name The name of the automated O&M rule.
    Associated Monitoring Rule The monitoring rule with which you want to associate the automated O&M rule. For more information about how to create a monitoring rule, see Manage custom alert rules.
    Note You can associate only monitoring rules for exclusive resource groups for scheduling with automated O&M rules.
    O&M Operation The automated O&M operation that you want to perform if the automated O&M rule is triggered.
    Note Only the operation Terminate Running Instance is supported.
    Filter Condition Resource Group The name of the resource group that you specified when you create the monitoring rule.
    Note Only the names of exclusive resource groups for Data Integration and exclusive resource groups for scheduling are displayed.
    Workspace The name of the workspace to which the automated O&M rule is applied.
    Instance Type The type of the node instance to which the automated O&M rule is applied. Valid values: Auto Triggered Node Instances, Data Backfill Instance, Test Instances, and Manually Triggered Workflows.
    Scheduling Cycle The scheduling cycle of the node instance. Valid values: Minute, Hours, Day, Week, and Month.
    Priority The priority of the automated O&M rule. A larger value indicates a higher priority. Valid values:
    • 1
    • 3
    • 5
    • 7
    • 8
    Status The status of the node instance. Valid values: Waiting for Resources and Running.
    Whitelist Whitelist The list of the node instances for which the automated O&M rule does not take effect. If you want to add another node instance to the list, you can select the name of the node instance from the drop-down list below the node instance list and click Add.
    Note Automated O&M rules do not take effect for the node instances that are contained in the node instance list.
    Constraints on Rule Effective Period The time range within which the automated O&M rule is effective.
    Maximum Effective Times The maximum number of times that the automated O&M rule can be triggered.
    Minimum Effective Interval The minimum interval at which the automated O&M rule can be triggered.
  • Search for an automated O&M rule

    In the search box of the Rule Management tab, you can enter the name of an automated O&M rule to search for the rule.

  • View, modify, and delete an automated O&M rule
    • If you want to view the information about an automated O&M rule, find the desired rule in the automated O&M rule list of the Rule Management tab and click View in the Actions column.
    • If you want to modify an automated O&M rule, find the desired rule in the automated O&M rule list of the Rule Management tab and click View in the Actions column. In the View Rule dialog box, click Change in the lower-right corner.
    • If you want to delete an automated O&M rule, find the desired rule in the automated O&M rule list of the Rule Management tab and click Delete in the Actions column. In the Delete Rule message, click OK.
  • View the execution records of an automated O&M rule

    The Execution Records tab displays the execution information about automated O&M rules, including the time when the rules are executed, rule owners, and the number of node instances to which the rules are applied. If you want to view the detailed execution information about a rule, click View Details in the Actions column of the rule. In the Record Details dialog box, you can view the line chart that displays the resource usage of the desired resource group within the last 24 hours and the number of node instances that are waiting for resources since the rule took effect.

    • Instances Waiting for Resources/Resource Usage: This section provides a chart that displays the number of node instances that are waiting for resources and the resource usage of the desired resource group. You can move the pointer over a point in the chart to view the number of node instances that are waiting for resources and the resource usage of the desired resource group at the related point in time.
    • Terminated Node Instances: This section displays all the node instances whose running is stopped.

Summary

After you create an automated O&M rule, the system automatically monitors the resource usage of the resource group defined in the automated O&M rule. After the automated O&M rule is triggered, the system performs the O&M operation on the node instances that are running on the resource group. For more information about the O&M of resources in resource groups, see Use the resource O&M feature.