Operation Center is a one-stop operations and maintenance (O&M) platform in DataWorks. It monitors real-time task status, provides O&M actions such as Intelligent Diagnosis and Rerun, and uses Smart Baseline to track critical task completion times across large volumes of tasks. Operation Center also covers O&M for engines, resources, and scheduling.
Operation Center requires the desktop version of the Chrome browser (Chromium engine 69 or later).
Prerequisites
Tasks are automatically scheduled and run only after you deploy them to the production environment. Tasks in the development environment are not automatically scheduled.
Access Operation Center
Log on to the DataWorks console.
In the top navigation bar, select the desired region.
In the left-side navigation pane, choose Data Development and O&M > Operation Center.
Select the desired workspace from the drop-down list and click Go to Operation Center.
Functional modules
After you develop, commit, and deploy tasks in DataStudio, use Operation Center to perform O&M on auto triggered nodes, manually triggered nodes, and real-time nodes. Operations include running production tasks, diagnosing execution issues, monitoring statuses, viewing O&M metrics, and checking engine task lists.
Auto triggered node O&M
Module | Description | Availability |
View the DAG (Directed Acyclic Graph) for scheduled tasks. Perform actions such as Test and Backfill Data. | The Development Operation Center cannot automatically schedule and generate scheduled instances. | |
List instances generated after scheduled tasks are submitted to the scheduling system. View the instance DAG, use Perform Diagnostics, and Rerun the instance. | - | |
List instances generated after you test a scheduled task. Check execution status, view the instance DAG, use Perform Diagnostics, and Rerun the instance. | - |
Real-time node O&M
Module | Description | Availability |
Start, Terminate, and Undeploy real-time tasks. Configure Monitoring Setting to detect and handle exceptions during task execution. | - | |
Start, Stop, Undeploy, and Change Owner for real-time synchronization tasks. Configure Monitoring Setting to detect and handle exceptions during task execution. | - |
Manually triggered node O&M
Module | Description | Availability |
Query and view the DAG for manual tasks, manually triggered workflows, and event-triggered workflows. Run tasks, View Instances, and perform other actions. | - | |
View detailed instance information in the DAG. Perform actions such as View Runtime Log, Perform Diagnostics, View Code, and View Lineage. | - |
O&M Dashboard
Module | Description | Availability |
Display O&M metrics for scheduled tasks in reports. Provides dedicated O&M pages for batch and real-time synchronization tasks in Data Integration. | Not available in the Development Operation Center. |
O&M Assistant
Module | Description | Availability |
Manage data backfill tasks. | - | |
End-to-end analysis to locate the source of a problem. View a task's Running Details, Basic Information, Impact Baselines, and Historical Instance. | Not available in the Development Operation Center. | |
Create custom O&M rules. Define monitoring metrics and rules for instances running on a target resource group. When a rule triggers, a predefined O&M action runs automatically. | - |
For a task in a scheduled instance to run, all of the following conditions must be met:
All ancestor node instances are in the success state.
The scheduled execution time has been reached.
Sufficient scheduling resources are available.
The task is not in the Suspended or Frozen state.
Task monitoring
Use Smart Baseline to detect and alert on task anomalies. Use Rule Management, Alert Management, and Schedule to handle O&M alerts.
Module | Description | Availability |
Detect and send early warnings about exceptions that could prevent tasks on a baseline from completing on time. Ensures critical data is generated within the expected timeframe. Reduces configuration costs, avoids unnecessary alerts, and automatically monitors all critical tasks. | Not available in the Development Operation Center. | |
Configure custom monitoring rules to monitor task run status or resource usage. Detect and handle exceptions. | - | |
Centralize all alerts generated by the Node Alarm module, including baseline warning messages and event alerts from Smart Baseline, and alerts from custom rules and global rules. | - | |
Create a roster for handling O&M alerts. After you configure an on-duty schedule, DataWorks sends Alert Information to the on-duty personnel. | - |
Engine, resource, and scheduling O&M
Module | Description | Availability |
View details of Compute Engine (E-MapReduce) jobs. Find and address jobs that have run with errors before they block downstream tasks and disrupt instance execution. | Not available in the Development Operation Center. | |
Visualize resource group usage and instance task execution. Enable intelligent monitoring and Automated O&M for resource groups and instance tasks. | - | |
Create and manage Scheduling Calendars and Workspace-level Parameters to customize task scheduling. | - |
Appendix: Instance run status and execution diagnosis
Different colors and icons in Operation Center represent task execution status. For more information about execution prerequisites, see Intelligent Diagnosis.
No. | Status | Status icon | Run flowchart |
1 | Success |
|
|
2 | Not Running |
| |
3 | Failed |
| |
4 | Running |
| |
5 | Pending |
| |
6 | Suspended/Frozen |
|
