Operation Center is a one-stop platform for big data operations and maintenance (O&M) in DataWorks. After you develop, commit, and deploy tasks in Data Studio, use Operation Center to monitor task execution, diagnose failures, and keep critical data pipelines on schedule.
With Operation Center, you can:
-
Monitor real-time status across auto triggered, manually triggered, and real-time nodes
-
Diagnose task failures with Intelligent Diagnosis and rerun affected instances
-
Protect critical data delivery timelines with Smart Baseline
-
View Compute Engine (E-MapReduce) job details and resource group usage
Operation Center requires the desktop version of the Chrome browser (Chromium engine 69 or later).
Prerequisites for task execution
A scheduled instance runs only when all of the following conditions are met:
-
All ancestor node instances are in the Success state.
-
The task's scheduled execution time has been reached.
-
Sufficient scheduling resources are available.
-
The task is not in the Suspended or Frozen state.
Tasks are automatically scheduled and run only after they are deployed to the production environment. Tasks in the development environment are not automatically scheduled.
Access Operation Center
-
Log on to the DataWorks console.
-
In the top navigation bar, select the target region.
-
In the left-side navigation pane, choose Data Development and O&M > Operation Center.
-
Select the target workspace from the drop-down list, then click Go to Operation Center.
Functional modules
Task O&M
The Task O&M module covers three node types: auto triggered, real-time, and manually triggered. Use the O&M Dashboard for a high-level view of task health, then drill into specific modules or use the O&M Assistant to take corrective action.
| Module | What you can do | Environment availability |
|---|---|---|
| O&M Dashboard | Understand key O&M metrics for scheduled tasks at a glance; access dedicated O&M pages for batch and real-time synchronization tasks in Data Integration. | Not available in the Development Operation Center |
| Auto triggered node O&M | ||
| — Auto Triggered Nodes | View the DAG for scheduled tasks; run Test and Backfill Data. | The Development Operation Center cannot automatically schedule and generate scheduled instances |
| — Auto Triggered Instances | Identify which instances generated by the scheduling system succeeded or failed; view instance DAGs; run Perform Diagnostics and Rerun. | |
| — Test Instances | Check the execution status of test instances; view instance DAGs; run Perform Diagnostics and Rerun. | |
| Real-time node O&M | ||
| — Real-time Computing Nodes | Start, Terminate, and Undeploy real-time tasks; configure Monitoring Setting to detect and handle exceptions promptly. | — |
| — Real-time Synchronization Nodes | Start, Stop, Undeploy, and Change Owner for real-time synchronization tasks; configure Monitoring Setting to detect and handle exceptions promptly. | — |
| Manually triggered node O&M | ||
| — Manual Triggered Nodes | View DAGs for manual tasks, manually triggered workflows, and event-triggered workflows; run tasks and view instances. | — |
| — Manual Triggered Instances | Quickly locate instance details via DAG; run View Runtime Log, Perform Diagnostics, View Code, and View Lineage. | — |
| O&M Assistant | ||
| — Data Backfill | Create and manage data backfill tasks. | — |
| — Intelligent Diagnosis | Locate the root cause of a problem end-to-end. View a task's Running Details, Basic Information, Impact Baselines, and Historical Instance. | Not available in the Development Operation Center |
| — Automatic | Create custom O&M rules that monitor metrics and trigger predefined actions automatically when a rule fires on a target resource group. | — |
Task monitoring
The task monitoring module helps you catch anomalies before they delay critical data delivery.
| Module | What you can do | Environment availability |
|---|---|---|
| Smart Baseline | Detect anomalies early and receive warnings before tasks on a baseline miss their completion deadline. Smart Baseline reduces configuration overhead, filters out noise alerts, and automatically covers all critical tasks — so you do not need to set up individual rules for every node. | Not available in the Development Operation Center |
| Alarm | ||
| — Rule Management | Configure custom rules to monitor task run status or resource usage and detect exceptions promptly. | — |
| — Alert Management | View all alerts in one place: baseline warnings and event alerts from Smart Baseline, and alerts from custom rules and global rules. This centralizes all alerts generated by the Node Alarm module. | — |
| — Schedule | Set up an on-duty roster so Alert Information is sent to the right person when an alert fires or an instance requires maintenance. | — |
Other O&M features
| Module | What you can do | Environment availability |
|---|---|---|
| Engine Maintenance | Find and address Compute Engine (E-MapReduce) jobs with errors before they block downstream tasks. | Not available in the Development Operation Center |
| Resource | Visualize resource group usage and instance task execution; enable intelligent monitoring and Automated O&M for resource groups to reduce manual effort. | — |
| Tenant Schedule Setting | Create and manage Scheduling Calendars and Workspace-level Parameters to customize task scheduling across your workspace. | — |
Appendix: Instance run status and execution diagnosis
In Operation Center, each instance status has a distinct color and icon. The table below describes each status. For detailed diagnosis steps, see Intelligent Diagnosis.
| No. | Status | Status icon | Run flowchart |
|---|---|---|---|
| 1 | Success | <img> | <img> |
| 2 | Not Running | <img> | |
| 3 | Failed | <img> | |
| 4 | Running | <img> | |
| 5 | Pending | <img> | |
| 6 | Suspended/Frozen | <img> |
For a full explanation of status transitions and execution prerequisites, see Appendix: Instance run status and execution diagnosis.