All Products
Search
Document Center

DataWorks:View statistics on the O&M Dashboard

Last Updated:Feb 27, 2026

The Operations and Maintenance (O&M) dashboard displays the O&M stability assessment for auto triggered tasks, key O&M metrics, an overview of schedule resource usage, and the running details of one-time tasks and data integration sync tasks. This dashboard helps you obtain a high-level overview of all tasks in your workspace, quickly find and handle abnormal tasks, and improve your O&M efficiency.

Usage notes

You can view the O&M overview for your workspace from the following perspectives: O&M for auto triggered tasks, O&M for one-time tasks, and O&M for data integration tasks.

  • Specific project: View the O&M overview for the selected workspace. In this view, you can see the O&M overview for the workspace and for data integration sync tasks.

  • All Projects: View the O&M overview of all workspaces in the current account. In this view, you cannot separately view the O&M overview of Data Integration sync tasks.

Limits

  • The O&M dashboard feature is not supported in the development environment of a standard mode workspace.

    Note

    In the top menu bar of the Operation Center, you can click to switch between the Production and Development environments.

  • Auto Triggered Task tab: Collects O&M information for only auto triggered tasks and their instances. Other types of tasks and instances are not included.

  • One-time Task tab: Collects O&M information for only manually triggered workflows and their inner node instances.

  • Data Integration Task tab: Collects O&M information for only offline and real-time data integration sync tasks.

Go to the O&M dashboard

Log on to the DataWorks console. In the top navigation bar, select the desired region. In the left-side navigation pane, choose Data Development and O&M > Operation Center. On the page that appears, select the desired workspace from the drop-down list and click Go to Operation Center.

View O&M information for auto triggered tasks

On the Auto Triggered Task tab, you can view the O&M overview, which includes the O&M stability assessment, key concerns, recurring instance status distribution, recurring instance completion status, and scheduling resource group usage.

O&M stability assessment

The O&M stability of your workspace is assessed based on the overall running status of tasks in the workspace.

Workspace

Single workspace

All my workspaces

Stability diagram

image

image

Stability description

The stability health status is categorized into four levels: Excellent, Good, Fair, and Poor. A high-risk or low-risk tag indicates that the workspace health is poor and requires immediate optimization.

  • Switch to the All my workspaces view at the top of the page to see the O&M stability, number of recurring instances, and completion status of recurring instances for all workspaces you have joined.

  • You can also click View Details in the Operation column for a specific workspace to view its O&M stability details.

View key concerns

The Key Concerns section displays abnormal items based on smart baselines and auto triggered task exceptions. You can view these items from a workspace or personal perspective. This lets you view abnormal issues for the entire workspace or only for the tasks you own. Find and fix these issues immediately to prevent them from affecting your business.

Abnormal issue type

Problem Description

References

Diagram

Baseline instance breach

The number of baseline instances that breached their committed completion time today.

A baseline instance breach means that the estimated completion time of a task on the baseline exceeds the committed time, and an alert is triggered because the task did not complete on time.

Baseline instances

异常问题

Baseline instance warning

The number of baseline instances with warnings today.

The warning margin ensures that important data in complex dependency scenarios is generated on time. Exceeding this margin may cause tasks to fail to complete on time, leading to an exception.

Committed time and warning margin for baselines

Error event

The number of error events today.

When a task is monitored by a baseline, an error event is generated if the task fails. A failed task can block its descendant nodes. Handle the failed task promptly to ensure its descendant nodes can run normally.

Event Management

Slowdown event

The number of slowdown events today.

When a task is monitored by a baseline, a slowdown event is generated if the task runs slowly. A slowdown means the current runtime of the task is significantly longer than its average runtime over a past period.

Isolated task

The number of auto triggered tasks that have no upstream dependencies.

When a node has no upstream dependencies, it becomes an isolated node and can no longer be automatically scheduled.

Isolated nodes

Frozen task

Counts the number of paused auto triggered tasks.

After an auto triggered task is frozen, the instances it generates will also be in the Frozen state. Frozen instances will not run and will block their descendant nodes.

Freeze and unfreeze tasks

Expired task

The number of auto triggered tasks whose scheduling validity period has expired.

A node automatically generates and runs recurring instances within its scheduling validity period. Outside this period, it cannot generate recurring instances and be automatically scheduled.

None

Modified task

This shows the number of recurring schedule tasks modified today.

  • Modifications: Modifications include code changes, scheduling configuration changes, node status changes, and node owner changes.

  • Scope: The scope includes changes to production tasks made through the task publishing process after changes in Data Studio, along with changes made directly to auto triggered tasks in the production environment.

Note

When you switch to the My view, the system counts the number of modified nodes for tasks that you own.

None

O&M overview for recurring instances and auto triggered tasks

The following table describes the O&M overview for recurring instances and auto triggered tasks.

O&M category

Description

Diagram

Recurring instance status distribution

  • Scope: Collects statistics on the status distribution of recurring instances in the current workspace or instances you own for a specified data timestamp. The data reflects the status at the time of the page request.

  • How to view: Click a segment in the pie chart to view the number and percentage of instances in the corresponding state.

  • Instance statuses that require attention:

    • Failed: A failed instance may block its descendant nodes.

    • Frozen: A frozen instance will not run and will block its descendant nodes.

    • Running slow: An instance in the running state is considered slow if its runtime is 15 minutes longer than the average of the last 10 days. If there are fewer than 4 historical instances, an instance is considered slow if its runtime exceeds half an hour.

Note

Only Normal tasks are counted here. Dry-run and frozen tasks are not included.

实例运行状态分布

Recurring instance completion status

  • Scope: Collects statistics on the completion status (number of successful or not-run instances and their fluctuations) of recurring instances in the current workspace for yesterday, today, and the historical average, within the time range of 00:00 to 23:00 on the day of the page request.

  • Display method: A line chart shows the completion status for yesterday, today, and the historical average. If the three lines deviate significantly, it indicates an abnormality during a certain period that requires further investigation.

  • Task type: You can specify the task type to view.

  • Historical average: The historical average here refers to the completion status of instances over the last 10 days.

周期实例完成情况

Recurring instance and auto triggered task trends

Scope: Collects statistics on the change trends in the number of auto triggered tasks and recurring instances in the production environment over a specified range of data timestamps. You can view data for up to the last year.

周期实例与周期任务趋势

Auto triggered task distribution

  • Scope: Collects statistics on the number and percentage of auto triggered tasks by different dimensions (node type, scheduling cycle) at the time of the page request.

  • Display method: The pie chart has a display limit. If the number of statistical types exceeds the limit, they will be merged for display.

Note

In the All my workspaces view, you can view the distribution of auto triggered tasks by workspace.

任务分布情况

Scheduling resource group usage

This section shows the usage rate, which is the percentage of resources used by instances running on the resource group, and the trend in the number of instances running on the selected scheduling resource group at different times within a specified period.

Note
  • Data for up to 7 days is supported.

  • If resource group usage exceeds 80%, you should scale out the resource group to prevent resource shortages from affecting task execution.

  • The statistics for resource group usage and the number of running instances are at the resource group level. For example, if the exclusive resource group for scheduling that you use is shared by multiple workspaces, the statistics show the total resource usage rate and instance number trend for that resource group across all workspaces.

调度资源组使用情况

Recurring instance runtime and error rankings

实例运行及出错排行

  • Yesterday's recurring instance rankings

    This section ranks the top 30 recurring instances from the previous day by runtime, resource wait time, and slowdown duration. You can use the rankings to quickly find time-consuming tasks. Click an instance ID to go to the instance details page and view the running details through the run diagnostics.

    Note

    Slowdown duration: The difference between the previous day's runtime and the historical average runtime, sorted in descending order.

  • Recurring instance error rankings for the last month

    This section ranks the top 30 recurring instances with errors over the last month. You can quickly locate tasks with high error rates in the last month, view task details, and identify the cause of the errors.

View O&M information for one-time tasks

On the One-time Task tab, you can view the running status of manually triggered workflows and inner node instances.

One-time task overview

This section shows the total number of manually triggered workflows and inner node instances that have run since a specified date and the percentage of successful runs.

image

Workflow instance status

O&M category

Description

Diagram

Workflow instance status distribution

A pie chart shows the status distribution of manually triggered workflow instances for the specified run date.

  • Click a segment to go to the details page for tasks in that state to view and handle any issues. Pay close attention to Failed tasks.

  • Data for up to 7 days is supported.

  • When you switch to the My view, the system shows the status distribution for manually triggered workflow instances that you own.

image

Workflow rankings

This ranks workflows with long runtimes and high failure rates for a specified run date.

  • Use the rankings to quickly find workflows that are time-consuming or have high failure rates. Click a Task ID to go to the Manually Triggered Workflow Instance details page. On the details page, view the Run Diagnostics for specific instances in the workflow DAG to understand their running status.

  • Only the top 30 workflows are shown.

image

Internal task instance status

O&M category

Description

Diagram

Internal Task Distribution

A pie chart shows the real-time distribution of inner node instances in the Operation Center, categorized by Node Type and Owner.

image

Internal Task Leaderboard

This ranks inner node instances with long runtimes and high failure rates for a specified run date.

  • Use the rankings to quickly find inner node instances that are time-consuming or have high failure rates. Click a Task ID to go to the Manually Triggered Workflow Instance details page. On the details page, view the Run Diagnostics for specific instances in the workflow DAG to understand their running status.

  • Only the Top30 internal tasks are displayed.

image

View O&M information for data integration tasks

On the Data Integration tab, you can view the overview and resource group usage for data integration sync tasks from Yesterday or Today.

Data Integration resource group usage

This section shows the resource details for all data integration tasks in the current workspace, including Running Tasks, Resource Usage, and Expired At. Based on the resource group usage and task volume, you can decide whether to perform operations, such as scaling, to allocate resources reasonably.独享数据集成资源组使用情况

Note

Data Integration sync task status distribution

A pie chart shows the status distribution of sync tasks in the current workspace. You can click a segment to go to the details page for tasks in that state to view and handle any issues. Pay close attention to Abnormal and Failed tasks because they usually block downstream task execution.运行状态分布

Offline sync task status

The following table describes the offline sync task status.

O&M category

Description

Diagram

Data synchronization progress

This shows the total data volume and total traffic usage for offline synchronization within the selected data timestamp.

数据同步速度

Data synchronization volume statistics

This shows the data pull and write curves for the synchronized data volume by data source type for the selected data timestamp. You can quickly view DPI engine tasks with large data synchronization volumes and consider allocating more resources to them.

离线数据同步任务数据统计量

Latest Top 10 rankings

This shows the 10 most recent Latest Failed Instances and Latest Successful Instances so you can get a global view of the latest sync task statuses. Use the error messages to quickly find the cause of instance failures and resolve them.

离线任务同步榜单

Data synchronization task execution details

You can filter by conditions such as Commit Time, Task Status, and Task Name to quickly search for task instances and view their running details.

离线同步任务详情

Real-time sync task status

The following table describes the real-time sync task status.

O&M category

Description

Diagram

Data synchronizationoverview

This shows the sum of the data speed and record speed for all real-time sync tasks in the current workspace.

同步速度

Top 10 task latency

This shows the 10 real-time sync tasks with the highest latency, so you can quickly locate and optimize them.

任务延迟

Alert information

This shows the alert information generated by real-time sync tasks recently, so you can quickly catch and resolve exceptions.

报警信息

Failover information

This shows the Failover messages for real-time sync tasks within a specified time, providing an overview of the task Failover status. For more information about Failover, see Run and manage real-time sync tasks.

failover