All Products
Search
Document Center

DataWorks:Data asset pivoting from the R&D link perspective

Last Updated:Apr 01, 2025

Data Asset Governance allows you to view and analyze the status and resource consumption of DataWorks batch synchronization tasks and scheduling tasks in a workspace from the R&D link perspective, which involves data synchronization and data development. This way, you can learn the types of resources that incur the highest costs and identify tasks that fail to run. Then, you can adjust the proportion of resources to purchase, identify the issues that block the execution of tasks, and fix the issues at the earliest opportunity.

Permissions

  • To view the resource usage in any workspace, you must meet one of the following requirements:

    • You have an Alibaba Cloud account.

    • You have created a Resource Access Management (RAM) user and the AliyunDataWorksFullAccess policy is attached to the RAM user.

    • You are a tenant administrator.

    • You are a tenant-level data governance administrator.

  • Other users can select only the workspaces to which they are added as the members of the workspaces.

Go to the Development page

  1. Go to the Data Asset Governance page.

    Log on to the DataWorks console. In the top navigation bar, select the desired region. In the left-side navigation pane, choose Data Governance > Data Asset Governance. On the page that appears, click Go to Data Asset Governance.

  2. In the left-side navigation pane, choose Assets > Asset Analysis > Development.

Data synchronization: View the resource consumption of DataWorks batch synchronization tasks

You can view the resource consumption of DataWorks batch synchronization tasks in the selected workspace on the specified date. The detailed information about a batch synchronization task includes the following fields: Tag, Average Waiting Time in 7 Days, Average Running Duration in 7 Days, Waiting Time Fluctuation Rate, Running Duration Fluctuation Rate, Owner, Task Running Duration, and Total Volume of Synchronized Data.

Note

The tags that are added to nodes shown in the following figure are offline snapshots and cannot be modified. After you modify or add a new tag on the Tag Management page, snapshots are not updated in real time. Tag data takes effect on the next day. For more information, see Manage tags.

image

Data development: View the resource consumption of DataWorks scheduling tasks

You can view the resource consumption of DataWorks scheduling tasks in the selected workspace on the specified date. The detailed information about a DataWorks scheduling task includes the following fields: Tag, Average Waiting Time in 7 Days, Average Running Duration in 7 Days, Waiting Time Fluctuation Rate, Running Duration Fluctuation Rate, Owner, Task Running Duration, Runtime Environment (development or production), and Task Type, such as auto-triggered tasks and data backfill tasks.

Note

The tags that are added to nodes shown in the following figure are offline snapshots and cannot be modified. After you modify or add a new tag on the Tag Management page, snapshots are not updated in real time. Tag data takes effect on the next day. For more information, see Manage tags.

image

View DataWorks tasks and costs

You can click the Statistical Metric tab to view the following information in the specified workspace: the overall cost trend and distribution of costs, the trends of the number of tasks and the number of instances, the status of tasks and instances, and the parallelism trend of tasks.

image

  • Consumption Trend: displays the overall cost trend of DataWorks in the selected workspace.

  • Consumption Distribution: displays the distribution of costs of different DataWorks billable items in the selected workspace.

  • Running Status Distribution: displays the distribution of tasks in different states in the selected workspace. You can view the proportion of tasks that fail to run. You need to identify causes and fix issues at the earliest opportunity.

  • Task Type Distribution: displays the distribution of tasks of different types in the selected workspace.

  • Task Quantity Trend: displays the trend of changes in the number of tasks in the selected workspace. You can view the number of tasks based on the type of resource group that is used to run the tasks. You can view data within the previous seven days.

  • Instance Quantity Trend: displays the trend of changes in the number of instances in the selected workspace. You can view the number of instances based on the type of resource group that is used to run the instances. You can view data within the previous seven days.

  • Parallelism Trend of Task: displays the trend of changes in the number of tasks that are run in parallel in the selected workspace. You can view the total number of scheduling tasks that are run in parallel, the total number of batch synchronization tasks that are run in parallel, and the trend of changes in the number of tasks that are run based on different types of resource groups at specific points in time. You can select only one day from the previous seven days in the date picker.