All Products
Search
Document Center

DataWorks:Release notes

Last Updated:Mar 20, 2026

This topic covers feature and documentation updates for DataWorks.

2026

February 2026

Feature

Description

Release date

Region

Applies to

Documentation

Quality Rules for DataStudio SQL Nodes

You can configure Quality Rules for MaxCompute SQL Nodes in DataStudio to validate the Data Tables they generate during development and in production. By deeply integrating quality testing into the SQL development workflow, this feature addresses common issues such as delayed rule configuration, late discovery of data issues, and high maintenance costs.

2026-02-20

All Regions

All DataWorks users

Configure data quality tests

Proactive Data Governance for DataStudio SQL Nodes

SQL Nodes in DataStudio support in-depth, proactive Data Governance checks. This feature uses AI to help you define custom rules that identify and fix code issues in real time as you write, improving both code quality and Data Security.

2026-02-20

All Regions

All DataWorks users

Node development diagnosis and governance

Data O&M Agent

The Data O&M Agent provides AI-powered, end-to-end diagnostics and generates structured Diagnostic Reports. You can execute O&M operations, such as Reruns and Resource Group modifications, directly from the conversational interface with manual confirmation, to improve O&M Efficiency.

2026-02-20

China (Beijing), China (Zhangjiakou), China (Ulanqab), China (Hangzhou), China (Shanghai), China (Shenzhen), China (Chengdu), China (Hong Kong), Singapore, Malaysia (Kuala Lumpur), Indonesia (Jakarta), Japan (Tokyo)

All DataWorks users

Data O&M Agent

Asset Transfer

DataStudio supports Asset Transfer, a feature for smoothly transferring Data Development Assets when an employee leaves.

2026-02-20

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Ulanqab), China (Shenzhen), China (Chengdu), China (Hong Kong), Japan (Tokyo), Singapore, Malaysia (Kuala Lumpur), Indonesia (Jakarta), Thailand (Bangkok), Germany (Frankfurt), UK (London), US (Silicon Valley), US (Virginia)

All DataWorks users

/

Batch O&M for O&M Assistant

The O&M Assistant supports Batch O&M. You can perform operations on multiple Instances at once by manually entering them or by uploading a file that contains the target instances.

2026-02-20

China (Ulanqab)

All DataWorks users

/

2025

December 2025

Feature

Description

Release date

Region

Available to

Documentation

Data Integration enhancements

  • Supports real-time full database synchronization from PostgreSQL to Lindorm. After the initial full synchronization, real-time incremental updates start automatically.

  • Supports real-time full database synchronization from PostgreSQL to AnalyticDB for MySQL. After the initial full synchronization, real-time incremental updates start automatically.

  • Supports full database batch synchronization from Hive to Data Lake Formation (DLF), including one-time or periodic full and incremental synchronization.

2025-12-20

All regions

All DataWorks users

Data Map now supports metadata collection for Paimon Catalog and MongoDB

DataWorks now supports metadata collection for Paimon Catalog. This allows users who manage their own Paimon Catalog on an Object Storage Service (OSS) FileSystem (in non-DLF scenarios) to collect, manage, and view metadata. This release also adds support for periodic metadata collection from MongoDB. Data developers can now quickly discover, understand, and manage collections and field schemas in MongoDB. This enhances the visibility and governance of semi-structured data assets.

2025-12-20

All regions

All DataWorks users

Metadata Collection

Open Data adds a Data Quality view

DataWorks Open Data now includes the Quality Instance View. Use this view with MaxCompute SQL to query daily details and statistical metrics of data quality instances for custom, multi-dimensional analysis.

2025-12-20

All regions

DataWorks Enterprise Edition users

Open Data Table Structure Details

November 2025

Feature

Description

Release date

Region

Available to

Documentation

Concurrency support for for-each nodes

You can now choose between Serial Execution and Parallel Execution for for-each nodes.

2025-11-27

All Regions

All DataWorks users

for-each Node

New Dependent Node

The Dependent Node lets you configure complex cross-cycle dependencies.

2025-11-27

All Regions

All DataWorks users

Dependent Node

Notebook supports AnalyticDB for MySQL Spark compute resources

In a DataWorks Notebook, use Magic Commands to connect to an AnalyticDB for MySQL Spark Compute Resource for PySpark data development.

2025-11-27

All Regions

All DataWorks users

Advanced Notebook Development

Export Data Analysis query results to Object Storage Service (OSS)

After you run a query in Data Analysis, click Export > Object Storage Service (OSS) to export the results for archival and reuse.

2025-11-27

All Regions

All DataWorks users

Export and Share

Data Integration supports Public Data Sources as a source

You can now use Public Data Sources as a source in Data Integration. You can perform offline, single-table synchronization to any supported Destination.

2025-11-20

All Regions

All DataWorks users

Public Dataset Data Source

Data Integration enhancements

  • Real-time, single-table synchronization from Hologres to a Lindorm wide table.

  • Offline, single-table writes to AWS S3.

  • Real-time, full-database synchronization from PostgreSQL to a MaxCompute DeltaTable.

  • Offline, single-table reads from a Databricks Data Source.

  • Real-time, single-table synchronization from Kafka to Lindorm.

  • Real-time, full-database synchronization from OceanBase to Hologres.

2025-11-20

All Regions

All DataWorks users

Data Integration Overview

October 2025

Feature

Description

Release date

Region

Available to

Documentation

Connect Data Studio to your local IDE

You can connect Data Studio to your local Integrated Development Environment (IDE). This lets you work in your preferred editor while leveraging cloud-based collaboration and version management.

2025-10-25

China (Zhangjiakou), China (Ulanqab), China (Beijing), China (Hangzhou), China (Shanghai), China (Shenzhen), China (Chengdu), Malaysia (Kuala Lumpur), Indonesia (Jakarta), Thailand (Bangkok), Japan (Tokyo), US (Silicon Valley), Germany (Frankfurt), UK (London), US (Virginia), China (Hong Kong), Singapore

Users of Data Studio

/

Git integration for code synchronization

Data Studio supports automatic code synchronization to Git and merging code from Git. This enables an end-to-end DevOps workflow and enhances team collaboration.

2025-10-25

China (Zhangjiakou), China (Ulanqab), China (Beijing), China (Hangzhou), China (Shanghai), China (Shenzhen), China (Chengdu), Malaysia (Kuala Lumpur), Indonesia (Jakarta), Thailand (Bangkok), Japan (Tokyo), US (Silicon Valley), Germany (Frankfurt), UK (London), US (Virginia), China (Hong Kong), Singapore

Users of Data Studio

Git code synchronization and merge

Serverless Spark and StarRocks Nodes

Data Studio is fully compatible with OpenLake workspaces and introduces Serverless Spark and Serverless StarRocks Nodes. This delivers a seamless and efficient development experience for OpenLake users.

2025-10-25

China (Zhangjiakou), China (Ulanqab), China (Beijing), China (Hangzhou), China (Shanghai), China (Shenzhen), China (Chengdu), Malaysia (Kuala Lumpur), Indonesia (Jakarta), Thailand (Bangkok), Japan (Tokyo), US (Silicon Valley), Germany (Frankfurt), UK (London), US (Virginia), China (Hong Kong), Singapore

Users of Data Studio

Data Studio Agent

The Data Studio Agent provides end-to-end autonomous development. Based on a submitted requirements document, it can automatically plan Tasks, generate code and workflows, configure scheduling, and manage the release process. This streamlines the ETL development workflow, improves the developer experience, reduces code errors, and accelerates project delivery.

2025-10-25

China (Zhangjiakou), China (Beijing), China (Ulanqab), China (Hangzhou), China (Shanghai), China (Shenzhen), China (Chengdu), China (Hong Kong), Singapore, Malaysia (Kuala Lumpur), Indonesia (Jakarta), Japan (Tokyo)

Users of Data Studio

Data Studio Agent

Copilot Rule Engine

The Copilot Rule Engine lets you customize the Agent's behavior to align with your team's standards and preferences. This flexible configuration meets both standardized and personalized requirements in enterprise scenarios.

2025-10-25

China (Zhangjiakou), China (Beijing), China (Ulanqab), China (Hangzhou), China (Shanghai), China (Shenzhen), China (Chengdu), China (Hong Kong), Singapore, Malaysia (Kuala Lumpur), Indonesia (Jakarta), Japan (Tokyo)

Users of Data Studio

Configure rules

Multimodal dataset search

DataWorks supports a tenant-level search across all your managed multimodal datasets, including those created in DataWorks and PAI. For PAI datasets that have been tagged and indexed, you can also perform semantic searches on dataset details.

2025-10-17

All Regions

All users

/

Large Language Model (LLM) Node

The Large Language Model (LLM) Node integrates large language models to provide intelligent data processing and analysis. This significantly lowers the technical barrier, empowering business users to work directly with data.

2025-10-17

China (Hangzhou), China (Shanghai), China (Beijing), China (Ulanqab), China (Shenzhen), China (Hong Kong), Japan (Tokyo), Singapore, Malaysia (Kuala Lumpur), Indonesia (Jakarta), Germany (Frankfurt), US (Silicon Valley), and US (Virginia)

Users of Data Studio

Large Language Model (LLM) Node

DataWorks OpenData

DataWorks OpenData provides a centralized collection of metadata, including detailed information about objects such as tables, Nodes, instances, workspaces, members, and projects within your tenant. After you install OpenData in a workspace that is bound to a MaxCompute compute resource, you can use MaxCompute Package views to authorize and share metadata. This lets you quickly access standardized and traceable metadata for more efficient data governance and analysis.

2025-10-14

All Regions

DataWorks Enterprise Edition users

September 2025

Feature

Description

Release date

Region

Available to

Documentation

Enhanced Data Integration capabilities

  • Supports single-table real-time synchronization from Hologres to MaxCompute.

  • Supports single-table offline reads from Lindorm Iceberg tables, enabling you to use data from your data lake in other applications.

  • Supports single-table offline reads from Data Lake Formation (DLF) tables, enabling you to use data from your data lake in other applications.

  • Supports single-table offline synchronization from OceanBase sharded databases and tables to MaxCompute.

2025-09-22

All regions

All users in the available regions

Data Integration Overview

Data Integration alert rule templates

Data Integration supports alert rule templates. This feature lets you configure a template once and apply it to multiple tasks, improving alert configuration efficiency.

2025-09-22

All regions

All users in the available regions

Common Alert Rules

Custom model support for AI-assisted processing

The AI-assisted processing feature in Data Integration supports custom Large Language Models (LLMs) for single-table offline tasks, enabling you to use your own fine-tuned models for AI-powered data processing.

2025-09-22

All regions

All users in the available regions

AI-assisted Processing

Embedding support in Data Integration

Data Integration supports Embedding vectorization for data processing in single-table offline synchronization tasks. You can extract data from sources like Object Storage Service (OSS), MaxCompute, and Hadoop Distributed File System (HDFS), transform it into vectors, and load it into destinations with vector storage capabilities. Supported destinations include vector databases like Milvus, Elasticsearch, and OpenSearch, as well as Hologres vector tables. This simplifies ETL workflows, provides efficient knowledge vectorization, and supports AI applications like Retrieval-Augmented Generation (RAG).

2025-09-22

All regions

All users in the available regions

Vectorization

Deploy and use Large Language Model (LLM) services in DataWorks resource groups

The DataWorks Large Language Model (LLM) service offers a unified solution for efficient model deployment, secure communication, and simplified invocation. You can deploy models using a DataWorks serverless resource group and call them directly in data development tasks. All traffic is transmitted through a PrivateLink channel, which ensures that your data remains within your domain and enhances data security.

2025-09-22

China (Hangzhou), China (Shanghai), China (Beijing), China (Ulanqab), China (Shenzhen), China (Hong Kong), Japan (Tokyo), Singapore, Malaysia (Kuala Lumpur), Indonesia (Jakarta), Germany (Frankfurt), US (Silicon Valley), and US (Virginia)

All users in the available regions

August 2025

Feature

Description

Release date

Available regions

Available to

Documentation

E-MapReduce (EMR) Serverless Spark Compute Resources now support OpenLDAP and EMR Kyuubi

E-MapReduce (EMR) Serverless Spark Compute Resources now support OpenLDAP account mapping and Kyuubi Configurations. You can now manage permissions more efficiently and develop EMR Kyuubi node tasks.

2025-08-14

All regions

Users of the new Data Studio

Bind an E-MapReduce (EMR) Serverless Spark compute resource

Notebook Nodes now support Magic Commands for quick connections

In Notebooks, you can now use Magic Commands in Python cells to quickly create and start a Livy Service. This enables you to efficiently develop and debug on MaxCompute Spark and E-MapReduce (EMR) Serverless Spark Compute Resources.

2025-08-07

All regions

Users of the new Data Studio

Develop with Notebooks

July 2025

Feature

Description

Release date

Region

Available to

Documentation

Compute quota in node scheduling configuration

Data Studio now supports setting a Compute Quota in the scheduling configuration for MaxCompute SQL and MaxCompute Script Nodes.

2025-07-31

All regions

Users of the new Data Studio

Configure node scheduling

Clone and version rollback for data objects

Data Studio now supports cloning and Version Rollback for data objects such as workflows, nodes, and components. This feature lets you quickly reuse these objects and easily restore previous versions.

2025-07-31

All regions

Users of the new Data Studio

Use datasets

Data Studio now supports Datasets in Shell nodes, Python nodes, Notebooks, and the personal development environment.

2025-07-28

All regions

Users of the new Data Studio

Use datasets

Manage datasets

DataWorks now supports the Dataset feature, which lets you manage versions of Unstructured Data such as images and documents.

2025-07-28

All regions

All users

Manage datasets

Data Umbrella entry point update

The entry point for the Data Umbrella feature has been updated.

Before: All Products > Data Governance > Data Umbrella

After: All Products > Data Governance > Security Center. In the left-side navigation pane of Security Center, click Sensitive Data Management.

2025-07-10

China (Shanghai), China (Shenzhen), China (Beijing), China (Chengdu)

All users in the available regions

-

Triggered workflows in Data Studio

Triggered Workflows are ideal for scenarios that do not have a fixed scheduling cycle. You can run them manually or trigger them based on events.

2025-07-07

China (Hangzhou), China (Ulanqab), China (Shenzhen)

Users of the new Data Studio

June 2025

Feature

Description

Release date

Region

Available to

Documentation

Convert Pay-as-you-go Resource Groups to Subscription

You can now convert Pay-as-you-go Serverless Resource Groups to the Subscription billing model.

June 19, 2025

All Regions

All users

Use Serverless Resource Groups

Data Synchronization

For real-time, full-database Synchronization Tasks from MySQL or PolarDB to MaxCompute, you can now use Function Expressions to assign values to target table fields.

June 11, 2025

All Regions

All users

Assign Values to Target Table Fields Using Function Expressions

PAI Flow

PAI Flow enables end-to-end Machine Learning Workflow development. It offers the same workflow features as Visualized Modeling (Designer) in Platform for AI (PAI) and supports periodic scheduling.

June 10, 2025

China (Hangzhou), China (Shanghai), China (Beijing), China (Ulanqab), China (Shenzhen), China (Hong Kong), Singapore, Indonesia (Jakarta), Japan (Tokyo), Germany (Frankfurt), US (Silicon Valley), and US (Virginia)

Users of the new Data Studio

How to Configure PAI Flow

Smoke Testing

The new Data Studio now supports Smoke Testing. This feature validates the parameter substitution logic and execution results of a scheduled node task to prevent basic configuration errors from affecting production data.

June 10, 2025

All Regions

All users

Smoke Testing

Manual Task Support in O&M Dashboard

You can now view the status of manual business workflows and Manual Task instances on the O&M Dashboard.

June 4, 2025

All Regions

All users

View Statistics on the O&M Dashboard

May 2025

Feature

Description

Release date

Region

Scope

Documentation

Personal Development Environment

When you create a DataWorks Custom Image in the Personal Development Environment, DataWorks automatically creates a corresponding MaxCompute Custom Image.

2025-05-23

All Regions

Users of the new DataStudio

Build a MaxCompute Custom Image in a Personal Development Environment

Code Review

Code Review improves the quality and compliance of production Code through manual checks. You can require a manual review before Task Deployment to meet your governance needs. If this review is mandatory, only tasks that pass it can be deployed.

2025-05-23

All Regions

Users of the new DataStudio

Code Review

April 2025

Feature

Description

Release date

Region

Scope

Documentation

DataWorks and Lindorm Integration

You can now associate a Lindorm Compute Resource with a Workspace. This allows you to develop Tasks using Lindorm Spark and Lindorm Spark SQL Nodes, and manage Lindorm Data Lineage in Data Map.

2025-04-18

China (Hangzhou), China (Shanghai), China (Beijing), China (Shenzhen)

Users of the new version of Data Studio

Data Integration

Data Integration now supports MongoDB Data Source versions 6.x and 7.x.

2025-04-14

All Regions

All users

MongoDB Data Source

MaxCompute and Hologres Integration

Data Studio now supports Metadata mapping and Data Synchronization between MaxCompute and Hologres.

2025-04-10

All Regions

All users

DataWorks Agent

The DataWorks Agent is now available. You can use it to develop data Tasks using natural language.

2025-04-08

All Regions

All users

DataWorks Agent with Third-Party Clients

March 2025

Feature

Description

Release date

Region

Scope

References

Data Integration

Data Integration now supports the Milvus data source.

2025-03-27

All Regions

All users

Milvus

Data Integration

Data Integration adds support for real-time synchronization of an entire MySQL database to LogHub (SLS).

2025-03-12

All Regions

All users

Real-time synchronization of an entire MySQL database to LogHub (SLS)

Operation Center

The Operation Center now features Copilot intelligent search and allows you to save custom views.

2025-03-03

All Regions

All users

February 2025

Feature

Description

Release date

Region

Scope

References

Operation Center

You can now access the E-MapReduce (EMR) web UI from Intelligent Diagnosis task run logs over the public network, even if the UI is on a private network.

2025-02-26

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Hong Kong), Japan (Tokyo), Singapore, Malaysia (Kuala Lumpur), Indonesia (Jakarta), Germany (Frankfurt), US (Silicon Valley), US (Virginia), and UAE (Dubai)

Users of DataWorks Professional Edition or a more advanced edition

Intelligent Diagnosis

Operation Center

When a Baseline is enabled for an E-MapReduce (EMR) node in Data Studio, you can now configure the ENABLE_TASK_PRIORITY parameter to control the YARN queue's scheduling priority.

2025-02-14

All regions

All users

Configure a priority mapping between a baseline and a YARN queue

Data Studio

You can now upgrade the PyODPS version for PyODPS 3 nodes in Data Studio.

2025-02-14

All regions

All users

DataWorks Copilot

DataWorks Copilot now supports the following DeepSeek models: DeepSeek-R1-671B and DeepSeek-R1-Distill-Qwen-32B.

2025-02-14

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), and China (Chengdu)

All users

DataWorks Copilot code programming assistant

DataWorks Copilot

DataWorks Copilot now supports code optimization and code testing.

2025-02-14

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Chengdu), Singapore, Malaysia (Kuala Lumpur), and Indonesia (Jakarta)

All users

DataWorks Copilot code programming assistant

Personal Development Environment

  • You can now create a Personal Development Environment instance from a GPU image, which is ideal for deep learning frameworks such as TensorFlow and PyTorch.

  • You can now configure a workspace-level auto-shutdown policy and set a scheduled shutdown time for Personal Development Environment instances.

2025-02-12

All regions

Users of the new Data Studio

Personal development environment

Operation Center

Automated O&M now includes an automatic rerun policy.

2025-02-10

All regions

All users

Automated O&M

New node in Data Studio

The SUB_PROCESS node is a special node type that lets you reference one workflow from within another. This lets you break down complex tasks into smaller, independently defined subtasks, improving workflow maintainability and reusability.

2025-02-06

All regions

Users of the new Data Studio

SUB_PROCESS node

January 2025

Feature

Description

Release date

Region

Scope

References

Data Source

DataWorks now supports PolarDB-O (Sharding) as a data source.

2025-01-14

All regions

All users

Configure a batch sharding synchronization task

Availability Zone replacement for legacy Resource Groups

This feature enhances disaster recovery capabilities for resources, enabling a faster and more efficient response to failures.

2025-01-02

All regions

All users

2024

December

Feature

Description

Release date

Region

Scope

References

New node type in Data Studio

Data Studio adds the ADB Spark SQL node. Use this node to develop, periodically schedule, and integrate AnalyticDB for MySQL Spark SQL tasks with other task types.

Dec 19, 2024

All regions

Users of the new Data Studio

ADB Spark SQL node

New node type in Data Studio

Data Studio adds the ADB Spark node. Use this node to develop, periodically schedule, and integrate AnalyticDB for MySQL Spark tasks with other task types.

Dec 19, 2024

All regions

Users of the new Data Studio

ADB Spark node

Hologres dynamic tables

DataWorks data catalogs now integrate with the Hologres dynamic table engine. This integration provides visual tools to manage dynamic tables, configure scheduling dependencies, and maintain tasks.

Dec 13, 2024

All regions

Users of the new Data Studio

Use Hologres dynamic tables

New data source type

DataWorks now supports the TiDB data source.

Dec 4, 2024

All regions

All users

TiDB data source

Data Quality

Data Quality now supports format-matching Quality Rule Templates. You can define data formats with custom regular expressions or use built-in templates to validate common formats like email addresses, phone numbers, and ID card numbers.

Dec 2, 2024

All regions

All users

View built-in rule templates

November

Feature

Description

Release date

Region

Scope

References

Data Integration

Data Integration now supports real-time synchronization of entire ApsaraDB for OceanBase databases to MaxCompute for tenants using the MySQL protocol.

2024-11-21

All regions

All users

Real-time synchronization of entire ApsaraDB for OceanBase databases to MaxCompute

Data Map

Data Map now supports collecting and managing Metadata from AnalyticDB for Spark.

2024-11-21

All regions

Users of the new Data Studio

Metadata collection

Data Map

You can now create a Data Insight from the details page of a MaxCompute table in Data Map to gain in-depth statistics and distribution analysis.

2024-11-21

China (Hangzhou), China (Shanghai), China (Shenzhen), China (Chengdu), China (Ulanqab), and China (Beijing)

All users

MaxCompute table data

Data Asset Governance

The Data Governance Center is upgraded to Data Asset Governance. This feature automatically identifies issues with data storage, task computation, code development, and data quality based on pre-configured governance plans. It uses a Health Score to quantify governance results from multiple perspectives. Additionally, it provides features like business asset management, asset analysis, resource consumption details for tasks, and cost estimation for comprehensive resource management.

2024-11-14

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Chengdu), China (Hong Kong), Singapore, Malaysia (Kuala Lumpur), Indonesia (Jakarta), Germany (Frankfurt), US (Silicon Valley), and US (Virginia)

All users

Data Asset Governance

Compute Resources

You can now associate AnalyticDB for Spark compute resources with a workspace.

2024-11-05

All regions

Users of the new Data Studio

Associate a computing resource

Git integration for Personal Development Environment

The Personal Development Environment now integrates with Git repositories, simplifying code version management and team collaboration.

2024-11-04

All regions

Users of the new Data Studio

Connect a personal development environment to a Git repository

October

Feature

Description

Release date

Region

Scope

References

Image management

DataWorks now lets you build a Custom Image into a persistent Image. This avoids the need to redeploy the image environment for each task. Using the same runtime environment for every run ensures Consistency, reduces task completion time, and lowers computing cost and traffic costs.

Oct 18, 2024

China (Beijing), China (Shanghai), China (Shenzhen), China (Hangzhou), China (Hong Kong), China (Zhangjiakou), Singapore, Malaysia (Kuala Lumpur), Indonesia (Jakarta), Japan (Tokyo), Germany (Frankfurt), UK (London), US (Silicon Valley), and US (Virginia)

All DataWorks users

Custom images

Serverless synchronization task

Data Integration introduces the Serverless synchronization task. This fully managed task type operates without a resource group, letting you focus on your synchronization logic instead of infrastructure management.

Oct 12, 2024

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Hong Kong), UK (London), US (Silicon Valley), US (Virginia), Japan (Tokyo), Germany (Frankfurt), and Malaysia (Kuala Lumpur)

All DataWorks users

Serverless synchronization task

September

Feature

Description

Release date

Region

Users

References

Real-time synchronization tasks

DataWorks now supports real-time single table data ingestion from Simple Log Service (SLS) to Data Lake Formation (DLF) 2.0. This feature writes data in the Paimon format and supports simple data processing during synchronization.

Sep 13, 2024

All regions

All DataWorks users

Real-time single-table ingestion from Simple Log Service (SLS) to Data Lake Formation

Real-time synchronization tasks

DataWorks now supports auto-scaling for real-time synchronization tasks. You can create an adjustment schedule to dynamically adjust resources for a running task without downtime.

Sep 13, 2024

All regions

All DataWorks users

Conditions for resource auto-scaling

August

Feature

Description

Release date

Region

Scope

References

Real-time synchronization task

DataWorks now supports real-time synchronization from a full MySQL database to SelectDB or Apache Doris.

August 29, 2024

All regions

All DataWorks users

Real-time synchronization from a full MySQL database to SelectDB

Hologres data access control

The DataWorks Security Center now lets you manage permissions for accessing Hologres data. You can specify authorized identities, request permissions, and approve requests. You can also view permission request records and request processing records.

August 22, 2024

All regions

All DataWorks users

Hologres data access control

Export SQL query results to a DingTalk sheet

After you run an SQL query in DataWorks, you can export the results directly to a DingTalk sheet. This avoids the security risks of downloading query results as local Excel files.

August 14, 2024

China (Zhangjiakou)

China (Chengdu)

All DataWorks users

Export SQL query results to a DingTalk sheet

Lowered permission requirements for modifying the default access identity of a MaxCompute data source

Fewer permissions are now required to modify the default access identity of a MaxCompute data source. If the default access identity is a RAM User, the user now only needs the admin or super_administrator role for the MaxCompute project. Previously, the AdministratorAccess policy for RAM was required.

August 8, 2024

All regions

All DataWorks users

Associate a MaxCompute computing resource

Support for CloudSSO in DataWorks Enterprise Edition

DataWorks Enterprise Edition now supports CloudSSO. CloudSSO lets you use a third-party or self-managed Identity Provider (IdP) to log on to DataWorks.

August 8, 2024

All regions

All DataWorks users

Features by edition

July

Feature

Description

Release date

Region

Scope

References

RAM policy update

The dataworks:ListUserResources permission has been added to the AliyunDataWorksReadOnlyAccess policy, allowing users with this policy to view user resource information.

Jul 10, 2024

All regions

All DataWorks users

E-MapReduce cluster registration enhancements

When registering an E-MapReduce cluster in DataWorks, you can now:

  • Configure custom Kyuubi connection information.

  • Register an E-MapReduce Serverless Spark cluster.

Jul 10, 2024

E-MapReduce Serverless Spark is available only in the China North 3 (Zhangjiakou) region.

All DataWorks users

New node type in DataStudio

DataStudio now supports CDH Spark SQL nodes. You can use this node type to develop and periodically schedule CDH Spark SQL tasks and integrate them with other tasks.

Jul 10, 2024

All regions

All DataWorks users

Create a CDH Spark SQL node

June

Feature

Description

Release date

Region

Scope

References

Data Integration: MySQL to StarRocks Synchronization

Data Integration now supports both batch and real-time synchronization of entire MySQL databases to StarRocks.

June 28, 2024

All regions

All DataWorks users

DataStudio supports Data Push Nodes

In a workflow, Data Push Nodes allow you to periodically send the output from upstream data processing tasks as message cards to DingTalk or Lark groups.

Note

To use Data Push Nodes, submit a ticket to contact technical support and request an upgrade for your Exclusive Resource Group for Scheduling.

June 28, 2024

  • China (Hangzhou)

  • China (Shanghai)

  • China (Beijing)

  • China (Shenzhen)

  • China (Chengdu)

  • China (Hong Kong)

  • Singapore

  • Malaysia (Kuala Lumpur)

  • US (Silicon Valley)

  • US (Virginia)

  • Germany (Frankfurt)

All DataWorks users

Configure data push nodes

Introducing Serverless Resource Groups

To centralize resource management and unify the user experience, DataWorks introduces Serverless Resource Groups. A Serverless Resource Group consolidates the core features of the legacy Exclusive Resource Group for Scheduling, Exclusive Resource Group for Data Integration, and Exclusive Resource Group for DataService Studio. You can now use a single resource group for data synchronization, task scheduling, and API management.

June 11, 2024

  • China (Beijing)

  • China (Shanghai)

  • China (Shenzhen)

  • China (Hangzhou)

  • China (Hong Kong)

  • China (Zhangjiakou)

  • Singapore

  • Malaysia (Kuala Lumpur)

  • Indonesia (Jakarta)

  • Japan (Tokyo)

  • Germany (Frankfurt)

  • UK (London)

  • US (Silicon Valley)

  • US (Virginia)

All DataWorks users

New Best Practice: Developing Tasks on the Lindorm Compute Engine

The Lindorm Compute Engine is compatible with Cloudera's Distribution Including Apache Hadoop (CDH). You can register a CDH cluster in DataWorks and configure the Lindorm Compute Engine connection information to perform interactive SQL queries, develop SQL tasks, and run JAR tasks.

June 5, 2024

All regions

All DataWorks users

Develop tasks based on LDPS

New Data Source for Data Integration

Data Integration now supports Azure Blob Storage as a data source.

June 3, 2024

All regions

All DataWorks users

Azure Blob Storage

May

Parameter

Description

Release date

Region

Scope

References

Reading MySQL binary logs from Object Storage Service (OSS)

Data Integration now supports reading MySQL binary logs from Object Storage Service (OSS).

When you add a MySQL data source, if you set Configuration Mode to Alibaba Cloud Instance Mode and the RDS for MySQL instance is in the same region as the DataWorks workspace, you can enable Enable Binary Log Reading from OSS. After you enable this feature, if DataWorks cannot access the RDS binary logs, it attempts to obtain the binary logs from OSS. This prevents real-time synchronization tasks from being interrupted.

May 24, 2024

All Regions

All DataWorks users

MySQL

Data Quality module redesign

The redesigned Data Quality module streamlines the data quality monitoring workflow. You can now validate specific data ranges in a table by using monitoring rules.

May 21, 2024

The new version of Data Quality is rolling out to Regions in phases. For availability in your Region, check the Console. If the new features are not yet available in your Region, see the Data Quality (Legacy) documentation.

All DataWorks users

Data Quality (new version)

New Data Integration synchronization path

Data Integration now supports batch synchronization of a full Hologres database to another Hologres database.

May 20, 2024

All Regions

All DataWorks users

Batch synchronization from a full Hologres database to Hologres

Remotely trigger server scripts

The DataWorks SSH node lets you remotely run scripts on a host by using an SSH data source.

May 15, 2024

All Regions

All DataWorks users

DataStudio now supports EMR Kyuubi nodes

You can use the DataWorks EMR Kyuubi node to develop, periodically schedule, and integrate Kyuubi tasks with other jobs.

May 11, 2024

All Regions

All DataWorks users

EMR Kyuubi node

DataStudio now supports multiple new database node types

DataStudio now supports multiple database node types, including DRDS, PolarDB for MySQL, and Doris. You can use these nodes to develop and periodically schedule database tasks and integrate them with other jobs.

May 11, 2024

All Regions

All DataWorks users

April

Feature

Description

Release date

Region

Scope

References

SSL authentication for PostgreSQL data sources

Data Integration now supports SSL authentication when you add a PostgreSQL data source.

Apr 26, 2024

All Regions

All DataWorks users

PostgreSQL

Support for Hologres data sources in Data Governance Center

The Data Governance Center now supports Hologres data sources.

Before you can use a Hologres data source in Data Governance Center, you must collect metadata of Hologres in Data Map. For more information, see Metadata collection.

Apr 24, 2024

Hologres data sources support Data Governance Center only in the following regions: China (Beijing), China (Shanghai), China (Hangzhou), and China (Shenzhen).

All DataWorks users

Data Governance Center overview

New materialized view feature in Data Governance Center

DataWorks provides an intelligent, automated solution for frequent big data computing tasks that involve many similar subqueries. After you enable this feature, DataWorks automatically detects and categorizes similar subqueries in MaxCompute and generates recommendations for materialized views. You can create materialized views with a single click to significantly improve computational efficiency and save computing resources.

Apr 12, 2024

All Regions

All DataWorks users

Automated governance of materialized views

OpenLDAP authentication for DataWorks on CDH/CDP

When you register a CDH/CDP cluster in DataWorks, you can configure a custom mapping between an Alibaba Cloud account or a RAM user and an OpenLDAP account in the cluster. After the mapping is configured, the mapped OpenLDAP account runs tasks submitted by the Alibaba Cloud account or RAM user. This provides permission isolation when different Alibaba Cloud accounts or RAM users access data within the CDH/CDP cluster.

Apr 8, 2024

China (Beijing), China (Shanghai), China (Hangzhou), China (Shenzhen), China (Zhangjiakou), and China (Chengdu)

All DataWorks users

Configure mappings between tenant member accounts and CDH or CDP cluster accounts

March

Feature

Description

Release date

Region

Scope

References

Data backfill feature

Once deployed, an auto triggered task runs based on its configured schedule. data backfill lets you run the task for a specific past or future time range, writing data to the corresponding time partition. This feature supports the following methods:

Mar 28, 2024

All regions

All DataWorks users

Manage data backfill instances

Develop and deploy extensions with Function Compute

You can now develop and deploy DataWorks extensions using Function Compute. This feature lets you define custom logic to manage user actions, such as intercepting an event message to block an unwanted operation. With this deployment method, event messages are sent directly to the corresponding Function Compute Service. Key points:

  • The deployment is simple and requires only a single function.

  • Function Compute usage is billed separately. For more information, see Billing overview.

  • Currently, an extension deployed using this method can only process a pre-event for data download.

Mar 19, 2024

  • China (Beijing)

  • China (Hangzhou)

  • China (Shanghai)

  • China (Zhangjiakou)

  • China (Shenzhen)

  • China (Chengdu)

  • US (Silicon Valley)

  • US (Virginia)

  • Germany (Frankfurt)

  • Japan (Tokyo)

  • China (Hong Kong)

  • Singapore

DataWorks Enterprise Edition users

Develop and deploy extensions based on Function Compute

Data Modeling supports custom model publishing policies

After you enable a policy, you can select the corresponding publishing mode when publishing a model.

Mar 12, 2024

All regions

DataWorks users who purchase the Intelligent Data Modeling service

Publishing policy management

February

Feature

Description

Release date

Region

Availability

References

New guide for using CDP/CDH in DataWorks

This guide describes the basic development process for using CDP/CDH in DataWorks, covering billing, environment preparation, and Access Control.

Feb 21, 2024

All regions

All DataWorks users

DataWorks on CDP/CDH guide

DataService Studio supports StarRocks data sources in instance mode

After you create an E-MapReduce (EMR) Serverless StarRocks cluster, you can add it as a StarRocks data source in DataWorks by using the Alibaba Cloud instance mode. You can then use DataService Studio to create data APIs from the StarRocks data source for data sharing and access.

Feb 20, 2024

All regions

All DataWorks users

Configure data sources

Data Map now supports DataStudio code search

Data Map now supports code search in DataStudio. This feature lets you search for specific code across workspaces by using keywords. This improves development efficiency and reduces project redundancy.

Feb 20, 2024

All regions

Users of DataWorks Standard Edition or a more advanced edition

Metadata retrieval

New data upload and download feature

You can use the data upload and download feature to upload local CSV files and files from Object Storage Service (OSS) to MaxCompute for processing and analysis. You can also manage lists of uploaded and downloaded files from other modules, simplifying data management.

Feb 20, 2024

All regions

All DataWorks users

DataStudio now supports CDH cluster nodes

You can now use DataWorks to develop and periodically schedule CDH-related tasks for various engines, including Hive, Spark, MR, Presto, and Impala.

Feb 19, 2024

All regions

All DataWorks users

New System Configuration page for Data Security Guard

Use the new System Configuration page to:

  • Configure the content and scope for data identification.

  • Set the retention period for watermarked files.

  • Specify whether to display the security level of identified data.

  • Configure email addresses and webhook URLs for receiving Alert Notifications.

These settings help you identify and handle potential security risks.

Feb 6, 2024

All regions

All DataWorks users

System configuration

January

Feature

Description

Release date

Region

Scope

References

Data masking for query results in DataStudio and DataAnalysis

Data Security Guard now supports data classification, sensitive data identification, and data masking for data in E-MapReduce (EMR) tables.

This feature strengthens your enterprise's data security.

Jan 25, 2024

All Regions

All DataWorks users

Data Lineage for Real-time Synchronization links in Data Map

Data Map now parses and displays Data Lineage for the following Real-time Synchronization links:

  • Real-time Synchronization from MySQL to MaxCompute or Hologres

  • Real-time Synchronization from Kafka to MaxCompute or Hologres

  • Real-time Synchronization from LogHub to MaxCompute or Hologres

  • Real-time Synchronization from PolarDB to MaxCompute

By combining Real-time and Batch Synchronization lineage, you can gain a more comprehensive view of your data flow.

Jan 15, 2024

All Regions

All DataWorks users

View data lineages

2023

December

Feature

Description

Release date

Region

Users

References

Associate a Data Source with DataStudio

Before performing data modeling, data development, or using the Operation Center for periodic scheduling in DataWorks, associate your Data Source or Cluster with the DataStudio module. Once associated, you can read data from the Data Source or Cluster and perform development tasks.

Dec 29, 2023

All regions

All DataWorks users

Preparing for data development: Associate a Compute Resource or a Cluster with DataStudio

Compute Resource consolidation

To unify the user experience, DataWorks now manages MaxCompute, Hologres, AnalyticDB for PostgreSQL, AnalyticDB for MySQL, and ApsaraDB for ClickHouse engines as Compute Resources. E-MapReduce (EMR) and CDH/CDP engines are now managed as Open Source Clusters. After this update, operations on the original Compute Engines, such as creating and editing, are now performed on the Compute Resource or Open Source Clusters page.

Dec 29, 2023

All regions

All DataWorks users

New Extension Point Events

  • DeleteProject: A pre-event that is triggered before a Workspace is deleted.

  • ProjectDeleted: A post-event that is triggered after a Workspace is deleted.

  • DownloadResources: An event that is triggered before a data download.

Dec 27, 2023

All regions

All DataWorks users

New application scopes for Extension Point Events

Extension Point Events now support the following application scopes:

  • Tenant level: The event applies to the entire Tenant.

  • Workspace level: The event applies only to the target Workspace.

When you register an Extension, you can select only one type of Extension Point Event.

Dec 22, 2023

  • China (Beijing)

  • China (Hangzhou)

  • China (Shanghai)

  • China (Zhangjiakou)

  • China (Shenzhen)

  • China (Chengdu)

  • US (Silicon Valley)

  • US (Virginia)

  • Germany (Frankfurt)

  • Japan (Tokyo)

  • China (Hong Kong)

  • Singapore

All DataWorks users

Data Governance Center introduces SQL efficiency optimization checks

The Data Governance Center introduces five new checks for ODPS, Hive, and Spark SQL, including Cartesian product detection, ineffective full table joins, and Brute-force Scans. This feature lets you proactively check and optimize code during development to improve computing efficiency, prevent resource waste, and ensure timely data output.

Dec 22, 2023

All regions

All DataWorks users

Configure check items

DataWorks now fully supports StarRocks Data Sources

DataWorks now provides full support for StarRocks Data Sources across the following services:

  • Data Integration: You can now synchronize StarRocks data.

  • DataStudio: You can now create and periodically schedule StarRocks tasks.

  • Data Analysis: You can now query and analyze StarRocks data.

  • DataService Studio: You can now expose StarRocks tables as APIs.

  • Data Map: You can now manage, search, and view StarRocks Metadata.

Dec 15, 2023

All regions

All DataWorks users

Support for additional E-MapReduce (EMR) Hadoop Cluster versions

DataWorks now supports over ten additional E-MapReduce (EMR) Hadoop Cluster versions, including:

  • EMR-3.26.3

  • EMR-3.27.2

  • EMR-3.29.0

  • EMR-3.32.0

  • EMR-3.35.0

  • EMR-3.38.2

  • EMR-3.38.3

  • EMR-4.3.0

  • EMR-4.4.1

  • EMR-4.5.0

  • EMR-4.5.1

  • EMR-4.6.0

  • EMR-4.8.0

  • EMR-4.9.0

  • EMR-5.2.1

  • EMR-5.4.3

  • EMR-5.6.0

Dec 15, 2023

All regions

All DataWorks users

Usage notes for development of EMR nodes in DataWorks

Check Node adds support for new object types

Use the Check Node to verify the availability of a target object, such as a MaxCompute partitioned table, an FTP file, or an OSS file. The Check Node returns a success status after the check policy is met.

If a task depends on a target object, use a Check Node to verify its availability and then configure the task as a Downstream Task of the Check Node. Once the check policy is met, the Check Node runs successfully and triggers the Downstream Task.

Dec 8, 2023

All regions

All DataWorks users

Check Node

DataStudio introduces the PAI DLC Node

You can now use the PAI DLC Node to periodically schedule and run PAI DLC tasks.

Dec 8, 2023

All regions

All DataWorks users

Create and use a PAI DLC node

Security Center introduces Risk Identification Rules

The Security Center now lets administrators register risk identification capabilities as Extensions in DataWorks. These extensions function as Risk Identification Rules to detect risks in user operations.

You can use default or custom extensions to identify risks in data download operations and configure a blocking or approval response as needed.

Dec 8, 2023

All regions

All DataWorks users

Risk Identification Rule

November

Feature

Description

Release date

Region

Scope

References

Check node now available in DataStudio

The DataStudio Check node verifies if a MaxCompute partitioned table is available by checking that the target partition exists and its data has been written. When a downstream task depends on a partitioned table, use this node to ensure data readiness. This helps prevent errors from processing incomplete or incorrect data.

November 20, 2023

  • China (Chengdu)

  • China (Zhangjiakou)

  • China (Beijing)

  • China (Shanghai)

  • Malaysia (Kuala Lumpur)

All DataWorks users

Check node

August

Feature

Description

Release date

Region

Scope

References

Custom scheduling cycles

Scheduling calendars extend the existing DataWorks scheduling cycles. You can now define a custom scheduling cycle by marking dates on a calendar as either scheduling or non-scheduling days.

Aug 24, 2023

All Regions

Users of DataWorks Enterprise Edition

Configure scheduling calendars

Data Governance Center now governs E-MapReduce Data Lakes

The DataWorks Data Governance Center now proactively analyzes the Data Lake development pipeline, which involves E-MapReduce, Data Lake Formation (DLF), and DataWorks. Capabilities include:

  • Governance health score assessment.

  • Automatic discovery of governance issues in development and storage.

  • Proactive issue checks for Hive SQL and Spark SQL.

Aug 24, 2023

  • China North 2 (Beijing) Ali Gov Cloud

  • China East 2 (Shanghai) Finance Cloud

  • China East 2 (Shanghai)

  • China East 1 (Hangzhou)

  • China North 2 (Beijing)

  • China South 1 (Shenzhen)

  • China Southwest 1 (Chengdu)

  • China (Hong Kong)

  • Singapore

  • US (Silicon Valley)

  • Germany (Frankfurt)

  • Indonesia (Jakarta)

Users of DataWorks Enterprise Edition and above

Data Governance Center overview

June

Feature

Description

Release date

Region

Scope

References

Real-time ETL from Kafka to Hologres

  • Data Integration supports real-time ETL from Kafka data sources to Hologres, which includes JSON parsing and other basic data processing during synchronization.

  • You can extract key-values from a specified JSON path and use dynamic key-value extension, which is ideal for handling changing message formats from the Kafka source.

  • During task configuration, the Simulated Run feature lets you verify the data transformation before writing to the destination.

Jun 1, 2023

All regions

All DataWorks users

Kafka

Real-time Full Database Synchronization from MySQL to a Data Lake on OSS in Hudi format

Data Integration supports real-time Full Database Synchronization from MySQL to a Data Lake on Object Storage Service (OSS), storing the data in Hudi format.

  • This feature automatically integrates with Data Lake Formation (DLF) to generate and manage Metadata.

  • It supports Instance-level synchronization, allowing you to select multiple databases from the source MySQL Instance.

  • You can select source MySQL databases and tables by using Regular Expressions.

  • New databases and tables added to the MySQL source are automatically discovered and synchronized to OSS without requiring manual intervention.

Jun 1, 2023

All regions

All DataWorks users

OSS

Support for Amazon Relational Database Service (Amazon RDS) data sources

You can configure an Amazon Relational Database Service (Amazon RDS) data source just as you would a MySQL data source. All capabilities available for MySQL also apply to Amazon RDS.

Jun 1, 2023

All regions

All DataWorks users

MySQL

April

Feature

Description

Release date

Region

Scope

References

Save Data Analysis results directly to MaxCompute

You can save Data Analysis results directly to a MaxCompute table without writing any code. This simplifies subsequent Query and joint analyses.

Apr 20, 2023

All regions

All DataWorks users

SQL Query (legacy version)

Download millions of Query results from Data Analysis

The default download limit for SQL Query results from Data Analysis is 10,000 rows. Administrators can configure higher limits in the Security Center based on the DataWorks edition: up to 200,000 for Standard Edition, 2,000,000 for Professional Edition, and 5,000,000 for Enterprise Edition and Ultimate Edition. Administrators can also disable the download feature.

Apr 18, 2023

All regions

All DataWorks users

SQL Query (legacy version)

New Public Datasets for big data analysis

DataWorks and MaxCompute provide seamless access to terabyte-scale Public Datasets for big data and AI analysis. These datasets include data from Taobao, Fliggy, Ali Music, GitHub, and TPC benchmarks.

Apr 11, 2023

All regions

All DataWorks users

SQL Query (legacy version)

March

Feature

Description

Release date

Region

Scope

References

Notifications for governance issues

Administrators and users can configure notifications for unresolved daily governance issues. These notifications are sent to designated recipients via system alerts, email, a DingTalk group, or a Webhook. This feature ensures responsible parties are promptly informed of new issues, enabling them to access the system for quick resolution.

Mar 15, 2023

All regions

All DataWorks users

Configure a periodic notification for governance issues

New governance item for long-lifecycle storage

This governance item lets you configure an appropriate Lifecycle for MaxCompute partitioned tables to reduce wasted storage resources.

Mar 15, 2023

All regions

All DataWorks users

Handle governance issues

Acceleration Service for DataService Studio is now commercially available

The Acceleration Service for DataService Studio generates online APIs for MaxCompute data sources. This service provides high query performance for online applications without needing to export data from MaxCompute.

Mar 1, 2023

China (Shanghai), China (Beijing), China (Hangzhou), and China (Shenzhen)

All DataWorks users

Acceleration Service

January

Feature

Description

Release date

Region

Scope

References

Centralized Resource Management

DataWorks now lets you manage all your active resources from a single location. This simplifies operations such as upgrading or downgrading specifications, renewals, and refunds.

Jan 11, 2023

All Regions

All DataWorks users

Billing overview

Batch graceful undeployment for tasks in Data Governance Center

This feature introduces:

  • A scenario-based governance plan for the secure, batch undeployment of invalid or duplicate tasks.

  • A graceful undeployment governance plan that lets administrators select target objects and quickly confirm the impact on users and assets.

  • A step-by-step process for orderly task undeployment, covering node suspension, delays, undeployment, and status notifications for each phase.

Jan 9, 2023

All Regions

All DataWorks users

Graceful undeployment

Code Review support in Basic Mode for DataStudio

DataStudio's Basic Mode now supports Code Review. You can enable this feature to ensure that Node tasks are deployed to the Production Environment only after passing review.

Jan 5, 2023

All Regions

All DataWorks users

Code Review

2022

November

Feature

Description

Release date

Region

Scope

References

Create APIs for development and production environments

In a Standard Mode Workspace, DataService Studio now lets you create APIs for specific environments. You can:

  • Configure advanced API parameters based on the environment type (Development Environment or Production Environment) of your Data Source.

  • Test APIs in a Development Environment and deploy them to a Production Environment. This separates your development and testing workflow from your production deployment.

2022-11-29

All regions

All DataWorks users

Create an API in wizard mode

Request permissions for Hive Tables in Data Map

A new Request Permissions button on the DataWorks Data Map > EMR Hive Table Details page allows you to go to Security Center to request table permissions.

2022-11-29

All regions

All DataWorks users

Which types of Hive Tables can be previewed in Data Map?

Data Map introduces Data Albums for data organization

DataWorks Data Map has added a Data Album page. The features are as follows:

  • Organize and manage data tables from a business perspective, based on categories and sensitivity levels.

  • Add frequently used tables, team tables, or popular tables to a Data Album for quick and easy retrieval.

2022-11-16

All regions

All DataWorks users

Data Album

DataAnalysis upgrade delivers a new SQL Query experience

The upgraded DataAnalysis provides a more unified and powerful SQL Query experience. You can now:

  • Centrally manage all your SQL files and frequently used data tables.

  • Extract business data with SQL statements, based on your permissions.

  • Perform post-processing on SQL Query results and visualize them as charts.

2022-11-15

All regions

All DataWorks users

SQL Query (legacy version)

DataService Studio now supports one-click parameter parsing for APIs

DataService Studio now simplifies API creation in advanced script mode. You can now:

  • Automatically Parse parameters from your SQL with the one-click parsing feature in the Request Parameter and Response Parameter panels.

  • Eliminate manual parameter entry.

2022-11-10

All regions

All DataWorks users

None

Recent Updates

Feature

Description

Release date

Region

Scope

References

Data Modeling now supports E-MapReduce Hive

DataWorks adds the following two features to Intelligent Data Modeling > Dimensional Modeling:

  • Publish models to E-MapReduce Hive and generate a corresponding ETL code framework.

  • Perform Reverse Modeling on existing E-MapReduce Hive tables to generate models.

November 25, 2022

All Regions

All DataWorks users

Data Modeling now supports Version Management

DataWorks adds the following two features to Intelligent Data Modeling > Dimensional Modeling:

  • Version Management for models. Only submitted models can be published.

  • Version comparison and Rollback across different versions of the same model.

November 25, 2022

All Regions

All DataWorks users

Publish a model

DataService Studio now displays API invocation URLs by domain name

The API details page now displays the invocation URLs generated for an API categorized by Internet, VPC, and Independent domain names. This lets you use different domain names to call the API.

October 21, 2022

All Regions

All DataWorks users

View the details of an API

Enhanced Data Lineage visualization in Data Map

The enhanced Data Lineage feature provides an improved analysis experience. From the lineage details page, you can:

  • View Upstream and Downstream Nodes for tables and columns.

  • Trace the original source and final destination of table data.

  • Perform Impact Analysis across different lineage levels.

October 21, 2022

All Regions

All DataWorks users

View the details of a table

Data Governance Center introduces four new development check items

The Data Governance Center introduces four new check items:

  • Type consistency in JOIN conditions.

  • Prohibited use of specific assets.

  • Use of UDFs with the same name.

  • Write restrictions in the Development Environment.

Key features include:

  • Manageable governance checks.

    You can enable and configure new check items on the Setting tab and view usage details in the Knowledge tab.

  • Proactive data governance for tasks during commit and deployment.

October 20, 2022

All Regions

All DataWorks users

Configure check items

Support for Code Review in DataStudio Basic Mode

In Basic Mode, you can now enable mandatory Code Review. When enabled, code can be deployed to the Production Environment only after it passes the review.

September 22, 2022

All Regions

All DataWorks users

Code review

August

Feature

Description

Release date

Region

Scope

References

Workflow-based task management in Operation Center

Operation Center now lets you view task status by Workflow and rerun, freeze, or terminate tasks.

Aug 22, 2022

All regions

All DataWorks users

View auto triggered task instances

Query Acceleration for MaxCompute data sources in DataService Studio

DataService Studio supports Query Acceleration for MaxCompute data sources. This feature lets you create online APIs to query MaxCompute data directly, enabling high-performance online queries without data export. The following acceleration solutions are available:

  • Acceleration based on Hologres Foreign Tables.

  • Acceleration based on MaxCompute Query Acceleration (MCQA).

Aug 17, 2022

China (Shanghai) and China (Shenzhen)

All DataWorks users

Acceleration service

Intelligent diagnostics and call chain analysis for APIs in DataService Studio

DataService Studio supports analyzing API Call Logs. Use this feature to trace the call chain of a single Request, detect abnormal requests, and quickly locate issues, receiving diagnostic results and recommendations.

Aug 7, 2022

All regions

All DataWorks users

View and Analyze API Call Logs (Public Preview)

Fine-grained Permission Management at the Project and Table levels in Data Map

Data Map provides Fine-grained Permission Management for Metadata. You can configure policies to control access at different levels, including:

  • Visibility of the current Project's Metadata to members of other Projects.

  • Visibility of Table Metadata to non-Project Members, non-Table owners, and Workspace Administrators.

Aug 5, 2022

All regions

All DataWorks users

Overview of permission management in Data Map

Codeless batch synchronization for Dameng databases in Data Integration

Data Integration now lets you create Batch Synchronization tasks for Dameng databases in the Codeless UI. This method is simpler than the Script Mode.

Aug 2, 2022

All regions

All DataWorks users

Use the codeless UI

July

Feature

Description

Release date

Region

Scope

References

Support for dimensional modeling in Intelligent Data Modeling

Intelligent Data Modeling now supports the following features:

  • In template design, you can now directly reference the Column and partition information of existing Hologres tables or views as Columns in the current model.

  • You can now use one-click filling for Columns that have an empty display name or description.

    Physical tables often include Column descriptions. You can use this feature to quickly populate missing display names and improve modeling efficiency.

  • In template development, you can create new DataStudio nodes or associate existing ones to streamline ETL development for your models.

Jul 29, 2022

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Chengdu), China (Hong Kong), Singapore, China East 2 Finance, China South 1 Finance, China North 2 Ali Gov, Germany (Frankfurt), and US (Silicon Valley)

All DataWorks users

View associated table information in Intelligent Data Modeling

From the configuration tab of a derived metric or an atomic metric, you can now view the associated model table and Column names in the right-side navigation pane. You can also navigate directly to the target table's configuration tab to manage the association.

Jul 29, 2022

All DataWorks users

Derived metrics

Configure a naming rule checker for models and metrics in Intelligent Data Modeling

In a data layer, you can configure a naming checker for model or metric types. When you design models and metrics, the checker constrains and validates entity names to ensure consistent naming throughout the development process.

Naming rules:

  • Rule strength: A Strong Rule provides name suggestions and enforces the naming rule. A Weak Rule only provides name suggestions.

  • Rule definition: Defines the structure and order of elements that compose a name.

Jul 29, 2022

All DataWorks users

data layer

Configure exclusive resource groups for Data Analysis

An Alibaba Cloud account can now configure Data Analysis settings in System Management. You can run SQL queries on a specified exclusive resource group.

Jul 29, 2022

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Chengdu), Germany (Frankfurt)

All DataWorks users

System Management

Enhanced SSL support for PostgreSQL in Data Integration

Postgres databases are supported, and for SSL authentication, you can use the two-file method with .crt and .key files.

Jul 26, 2022

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Chengdu), China (Hong Kong), Japan (Tokyo), Singapore, Malaysia (Kuala Lumpur), Indonesia (Jakarta), Germany (Frankfurt), UK (London), US (Silicon Valley), US (Virginia), UAE (Dubai)

All DataWorks users

Data Integration

DataWorks now supports E-MapReduce (EMR) Data Lake clusters

DataWorks now supports the E-MapReduce (EMR) Data Lake compute engine, enabling management of the entire data lifecycle. You can now use the E-MapReduce (EMR) engine for Data Integration, Intelligent Data Modeling, data development and scheduling, Data Quality, Data Map, data security, Data Analysis (on exclusive resource groups), and data services.

Jul 8, 2022

China (Chengdu), China (Zhangjiakou), China (Shenzhen), US (Silicon Valley), China (Beijing), China (Shanghai), Japan (Tokyo), Germany (Frankfurt), US (Virginia), Indonesia (Jakarta), UK (London), China (Hangzhou), Singapore, China (Hong Kong), Malaysia (Kuala Lumpur), UAE (Dubai)

All DataWorks users

Usage notes for development of EMR nodes in DataWorks

DataStudio intelligent code editor supports visual Column insertion and permission checks

  • The intelligent code editor automatically identifies table queries in your code. You can hover over a table name, select the desired Columns, and click Confirm to automatically insert them into your code.

  • The intelligent code editor also provides table permission verification, allowing you to request permissions directly from the prompt.

Jul 2, 2022

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Chengdu), China (Hong Kong)

All DataWorks users

DataStudio

June

Feature

Description

Release date

Region

Users

References

Data Governance Center announces general availability

The Data Governance Center has the following features:

  • It automatically discovers and prevents data governance issues by calculating a health score from five dimensions: storage, Compute, development, quality, and security.

  • It provides detailed Resource Consumption breakdowns, overall resource consumption trends, and cost estimates for individual tasks to help you optimize and control resource costs.

Note
  • The Data Governance Center will be generally available on July 5, 2022, with a one-month limited-time trial.

  • After August 5, 2022, all its capabilities will be available only in DataWorks Enterprise Edition.

June 27, 2022

China (Hangzhou), China (Shanghai), China (Beijing), China (Shenzhen), China (Chengdu), Singapore, and US (Silicon Valley)

All DataWorks users

Data Governance Center overview

Data Governance Center introduces the Task 360 page

The new Task 360 page provides a panoramic view of your tasks. It centralizes key information such as associated governance issues, change event records, affected Baselines, and task execution details. This helps you govern your scheduled tasks.

June 24, 2022

China (Hangzhou), China (Shanghai), China (Beijing), China (Shenzhen), China (Chengdu), Singapore, and US (Silicon Valley)

All DataWorks users

Obtain a panoramic view of a task

Data Modeling now supports View creation and referencing

  • During the design process, you can now reference Column and partition information from existing Views to add as Columns in your current Model.

  • After completing the design, you can materialize a Model into a View.

June 22, 2022

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Chengdu), China (Hong Kong), Singapore, US (Silicon Valley), Germany (Frankfurt), China East 2 Finance, China South 1 Finance, and China North 2 Ali Gov 1

All DataWorks users

Publish a model

Reverse Modeling using table name keywords

You can now use Reverse Modeling to generate a logical Model based on a Fuzzy Match of table name keywords.

June 19, 2022

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Chengdu), China (Hong Kong), Singapore, US (Silicon Valley), Germany (Frankfurt), China East 2 Finance, China South 1 Finance, and China North 2 Ali Gov 1

All DataWorks users

Reverse modeling: Generate dimensional models from physical tables

Approval Center now supports Data Integration policies

To secure data transfers, the Approval Center now lets you define policies based on source and destination combinations. These policies require approval before you can save a Data Integration task, which provides more flexible control over the Data Integration process.

June 15, 2022

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Chengdu), China (Hong Kong), Singapore, Indonesia (Jakarta), Malaysia (Kuala Lumpur), US (Silicon Valley), US (Virginia), and Germany (Frankfurt)

All DataWorks users

Approval policies for Data Integration tasks

Data Security Guard introduces sensitive Data Lineage graphs

The new sensitive data lineage visualization graph provides the following features:

  • It parses lineage relationships between sensitive Columns from the data production lineage. The system can then propagate identification results across Columns of the same sensitive data type, improving identification efficiency.

  • It automatically visualizes the identified lineage relationships to provide a clear map of data flow from source to destination.

Note

This feature is available only in DataWorks Enterprise Edition.

June 14, 2022

China (Hangzhou) and China (Shanghai)

All DataWorks users

Data Lineage (Public Preview)

Data Security Guard adds abnormal lineage analysis

The newly added Anomaly Data Lineage Analysis feature includes the following:

  • The system automatically analyzes abnormal associations between sensitive Columns based on their lineage to prevent users from bypassing sensitive data identification and usage audits by concatenating or splitting strings.

  • It helps you identify Columns that are related to the queried Column but have a different sensitive data type.

June 14, 2022

China (Hangzhou) and China (Shanghai)

All DataWorks users

Data Lineage (Public Preview)

May

Feature

Description

Release date

Region

Scope

References

Redesigned Risk Identification in Data Security Guard (Migration Required)

The redesigned Risk Identification feature uses built-in Scenarios to identify risks across multiple dimensions, using criteria such as Data Classification and Grading, operation methods, and user permissions. It improves alert aggregation to reduce false positives and provides fine-grained risk management for high, medium, and low-level risks. This provides a comprehensive view of data risks in your enterprise.

Note
  • The Risk Identification Management feature is available only in DataWorks Professional Edition or higher.

  • The legacy Risk Identification Management feature will be discontinued on June 30, 2022. After this date, the system will automatically clear all rules and risk data from the legacy version. Export and back up any required rules and risk data before this date.

  • Users of the legacy version must migrate to access the new features.

May 16, 2022

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Chengdu), and China (Hong Kong)

All DataWorks users.

Risk Identification Management

April

Feature

Description

Release date

Region

Scope

References

DataStudio streamlines the data development workflow

  • You can directly click New Node, and the system will recommend recently used node types, eliminating the need to manually search for each required node.

  • The My Favorites feature lets you save frequently used nodes for quick access and collaborative editing.

  • The task status display in the directory tree is optimized. The Commit and Deploy buttons now appear next to uncommitted tasks for quick deployment.

April 7, 2022

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Chengdu), and China (Hong Kong)

All DataWorks users.

DataStudio feature guide

Data Quality adds bulk management for global Quality Rules

  • You can now view all global Quality Rules in your workspace to perform batch operations, such as enabling, disabling, subscribing, associating with scheduling, and setting rule strength.

  • Combined with the existing option to configure rules based on a rule template, this feature lets you create and manage Quality Rules in bulk to address enterprise-wide data quality issues.

April 11, 2022

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Chengdu), and China (Hong Kong)

All DataWorks users.

Configure a Quality Rule in Data Quality

Operation Center enhances Smart Baseline with flexible alert management

The Smart Baseline feature is upgraded to include the following capabilities:

  • It provides unified management of baselines, baseline instances, and events.

  • You can now configure separate alert rules for each baseline, with options for SMS, email, or phone notifications. You can also associate a baseline directly with a shift schedule to simplify owner management and reduce operational complexity.

April 26, 2022

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Chengdu), and China (Hong Kong)

All DataWorks users.

Smart Baseline overview

March

Feature

Description

Release date

Region

Audience

References

Cross-workspace deployment and enhanced management

You can now deploy objects such as Tasks, Resources, and Functions between Workspaces. This feature simplifies deployment workflows across multiple Workspaces.

Mar 2, 2022

All regions

This feature is for users who require strong control over Deployments, such as those in the finance and public sectors.

Cross-workspace deployment

Audit logging for Data Analysis operations in ActionTrail

Data Analysis is now integrated with ActionTrail. The following operations are logged for security audits:

  • Running a MaxCompute SQL Statement.

  • Downloading SQL execution results.

  • Downloading a Workbook.

Mar 20, 2022

All regions

Available to all DataWorks users.

Enhanced governance leaderboard in Data Governance Center

The governance leaderboard in the Data Governance Center now provides the following capabilities:

  • Filter Governance Items by role.

  • Sort Governance Items and Check Items by different dimensions.

  • View governance rankings for all members in a Workspace.

  • View details of pending governance issues.

  • View historical trends for resource consumption and detailed MaxCompute usage.

Mar 21, 2022

All regions

Available to users participating in the invitation-only beta for the Data Governance Center.

View governance results

Performance optimization for Data Integration tasks

When you use an Exclusive Resource Group for Data Integration, you can now synchronize more than 1,000 Tables in a single Task. This improves the efficiency of real-time synchronization to MaxCompute and real-time synchronization to Hologres Tasks.

Mar 25, 2022

All regions

For users who need to synchronize a large number of Tables, such as those on SaaS platforms or in the finance sector.

Prepare a PostgreSQL environment

2021

December

Feature

Description

Release date

Region

References

Batch configuration of Data Quality rules using templates

DataWorks Data Quality now provides rule templates to simplify the bulk configuration of Data Quality rules. This feature lets you:

  • Select a table-level rule template to configure rules for multiple tables at the same time.

  • Select a field-level rule template to configure rules for multiple fields at the same time.

2021-12-14

All regions.

Configure rules in batches using templates

Resource usage analysis in Data Governance Center

The Data Governance Center now includes resource usage analysis to provide an overview, identify changes, and show details about resource consumption across four dimensions: MaxCompute storage consumption, MaxCompute computing consumption, DataWorks task scheduling consumption, and DataWorks batch synchronization consumption.

2021-12-09

All regions.

Data pivoting from the resource type perspective

November

Feature

Description

Release date

Region

References

Resource group orchestration in DataStudio

DataStudio now supports resource group orchestration. This feature allows you to change the resource group for scheduling for multiple nodes in a workflow at once. If you have multiple resource groups for scheduling in your workspace, you can use this feature to reassign them quickly and improve resource utilization.

2021-11-30

All regions.

Change resource groups for scheduling for nodes

Batch operations in DataStudio

DataWorks now supports batch operations on nodes, resources, and functions, such as changing the owner. You can also commit and deploy these changes to the production environment in a single operation.

2021-11-11

All regions.

Perform operations on multiple DataWorks objects at the same time

October

Feature

Description

Release date

Region

References

Data Modeling introduces reverse modeling and naming dictionary

  • The naming dictionary lets you manage the roots and morphemes of business terms, physical tables, and fields, as well as their standardized translations.

  • The reverse modeling feature lets you import models created in other tools into the dimensional modeling module of DataWorks.

2021-10-30

Available in public preview in the following regions: China (Beijing), China (Shanghai), China (Hangzhou), China (Shenzhen), China (Zhangjiakou), China (Chengdu), Singapore, US (Silicon Valley), Germany (Frankfurt), China (Hong Kong), China East 2 (Shanghai) Finance, and China South 1 (Shenzhen) Finance.

Code search in DataStudio

DataStudio now includes a Code search feature for finding code snippets in nodes by keyword. Search results list all matching nodes with their details. This is useful for tracing the source of data changes in a target table.

2021-10-27

All regions.

Code search

September

Feature

Description

Release date

Region

References

DataService Studio API assets integrated into Data Map

Data service APIs (including wizard-generated, scripted, and registered APIs) are now integrated into Data Map. This enables enterprise-wide API discovery and management, including global API search, popular API statistics, dedicated API detail pages, and viewing APIs by data source.

2021-09-30

All regions.

Introducing the new Data Governance Center

The Data Governance Center automatically discovers and quantifies data governance issues from global, workspace, and personal perspectives. It evaluates issues across storage, compute, development, Data Quality, and data security dimensions. The service uses a health score model and presents governance results through reports and leaderboards to help you effectively resolve issues and meet your governance goals.

2021-09-12

This feature is in public preview in the following regions: China (Shanghai), China (Hangzhou), China (Beijing), and China (Shenzhen).

Data Governance Center overview

August

Feature

Description

Release date

Region

References

Exclusive resource group for DataService Studio now available in new regions

The exclusive resource group for DataService Studio is now available in the China (Hangzhou) and China (Shanghai) regions. In use cases where API calls require high QPS and SLA guarantees, these dedicated resources ensure successful execution. An exclusive resource group for DataService Studio supports high-concurrency, frequent API calls with fast response times.

2021-08-06

China (Hangzhou) and China (Shanghai) regions.

Exclusive resource groups for DataService Studio

Migration Assistant is now commercially available

Migration Assistant helps you quickly migrate data development objects across different DataWorks editions, Alibaba Cloud accounts, regions, and workspaces. Migration Assistant supports migrating objects such as auto-triggered tasks, manually triggered tasks, resources, functions, data sources, table metadata, ad-hoc queries, and components. You can perform full, incremental, or custom exports based on your business needs.

2021-08-01

All regions.

Migration Assistant

July

Feature

Description

Release date

Region

References

Approval Center introduced for data governance

The DataWorks Approval Center is a new module for managing data authorization and sensitive operations. It provides core features like defining approval scopes and workflows to meet various enterprise compliance requirements.

2021-07-16

All regions.

Approval Center overview

Dispatching tasks to E-MapReduce (EMR) gateway nodes

DataWorks can now dispatch tasks to E-MapReduce (EMR) gateway nodes. You can use advanced task parameters to enable E-MapReduce (EMR) load balancing. Workspace-level settings for task submission will be available in a future release.

2021.07

All regions.

Create an EMR Hive node

June

Feature

Description

Release date

Region

References

Development and O&M for E-MapReduce (EMR) real-time tasks

DataWorks now supports development and O&M for E-MapReduce (EMR) Spark Streaming and EMR Streaming SQL tasks.

Supported capabilities include real-time task development, trial runs, deployment to the production environment, retries, status monitoring, starting/stopping/undeploying tasks, and alerting for task failures.

2021.06

All regions.

Create an EMR Spark Streaming node

One-click migration for E-MapReduce (EMR) data development tasks

Migration Assistant provides two methods for migrating workflows (nodes and scheduling configurations), manually triggered tasks, resources, and data sources from an E-MapReduce (EMR) cluster to a DataWorks workspace. You can view the migration progress, results, and reports in the Migration Assistant console.

2021.06

All regions.

Migrate EMR projects to DataWorks

Resource O&M in Operation Center with resource utilization monitoring

The Resource O&M feature monitors the usage of resource groups that run tasks in DataWorks.

2021-06-09

All regions.

Resource O&M

API generation from MaxCompute data sources in DataService Studio

DataService Studio now supports creating APIs directly from MaxCompute tables. These API calls leverage MaxCompute's query acceleration (MCQA) capability to execute queries within the engine, ensuring fast and efficient responses. Note: This feature is available only in an exclusive resource group for DataService Studio.

2021.06

All regions.

None

Alert Contacts in DataWorks

You can use the Alert Contacts page to add a RAM user or RAM role as an alert contact. When a task fails, DataWorks sends alert notifications to the specified contact, helping you identify and resolve issues promptly.

2021.06

All regions.

View and set alert contacts

May

Feature

Description

Release date

Region

References

Data Integration adds real-time synchronization to AnalyticDB for MySQL 3.0.

DataWorks now supports real-time synchronization from databases such as MySQL, OceanBase, and PolarDB to AnalyticDB for MySQL 3.0. You can perform an initial full synchronization and then start real-time incremental synchronization for continuous data updates. This feature also automatically handles schema changes, such as adding a new column to the source, by propagating the change to the destination table.

2021-05-25

All regions.

Resource planning and configuration

The Open Message service is now in public preview.

DataWorks now provides the Open Message service. You can enable the message subscription feature in the DataWorks Open Platform. This feature is available for free to DataWorks Enterprise Edition users during the public preview. You can use Open Message to get metadata and task change events from DataWorks, enabling deep integration with your own systems.

2021-05-21

China (Beijing), China (Hangzhou), China (Shenzhen), and China (Shanghai) regions.

Overview of OpenEvent

Task scheduling adds yearly and end-of-month cycles

Task scheduling for auto-triggered tasks now supports new yearly and end-of-month cycles. You can now schedule tasks to run annually, quarterly, or on the last day of a specified month. DataWorks supports scheduling cycles of minute, hour, day, week, month, and year.

2021-05-19

All regions.

Configure time properties

DataWorks now supports the ClickHouse engine

DataWorks now provides ETL operations and management capabilities for the ClickHouse engine, including Data Integration, DataStudio, task scheduling, and task O&M.

  • You can now associate a ClickHouse cluster with a workspace by using the E-MapReduce (EMR) instance mode or a JDBC connection string. You can also add a ClickHouse data source by using a JDBC connection string.

  • You can now use Data Integration to read data from or write data to ClickHouse.

  • DataWorks provides a ClickHouse SQL node, which uses a distributed SQL query engine to process structured data for more efficient job execution.

2021-05-15

All regions.

April

Feature

Description

Release date

Region

References

Data Integration adds multi-table real-time synchronization to AnalyticDB for MySQL 3.0.

You can now create a task to synchronize real-time data from multiple tables to an AnalyticDB for MySQL 3.0 data source in a single run. This feature automatically supports DDL synchronization for new columns, meaning columns added to the source are also added to the destination table.

2021-04-20

All regions.

Create a real-time synchronization solution to synchronize data to AnalyticDB for MySQL 3.0

DataStudio adds the FTP Check node

The FTP Check node periodically checks for a specified file via FTP. If the file exists, its descendant nodes are triggered. Otherwise, the node retries at a configured interval until a stop condition is met. This node is typically used to signal between the DataWorks scheduling system and other scheduling systems.

2021-04-15

China (Beijing), China (Shanghai), China (Hangzhou), China (Shenzhen), China (Zhangjiakou), China (Chengdu), and Singapore.

FTP Check node

March

Feature

Description

Release date

Region

References

Custom roles in DataWorks Enterprise Edition

DataWorks Enterprise Edition now supports custom roles. This feature allows you to create roles with specific permissions tailored to your business requirements.

2021-03-22

All regions.

Workspace-level module permission control

Kerberos authentication in Data Integration

Data Integration now supports Kerberos authentication via file upload. For data sources like Hive and Kafka that require Kerberos, you can upload the necessary authentication files during configuration to ensure secure access.

2021-03-16

All regions.

Appendix: Configure Kerberos authentication

New Security Center in the DataWorks console

The new Security Center is now available in the DataWorks console. Security Center helps you quickly build security capabilities for data content and personal privacy, meeting various enterprise compliance requirements such as auditing. You can use this feature without any additional configuration.

2021-03-13

All regions.

Overview

DAG aggregation view and ancestor/descendant analysis in Operation Center

Operation Center introduces a new DAG aggregation view and ancestor/descendant analysis. You can now aggregate nodes in a DAG by dimensions such as workspace, owner, or priority to view the total node count. You can also analyze the ancestor nodes and descendant nodes for a specific node to quickly locate blockers and understand the overall task execution status.

2021-03-10

China (Shenzhen) region.

Manage auto-triggered tasks

February

Feature

Description

Release date

Region

References

Batch creation of metadata crawlers in Data Discovery

The Data Map > data discovery feature now supports creating multiple metadata crawlers at once. This helps you quickly visualize table structures and their relationships within Data Map.

2021-02-17

All regions.

Collect metadata from an EMR data source

Migration Assistant now supports Airflow

Migration Assistant now supports migrating tasks from the Airflow scheduling system to DataWorks.

2021-02-16

All regions.

Export tasks from open-source engines

Metering API in DataService Studio

DataService Studio introduces a metering API, which includes a metering dashboard and detailed statistics. This provides various charts and metrics, such as the total number of APIs and calls in a workspace, giving you a global view of API usage. You can also view monitoring charts for individual APIs to get detailed information like API gateway status codes and DataService Studio error codes.

2021-02-16

China (Beijing).

Open Platform in the DataWorks console

The Open Platform is now available in the DataWorks console. It provides metering reports for the OpenAPI, allowing you to view detailed call information for a specified date.

2021-02-13

All regions.

DataWorks Open Platform

January

Feature

Description

Release date

Region

References

Support for HTTP data sources in Data Integration

Data Integration can now read from and write to HTTP data sources in batch mode, a feature designed for sources that expose data only through a REST API.

2021-01-04

All regions.

RestAPI Reader

2020

December

Feature

Description

Release date

Region

References

Full and incremental data synchronization to Elasticsearch

This feature enables one-time, full synchronization of all or specific tables in a database to Elasticsearch. It also supports real-time, incremental synchronization of new data.

2020-12-30

All regions

Synchronize an entire MySQL database to Elasticsearch

September

Feature

Description

Release date

Region

References

Real-time synchronization in Data Integration

Data Integration now supports real-time data synchronization. This feature synchronizes data changes from selected or all tables in a source database to a destination database in real time, ensuring data consistency. This solution provides both full and incremental synchronization.

2020-09-15

All regions

July

Feature

Description

Release date

Region

References

Public preview of OpenAPI

DataWorks now provides OpenAPI for multiple modules, including Tenants, Metadata, DataStudio, Operation Center, Data Quality, and DataService Studio.

Note

You must subscribe to DataWorks Enterprise Edition or a more advanced edition to use OpenAPI.

2020-07-16

China (Hangzhou), China (Shanghai), China (Shenzhen), China (Beijing), and China (Zhangjiakou)

Overview of DataWorks OpenAPI

Public preview of Migration Assistant

Migration Assistant supports migrating objects such as Auto-triggered tasks, Manually triggered tasks, resources, functions, data sources, table metadata, ad hoc queries, and components. You can export your DataWorks objects in full, incrementally, or through a custom selection to meet your business requirements.

2020-07-01

China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Chengdu), and Singapore

Migration Assistant

Upgrade of DataService Studio

DataService Studio now features a new directory tree structure.

2020-07-28

  • Functions and filters are available only in the China (Shanghai) region and require DataWorks Professional Edition or a higher edition.

  • Service orchestration is available only in the China (Shanghai) region and requires DataWorks Enterprise Edition or a higher edition.

DataService Studio

June

Feature

Description

Release date

Region

References

Data source query

When editing a Workbook, use the Data source query feature to quickly read and analyze data.

2020-06-09

China (Shanghai)

Analyze data

April

Feature

Description

Release date

Region

References

Phone call alerts in Operation Center

Operation Center now supports three alert methods: text message, email, and phone call.

Important

Phone call alerts require DataWorks Professional Edition or a higher edition.

2020-04-15

All regions

Create a custom alert rule