This topic describes the change history of DataWorks documentation. You can learn the new features and feature changes of DataWorks.

Note DataWorks can be automatically updated, and the update has no impact on existing users.

Changes in August 2020

Date Feature Change type Description Documentation
August 07, 2020 Custom resource group for scheduling Experience optimization A topic is added to describe how to create a custom resource group for scheduling and change the resource group for a node to the created custom resource group for scheduling. DataWorks provides you with custom resource groups for scheduling and custom resource groups for Data Integration to ensure the flexibility of node scheduling and the timeliness of data synchronization. Create custom resource groups for scheduling
August 07, 2020 Hive connection New connection A topic is added to describe how to configure a Hive connection. A Hive connection allows you to read data from and write data to Hive by using Hive Reader and Writer. You can use the codeless user interface (UI) or code editor to configure sync nodes for Hive. Configure a Hive connection
August 07, 2020 Gbase8a connection New connection A topic is added to describe how to configure a Gbase8a connection. A Gbase8a connection allows you to read data from and write data to Gbase8a by using Gbase8a Reader and Writer. You can use the codeless UI or code editor to configure sync nodes for Gbase8a. Configure a GBase 8a connection
August 07, 2020 Hologres connection New connection A topic is added to describe how to configure a Hologres connection. A Hologres connection allows you to read data from and write data to Hologres by using Hologres Reader and Writer. You can use the codeless UI or code editor to configure sync nodes for Hologres. Configure a Hologres connection
August 07, 2020 HBase connection New connection A topic is added to describe how to configure an HBase connection. An HBase connection allows you to read data from and write data to HBase by using HBase Reader and Writer. You can use the code editor to configure sync nodes for HBase. Configure an HBase connection
August 07, 2020 Elasticsearch connection New connection A topic is added to describe how to configure an Elasticsearch connection. An Elasticsearch connection allows you to read data from and write data to Elasticsearch by using Elasticsearch Reader and Writer. You can use the code editor to configure sync nodes for Elasticsearch. Configure an Elasticsearch connection
August 07, 2020 Connections Experience optimization A topic is added to describe how to troubleshoot issues related to connectivity, parameters, and permissions when you create connections in DataWorks. Troubleshooting for connections
August 07, 2020 EMR Presto node New feature A topic is added to describe how to create an EMR Presto node. EMR Presto nodes allow you to perform interactive analysis and query on large-scale structured and unstructured data. EMR Presto node
August 05, 2020 SDK for Java New feature A topic is added to describe how to install Alibaba Cloud DataWorks SDK for Java by using Maven dependencies. Install the Alibaba Cloud SDK for Java
August 05, 2020 Release notes of key features Experience optimization A topic is added to describe the release notes of key DataWorks features. Release notes of key features

Changes in June 2020

Date Feature Change type Description Documentation
June 30, 2020 DataWorks services Experience optimization Topics are added to describe the FAQ about DataWorks services, including Data Integration, Data Analytics, custom resource groups, exclusive resource groups, dependencies, Intelligent Monitor, and DataService Studio. FAQ
June 28, 2020 Exclusive resource group New feature A topic is added to describe how to add a route for your exclusive resource group to connect to a data store in a VPC or a data center. Add a route
June 28, 2020 Data migration Experience optimization A best practice is added to describe how to use an exclusive resource group for Data Integration to migrate data from a user-created MySQL database on an Elastic Compute Service (ECS) instance to MaxCompute. Migrate data from a user-created MySQL database on an ECS instance to MaxCompute
June 28, 2020 Data analysis Experience optimization A best practice is added to describe how to use Artificial Intelligence Recommendation to provide a personalized recommendation service for developers to increase the customer purchase rate and order conversion rate. This feature is based on the cutting edge big data and artificial intelligence (AI) technologies of Alibaba, and years of experience in the e-commerce industry. Intelligently recommend items on e-commerce websites
June 28, 2020 Data security Experience optimization A best practice is added to describe how to grant access to a specific user-defined function (UDF) only to a specified user. This best practice involves data encryption and decryption algorithms. It relates to data security. Grant access to a specific UDF to a specified user
June 28, 2020 Data analytics Experience optimization A best practice is added to describe how to build a data warehouse for an enterprise based on AnalyticDB for MySQL, and use the data warehouse to perform O&M and manage metadata. Build a data warehouse for an enterprise based on AnalyticDB for MySQL
June 28, 2020 Data analytics Experience optimization A best practice is added to describe how to use a PyODPS node in DataWorks to segment Chinese text based on Jieba, an open-source segmentation tool, and write the segmented words and phrases to a new table. After you read this topic, you will also know how to use closure functions to segment Chinese text based on a custom dictionary. Use a PyODPS node to segment Chinese text based on Jieba
June 28, 2020 Data analytics Experience optimization A best practice is added to describe how to use a PyODPS node that is run on an exclusive resource group to send emails. Use a PyODPS node to send emails
June 28, 2020 Data analytics Experience optimization A best practice is added to describe how to connect DataV to DataWorks DataService Studio and then call API operations of DataService Studio to obtain data from MaxCompute. DataV can display the data analysis results on dashboards. Connect DataV to DataWorks DataService Studio
June 28, 2020 Data migration Experience optimization A best practice is added to describe how to automatically synchronize Internet of Things (IoT) data to the cloud. The IoT is a network that carries data based on the Internet and traditional telecommunication networks and it enables connections among all physical objects that are independently addressable. Automatically synchronize IoT data to the cloud
June 16, 2020 DataWorks services Experience optimization A tutorial is added to describe how to use big data services of Alibaba Cloud to build an online operation analysis platform. Business scenarios and development process
June 15, 2020 Data Integration New feature A topic is added to describe how to create, commit, and manage real-time sync nodes. Create, commit, and manage real-time sync nodes
June 15, 2020 PolarDB Reader New reader or writer A topic is added to describe how to configure PolarDB Reader. PolarDB Reader can read data only from PolarDB for MySQL databases but not from PolarDB for PostgreSQL databases. ApsaraDB for PolarDB reader
June 15, 2020 MaxCompute Writer New reader or writer A topic is added to describe how to configure MaxCompute Writer. MaxCompute Writer allows you to import large amounts of data to MaxCompute for fast computing. MaxCompute writer
June 15, 2020 Hologres Writer New reader or writer A topic is added to describe how to configure Hologres Writer. Hologres Writer allows you to build a real-time data warehouse based on the real-time writing capability of Hologres. Hologres writer
June 15, 2020 ApsaraDB for OceanBase connection New connection A topic is added to describe how to configure an ApsaraDB for OceanBase connection. An ApsaraDB for OceanBase connection allows you to read data from and write data to ApsaraDB for OceanBase by using ApsaraDB for OceanBase Reader and Writer. You can use the code editor to configure sync nodes for ApsaraDB for OceanBase. Configure an ApsaraDB for OceanBase connection
June 15, 2020 Vertica connection New connection A topic is added to describe how to configure a Vertica connection. A Vertica connection allows you to read data from and write data to Vertica by using Vertica Reader and Writer. You can use the code editor to configure sync nodes for Vertica. Configure a Vertica connection
June 15, 2020 Gbase8a Reader New reader or writer A topic is added to describe the data types and parameters that Gbase8a Reader supports and how to configure it by using the code editor. Gbase8a Reader
June 15, 2020 Hologres Reader New reader or writer A topic is added to describe the parameters that Hologres Reader supports and how to configure it by using the codeless UI and code editor. You can use Hologres Reader to read data from the tables of Hologres data stores and write the data to other types of data stores. Hologres Reader
June 15, 2020 TSDB Reader New reader or writer A topic is added to describe the data types and parameters that Time Series Database (TSDB) Reader supports and how to configure it by using the code editor. TSDB Reader
June 15, 2020 Hologres Writer New reader or writer A topic is added to describe the working mechanism and parameters that Hologres Writer supports and how to configure it by using the codeless UI and code editor. Hologres Writer allows you to import data from multiple data stores to Hologres for real-time data analysis. Hologres Writer
June 15, 2020 Resource group configuration New configuration A topic is added to describe how to configure a resource group for node scheduling. You can configure the resource group for scheduling a node when you configure the scheduling properties of the node. Configure the resource group
June 15, 2020 Scheduling dependency logic Experience optimization A topic is added to describe the logic of scheduling dependencies. To ensure that business data is effectively produced in a timely manner, you must configure correct dependencies for nodes. Logic of scheduling dependencies
June 15, 2020 Exclusive resource group for scheduling New feature A topic is added to describe how to create and use exclusive resource groups for scheduling. DataWorks allows you to bind an exclusive resource group for scheduling to a VPC so that the resource group can connect to data stores in the VPC. Add and use exclusive resource groups for scheduling

Changes in May 2020

Date Feature Change type Description Documentation
May 27, 2020 Resource group usage Experience optimization A topic is added to describe the scenarios and methods of using shared resource groups, exclusive resource groups, and custom resource groups in DataWorks. DataWorks resource groups
May 27, 2020 Report template New feature A topic is added to describe how to manage report templates. You can create a template of data quality reports on the Report Template Management page. DataWorks Data Quality can periodically generate and send data quality reports based on the template. Manage report templates
May 27, 2020 Rule template New feature A topic is added to describe how to manage rule templates. In Data Quality, you can manage a set of custom rule templates and use the rule templates to improve the efficiency of rule configuration. Manage rule templates
May 27, 2020 Built-in rule template New feature A topic is added to describe the verification logic of Data Quality and the built-in rule templates that DataWorks provides for monitoring offline data. Built-in rule templates for offline data

Changes in April 2020

Date Feature Change type Description Documentation
April 19, 2020 Operation Center Update in DataWorks V3.0 Topics are added to describe how to use Operation Center. In Operation Center, you can view the dashboard, manage auto triggered nodes and manually triggered nodes, and monitor nodes. Operation Center
April 18, 2020 Data warehouse Update in DataWorks V3.0 Topics are added to describe the overall process of building a MaxCompute data warehouse. Build and optimize a data warehouse
April 18, 2020 Data Integration Update in DataWorks V3.0 Topics are added to describe how to use Data Integration. Data Integration is a stable, efficient, and scalable data synchronization service. It is designed to migrate and synchronize data between a wide range of heterogeneous data stores fast and stably in complex network environments. Data Integration
April 08, 2020 Plug-ins for real-time synchronization Update in DataWorks V3.0 A topic is added to describe the readers, writers, and transformation plug-ins that DataWorks supports for real-time data synchronization. Supported data stores
April 08, 2020 Data analytics and O&M Update in DataWorks V3.0 A quick start is added to guide you through a complete process of data analytics and O&M. Overview
April 08, 2020 DataWorks services Update in DataWorks V3.0 Topics are added to provide an overview of DataWorks, including the basic concepts, usage scenarios, and data analytics processes. What is DataWorks?

Changes in March 2020

Date Feature Change type Description Documentation
March 26, 2020 DataWorks for E-MapReduce Experience optimization A tutorial is added to describe how to use E-MapReduce in DataWorks for data computing. DataWorks for EMR Workshop
March 17, 2020 Data analytics Update in DataWorks V3.0 A topic is added to describe the updated data analytics mode. In updated data analytics mode, you can group multiple workflows in a solution of a workspace. Solution
March 17, 2020 Node types Update in DataWorks V3.0 Topics are added to describe how to create various types of nodes in DataWorks, including batch sync nodes, MaxCompute nodes, EMR nodes, general nodes, and custom nodes. Node types
March 17, 2020 Setup Update in DataWorks V3.0 Topics are added to describe how to configure various objects in DataStudio. For example, you can configure code templates, table folders, and table levels. Setup
March 02, 2020 DataWorks console Update in DataWorks V3.0 Topics are added to provide an overview of the DataWorks console. You can view the workspaces, resource groups, and compute engines in the DataWorks console. DataWorks console overview

Changes in February 2020

Date Feature Change type Description Documentation
February 28, 2020 App Studio New feature A topic is added to describe the Projects page in App Studio. You can create and manage projects on the Projects page. Projects
February 28, 2020 App Studio New feature A topic is added to describe the Apps page in App Studio. You can view applications that are created by you, shared by you, and shared to you on the Apps page. Apps
February 28, 2020 App Studio New feature A topic is added to describe the Templates page in App Studio. You can view all templates that are created based on projects on the Templates page. Templates
February 28, 2020 App Studio New feature A topic is added to describe how to create an application in App Studio and deploy it in the production environment to make it accessible on the Internet. App deployment
February 02, 2020 DataAnalysis New feature Topics are added to describe how to use the DataAnalysis service. DataAnalysis allows you to collaboratively edit and analyze workbooks, manage MaxCompute tables by using dimension tables, and generate and share visual reports. DataAnalysis

Changes in December 2019

Date Feature Change type Description Documentation
December 31, 2019 De-identification New feature A topic is added to describe how to customize de-identification rules in Data Security Guard so that DataWorks can dynamically de-identify the results of ad hoc queries. Customize desensitization rules

Changes in October 2019

Date Feature Change type Description Documentation
October 31, 2019 Requirements Management New feature Topics are added to describe how to use the Requirements Management service. The Requirements Management service helps users of Alibaba Cloud big data services develop data in a standardized and cost-effective way. Requirements Management

Changes in August 2019

Date Feature Change type Description Documentation
August 21, 2019 WYSIWYG designer Experience optimization Topics are added to describe how to use the WYSIWYG designer. The WYSIWYG designer of App Studio is provided to assist in developing frontend pages. It provides common web page components that allow you to create frontend pages by using simple drag-and-drop operations. WYSIWYG designer
August 21, 2019 Bill details Experience optimization A topic is added to describe how to view the details of each billing item for DataWorks that you activated in pay-as-you-go mode. View spending details