This topic describes the change history of DataWorks documentation. You can learn the new features and feature changes of DataWorks.

Note DataWorks can be automatically upgraded, and the upgrade has no impact on existing users.

Changes in June 2020

Date Feature Change type Description Documentation
June 30, 2020 DataWorks services Experience optimization The FAQ about DataWorks services is provided, including Data Integration, Data Analytics, custom resource groups, exclusive resource groups, dependencies, Intelligent Monitor, and DataService Studio. FAQ
June 28, 2020 Exclusive resource group New feature A topic is added to describe how to add a route for your exclusive resource group to access a data store in a virtual private cloud (VPC) or a data center. Add a route
June 28, 2020 Data migration Experience optimization A best practice is added to describe how to use an exclusive resource group for Data Integration to migrate data from a user-created MySQL database on an Elastic Compute Service (ECS) instance to MaxCompute. Migrate data from a user-created MySQL database on an ECS instance to MaxCompute
June 28, 2020 Data analysis Experience optimization A best practice is added to describe how to use Artificial Intelligence Recommendation to provide a personalized recommendation service for developers to increase the customer purchase rate and order conversion rate. This feature is based on the cutting edge big data and artificial intelligence (AI) technologies of Alibaba, and years of experience in the e-commerce industry. Intelligently recommend items on e-commerce websites
June 28, 2020 Data security Experience optimization A best practice is added to describe how to grant access to a specific user-defined function (UDF) only to a specified user. This best practice involves data encryption and decryption algorithms. It relates to data security. Grant access to a specific UDF to a specified user
June 28, 2020 Data analytics Experience optimization A best practice is added to describe how to build a data warehouse for an enterprise based on AnalyticDB for MySQL, and use the data warehouse to perform O&M and manage metadata. Build a data warehouse for an enterprise based on AnalyticDB for MySQL
June 28, 2020 Data analytics Experience optimization A best practice is added to describe how to use a PyODPS node in DataWorks to segment Chinese text based on Jieba, an open-source segmentation tool, and write the segmented words and phrases to a new table. After you read this topic, you will also know how to use closure functions to segment Chinese text based on a custom dictionary. Use a PyODPS node to segment Chinese text based on Jieba
June 28, 2020 Data analytics Experience optimization A best practice is added to describe how to use a PyODPS node that is running on an exclusive resource group to send emails. Use a PyODPS node to send emails
June 28, 2020 Data analytics Experience optimization A best practice is added to describe how to connect DataV to DataWorks DataService Studio and then call API operations of DataService Studio to obtain data from MaxCompute. DataV can display the data analysis results on dashboards. Connect DataV to DataWorks DataService Studio
June 28, 2020 Data migration Experience optimization A best practice is added to describe how to automatically synchronize Internet of Things (IoT) data to the cloud. The IoT is a network that carries data based on the Internet and traditional telecommunication networks and it enables connections among all physical objects that are independently addressable. Automatically synchronize IoT data to the cloud
June 16, 2020 Data quality Experience optimization A tutorial is added to describe how to guarantee data quality. Data quality is the basis for effective and accurate data analysis. Overview
June 16, 2020 DataWorks services Experience optimization A tutorial is added to describe how to use big data services of Alibaba Cloud to build an online operation analysis platform. Business scenario and development process
June 15, 2020 Data Integration New feature A topic is added to describe how to create, commit, and manage real-time sync nodes. Create, commit, and manage real-time sync nodes
June 15, 2020 PolarDB Reader New feature A topic is added to describe how to configure PolarDB Reader. PolarDB Reader can only read data from PolarDB for MySQL databases but not from PolarDB for PostgreSQL databases. ApsaraDB for POLARDB reader
June 15, 2020 MaxCompute Writer New feature A topic is added to describe how to configure MaxCompute Writer. MaxCompute Writer allows you to import large amounts of data to MaxCompute for fast computing. MaxCompute writer
June 15, 2020 Hologres Writer New feature A topic is added to describe how to configure Hologres Writer. Hologres Writer allows you to build a real-time data warehouse with the real-time writing capability of Hologres. Hologres writer
June 15, 2020 ApsaraDB for OceanBase connection New connection A topic is added to describe how to configure an ApsaraDB for OceanBase connection. An ApsaraDB for OceanBase connection allows you to read data from and write data to ApsaraDB for OceanBase by using ApsaraDB for OceanBase Reader and Writer. You can configure sync nodes for ApsaraDB for OceanBase by using the codeless user interface (UI) or code editor. Configure an ApsaraDB for OceanBase connection
June 15, 2020 Vertica connection New connection A topic is added to describe how to configure a Vertica connection. A Vertica connection allows you to read data from and write data to Vertica by using Vertica Reader and Writer. You can configure sync nodes for Vertica by using the codeless UI or code editor. Configure a Vertica connection
June 15, 2020 Gbase8a Reader New reader or writer A topic is added to describe the data types and parameters supported by Gbase8a Reader and how to configure it by using the code editor. Gbase8a Reader
June 15, 2020 Hologres Reader New reader or writer A topic is added to describe the parameters supported by Hologres Reader and how to configure it by using the codeless UI and code editor. You can use Hologres Reader to read data from the tables of Hologres data stores and write the data to other types of data stores. Hologres Reader
June 15, 2020 TSDB Reader New reader or writer A topic is added to describe the data types and parameters supported by Time Series Database (TSDB) Reader and how to configure it by using the code editor. TSDB Reader
June 15, 2020 Hologres Writer New reader or writer A topic is added to describe the working mechanism and parameters supported by Hologres Writer and how to configure it by using the codeless UI and code editor. Hologres Writer allows you to import data from multiple data stores to Hologres for real-time data analysis. Hologres Writer
June 15, 2020 Resource group configuration New configuration A topic is added to describe how to configure a resource group for node scheduling. You can configure the resource group for scheduling a node when you configure the scheduling properties of the node. Configure the resource group
June 15, 2020 Scheduling dependency logic Experience optimization A topic is added to describe the logic of scheduling dependencies. To guarantee that business data is effectively produced in a timely manner, you must configure correct dependencies for nodes. Logic of scheduling dependencies
June 15, 2020 Exclusive resource group for scheduling New feature A topic is added to describe how to add and use exclusive resource groups for scheduling. DataWorks allows you to bind an exclusive resource group for scheduling to a VPC so that the resource group can access data stores in the VPC. Add and use exclusive resource groups for scheduling

Changes in May 2020

Date Feature Change type Description Documentation
May 27, 2020 Resource group usage Experience optimization A topic is added to describe the scenarios and methods of using the default resource group, exclusive resource groups, and custom resource groups in DataWorks. DataWorks resource groups
May 27, 2020 Report template New feature A topic is added to describe how to manage report templates. You can create a template of data quality reports on the Report Template Management page. DataWorks Data Quality can periodically generate and send data quality reports based on the template. Manage report templates
May 27, 2020 Rule template New feature A topic is added to describe how to manage rule templates. In Data Quality, you can manage a set of custom rule templates and use the rule templates to improve the efficiency of rule configuration. Manage rule templates
May 27, 2020 Built-in rule template New feature A topic is added to describe the verification logic of Data Quality and the built-in rule templates provided for monitoring offline data. Built-in rule templates for offline data

Changes in April 2020

Date Feature Change type Description Documentation
April 19, 2020. Operation Center Upgrade in DataWorks V3.0 Topics are added to describe how to use Operation Center. In Operation Center, you can view the dashboard, manage auto triggered nodes and manually triggered nodes, and monitor nodes. Operation Center
April 18, 2020 Data warehouse Upgrade in DataWorks V3.0 Topics are added to describe the overall process of building a MaxCompute data warehouse. Build and optimize a data warehouse
April 18, 2020 Data Integration Upgrade in DataWorks V3.0 Topics are added to describe how to use Data Integration. Data Integration is a stable, efficient, and scalable data synchronization service. It is designed to migrate and synchronize data between a wide range of heterogeneous data stores fast and stably in complex network environments. Data Integration
April 8, 2020 Plug-ins for real-time synchronization Upgrade in DataWorks V3.0 A topic is added to describe the readers, writers, and transformation plug-ins supported for real-time data synchronization. Supported data stores
April 8, 2020 Data analytics and O&M Upgrade in DataWorks V3.0 A quick start is added to guide you through a complete process of data analytics and O&M. Overview
April 8, 2020 DataWorks services Upgrade in DataWorks V3.0 Topics are added to provide an overview of DataWorks, including the basic concepts, usage scenarios, and data analytics processes. What is DataWorks?

Changes in March 2020

Date Feature Change type Description Documentation
March 26, 2020 DataWorks for E-MapReduce Experience optimization A tutorial is added to describe how to use E-MapReduce in DataWorks for data computing. DataWorks for EMR Workshop
March 17, 2020 Data analytics Upgrade in DataWorks V3.0 A topic is added to describe the upgraded data analytics mode. In the upgraded data analytics mode, you can group multiple workflows in a solution of a workspace. Solution
March 17, 2020 Node types Upgrade in DataWorks V3.0 Topics are added to describe how to create various types of nodes in DataWorks, including batch sync nodes, MaxCompute nodes, E-MapReduce nodes, general nodes, and custom nodes. Node types
March 17, 2020 Setup Upgrade in DataWorks V3.0 Topics are added to describe how to configure various objects in DataStudio. For example, you can configure code templates, table folders, and table levels. Setup
March 2, 2020 DataWorks console Upgrade in DataWorks V3.0 Topics are added to provide an overview of the DataWorks console. You can view the workspaces, resource groups, and computing engines in the DataWorks console. DataWorks console overview

Changes in February 2020

Date Feature Change type Description Documentation
February 29, 2020 Data migration Experience optimization A best practice is added to describe how to use the data synchronization feature of DataWorks to migrate data from Oracle to MaxCompute. Migrate data from Oracle to MaxCompute
February 28, 2020 App Studio New feature A topic is added to describe the Projects page in App Studio. You can create and manage projects on the Projects page. Projects
February 28, 2020 App Studio New feature A topic is added to describe the Apps page in App Studio. You can view applications that are created by you, shared by you, and shared to you on the Apps page. Apps
February 28, 2020 App Studio New feature A topic is added to describe the Templates page in App Studio. You can view all templates that are created based on projects on the Templates page. Templates
February 28, 2020 App Studio New feature A topic is added to describe how to create an application in App Studio and deploy it in the production environment to make it accessible on the Internet. App deployment
February 2, 2020 DataAnalysis New feature Topics are added to describe how to use the DataAnalysis service. DataAnalysis allows you to collaboratively edit and analyze workbooks, manage MaxCompute tables by using dimension tables, and generate and share visual reports. DataAnalysis

Changes in December 2019

Date Feature Change type Description Documentation
December 31, 2019 De-identification New feature A topic is added to describe how to customize de-identification rules in Data Security Guard so that DataWorks can dynamically de-identify the results of ad hoc queries. Customize desensitization rules

Changes in October 2019

Date Feature Change type Description Documentation
October 31, 2019 Requirements Management New feature Topics are added to describe how to use the Requirements Management service. The Requirements Management service helps users of Alibaba Cloud big data services develop data in a standardized and cost-effective way. Requirements Management

Changes in August 2019

Date Feature Change type Description Documentation
August 21, 2019 WYSIWYG designer Experience optimization Topics are added to describe how to use the WYSIWYG designer. The WYSIWYG designer of App Studio is provided to assist in developing frontend pages. It provides common web page components that allow you to create frontend pages by using simple drag-and-drop operations. WYSIWYG designer
August 21, 2019 Bill details Experience optimization A topic is added to describe how to view the details of each billing item for DataWorks that you activated in the pay-as-you-go mode. View spending details