This topic describes the basic scenarios of DataWorks modules.

Usage description

ModuleDescriptionReferences
Data ModelingDataWorks Data Modeling allows you to plan and design a data warehouse, formulate and summarize data standards, perform dimensional modeling, and define data metrics. Data Modeling is used to structure and manage huge amount of disordered and complex data. You can use Data Modeling to build a data warehouse from scratch. You can also use Data Modeling to generate standard data warehouse models based on existing data tables in an efficient manner, which resolves the cold start issue of data warehouses. Overview of DataWorks Data Modeling
Data IntegrationDataWorks Data Integration supports data synchronization in complex network environments. You can create a batch synchronization node on the DataStudio page to periodically synchronize offline data or create a real-time synchronization node on the DataStudio page to synchronize incremental data from a single table or a database in real time. DataWorks allows you to create various data synchronization solutions in Data Integration, such as a data synchronization solution used to synchronize both full and incremental data and a data synchronization solution used to batch synchronize data from a database. Overview
DataStudio and Operation CenterDataWorks DataStudio allows you to develop data based on nodes. Overview
DataWorks Operation Center allows you to manage, monitor, and perform O&M operations on nodes in the production environment. Overview
DataAnalysisDataWorks DataAnalysis allows you to analyze, edit, and share data online. Overview of DataAnalysis
Data GovernanceDataWorks Data Governance Center is a platform used to detect and govern issues.
  • Issue detection: Data Governance Center can automatically detect issues that are brought about when you use DataWorks. Data Governance Center provides health scores based on the health assessment model and visualizes the governance results from multiple perspectives. This helps you effectively resolve governance issues and achieve governance goals.
  • In terms of cost governance, Data Governance Center provides features such as node resource consumption details, overall trend of resource consumption, and cost estimation for a single node. These features help you effectively optimize resource utilization and reduce costs of various types of resources.
Overview
DataWorks Data Quality provides 35 built-in table-level and field-level rule templates to monitor all fields or a specific field in a table. You can also create rule templates based on your business requirements. Data Quality allows you to create monitoring rules to detect changes in source data and dirty data that is generated during the extract, transform, and load (ETL) process at the earliest opportunity. Data Quality blocks the execution of nodes that involve dirty data and effectively stops the spread of dirty data to descendant nodes. Overview
DataWorks DataMap is a module used to manage data directories of enterprises based on metadata. The module provides various features, such as globally searching for data, viewing the details of metadata, previewing data, viewing data lineage, and managing data categories. DataMap can help you search for, understand, and use data. Overview
DataWorks Security Center allows you to build a security system that can secure data and personal privacy in an efficient manner. Security Center can meet various security requirements, such as auditing, control for data access behaviors, and control for sensitive behaviors in high-risk scenarios. You can use Security Center without the need to perform additional configurations. Overview of Security Center
DataWorks Data Security Guard is a module that ensures data security. The module provides various features, such as identifying and masking sensitive data, adding watermarks to data, managing data permissions, identifying and auditing data risks, and tracing leak sources. Overview of Data Security Guard
DataService StudioDataWorks DataService Studio provides comprehensive data sharing capabilities. The module implements data value generation and data sharing and openness from various aspects such as processing of API publish requests, authorization management, calculation of the number of API calls, and resource isolation. You can register APIs based on data sources or register existing APIs in DataService Studio. Overview of DataService Studio
Open PlatformDataWorks Open Platform provides the OpenAPI, OpenEvent, and Extensions modules. You can use the modules to integrate DataWorks with your applications and subscribe to event messages. These modules facilitate process management of data processing, data governance, and data O&M, and allow you to identify important changes in DataWorks and respond to the changes at the earliest opportunity. Overview
Migration AssistantDataWorks Migration Assistant allows you to migrate jobs of open source scheduling engines to DataWorks. Migration Assistant also allows you to migrate data objects within DataWorks across clouds, regions, or accounts. This way, you can quickly clone and deploy jobs in DataWorks. To quickly migrate data and jobs to the cloud, you can obtain help from the DataWorks team and the big data service team of Alibaba Cloud. Overview of Migration Assistant