This topic introduces Alibaba Cloud DataWorks, including its features and limits.

DataWorks is an important platform as a service (PaaS) of Alibaba Cloud. It offers all-around services, such as Data Integration, DataStudio, Data Map, Data Quality, and DataService Studio. In addition, it provides an end-to-end data development and management console to help enterprises mine and explore data value.

DataWorks supports multiple types of compute and storage engines, such as MaxCompute, E-MapReduce (EMR), Realtime Compute for Apache Flink, Machine Learning Platform for AI, Graph Compute, and Hologres. It also allows you to use custom computing and storage services. DataWorks provides end-to-end big data services and allows you to perform artificial intelligence (AI) development and data governance.

DataWorks simplifies operations such as data transmission, conversion, and integration. You can import data from various data sources, convert, develop, and process the data, and then transmit the data to other data systems. Architecture

Limits

DataWorks supports only Google Chrome 69 or later.

Features

  • DataWorks is hosted on the cloud.
    • DataWorks provides powerful scheduling capabilities. For more information, see Schedule.
      • In DataWorks, nodes can be triggered by time- or dependency-based scheduling configurations. For more information, see Configure time properties and Instructions to configure scheduling dependencies.
      • DataWorks enables tens of millions of nodes to accurately run on time every day based on node relationships in directed acyclic graphs (DAGs).
      • DataWorks allows you to run nodes at custom intervals in minutes, hours, days, weeks, or months.
    • DataWorks is a fully managed service that frees you from server deployment.
    • DataWorks provides the isolation feature to ensure that nodes of different tenants do not affect each other.
  • DataWorks supports multiple types of nodes, such as batch synchronization node, Shell node, MaxCompute SQL node, and MaxCompute MR node. It analyzes and processes complex data based on the dependencies between nodes.
    • Data conversion: DataWorks ensures superior performance in analyzing and processing big data based on the powerful computing capabilities of MaxCompute.
    • Data integration: DataWorks supports more than 20 types of data sources and provides stable, efficient data transmission features based on the Data Integration service. For more information, see Data Integration.
  • DataWorks provides visualized code development.

    DataWorks provides a graphical user interface (GUI) for you to develop code and design workflows. You can perform simple drag-and-drop operations to create complex data analytics nodes without the need to use development tools. For more information, see GUI elements.

    A browser with Internet access enables you to develop code anytime and anywhere.

  • DataWorks supports monitoring and alerting.

    Operation Center provides a visualized node monitoring and management tool and displays the overall node running status in DAGs. For more information, see Operation Center.

    You can configure various alert notification methods to promptly notify relevant personnel when a node error occurs. This ensures normal business running. For more information, see Monitor.

Features provided by DataWorks Professional Edition

  • Resource optimization: reduces computing and storage costs.
  • Mobile O&M: implements easy, rapid node O&M.
  • Running diagnosis: helps you quickly locate difficult issues.
  • Intelligent monitoring: improves productivity and provides intelligent solutions for alerts.
  • Field-level data lineage: helps you quickly locate the source of dirty data.
  • Multiple types of control nodes: meet the logic of complex business scenarios.
  • Data Security Guard: protects the security of your data.
  • Development of real-time stream computing nodes: enables you to easily use new technologies in DataWorks.
DataWorks Professional Edition provides more features and greatly increases the efficiency of big data governance. For more information, see DataWorks Professional Edition.