DataWorks provides powerful basic capabilities that help improve work efficiency, ensure the timely generation of data, facilitate data governance, and allow you to construct data services at minimum costs.

Low learning costs

Common users other than technical personnel can have a good command of data development and governance procedures within 1 to 2 hours and no longer need to use traditional command line tools to perform development operations. This greatly reduces learning costs.

DataWorks allows you to organize nodes that are run by using various heterogeneous compute engines and to configure dependencies between the nodes in the same directed acyclic graph (DAG). This way, you do not need to separately maintain different technology stacks, and node organization efficiency is improved. The nodes include data synchronization nodes, SQL nodes, MR nodes, ODPS Spark nodes, real-time computing nodes, and Machine Learning Platform for AI (PAI) nodes.

Reduced labor costs

You can activate the DataWorks service by performing only simple configurations. After the service is activated, you can use the out-of-the-box features provided by this service to build data warehouses. This frees you from heavy development, deployment, and maintenance work and significantly reduces O&M costs.

Comprehensive features

DataWorks provides comprehensive features that can be used in data transmission, data development, data production, data governance, and data security scenarios. The whole lifecycle of big data in each scenario is covered by the related features. This helps address issues encountered by enterprises in data warehouse building, data mid-end building, and digital transformation.

  • Data synchronization from data sources that reside in complex network environments, and real-time and batch synchronization of full and incremental data are supported.
  • Scheduling for tens of millions of nodes for a single user is supported. Data processing is more fluent.