DataWorks can work with compute engines to support end-to-end big data development and governance. DataWorks allows you to add data sources to Data Integration and then use Data Integration to transmit data between the data sources. This topic provides the services that can work with DataWorks in typical scenarios.

Supported compute engines

DataWorks allows you to associate compute engine instances with your DataWorks workspaces. After you associate a compute engine instance with a DataWorks workspace, you can create nodes of the same type as the compute engine instance in the DataWorks console and then enable the system to periodically schedule the nodes. DataWorks supports the following compute engines:
  • MaxCompute
  • E-MapReduce
  • Hologres
  • ADB for PostgreSQL
  • ADB for MySQL
  • CDH
  • ClickHouse
For more information about how to associate a compute engine instance with a workspace, see Create and manage workspaces.

Supported data sources

DataWorks can synchronize batch data or real-time data between different data sources. You can configure clusters or instances in the following services as the data sources of DataWorks: Alibaba Cloud services and self-managed services that are related to databases, unstructured storage, big data, and message queues. You can use DataWorks to integrate data only after you configure the data source.