DataWorks can work with compute engines to support end-to-end big data development and governance. DataWorks allows you to add data sources to Data Integration and then use Data Integration to transmit data between the data sources. This topic provides the services that can work with DataWorks in typical scenarios.
Supported compute engines
DataWorks allows you to associate compute engine instances with your DataWorks workspaces. After you associate a compute engine instance with a DataWorks workspace, you can create nodes of the same type as the compute engine instance in the DataWorks console and then enable the system to periodically schedule the nodes. DataWorks supports the following compute engines:
- MaxCompute
- E-MapReduce
- Hologres
- ADB for PostgreSQL
- ADB for MySQL
- CDH
- ClickHouse
Supported data sources
DataWorks can synchronize batch data or real-time data between different data sources. You can configure clusters or instances in the following services as the data sources of DataWorks: Alibaba Cloud services and self-managed services that are related to databases, unstructured storage, big data, and message queues. You can use DataWorks to integrate data only after you configure the data source.
- For more information about data sources that support batch synchronization, see Supported data source types, readers, and writers.
- For more information about data sources that support real-time synchronization, see Data source types that support real-time synchronization.