When you use DataWorks to synchronize data, you can use only exclusive resource groups for Data Integration to run real-time sync nodes. This topic describes the resources and configurations required to run real-time sync nodes.

Background information

  • Resource planning and preparation

    Before you use a data synchronization node to synchronize data, you must purchase an exclusive resource group for data integration and add the resource group to DataWorks for subsequent use.

    For more information about exclusive resource groups for Data Integration, see Exclusive resource groups for Data Integration.

  • Network connectivity

    An exclusive resource group for Data Integration is essentially a group of Elastic Compute Service (ECS) instances. After you purchase an exclusive resource group for Data Integration, it is isolated from other services. You must associate the resource group with a virtual private cloud (VPC) to ensure network connectivity between the resource group and data sources during subsequent data synchronization.

What to do next

After you plan and configure resources, you can configure data sources. You must configure network connections for the data sources and permissions to access the data sources. This facilitates the creation of a real-time sync node. DataWorks allows you to synchronize data to Kafka only from a MySQL data source in real time. For more information about how to configure a MySQL data source, see Configure data sources for data synchronization from MySQL.