When you use DataWorks to synchronize data, you can use only exclusive resource groups for Data Integration to run real-time sync nodes. This topic describes the resources and configurations required to run real-time sync nodes.

Background information

  • Resource planning and preparation

    Before you use a data synchronization node to synchronize data, you must purchase an exclusive resource group for data integration and add the resource group to DataWorks for subsequent use.

    For more information about exclusive resource groups for Data Integration, see Exclusive resource groups for Data Integration.

  • Network connections

    An exclusive resource group for Data Integration is essentially a group of Elastic Compute Service (ECS) instances. After you purchase and create such an exclusive resource group, it is isolated from other services. You must associate the resource group with a virtual private cloud (VPC) to ensure network connectivity between the resource group and data sources during subsequent data synchronization.

What to do next

After you plan and configure resources, you can configure data sources. You must configure network connections for the data sources and permissions to access the data sources. This facilitates the creation of a real-time sync node. You can synchronize data only from PolarDB, ApsaraDB for OceanBase, MySQL, or Oracle to DataHub. For more information about how to configure a data source, see Configure a data source (PolarDB), Configure a data source (ApsaraDB for OceanBase), Configure data sources for data synchronization from MySQL, and Configure data sources for data synchronization from Oracle.