When you use DataWorks to synchronize data, you can use only exclusive resource groups for data integration to run real-time data synchronization nodes. This topic describes the resources and configurations required to run a real-time data synchronization node.

Background information

  • Resource planning and preparation

    Before you use a data synchronization node to synchronize data, you must purchase an exclusive resource group for data integration and add the resource group to DataWorks for subsequent use.

    For more information about exclusive resource groups for data integration, see Exclusive resources for Data Integration.

  • Network connections

    An exclusive resource group for data integration is essentially a group of resource instances. After you purchase such an exclusive resource group, it is isolated from other services. You must bind the resource group to a virtual private cloud (VPC) to ensure the network connectivity between the resource group and data sources during subsequent data synchronization.

What to do next

After you plan and configure resources, you can configure data sources. You must configure network connectivity for the data sources and permissions to access the data sources. This facilitates the creation of a real-time data synchronization node. You can synchronize data only from PolarDB, ApsaraDB for OceanBase, or MySQL to DataHub. You can select a data source based on your business requirements. For more information about how to configure a data source, see Configure a data source (PolarDB), Configure a data source (ApsaraDB for OceanBase), and Configure a data source (MySQL).