When you use DataWorks to synchronize data, you can use only exclusive resource groups for Data Integration to run Data Integration nodes. In addition, you can select a shared or exclusive resource group for scheduling based on your business requirements. This topic describes the resources that are used for data synchronization and how to configure the resources.
- Resource planning and preparation
When you synchronize data, Data Integration nodes run on resources in resource groups for Data Integration and resource groups for scheduling. You can use only exclusive resource groups for Data Integration. Before you synchronize data, you must purchase an exclusive resource group for Data Integration and add the exclusive resource group to your DataWorks workspace.
For more information about exclusive resource groups for Data Integration, see Exclusive resource groups for Data Integration.
- Network connections
An exclusive resource group for Data Integration is essentially a group of ECS instances. After you purchase such an exclusive resource group, it is isolated from other services. You must associate the resource group with a virtual private cloud (VPC) to ensure network connectivity between the resource group and data sources during subsequent data synchronization.
What to do next
After you plan and configure resources, you can configure data sources. You must connect the exclusive resource group for Data Integration to the source and destination. You must also create an account and grant the required permissions to the account. This account is used to access the source and destination. The preceding operations help create a data synchronization node. For more information about how to configure data sources, see Configure a data source (MySQL) and Configure the source (PolarDB).