If you use sync solutions of DataWorks to synchronize data, you can use only exclusive resource groups for Data Integration to run Data Integration nodes. However, you can select a shared or exclusive resource group for scheduling based on your business requirements. This topic describes the resources that are used for data synchronization and how to configure the resources.
Background information
- Resource planning and preparation
When you synchronize data, Data Integration nodes run on resources in resource groups for Data Integration and resource groups for scheduling. You can use only exclusive resource groups for Data Integration. Before you synchronize data, you must purchase an exclusive resource group for Data Integration and add the exclusive resource group to your DataWorks workspace.
For more information about exclusive resource groups for Data Integration, see Overview of exclusive resource groups for Data Integration.
- Network connectivity
An exclusive resource group for Data Integration is essentially a group of Elastic Compute Service (ECS) instances. After you purchase such an exclusive resource group, it is isolated from other services. You must associate the resource group with a virtual private cloud (VPC) to ensure network connectivity between the resource group and data sources during subsequent data synchronization.
Associate the exclusive resource group with a VPC
- Log on to the DataWorks console.
- In the left-side navigation pane, click Resource Groups. On the Exclusive Resource Groups tab of the Resource Groups page, find the created resource group and click Network Settings in the Actions column. On the page that appears, you can associate the resource group with a VPC. Before you associate the exclusive resource group with a VPC, you must log on to the RAM console with your Alibaba Cloud account and authorize DataWorks to access your cloud resources. You can go to the Cloud Resource Access Authorization page to authorize DataWorks to access your cloud resources. You can also authorize DataWorks to access your cloud resources by clicking the related button in the dialog box that is displayed the first time you log on to the DataWorks console with your Alibaba Cloud account.
- Associate the exclusive resource group with a VPC. Note If your data source and the exclusive resource group reside in different regions or belong to different Alibaba Cloud accounts, you must add a route that points to the IP address of your data source after you associate the exclusive resource group with a VPC.
- Optional:Add host configurations. You may fail to access your data source by using IP addresses. For example, you can access your data source only by using hostnames. In this case, you must perform the following steps to add host configurations. Otherwise, the connectivity test fails when you add the data source by using its hostnames.
What to do next
After you plan and configure resources, you can configure data sources. You must connect the exclusive resource group for Data Integration to the source and destination. You must also create an account and grant the required permissions to the account. This account is used to access the source and destination. The preceding operations help create a data sync node. For more information about how to configure data sources, see Configure data sources for data synchronization from MySQL, Configure data sources for data synchronization from PolarDB, and Configure data sources for data synchronization from Oracle.