This topic describes how to use DataWorks to synchronize data from DRDS to Hologres. You must refer to the operations in this topic to configure the network, whitelists, and permissions for data sources to implement data synchronization.

Prerequisites

Before you configure data sources, make sure that the following operations are performed:
  • Data sources are purchased. A DRDS data source (source) and a Hologres data source (destination) are created.
    Note Limits on the DRDS data source:
    • The instance must be a non-read-only instance of DRDS 1.0.
    • The storage type must be PolarDB for MySQL (for tenants) and ApsaraDB RDS (excluding ApsaraDB RDS for MySQL). ApsaraDB RDS can be used only for existing DRDS instances and cannot be used for newly purchased DRDS instances.
    For information about how to create DRDS 1.0 instances, see Buy a DRDS instance in the PolarBD-X documentation.Step 1: Create a DRDS instance.
  • Plan and prepare resources: An exclusive resource group for Data Integration is purchased and configured. For more information, see Plan and configure resources.
  • Evaluate and plan the network environment: Before you perform data integration, connect data sources to exclusive resource groups for Data Integration based on your business requirements. After data sources and exclusive resource groups for Data Integration are connected, you can refer to the operations in this topic to configure access settings such as vSwitches and whitelists.
    • If data sources and exclusive resource groups for Data Integration reside in the same region and virtual private cloud (VPC), they are automatically connected.
    • If data sources and exclusive resource groups for Data Integration reside in different network environments, you must connect data sources and resource groups by using methods such as a VPN gateway.

Background information

Before you synchronize data from the source to the destination, make sure that the data sources and exclusive resource groups for Data Integration are connected. In addition, you must create an account and authorize the account to access the data sources.
  • Configure whitelists for the data sources
    If the data sources and the exclusive resource group for Data Integration reside in the same VPC, you must add the CIDR block of the vSwitch that is bound to the exclusive resource group for Data Integration during network configuration to the whitelists of the data sources. This ensures that the exclusive resource group for Data Integration can be used to access the data sources. VPC connection
  • Create an account and grant permissions the account

    You must create an account that can be used to access the data sources, read data from the source, and write data to the destination during the data synchronization process.

Procedure

  1. Configure a whitelist for the DRDS instance.
    Add the CIDR block of the VPC where the exclusive resource group resides to a whitelist of the PloarDB-X instance.
    1. View and record the elastic IP address (EIP) and CIDR block of the exclusive resource group.
    2. Add the elastic IP address (EIP) and CIDR block of the exclusive resource group for Data Integration to a whitelist of the PloarDB-X instance.
      For more information, see Set an IP address whitelist.
  2. Create an account and grant the required permissions to the account.
    You need to create an account to log on to the PloarDB-X database for subsequent operations. For more information, see Manage accounts.

What to do next

After data sources are configured, the source data source, destination data source, and exclusive resource group for Data Integration are connected. Then, the exclusive resource group for Data Integration can be used to access the data sources. You can add the source data source and destination data source to DataWorks, and associate them with a data synchronization solution when you create the solution.

For more information about how to add a data source, see Add a data source.