If you want to synchronize data from PolarDB-X 1.0 to Hologres, the source type is PolarDB-X 1.0, and the destination type is Hologres. Before you run a data synchronization node, you can refer to the operations that are described in this topic to prepare the configurations, such as network environments, whitelists for data sources, and accounts, to implement data synchronization.

Prerequisites

Before you configure a data source, make sure that the following operations are completed:
  • Prepare data sources: A PolarDB-X 1.0 data source and a Hologres data source are created.
    Note The PolarDB-X 1.0 instance must meet the following requirements:
    • The instance must be a non-read-only instance of PolarDB-X V1.0.
    • The instance must be added in DataWorks in Alibaba Cloud instance mode. If you add the instance in connection string mode and use the data source in a data synchronization node, the node fails.
    • The storage type of the instance must be PolarDB for MySQL or ApsaraDB RDS (excluding ApsaraDB RDS for MySQL). ApsaraDB RDS can be used only for existing PolarDB-X 1.0 instances and cannot be used for newly purchased PolarDB-X 1.0 instances.
    For more information about how to create a PolarDB-X 1.0 instance, see Create a instance.
  • Plan and prepare resources: An exclusive resource group for Data Integration is purchased and configured. For more information, see Plan and configure resources.
  • Evaluate and plan the network environment: Before you perform data integration, you must select a network connection method based on your business requirements and use the method to connect the data sources to the exclusive resource group for Data Integration. After the data sources and the exclusive resource group for Data Integration are connected, you can refer to the operations described in this topic to configure access settings such as vSwitches and whitelists.
    • If the data sources and the exclusive resource group for Data Integration reside in the same region and virtual private cloud (VPC), they are automatically connected.
    • If the data sources and the exclusive resource group for Data Integration reside in different network environments, you must connect the data sources and the resource group by using methods such as a VPN gateway.

Background information

Before you synchronize data from the source to the destination, make sure that network connections between the data sources and the exclusive resource group for Data Integration are established. You must also create an account and authorize the account to access the data sources.
  • Configure whitelists for the data sources
    If the data sources and the exclusive resource group for Data Integration reside in the same VPC, you must add the CIDR block of the exclusive resource group for Data Integration to the whitelists of the data sources. This ensures that the exclusive resource group for Data Integration can be used to access the data sources. VPC connection
  • Create an account and grant permissions to the account

    You must create an account that can be used to access the data sources, read data from the source, and write data to the destination during the data synchronization process.

Limits

You cannot use the real-time synchronization feature to synchronize data on which XA ROLLBACK statements are executed. For transaction data on which XA PREPARE statements are executed, you can use the real-time synchronization feature to synchronize the data to a destination. If XA ROLLBACK statements are executed later on the data, the rollback changes to the data cannot be synchronized to the destination. If the tables that you want to synchronize contain tables on which XA ROLLBACK statements are executed, you must remove the tables on which XA ROLLBACK statements are executed and add the removed tables again to initialize full data in the source and synchronize incremental data.

Procedure

  1. Configure the whitelist of the PolarDB-X 1.0 data source.
    Add the elastic IP address (EIP) of the exclusive resource group for Data Integration and the CIDR block of the vSwitch with which the exclusive resource group for Data Integration is associated to the whitelist of the PolarDB-X 1.0 data source. To view and add the EIP and CIDR block to the whitelist, perform the following steps:
    1. View and record the elastic IP address (EIP) and CIDR block of the exclusive resource group for Data Integration.
      1. Log on to the DataWorks console.
      2. In the left-side navigation pane, click Resource Groups.
      3. On the Exclusive Resource Groups tab, find the exclusive resource group for Data Integration and click View Information in the Actions column.
      4. In the Exclusive Resource Groups dialog box, view and record the values of the EIPAddress and CIDR Blocks parameters.
      5. On the Exclusive Resource Groups tab, find the exclusive resource group for Data Integration and click Network Settings in the Actions column.
      6. On the VPC Binding tab of the page that appears, view and record the CIDR block of the vSwitch with which the exclusive resource group for Data Integration is associated.
    2. Add the EIP and CIDR block that are recorded to the whitelist of the PloarDB-X 1.0 data source.
      For more information, see Set an IP address whitelist.
  2. Create an account and grant the required permissions to the account.
    You need to create an account that is used to log on to the databases of the PolarDB-X 1.0 instance for subsequent operations and grant the required permissions to the account. For more information, see Manage accounts.

What to do next

After the data sources are configured, the source, destination, and exclusive resource group for Data Integration are connected. Then, the exclusive resource group for Data Integration can be used to access the data sources. You can add the source and destination to DataWorks, and associate them with a data synchronization solution when you create the solution.

For more information about how to add a data source, see Add a data source.