Before you use DataWorks to synchronize data from ApsaraDB for OceanBase to DataHub, you must refer to the operations in this topic to prepare the configurations, such as network environments, whitelists, and permissions, for both the source and destination.
Prerequisites
- Prepare data sources: An ApsaraDB for OceanBase cluster and a DataHub project are prepared.
- Plan and prepare resources: An exclusive resource group for data integration is purchased and configured. For more information, see Plan and configure resources.
- Evaluate and plan the network environment: Before you perform data integration, connect
data sources to exclusive resource groups for data integration based on your business
requirements. After data sources and exclusive resource groups for data integration
are connected, you can refer to the operations in this topic to configure access settings
such as vSwitches and whitelists.
- If data sources and exclusive resource groups for data integration reside in the same region and virtual private cloud (VPC), they are automatically connected.
- If data sources and exclusive resource groups for data integration reside in different network environments, you must connect data sources and resource groups by using methods such as a VPN gateway.
Background information
- Configure whitelists for data sources
If the data sources and exclusive resource group for data integration reside in the same VPC, you must add the CIDR block of the exclusive resource group for data integration to the whitelists of the data sources. This ensures that the exclusive resource group for data integration can be used to access the data sources.
- Create an account and authorize the account
You must create an account that can be used to access data sources, read data from the source data source, and write data to the destination data source in the data integration process.
Limits
ApsaraDB for OceabBase is a distributed relational database that can integrate data distributed in multiple physical databases into a unified logical database. However, you can synchronize data of only one physical ApsaraDB for OceanBase database to DataHub in real time.
Procedure
What to do next
After data sources are configured, the source data source, destination data source, and exclusive resource group for data integration are connected. Then, the exclusive resource group for data integration can be used to access data sources. You can add the source data source and destination data source to DataWorks, and associate them with a data synchronization solution when you create the solution.
For more information about how to add a data source, see Add a data source.