When you use DataWorks to synchronize data from a PostgreSQL database to Hologres, you can refer to the operations in this topic to configure the network, whitelists, and permissions for data sources to implement data synchronization.
Prerequisites
- Data sources are purchased. A source PostgreSQL data source and a destination Hologres data source are purchased.
- Plan and prepare resources: An exclusive resource group for Data Integration is purchased and configured. For more information, see Plan and configure resources.
- Evaluate and plan the network environment: Before you perform data integration, connect
data sources to exclusive resource groups for Data Integration based on your business
requirements. After data sources and exclusive resource groups for Data Integration
are connected, you can refer to the operations in this topic to configure access settings
such as vSwitches and whitelists.
- If data sources and exclusive resource groups for Data Integration reside in the same region and virtual private cloud (VPC), they are automatically connected.
- If data sources and exclusive resource groups for Data Integration reside in different network environments, you must connect data sources and resource groups by using methods such as a VPN gateway.
Background information
- Configure whitelists for the data sources
If the data sources and the exclusive resource group for Data Integration reside in the same VPC, you must add the CIDR block of the vSwitch that is bound to the exclusive resource group for Data Integration during network configuration to the whitelists of the data sources. This ensures that the exclusive resource group for Data Integration can be used to access the data sources.
- Create an account and grant permissions the account
You must create an account that can be used to access the data sources, read data from the source, and write data to the destination during the data synchronization process.
- Check whether the database version is the version supported by real-time synchronization
nodes of Data Integration in DataWorks.
The following PostgreSQL database versions are supported: PostgreSQL
10
,11
, and12
. You can execute the following statement to view the version of the PostgreSQL database:show server_version
Limits
- Data Integration supports
ADD COLUMN
statements.- An
ADD COLUMN
statement cannot be combined with other DDL statements, such asDROP COLUMN
.Notice If you execute anADD COLUMN
statement together with anALTER COLUMN
statement, such as aDROP COLUMN or RNAME COLUMN
statement on a data synchronization node, the node cannot normally work. - DDL statements except
ADD COLUMN
statements cannot be identified.
- An
ALTER TABLE and CREATE TABLE
statements are not supported.- TEMPRORARY tables and UNLOGGED tables cannot be replicated. The PostgreSQL database does not provide a mechanism for log parsing subscriptions to these two types of tables.
- Sequences cannot be replicated (
serial, bigserial, and identity
). - TRUNCATE statements are not supported.
- Large objects cannot be replicated (BYTEA).
- Views, materialized views, and foreign tables cannot be replicated.