A single-table real-time synchronization task creates a destination table in the destination Doris data source based on the schema of the source Hologres table and writes data from the source table to the destination table. This topic describes how to synchronize data from a Hologres table to Doris in real time.
Limits
The version of your Hologres instance must be V2.1 or later.
Incremental synchronization of data from a Hologres partitioned table is not supported.
Messages for DDL changes on a Hologres table cannot be synchronized.
Incremental data of the following data types can be synchronized from Hologres:
INTEGER, BIGINT, TEXT, CHAR(n), VARCHAR(n), REAL, JSON, SERIAL, OID, INT4[], INT8[], FLOAT8[], BOOLEAN[], and TEXT[].
Prerequisites
You have purchased a serverless resource group.
You have created Hologres and Doris data sources. For more information, see Create a data source for Data Integration.
You have established network connectivity between the resource group and data sources. For more information, see Network connectivity solutions.
To synchronize a Hologres table in real time, you must enable Hologres Binlog for the source Hologres database table. For more information, see Subscribe to Hologres Binlog.
Procedure
1. Select a synchronization task type
Go to the Data Integration page.
Log on to the DataWorks console. In the top navigation bar, select the desired region. In the left-side navigation pane, choose . On the page that appears, select the desired workspace from the drop-down list and click Go to Data Integration.
In the left-side navigation pane, click Sync Tasks. Then, click Create Sync Task at the top of the page to go to the page for creating a synchronization task. Configure the following basic information:
Data Source And Destination:
Hologres→DorisNew Task Name: Customize a name for the synchronization task.
Sync Type:
Single-table Real-time.Sync Steps: Select
Full Sync.
2. Configure network and resources
In the Network And Resources section, select a Resource Group for the synchronization task. You can allocate the number of compute units (CUs) for Task Resource Usage.
Source Data Source Select the added
Hologresdata source, and for Destination Data Source select the addedDorisdata source, then click Test Connectivity.
After you confirm that both the source and destination data sources are connected, click Next.
3. Configure the synchronization link
1. Configure the Hologres source
Click the Hologres data source at the top of the page and edit the Holo Source Information.

In the Holo Source Information section, select the schema that contains the Hologres table to be read and the source table.
Click Data Sampling in the upper-right corner.
In the Data Output Preview dialog box, specify the Number Of Samples and click the Start Collection button to sample data from the specified Hologres table and preview the data in Hologres. This provides input for subsequent visual configuration.
3. Configure the Doris destination
Click the Doris data destination at the top of the page and edit the Doris Destination Information.

In the Doris Destination Information section, select the Database that contains the Doris table to be written and specify whether the destination table is Auto Table Creation or Use Existing Table.
If you select Auto Table Creation, a table with the same name as the source table is created by default. You can manually modify the destination table name.
If you select Use Existing Table, select the destination table for synchronization from the drop-down list.
(Optional) Modify the schema of a destination table.
If you select Auto Table Creation, you need to click the Edit Table Structure button and edit the destination table structure in the dialog box. You can also click the Regenerate Table Structure Based On Upstream Node Output Columns button to automatically generate the table structure based on the output columns of the upstream node. You can select a column in the automatically generated table structure as the primary key and bucketing column.
NoteThe destination table must have a primary key and a bucketing column. Otherwise, the configuration cannot be saved.
Set the Processing Strategy For Dynamic Fields From Upstream.
When an upstream field does not have a corresponding field in Doris, you can select Ignore, Report Error, or Replay.
Configure mappings between fields in the source and fields in the destination.
After you complete the preceding configuration, the system automatically establishes mappings between fields in the source and fields in the destination based on the Map Fields with Same Name principle. You can modify the mappings based on your business requirements. One field in the source can map to multiple fields in the destination. Multiple fields in the source cannot map to the same field in the destination. If a field in the source has no mapped field in the destination, data in the field in the source is not synchronized to the destination.
4. Configure alert rules
To prevent the failure of the synchronization task from causing latency on business data synchronization, you can configure different alert rules for the synchronization task.
In the upper-right corner of the page, click Configure Alert Rule to go to the Alert Rule Configurations for Real-time Synchronization Subnode panel.
In the Configure Alert Rule panel, click Add Alert Rule. In the Add Alert Rule dialog box, configure the parameters to configure an alert rule.
NoteThe alert rules that you configure in this step take effect for the real-time synchronization subtask that will be generated by the synchronization task. After the configuration of the synchronization task is complete, you can refer to Run and manage real-time synchronization tasks to go to the Real-time Synchronization Task page and modify alert rules configured for the real-time synchronization subtask.
Manage alert rules.
You can enable or disable alert rules that are created. You can also specify different alert recipients based on the severity levels of alerts.
5. Configure advanced parameters
DataWorks allows you to modify the configurations of specific parameters. You can change the values of these parameters based on your business requirements.
To prevent unexpected errors or data quality issues, we recommend that you understand the meanings of the parameters before you change the values of the parameters.
In the upper-right corner of the configuration page, click Configure Advanced Parameters.
In the Configure Advanced Parameters panel, change the values of the desired parameters.
6. Configure resource groups
You can click Configure Resource Group in the upper-right corner of the page to view and change the resource groups that are used to run the current synchronization task.
7. Execute the synchronization task
After the configuration of the synchronization task is complete, click Complete in the lower part of the page.
In the page of page, find the created synchronization task and click Start in the Operation column.
Click the Name or ID of the synchronization task in the Tasks section and view the detailed running process of the synchronization task.
Perform O&M operations on the synchronization task
View the status of the synchronization task
After the synchronization task is created, you can go to the Synchronization Task page to view all synchronization tasks that are created in the workspace and the basic information of each synchronization task.

You can Start or Stop a synchronization task in the Actions column. You can also Edit or View a synchronization task by clicking More.
For a started task, you can view the basic running status in Execution Overview. You can also click the corresponding overview area to view execution details.

A single-table real-time synchronization task from Hologres to Doris consists of three steps:
Structure Migration: includes the creation method of the destination table (existing table or auto table creation). If you select auto table creation, the DDL statement for creating the table will be displayed.
Full Initialization: If you select Full Sync for Sync Steps, the progress of full initialization will be displayed here.
Real-time Data Synchronization: includes statistics information for real-time synchronization, such as real-time read and write traffic, dirty data, failover, and operation logs.
Rerun the synchronization task
In some special cases, if you want to modify the fields to synchronize, the fields in a destination table, or table name information, you can also click Rerun in the Operation column of the desired synchronization task. This way, the system synchronizes the changes that are made to the destination. Data in the tables that are already synchronized and are not modified will not be synchronized again.
Directly click Rerun without modifying the configurations of the synchronization task to enable the system to rerun the synchronization task.
Modify the configurations of the synchronization task and then click Complete. Click Apply Updates that is displayed in the Operation column of the synchronization task to rerun the synchronization task for the latest configurations to take effect.