The Data Integration service of DataWorks allows you to use a synchronization solution
to synchronize data from a source to a destination in real time or in batch mode.
You can use a synchronization solution to synchronize multiple tables at a time or
synchronize both full and incremental data. If you want to synchronize both full and
incremental data, you can synchronize the incremental data after the full data is
synchronized.
Go to the Tasks page
- Log on to the DataWorks console.
- In the left-side navigation pane, click Workspaces.
- In the top navigation bar, select the region in which your workspace resides. Find
the workspace and click Data Integration in the Actions column.
- In the left-side navigation pane, click Synchronization task. The Tasks page appears.
You can create synchronization nodes and view the status of the created synchronization
nodes on this page.
When you create a synchronization node, you can select a synchronization solution
as needed. For more information, see the
Select a synchronization solution section of this topic. A synchronization node can be in the following states:
- Not Running: The synchronization node is not run. You can click Submit and Run in the Operation column of the synchronization node to run the synchronization node.
Note You can click Modify Configuration to modify a synchronization node only if the synchronization node is in the Not Running state. If you click Modify Configuration in the Operation column of a synchronization
node in another state, you can view only the information about that synchronization
node.
- Running: The synchronization node is running and cannot be terminated. You must wait until
the synchronization node is complete.
- Exception: An error occurred while running the synchronization node. You can click Execution details in the Operation column of the synchronization node to troubleshoot the error.
- Succeeded: The synchronization node is complete. You can click Execution details in the Operation column of the synchronization node to view the results of the synchronization
node.
Select a synchronization solution
- On the Tasks page, click New task in the upper-right corner.
- On the Create Data Synchronization Solution page, select the source and destination
for data synchronization, and then select a synchronization solution.
For more information about the supported source and destination data sources, see
Supported data sources. The following figure shows the supported synchronization solutions.
DataWorks supports the following synchronization solutions, which are classified based
on the destination data source:
- One-click real-time synchronization to DataHub: used to synchronize data to DataHub.
- One-click realtime synchronization to Elasticsearch: used to synchronize data to Elasticsearch.
- One-click real-time synchronization to Hologres: used to synchronize data to Hologres.
- One-click realtime synchronization to AnalyticDB MySQL 3.0: used to synchronize data to AnalyticDB for MySQL V3.0.
- One-click real-time synchronization to Kafka: used to synchronize data to Kafka.
- Synchronization solutions for synchronizing data to MaxCompute, including:
- One-click real-time synchronization to MaxCompute
- One-click batch synchronization to MaxCompute (Cyclical Full)
- One-click batch synchronization to MaxCompute (Cyclical Increment)
- One-click batch synchronization to MaxCompute (Once Full)
- One-click batch synchronization to MaxCompute (Once Increment)
- One-click batch synchronization to MaxCompute (Once Full then cyclical increment)