The Data Integration service of DataWorks allows you to use a synchronization solution to synchronize data from a source to a destination in real time or in batch mode. You can use a synchronization solution to synchronize multiple tables at a time or synchronize both full and incremental data. If you want to synchronize both full and incremental data, you can synchronize the incremental data after the full data is synchronized.

Go to the Tasks page

  1. Log on to the DataWorks console.
  2. In the left-side navigation pane, click Workspaces.
  3. In the top navigation bar, select the region in which your workspace resides. Find the workspace and click Data Integration in the Actions column.
  4. In the left-side navigation pane, click Synchronization task. The Tasks page appears.
    You can create synchronization nodes and view the status of the created synchronization nodes on this page.
    When you create a synchronization node, you can select a synchronization solution as needed. For more information, see the Select a synchronization solution section of this topic. A synchronization node can be in the following states:
    • Not Running: The synchronization node is not run. You can click Submit and Run in the Operation column of the synchronization node to run the synchronization node.
      Note You can click Modify Configuration to modify a synchronization node only if the synchronization node is in the Not Running state. If you click Modify Configuration in the Operation column of a synchronization node in another state, you can view only the information about that synchronization node.
    • Running: The synchronization node is running and cannot be terminated. You must wait until the synchronization node is complete.
    • Exception: An error occurred while running the synchronization node. You can click Execution details in the Operation column of the synchronization node to troubleshoot the error.
    • Succeeded: The synchronization node is complete. You can click Execution details in the Operation column of the synchronization node to view the results of the synchronization node.

Select a synchronization solution

  1. On the Tasks page, click New task in the upper-right corner.
  2. On the Create Data Synchronization Solution page, select the source and destination for data synchronization, and then select a synchronization solution.
    For more information about the supported source and destination data sources, see Supported data sources. The following figure shows the supported synchronization solutions. Supported synchronization solutionsDataWorks supports the following synchronization solutions, which are classified based on the destination data source:
    • One-click real-time synchronization to DataHub: used to synchronize data to DataHub.
    • One-click realtime synchronization to Elasticsearch: used to synchronize data to Elasticsearch.
    • One-click real-time synchronization to Hologres: used to synchronize data to Hologres.
    • One-click realtime synchronization to AnalyticDB MySQL 3.0: used to synchronize data to AnalyticDB for MySQL V3.0.
    • One-click real-time synchronization to Kafka: used to synchronize data to Kafka.
    • Synchronization solutions for synchronizing data to MaxCompute, including:
      • One-click real-time synchronization to MaxCompute
      • One-click batch synchronization to MaxCompute (Cyclical Full)
      • One-click batch synchronization to MaxCompute (Cyclical Increment)
      • One-click batch synchronization to MaxCompute (Once Full)
      • One-click batch synchronization to MaxCompute (Once Increment)
      • One-click batch synchronization to MaxCompute (Once Full then cyclical increment)