Currently, Batch Sync nodes support various data stores, including MaxCompute, MySQL, Distribute Relational Database Service (DRDS), SQL Server, PostgreSQL, Oracle, MongoDB, Db2, Table Store, Object Storage Server (OSS), FTP, HBase, LogHub, Hadoop Distributed File System (HDFS), and Stream.

When you enter a table name, the drop-down list displays all matched tables. Currently, only exact match is supported. You must enter a complete table name. Tables are labeled as unsupported if they are not supported by Batch Sync nodes.

If you move the pointer over a table in the list, the details of the table appear, including the database, IP address, and owner of the table. After you select a table, the column information is automatically entered. You can add, move, and delete columns.

Create a Batch Sync node

  1. Log on to the DataWorks console. In the left-side navigation pane, click Workspaces. On the Workspaces page, find the target workspace and click Data Analytics in the Actions column.
  2. Move the pointer over the Create icon and choose Data Integration > Batch Synchronization.

    You can also find the target workflow, right-click Data Integration, and choose Create > Batch Synchronization.

  3. In the Create Node dialog box that appears, enter the node name, select the target folder, and click Commit.
    Note A node name can be up to 128 characters in length.
  4. Configure the Batch Sync node. For more information.
  5. Configure the node properties.

    Click the Properties tab in the right-side navigation pane. On the Properties tab that appears, set the relevant parameters. For more information, see Properties.

  6. Commit the node.

    After the node properties are configured, click the Save icon in the upper-left corner. Then, commit or commit and unlock the node to the development environment.

  7. Deploy the node.

    For more information, see Deploy a node.

  8. Test the node in the production environment.