DataHub is a platform designed to process streaming data. You can publish and subscribe to streaming data in DataHub and distribute the data to other platforms. DataHub allows you to analyze streaming data and build applications based on the streaming data.

Prerequisites

A reader or transformation node is configured. For more information, see Supported data stores for real-time synchronization.

Background information

DataHub Writer writes data to DataHub by using the DataHub SDK for Java. The following code shows the version of the DataHub SDK for Java.
<dependency>
    <groupId>com.aliyun.datahub</groupId>
    <artifactId>aliyun-sdk-datahub</artifactId>
    <version>2.5.1</version>
</dependency>

Procedure

  1. Go to the DataStudio page.
    1. Log on to the DataWorks console.
    2. In the left-side navigation pane, click Workspaces.
    3. Select the region where the required workspace resides, find the workspace, and then click Data Analytics.
  2. Move the pointer over the Create icon and choose Data Integration > Real-time synchronization.
    Alternatively, you can click the required workflow, right-click Data Integration, and then choose Create > Real-time synchronization.
  3. In the Create Node dialog box, set the Node Name and Location parameters.
    Notice The node name must be 1 to 128 characters in length. It can contain letters, digits, underscores (_), and periods (.).
  4. Click Commit.
  5. On the configuration tab of the real-time sync node, drag DataHub under Output to the canvas on the right. Connect the new node to a reader or transformation node.
  6. Click the new DataHub node. In the configuration pane that appears, set the parameters in the Node configuration section.
    DataHub Writer
    Parameter Description
    Data source The connection to the DataHub data store. In this example, you can select only a DataHub connection.

    If no connection is available, click New data source on the right to create one on the Data Source page. For more information, see Configure a DataHub connection.

    Topic The name of the topic to which data is written in DataHub. You can click Data preview on the right to preview the selected topic.
    Batch number The number of records that are written at a time.
    Field Mapping The mappings between fields in the source and destination data stores. DataWorks synchronizes data based on the field mappings.
  7. Click the Save icon in the toolbar.