All Products
Search
Document Center

Data Transmission Service:Configure a source database

Last Updated:Oct 18, 2023

This topic describes how to configure a source database for an extract, transform, and load (ETL) task.

Prerequisites

  • An ETL task is created in one of the following regions: China (Hangzhou), China (Shanghai), China (Qingdao), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Guangzhou), and China (Hong Kong).

  • The source database belongs to one of the following types: MySQL, PolarDB for MySQL, Oracle, PostgreSQL, iSeries DB2 (AS/400), Db2 for LUW, PolarDB-X 1.0, PolarDB for PostgreSQL, MariaDB, PolarDB for Oracle, SQL Server, and PolarDB-X 2.0.

  • A database is created as the source database.

Procedure

Note

In this example, a self-managed MySQL database is used.

  1. Go to the Streaming ETL page.

    1. Log on to the Data Management (DMS) console.

    2. In the top navigation bar, move your pointer over DTS.

    3. Choose Data integration > Streaming ETL.

  2. In the upper-left corner of the Streaming ETL page, click新增数据流. In the Create Data Flow dialog box, specify an ETL task name in the Data Flow Name field and set the Development Method parameter to DAG.

  3. Click OK.

  4. On the left side of the canvas, drag an Input/Dimension Table MySQL node to the blank area of the canvas.

  5. Click the Input/Dimension Table MySQL-1 node on the canvas to configure the source database.

    1. On the Node Settings tab, configure the parameters that are described in the following table.

      Parameter

      Description

      Data Source Name

      The name of the data source. Data Transmission Service (DTS) automatically generates a data source name. We recommend that you specify a descriptive name for easy identification. You do not need to use a unique name.

      Region

      The region in which the source database resides.

      Note

      You can create an ETL task in one of the following regions: China (Hangzhou), China (Shanghai), China (Qingdao), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Guangzhou), and China (Hong Kong).

      Instances

      The name of the database instance to which the source database belongs. You can also click Create Instance to create a database instance. For more information, see Databases supported by DMS.

      Node Type

      The type of the source table. Valid values:

      • Stream Table

      • Dimension Table

      Convert Format

      This parameter is required if you set the Node Type parameter to Stream Table. Valid values:

      • Upsert Stream

      • Append-Only Stream

      Select Databases and Tables

      The source database and the tables that you want to transform.

      Note

      After you select the source database and tables, you are navigated to the Output Fields tab.

    2. On the Output Fields tab, select the fields in the Column Name column based on your business requirements.

    3. Optional. If you set the Node Type parameter to Stream Table, click the Time Attribute tab. On the Time Attribute tab, configure the parameters that are described in the following table to configure the Input/Dimension Table MySQL-1 node.

      Parameter

      Description

      Event Time Watermark

      The field that represents the time when the data is generated.

      Latency of Event Time Watermark

      The maximum data latency that you can accept.

      Processing Time

      The field that represents the time when the data is processed.

Result

If the 配置源库信息_感叹号 icon is not displayed on the right side of the node, the source database is configured.

What to do next

Configure transformation components