All Products
Search
Document Center

Data Lake Formation:Access DLF from DataWorks

Last Updated:Sep 23, 2025

This topic describes how to use DataWorks Data Integration to import data into a Data Lake Formation (DLF) catalog using Paimon REST.

Prerequisites

A DataWorks workspace is required. If you have not created one, see Create a workspace.

Create a DLF Paimon catalog

See Get started with DLF.

Add a data source

  1. Go to the Data Sources page.

    1. Log on to the DataWorks console. In the top navigation bar, select the desired region. In the left-side navigation pane, choose More > Management Center. On the page that appears, select the desired workspace from the drop-down list and click Go to Management Center.

    2. In the left-side navigation pane of the SettingCenter page, click Data Sources.

  1. On the Data Sources page, click Add Data Source. In the Add Data Source dialog box, search for Data Lake Formation and click Data Lake Formation 2.0 (DLF 2.0). On the Add Data Lake Formation 2.0 (DLF 2.0) Data Source page, configure the parameters that are described in the following table to add a DLF 2.0 data source.

    Parameter

    Description

    Data Source Name

    Specify a name for the data source based on your business requirements. The name must be unique within the current workspace. The name can contain only letters, digits, and underscores (_) and must start with a letter.

    Configuration Mode

    The mode in which you want to add the data source. The value of this parameter can be only Alibaba Cloud Instance Mode.

    Access Identity

    The identity that you want to use to access the data source in DataWorks. Valid values:

    • Alibaba Cloud Account

    • Alibaba Cloud RAM User

    • Alibaba Cloud RAM Role

    You can select a value based on your business requirements.

    Note

    If you select a RAM account or RAM role as the access identity, do the following:

    • Grant the RAM account or RAM role the permissions to perform required actions using REST APIs.

    • In the Data Lake Formation console, grant the Data Editor permission on the tables to be synchronized.

    DLF Catalog

    The name of a DLF catalog. Select DLF catalogs that reside in the same region as the DataWorks workspace.

    Database Name

    The name of a database that belongs to the DLF catalog.

    After the parameters are configured, you must test the network connectivity between the data source and a serverless resource group in the Connection Configuration section. If the network connectivity test is successful, you can click Complete Creation. If the network connectivity test fails, you can refer to the topics in the Network connectivity directory for troubleshooting.

Create a synchronization task

After a DLF 2.0 data source is added to DataWorks, you can configure a synchronization task to synchronize data to the data source. For more information, see the topics in the Synchronize data to DLF 2.0 directory.