This topic describes how to configure an HDFS connection in DataWorks.

Procedure

  1. Log on to the DataWorks console. In the Workspaces section, find the target workspace and click Data Integration.
  2. On the Data Integration page, click Connection in the left-side navigation pane. On the page that appears, click Add Connection.
  3. In the Add Connection dialog box, click HDFS.
  4. In the Add HDFS Connection dialog box, configure the parameters as prompted.
    Create a connection
    Parameter Description
    Connection Name The name of the connection. The name can contain letters, digits, and underscores (_). It cannot start with a digit or underscore (_).
    Description The description of the connection. The description can be up to 80 characters in length.
    DefaultFS The address of the NameNode of the HDFS, in the hdfs://ServerIP:Port format.
  5. Click Test Connection.
  6. After the connection passes the connectivity test, click Complete.

    Description of connectivity test

    • If the user-created data source is hosted on an ECS instance in a classic network, network connectivity is not guaranteed when nodes are deployed in the default resource group. We recommend that you use custom resource groups.
    • Connectivity testing is not supported for data sources in VPCs. You can click Complete without testing the connectivity.