All Products
Search
Document Center

E-MapReduce:Data source management

Last Updated:Aug 16, 2023

You can use the data source management features of E-MapReduce (EMR) Workflow to configure data sources to meet your different requirements for data storage and access. This topic describes how to create, modify, and delete a data source.

Limits

The cluster in which the data source resides and the cluster used to run a workflow must be deployed in the same virtual private cloud (VPC).

Create a data source

  1. Go to the Datasource page.

    1. Log on to the EMR console.

    2. In the left-side navigation pane, choose EMR Studio > Workflow.

    3. Click the Datasource tab.

  2. On the Datasource tab, click Create DataSource.

  3. In the CreateDataSource dialog box, configure the parameters. The following table describes the parameters.

    Parameter

    Required

    Description

    DataSource

    Yes

    The type of the data source. Valid values: HIVE/IMPALA and PRESTO.

    Datasource Name

    Yes

    The name of the data source.

    Description

    No

    The description of the data source.

    IP

    Yes

    The IP address of the data source.

    Port

    Yes

    The port number of the data source. Default value: 10000.

    User Name

    Yes

    The username that is used to connect to the data source.

    Password

    No

    The password that is used to connect to the data source.

    Catalog Name

    No

    The name of the catalog that is used to connect to the data source.

    Note

    This parameter is required only if you set the DataSource parameter to PRESTO.

    Database Name

    Yes

    The name of the database that is connected in the data source.

    jdbc connect parameters

    No

    The parameter settings that are used to connect to the data source. Specify the parameters in the format of {"key1":"value1","key2":"value2"...}.

  4. Click Confirm.

    You can perform the following operations on the created data sources:

    1. Modify a data source: Click the image..png icon in the Operation column of the data source to be modified.

    2. Delete a data source: Click the image..png icon in the Operation column of the data source to be deleted.