All Products
Search
Document Center

DataWorks:Configure the Kyuubi connection information

Last Updated:Mar 26, 2026

After you register an E-MapReduce (EMR) cluster with DataWorks, configure the Kyuubi connection information to control how DataWorks authenticates when running Kyuubi tasks. You can use the default EMR identity or a custom username and password.

Background information

Apache Kyuubi is a distributed and multi-tenant gateway that provides query services, such as SQL queries, for data lake query engines such as Spark, Flink, and Trino. For more information, see Kyuubi.

Prerequisites

Before you begin, make sure you have:

  • Added the Kyuubi service to your EMR cluster. For more information, see Add the Kyuubi service.

  • Attached the EMR cluster as a DataWorks computing resource and completed resource group initialization. For more information, see Data Development (new version): Attach an EMR computing resource.

Note

You must complete resource group initialization when attaching the EMR computing resource. Without it, the Kyuubi configuration page is unavailable.

Configure the Kyuubi connection information

  1. Go to the Kyuubi configuration page.

    1. Log on to the DataWorks console. In the top navigation bar, select a region. In the left-side navigation pane, choose More > Management Center.

    2. On the page that appears, select the target workspace from the drop-down list and click Go to Management Center.

    3. In the left-side navigation pane, click Computing Resources.

    4. Find the target EMR cluster and click Kyuubi Configuration > Edit Kyuubi Configuration.

  2. Select a connection mode.

    Connection mode Description When to use
    Connection Information of Alibaba Cloud EMR Cluster Uses the Default Access Identity you specified when registering the EMR cluster. Default. Use this when the cluster's built-in identity is sufficient for your tasks.
    Custom Configuration Information Uses a custom username and password to log on to Kyuubi via JDBC. Use this when you need a dedicated identity or custom credentials for Kyuubi access.
  3. (Optional) If you selected Custom Configuration Information, configure the JDBC URL. The JDBC URL format is:

    jdbc:hive2://host:port/;user=<logon username>;password=<logon password>
    Note

    The first time you select Custom Configuration Information, the JDBC URL is automatically populated based on the account information you configured when registering the EMR cluster. You can modify the pre-filled URL.

    How `DATAWORKS_PROXY_USER` affects the JDBC URL If you selected Pass Proxy User Information when registering the EMR cluster, DataWorks appends hive.server2.proxy.user configuration to the JDBC URL each time an EMR task runs. The behavior depends on whether you include the DATAWORKS_PROXY_USER placeholder in the URL:

    Scenario Behavior
    DATAWORKS_PROXY_USER is not in the JDBC URL DataWorks appends the hive.server2.proxy.user value to the end of the URL at task runtime.
    DATAWORKS_PROXY_USER is in the JDBC URL DataWorks replaces the placeholder with the hive.server2.proxy.user value at task runtime.

What's next

Follow the Data development process guide to configure component environments and run data development tasks in DataWorks.