The DLF-Auth component is provided by Data Lake Formation (DLF). You can use DLF-Auth to enable the data permission feature of DLF. DLF-Auth allows you to implement fine-grained permission control on databases, tables, columns, and functions. This way, you can manage data permissions on data lakes in a centralized manner. This topic describes how to enable DLF-Auth to manage permissions.
Background information
DLF is a fully managed service that helps you build cloud-based data lakes. DLF provides centralized permission management and metadata management for cloud-based data lakes. For more information about DLF, see
Overview.

Prerequisites
- An E-MapReduce (EMR) cluster is created. For more information, see Create a cluster.
Note When you create an EMR cluster, select DLF Unified Metadata as Metadata in the Software Configuration step.
- The data permission management feature of DLF is enabled. To enable the feature, you need to enable permission control on related catalogs after you activate DLF.
Limits
- DLF allows only RAM users to manage permissions. Therefore, you must use the user management feature to add a RAM user in the EMR console.
- For more information about the regions in which you can use the data permission management feature of DLF, see Supported regions and endpoints.
- If you enable Hive or Spark for DLF-Auth, you cannot enable or disable Hive or Spark for Ranger. If you enable Hive or Spark for Ranger, you cannot enable or disable Hive or Spark for DLF-Auth.
- The following table describes the EMR versions and compute engines that are supported by DLF-Auth.
EMR major version Hive Spark Presto Impala EMR V3.X EMR V3.39.0 and earlier minor versions Not supported Not supported Not supported Not supported EMR-3.40.0 Supported Supported Supported Not supported EMR V3.41.0 to EMR V3.43.1 Supported Supported Not supported Not supported EMR V5.X EMR V5.5.0 and earlier minor versions Not supported Not supported Not supported Not supported EMR-5.6.0 Supported Supported Supported Not supported EMR V5.7.0 to EMR V5.9.1 Supported Supported Not supported Not supported
Procedure
This section describes how to enable DLF-Auth to implement fully managed and centralized permission management on data lakes.
- Step 1: Enable DLF-Auth to manage Hive permissions
- Step 2: Add a RAM user
- Step 3: Authenticate the permissions of the RAM user
- (Optional) Step 4: Enable LDAP authentication for Hive
After you enable DLF-Auth to manage permissions, we recommend that you enable Lightweight Directory Access Protocol (LDAP) authentication for Hive. This way, all users who connect to Hive can run related scripts after they pass LDAP authentication.
Step 1: Enable DLF-Auth to manage Hive permissions
- Go to the DLF-Auth service page.
- Log on to the EMR console. In the left-side navigation pane, click EMR on ECS.
- In the top navigation bar, select the region where your cluster resides and select a resource group based on your business requirements.
- Click the Cluster Management tab.
- On the EMR on ECS page, find the cluster that you want to manage and click Services in the Actions column.
- On the Services tab, click Status in the DLF-Auth section.
- Enable DLF-Auth to manage Hive permissions.
- In the Components section of the DLF-Auth service page, find DLFAuthRuntime and click enableHive in the Actions column.
- In the dialog box that appears, enter a reason in the Execution Reason field and click OK.
- In the Confirm message, click OK.
- Restart HiveServer.
- On the Services tab, click the HIVE service.
- In the Components section of the Hive service page, find HiveServer, move the pointer over the icon in the Actions column, and then select Restart.
- In the dialog box that appears, enter a reason in the Execution Reason field and click OK.
- In the Confirm message, click OK.
Step 2: Add a RAM user
You can add a RAM user by using the user management feature.
Step 3: Authenticate the permissions of the RAM user
- Authenticate the permissions of the RAM user before you grant permissions to the RAM user.
- Grant permissions to the RAM user.
- Authenticate the permissions that are granted to the RAM user.
Query the information about the table by referring to Step 1. The query is successful because the RAM user is granted the Select permission.
(Optional) Step 4: Enable LDAP authentication for Hive
- Go to the Services tab.
- Log on to the EMR console. In the left-side navigation pane, click EMR on ECS.
- In the top navigation bar, select the region where your cluster resides and select a resource group based on your business requirements.
- On the EMR on ECS page, click Services in the Actions column of the cluster that you want to manage.
- Enable Lightweight Directory Access Protocol (LDAP) authentication.
FAQ
Q: If I use multiple catalogs, how do I configure the ID of each catalog in DLF-Auth?
A: You can perform the following steps to configure the ID of a catalog in DLF-Auth.
Note The Presto compute engine is not supported in this version. Therefore, you do not need to configure the Presto compute engine.
- Go to the DLF-Auth service page.
- Log on to the EMR console. In the left-side navigation pane, click EMR on ECS.
- In the top navigation bar, select the region where your cluster resides and select a resource group based on your business requirements.
- Click the Cluster Management tab.
- On the EMR on ECS page, find the cluster that you want to manage and click Services in the Actions column.
- On the Services tab, click Configure in the DLF-Auth section.
- Configure the Hive compute engine.
You can configure the Hive compute engine based on your business requirements.
- Configure the Spark compute engine.
You can configure the Spark compute engine based on your business requirements.