Data Lake Analytics (DLA) CU Edition allows you to access a self-managed Hive metastore. This topic describes how to use DLA to access and query Hive metastore data from Hadoop Distributed File System (HDFS).
Prerequisites
- DLA CU Edition is activated. For more information, see Use the DLA Presto-compatible Presto CU edition.
Note
- When you create a virtual cluster (VC), make sure that the network of the data source, the Hive metastore, and the HDFS cluster reside in the same virtual private cloud (VPC).
- DLA is not allowed to access HDFS clusters for which Kerberos authentication is enabled. If Kerberos authentication is enabled for your HDFS cluster, submit a ticket.
- A database and a table are created in the Hive metastore. Data is inserted into the
table. Sample statements:
CREATE DATABASE testDb; CREATE EXTERNAL TABLE if not exists testDb.testTable( id int, name string); insert into testDb.testTable(id, name) values (1, "jack");