All Products
Search
Document Center

Quick BI:Add a Hive Data Source

Last Updated:Apr 28, 2024

This topic describes how to create a Hive data source.

Limits

Apache hive 2.0.1 to 2.3.3 is supported.

Prerequisites

  • Your network meets the following requirements:

    • If you Quick BI connect to the Hive database over the Internet, add the Quick BI IP address to the whitelist of the Hive database. For more information, see Add security group rules.

    • If you want to connect Quick BI to the Hive database over an internal network, you can deploy a jump server and access the database over an SSH tunnel.

  • A Hive database is created.

  • The username and password that you use to access the Hive database are obtained.

Procedure

  1. Log on to the Quick BI console.

  2. Follow these steps to add a data source.

    1. Go to the Create Data Source page.

    2. Click Show More.

      image

    3. Select Hive Data Source.

      image

  3. In the Configure Connection dialog box, set the following parameters.

    Feature

    Description

    Name

    The name of the data source. The name is displayed in the data source list.

    The name cannot contain special characters or start or end with spaces.

    Database address

    The address where the Hive database is deployed, including the IP address or domain name.

    Port Number

    The port number of the database that you want to access. Default value: 10000.

    Instance

    The name of the database that you want to access.

    Username

    The username that is used to connect to the database.

    Password

    The password that is used to connect to the database.

    SSH

    Specifies whether to access the data source over an SSH tunnel based on your business requirements. If you select SSH, you must configure the following parameters:

    • SSH Host: hostname or IP address.

    • SSH Username: the username that is used to log on to the SSH host.

    • SSH Password: the password that is used to log on to the SSH host.

    • SSH Port Number: the port number that is used to connect to the SSH host. Default value: 22.

    Initialize SQL statements

    If you select this check box, you can set the initialization SQL.

    The SQL statement that is initialized and executed after each data source connection. Only SET statements are allowed. The statements are separated by semicolons.

    VPC Data Source

    If the database is deployed on a ECS and is deployed in a VPC, select VPC data source and configure the parameters. If you select VPC Data Source, you must specify the internal endpoint and port number of the database.

    • AccessKey ID and AccessKey: the AccessKey ID and AccessKey secret of the purchased instance.

    • Instance ID: the ID of the ECS instance.

    • Region: the region in which the ECS instance is deployed.

  4. Click Test Connection to verify that the data source can be connected.

    image

  5. Click OK.

What to do next

After you add a data source, you can create a dataset and analyze data.