This topic describes how to create a Hive data source.
Limits
Apache hive 2.0.1 to 2.3.3 is supported.
Prerequisites
Your network meets the following requirements:
If you Quick BI connect to the Hive database over the Internet, add the Quick BI IP address to the whitelist of the Hive database. For more information, see Add security group rules.
If you want to connect Quick BI to the Hive database over an internal network, you can deploy a jump server and access the database over an SSH tunnel.
A Hive database is created.
The username and password that you use to access the Hive database are obtained.
Procedure
Log on to the Quick BI console.
Follow these steps to add a data source.
Go to the Create Data Source page.
Click Show More.
Select Hive Data Source.
In the Configure Connection dialog box, set the following parameters.
Feature
Description
Name
The name of the data source. The name is displayed in the data source list.
The name cannot contain special characters or start or end with spaces.
Database address
The address where the Hive database is deployed, including the IP address or domain name.
Port Number
The port number of the database that you want to access. Default value: 10000.
Instance
The name of the database that you want to access.
Username
The username that is used to connect to the database.
Password
The password that is used to connect to the database.
SSH
Specifies whether to access the data source over an SSH tunnel based on your business requirements. If you select SSH, you must configure the following parameters:
SSH Host: hostname or IP address.
SSH Username: the username that is used to log on to the SSH host.
SSH Password: the password that is used to log on to the SSH host.
SSH Port Number: the port number that is used to connect to the SSH host. Default value: 22.
Initialize SQL statements
If you select this check box, you can set the initialization SQL.
The SQL statement that is initialized and executed after each data source connection. Only SET statements are allowed. The statements are separated by semicolons.
VPC Data Source
If the database is deployed on a ECS and is deployed in a VPC, select VPC data source and configure the parameters. If you select VPC Data Source, you must specify the internal endpoint and port number of the database.
AccessKey ID and AccessKey: the AccessKey ID and AccessKey secret of the purchased instance.
Instance ID: the ID of the ECS instance.
Region: the region in which the ECS instance is deployed.
Click Test Connection to verify that the data source can be connected.
Click OK.
What to do next
After you add a data source, you can create a dataset and analyze data.
For more information about how to add data tables in the Hive database or self-managed SQL statements to the Quick BI, see Create and manage datasets.
You can add charts for data analysis. For more information, see Create a dashboard and Overview of visualization charts.
Drill down the data for in-depth data analysis. For more information, see the Configure the drilling feature section of the "Drilling" topic.