This topic describes how to add a self-managed Impala data source. Only Quick BI Enterprise Standard and Quick BI Pro allow you to add self-managed Impala data sources to group workspaces.

Prerequisites

  • Make sure that your network meets the following requirements:
    • If you use the Internet to connect Quick BI to your Impala database, you must add the CIDR blocks of Quick BI to the whitelist of the Impala database. For more information, see Add security group rules.

      The CIDR blocks of Quick BI are 10.152.69.0/24, 10.152.163.0/24, and 139.224.4.0/24.

    • If you use an internal network to connect Quick BI to the Impala database, you must use one of the following methods to ensure the connectivity between Quick BI and the self-managed Impala data source:
      • If the Impala database is deployed on an Elastic Compute Service (ECS) instance, you can use a virtual private cloud (VPC) to connect to the data source.
      • You can deploy a jump server and connect to the database over an SSH tunnel.
  • A self-managed Impala database is created.
  • The username and password that are used to log on to the Impala database are obtained.

Procedure

  1. Add a data source, as shown in the following figure.
    Add a data source
  2. In the Add Impala Database dialog box, configure the parameters.
    Parameter Description
    Name The name of the data source. The name appears in the list of data sources. For example, enter self-managed data source Impala.
    Database Address The IP address of the database.
    Port Number The port number that is used to connect to the database. Default value: 21050.
    Database The name of the database to connect. For example, enter Impala.
    Username The username that is used to log on to the database. For example, enter root.
    Password The password that is used to log on to the database.
    User-created ECS Data Source (VPC) Select this option if you use a VPC to connect to the data source. If you select User-created ECS Data Source (VPC), configure the following parameters:
    • AccessKey ID: the AccessKey ID that you used to purchase the instance on which the database is deployed.
    • AccessKey Secret: the AccessKey secret that you used to purchase the instance.
    • Instance ID: the ID of the instance.
    • Region: the region in which the instance is deployed. For example, select cn-qingdao.
    ssh Specifies whether to connect to the data source over an SSH tunnel based on your business requirements. If you select ssh, configure the following parameters:
    • SSH Host: the hostname or IP address of the SSH host.
    • SSH Username: the username that is used to log on to the SSH host.
    • SSH Password: the password that is used to log on to the SSH host.
    • SSH Port Number: the port number that is used to connect to the SSH host. Default value: 22.
    Note Only Quick BI Enterprise Standard allows you to connect to self-managed data sources in group workspaces over SSH tunnels.
    Auth Username and Password are required only if you select this option.
  3. Click Test Connection to verify that the data source can be connected.
    Verify that the data source can be connected
  4. Click OK.

What to do next

After you add a data source, you can create a dataset and analyze data.
  • You can add data tables in the self-managed Impala data source or user-defined SQL statements for ad hoc analysis to Quick BI. For more information, see Create a dataset.
  • For more information about how to add visual charts and analyze data, see Create a dashboard and Overview.
  • For more information about how to mine data value and deeply analyze data, see Configure the drilling feature.