All Products
Search
Document Center

Dataphin:Add a Hologres data source

Last Updated:May 29, 2025

Creating a Hologres data source enables Dataphin to read business data from Hologres and write data to Hologres. In scenarios where you need to import business data from Hologres to Dataphin or write data from Dataphin to Hologres, you must first create a Hologres data source. This topic describes how to create a Hologres data source.

Background information

Hologres is an interactive analytics service developed by Alibaba. If you use Hologres and want to connect it to Dataphin for data development, you must first create a Hologres data source.

Permissions

Only users who have the Create Data Source permission point in a custom global role and users who have the super administrator, data source administrator, domain architect, or project administrator role can create data sources.

Procedure

  1. In the top navigation bar of the Dataphin homepage, choose Management Center > Datasource Management.

  2. On the Datasource page, click +Create Data Source.

  3. On the Create Data Source page, select Hologres in the Big Data section.

    If you have recently used Hologres, you can also select Hologres in the Recently Used section. You can also enter keywords in the search box to quickly filter for Hologres.

  4. On the Create Hologres Data Source page, configure the connection parameters.

    1. Configure the basic information of the data source.

      Parameter

      Description

      Datasource Name

      Enter a name for the data source. The name must meet the following requirements:

      • The name can contain only Chinese characters, letters, digits, underscores (_), and hyphens (-).

      • The name cannot exceed 64 characters in length.

      Datasource Code

      After you configure the data source code, you can reference tables in the data source in a Flink_SQL task by using the data source code.table name or data source code.schema.table name format. If you want to automatically access the data source in the corresponding environment based on the current environment, use the variable format ${data source code}.table or ${data source code}.schema.table. For more information, see Development method for Dataphin data source tables.

      Important
      • The data source code cannot be modified after it is configured successfully.

      • After the data source code is configured successfully, you can preview data on the object details page in the asset directory and asset inventory.

      • In Flink SQL, only MySQL, Hologres, MaxCompute, Oracle, StarRocks, Hive, and SelectDB data sources are currently supported.

      Data Source Description

      Enter a brief description of the data source. The description cannot exceed 128 characters in length.

      Data Source Configuration

      Select the data source that you want to configure:

      • If your business data source distinguishes between production and development data sources, select Production + Development Data Source.

      • If your business data source does not distinguish between production and development data sources, select Production Data Source.

      Tag

      You can categorize and tag data sources based on tags. For information about how to create tags, see Manage data source tags.

    2. Configure the connection parameters between the data source and Dataphin.

      If you select Production + Development data source for your data source configuration, you need to configure the connection information for the Production + Development data source. If your data source configuration is Production data source, you only need to configure the connection information for the Production data source.

      Note

      In most cases, the production data source and development data source should be configured as different data sources to isolate the development environment from the production environment and reduce the impact of the development data source on the production data source. However, Dataphin also supports configuring them as the same data source with identical parameter values.

      Parameter

      Description

      Endpoint

      The endpoint of the Hologres instance. Select the appropriate endpoint based on your network environment and access method. For information about how to obtain the endpoint, see Endpoints.

      DBName

      You can view the database name on the Database Management page in the Hologres console. For more information about how to obtain the database name, see Manage databases.

      Schema

      Enter the schema that you want to access.

      Access Id, Access Key

      Enter the authentication Access ID and Access Key. To ensure that tasks can be executed properly, make sure you have the required data permissions.

    3. Configure advanced settings for the connection between the data source and Dataphin.

      Connection Retries: If the database connection times out, the system automatically retries the connection until the specified number of retries is reached. If the connection still fails after the maximum number of retries, the connection fails.

      Note
      • The default number of retries is 1, and you can configure a value between 0 and 10.

      • The connection retry count is applied by default to offline integration tasks and global quality (requires the Data Quality module to be activated). You can configure task-level retry counts separately in offline integration tasks.

  5. Select a Default Resource Group, which is used to run tasks related to the current data source, including database SQL, offline database migration, data preview, and more.

  6. Click Test Connection or directly click OK to save and complete the creation of the Hologres data source.

    When you click Test Connection, the system tests whether the data source can connect to Dataphin properly. If you directly click OK, the system automatically tests the connection for all selected clusters. However, even if all selected clusters fail the connection test, the data source can still be created normally.