All Products
Search
Document Center

Dataphin:Create a SelectDB data source

Last Updated:Feb 12, 2026

By creating a SelectDB data source, you can enable Dataphin to read business data from SelectDB or write data to SelectDB. This topic describes how to create a SelectDB data source.

Background information

SelectDB is a data warehouse widely used in scenarios such as real-time report analysis, ad hoc multidimensional analysis, log retrieval analysis, data federation, and query acceleration. It is dedicated to providing users with excellent performance and easy-to-use data analysis services.

Permissions

Only custom global roles with the Create Data Source permission and system roles such as super administrator, data source administrator, domain architect, and project administrator can create data sources.

Procedure

  1. On the Dataphin homepage, click Management Hub > Datasource Management in the top navigation bar.

  2. On the Datasource page, click +Create Data Source.

  3. In the Create Data Source page, select SelectDB from the Big Data section.

    If you have recently used SelectDB, you can also select it from the Recently Used section. You can also enter SelectDB keywords in the search box to quickly search for it.

  4. On the Create SelectDB Data Source page, configure the connection parameters.

    1. Configure the basic information of the data source

      Parameter

      Description

      Datasource Name

      The name must meet the following requirements:

      • It can contain only Chinese characters, letters, digits, underscores (_), and hyphens (-).

      • It cannot exceed 64 characters in length.

      Datasource Code

      After you configure the data source code, you can reference tables in the data source in Flink_SQL tasks using the format data_source_code.table or data_source_code.schema.table. To automatically access the data source in the corresponding environment based on the current environment, use the format ${data_source_code}.table or ${data_source_code}.schema.table. For more information, see Flink_SQL task development.

      Important
      • The data source code cannot be modified after it is configured.

      • You can preview data on the object details page in the asset directory and asset checklist only after the data source code is configured.

      • In Flink SQL, only MySQL, Hologres, MaxCompute, Oracle, StarRocks, Hive, SelectDB, and GaussDB data warehouse service (DWS) data sources are currently supported.

      Kernel Version

      You can select Cloud Core V3.0.x (based On Apache Doris V2.0.x) or Cloud Core V4.0.x (based On Apache Doris V3.0.x).

      Data Source Description

      Enter a brief description of the data source. The description cannot exceed 128 characters.

      Data Source Configuration

      Select the data source to configure:

      • If your business data source distinguishes between production and development data sources, select Production + Development Data Source.

      • If your business data source does not distinguish between production and development data sources, select Production Data Source

      Tag

      You can categorize and tag data sources using tags. For information on how to create tags, see Manage data source tags.

    2. Configure the connection parameters between the data source and Dataphin.

      If you select Production + Development data source for your data source configuration, you need to configure the connection information for the Production + Development data source. If your data source configuration is Production data source, you only need to configure the connection information for the Production data source.

      Note

      Typically, production and development data sources should be configured as separate data sources to achieve environment isolation and reduce the impact of development activities on production. However, Dataphin also supports configuring them as the same data source with identical parameter values.

      Parameter

      Description

      JDBC URL

      Enter the JDBC URL connection address for SelectDB. The format is: jdbc:mysql://host:port/dbname.

      Username, Password

      The username and password for logging in to the database.

      FE Node URL

      The Host and HTTP port of the FE (Front End), in the format fe_host:http_port,fe_host:http_port. If there are multiple FE nodes, separate them with commas (,). The default HTTP port for SelectDB is 8030.

    3. Configure advanced settings for the data source.

      Parameter

      Description

      connectTimeout

      The connectTimeout duration for the database (in milliseconds). The default is 900000 milliseconds (15 minutes).

      Note
      • If you have included a connectTimeout configuration in the JDBC URL, the connectTimeout will be the timeout value configured in the JDBC URL.

      • For data sources created before Dataphin V3.11, the default connectTimeout is -1, which means no timeout limit.

      socketTimeout

      The socketTimeout duration for the database (in milliseconds). The default is 1800000 milliseconds (30 minutes).

      Note
      • If you have included a socketTimeout configuration in the JDBC URL, the socketTimeout will be the timeout value configured in the JDBC URL.

      • For data sources created before Dataphin V3.11, the default socketTimeout is -1, which means no timeout limit.

      Connection Retries

      If the database connection times out, the system will automatically retry the connection until the specified number of retries is reached. If the connection still fails after the maximum number of retries, the connection is considered failed.

      Note
      • The default number of retries is 1. You can configure a value between 0 and 10.

      • The connection retry count will be applied by default to offline integration tasks and global quality (requires the asset quality function module to be enabled). In offline integration tasks, you can configure task-level retry counts separately.

  5. Select a Default Resource Group. This resource group is used to run tasks related to the current data source, including database SQL, offline database migration, data preview, and more.

  6. Perform a Test Connection or directly click OK to save and complete the creation of the SelectDB data source.

    Click Test Connection to test whether the data source can connect to Dataphin normally. If you directly click OK, the system will automatically test the connection for all selected clusters. However, even if all selected clusters fail the connection test, the data source can still be created normally.