All Products
Search
Document Center

Dataphin:Initialize metadata warehouse using SelectDB or Doris as the metadata warehouse engine

Last Updated:Aug 13, 2025

The Dataphin metadata warehouse (metadata warehouse) is a data warehouse that uniformly manages Dataphin's internal business metadata and corresponding compute engine metadata. It exists in a Dataphin project within the Dataphin metadata warehouse tenant (OPS tenant) and consists of a series of periodic data integration nodes, SQL script nodes, and Shell nodes. Metadata warehouse initialization is the process of configuring the compute engine type for the Dataphin system and initializing metadata. This topic describes how to initialize the metadata warehouse using SelectDB or Doris as the metadata warehouse compute engine.

Limits

Only accounts with the metadata warehouse tenant super administrator or system administrator role can initialize the system.

Important

Keep the account and password of the metadata warehouse tenant super administrator or system administrator secure. Additionally, when the metadata warehouse tenant super administrator account logs on to the system, operate with caution.

Procedure

  1. In the top menu bar of the Dataphin homepage, choose Management Hub > System Settings.

  2. In the navigation pane on the left, choose System O&M > Metadata Warehouse Settings.

  3. On the Metadata Warehouse Settings configuration wizard page, click Start.

  4. In the step to select the initialization engine type, select SelectDB or Doris as the engine type.

    Important

    If the metadata warehouse has already been initialized, the previously successful metadata warehouse is selected by default.

  5. Click Next.

  6. On the Parameter Checking page, configure the SelectDB or Doris compute engine parameters.

    Parameter

    Description

    Version

    • SelectDB cluster: You can select Cloud Core V3.0.x (based On Apache Doris V2.1.x) or Cloud Core V4.0.x (based On Apache Doris V3.0.x).

    • Doris cluster: You can select Apache Doris v2.1.x or Apache Doris v3.0.x.

    JDBC URL

    Enter the connection address of the target database JDBC. For example, jdbc:mysql://host:port/dbname.

    Username, Password

    The username and password of the target database. To ensure that tasks run properly, make sure the user has the required database permissions.

    FE Node URL

    Enter the link address of the FE node, which is used to access the FE node through a web server. The format is <FE IP>:<HTTP Port> (where HTTP Port is 8030 by default). You can configure multiple FE nodes separated by commas (,).

    Metadata Production Project

    The logical project space used for metadata production and processing. It is recommended to configure it as dataphin_meta. When reinitializing, keep the name unchanged, otherwise the initialization will fail.

  7. Click Test Connection. After the connection test passes, click Next.

  8. On the initialization page, click Start.

    Note

    System initialization takes approximately 15 minutes. Please wait patiently.

  9. After the page indicates successful execution, click Complete to finish the configuration.

What to do next

After completing the system metadata initialization, you can set up the compute engine for the Dataphin instance. For more information, see Compute settings.