The Dataphin metadata warehouse (metadata warehouse) is a data warehouse that uniformly manages Dataphin's internal business metadata and corresponding compute engine metadata. It exists in a Dataphin project within the Dataphin metadata warehouse tenant (OPS tenant) and consists of a series of periodic data integration nodes, SQL script nodes, and Shell nodes. Metadata warehouse initialization is the process of configuring the compute engine type for the Dataphin system and initializing metadata. This topic describes how to initialize the metadata warehouse using SelectDB or Doris as the metadata warehouse compute engine.
Limits
Only accounts with the metadata warehouse tenant super administrator or system administrator role can initialize the system.
Keep the account and password of the metadata warehouse tenant super administrator or system administrator secure. Additionally, when the metadata warehouse tenant super administrator account logs on to the system, operate with caution.
Procedure
In the top menu bar of the Dataphin homepage, choose Management Hub > System Settings.
In the navigation pane on the left, choose System O&M > Metadata Warehouse Settings.
On the Metadata Warehouse Settings configuration wizard page, click Start.
In the step to select the initialization engine type, select SelectDB or Doris as the engine type.
ImportantIf the metadata warehouse has already been initialized, the previously successful metadata warehouse is selected by default.
Click Next.
On the Parameter Checking page, configure the SelectDB or Doris compute engine parameters.
Parameter
Description
Version
SelectDB cluster: You can select Cloud Core V3.0.x (based On Apache Doris V2.1.x) or Cloud Core V4.0.x (based On Apache Doris V3.0.x).
Doris cluster: You can select Apache Doris v2.1.x or Apache Doris v3.0.x.
JDBC URL
Enter the connection address of the target database JDBC. For example,
jdbc:mysql://host:port/dbname.Username, Password
The username and password of the target database. To ensure that tasks run properly, make sure the user has the required database permissions.
FE Node URL
Enter the link address of the FE node, which is used to access the FE node through a web server. The format is
<FE IP>:<HTTP Port>(where HTTP Port is 8030 by default). You can configure multiple FE nodes separated by commas (,).Metadata Production Project
The logical project space used for metadata production and processing. It is recommended to configure it as
dataphin_meta. When reinitializing, keep the name unchanged, otherwise the initialization will fail.Click Test Connection. After the connection test passes, click Next.
On the initialization page, click Start.
NoteSystem initialization takes approximately 15 minutes. Please wait patiently.
After the page indicates successful execution, click Complete to finish the configuration.
What to do next
After completing the system metadata initialization, you can set up the compute engine for the Dataphin instance. For more information, see Compute settings.