A project in Dataphin serves as the fundamental organizational unit, delineating the scope for multi-user isolation and access control. To develop tags, it is necessary to first establish the corresponding tag project. This topic outlines the steps to create a tag project.
Prerequisites
To engage in standardized modeling development, you must first create the relevant data block before initiating a tag item. Absent this, only general development is possible. For more information, see: Create a data block.
Prior to creating a tag item, the corresponding computing source must be established. For details, see Compute engines.
Limits
Dataphin permits only super administrators, system administrators, or block architects to create tag projects.
Only project super administrators, system administrators, or block architects can request permissions to enable or disable data table read and write operations in Dataphin.
Step 1: Select the project type
Navigate to the Dataphin home page and click Planning on the top menu bar.
On the Data Warehouse Planning page, use the operation guide depicted below to access the Tag Project dialog box to create a new project.

Step 2: Configure the project's affiliated block and basic information
In the New Project dialog box, set the Affiliated Block, Basic Information, and Business Information parameters for the tag project.
NoteThe tag project is designated for the Prod environment, akin to a Prod project. It is referred to as a Basic tag project due to the distinct data development processes involved. Within this project, you can create, develop, and submit data objects, which can then be updated to production scheduling and monitoring upon submission.
Area
Parameter Name
Description
Affiliated Block
Data Block
Choose the data block to which the project will be linked. Selecting an affiliated block enables standardized modeling development. Without this selection, only general development is possible.
NoteIn the Dev-Prod mode, standardized modeling objects cannot be created within a Basic project associated with a Prod business block. This restriction ensures the standardization of public layer data and prevents data production issues.
Basic Information
Project English Name
Input the project's English name, adhering to the following naming conventions:
Include letters, numbers, and underscores (_).
Avoid starting with LD_.
Limit to 64 characters or fewer.
Project Name
Enter the project's name, which may include Chinese characters, numbers, letters, underscores (_), or hyphens (-), and must not exceed 64 characters.
May include Chinese characters, numbers, letters, underscores (_), or hyphens (-).
Must not exceed 64 characters in length.
Computing Source Type
Select the type of computing source and then choose the specific source.
ImportantA computing source already associated with a project cannot be linked to another project.
Dataphin integrates with Machine Learning PAI to provide algorithm scheduling capabilities. When setting up a workspace for visual modeling on the machine learning platform, the resource selection is a computing resource group based on MaxCompute. For more information, see Overview of Resource Management. If machine learning is enabled, configure the following parameters:
PAI Region: Choose the region where the Dataphin instance is located.
AccessKey ID, AccessKey Secret: Provide the AccessKey ID and AccessKey Secret for PAI account access.
PAI Project Name: Select the PAI project.
Suggestion: It is advisable to keep the MaxCompute project tied to the current Dataphin project consistent with the MaxCompute project linked to PAI.
Description Information
Provide a concise description of the project.
Business Information
Space Type
Identify the project's development tasks and data characteristics. The default space type is the application layer, which includes:
Intermediate Layer: Typically stores data that has been processed for consistency, accuracy, and cleanliness.
Source Layer: Generally holds raw data from business systems, serving as the foundation for further processing and development.
Application Layer: Tailored to business needs, it defines personalized and diverse data metrics for various scenarios.
General Layer: Commonly used for storing general aggregated data, such as aggregated data for a specific dimension within a subject area.
Task Parameter Configuration
Flink Task Default Parameter Configuration
Once the real-time engine is enabled, you may input Flink-related parameters in the text box as configurations. These parameters will serve as default settings for any future Flink tasks created within this project. Parameters should be formatted as key-value pairs, such as
key:value. An example would betaskmanager.numberOfTaskSlots:1.Click OK to finalize the creation of the tag project.
What to do next
Once the tag project is created, you can associate it with the corresponding marketplace during marketplace creation. For more information, see: Create a marketplace.