All Products
Search
Document Center

Dataphin:Create a tag item

Last Updated:Jan 21, 2025

A project in Dataphin serves as the fundamental organizational unit, delineating the scope for multi-user isolation and access control. To develop tags, it is necessary to first establish the corresponding tag project. This topic outlines the steps to create a tag project.

Prerequisites

  • To engage in standardized modeling development, you must first create the relevant data block before initiating a tag item. Absent this, only general development is possible. For more information, see: Create a data block.

  • Prior to creating a tag item, the corresponding computing source must be established. For details, see Compute engines.

Limits

  • Dataphin permits only super administrators, system administrators, or block architects to create tag projects.

  • Only project super administrators, system administrators, or block architects can request permissions to enable or disable data table read and write operations in Dataphin.

Step 1: Select the project type

  1. Navigate to the Dataphin home page and click Planning on the top menu bar.

  2. On the Data Warehouse Planning page, use the operation guide depicted below to access the Tag Project dialog box to create a new project.

    image

Step 2: Configure the project's affiliated block and basic information

  1. In the New Project dialog box, set the Affiliated Block, Basic Information, and Business Information parameters for the tag project.

    Note

    The tag project is designated for the Prod environment, akin to a Prod project. It is referred to as a Basic tag project due to the distinct data development processes involved. Within this project, you can create, develop, and submit data objects, which can then be updated to production scheduling and monitoring upon submission.

    Area

    Parameter Name

    Description

    Affiliated Block

    Data Block

    Choose the data block to which the project will be linked. Selecting an affiliated block enables standardized modeling development. Without this selection, only general development is possible.

    Note

    In the Dev-Prod mode, standardized modeling objects cannot be created within a Basic project associated with a Prod business block. This restriction ensures the standardization of public layer data and prevents data production issues.

    Basic Information

    Project English Name

    Input the project's English name, adhering to the following naming conventions:

    • Include letters, numbers, and underscores (_).

    • Avoid starting with LD_.

    • Limit to 64 characters or fewer.

    Project Name

    Enter the project's name, which may include Chinese characters, numbers, letters, underscores (_), or hyphens (-), and must not exceed 64 characters.

    • May include Chinese characters, numbers, letters, underscores (_), or hyphens (-).

    • Must not exceed 64 characters in length.

    Computing Source Type

    Select the type of computing source and then choose the specific source.

    Important
    • A computing source already associated with a project cannot be linked to another project.

    Dataphin integrates with Machine Learning PAI to provide algorithm scheduling capabilities. When setting up a workspace for visual modeling on the machine learning platform, the resource selection is a computing resource group based on MaxCompute. For more information, see Overview of Resource Management. If machine learning is enabled, configure the following parameters:

    • PAI Region: Choose the region where the Dataphin instance is located.

    • AccessKey ID, AccessKey Secret: Provide the AccessKey ID and AccessKey Secret for PAI account access.

    • PAI Project Name: Select the PAI project.

    • Suggestion: It is advisable to keep the MaxCompute project tied to the current Dataphin project consistent with the MaxCompute project linked to PAI.

    Description Information

    Provide a concise description of the project.

    Business Information

    Space Type

    Identify the project's development tasks and data characteristics. The default space type is the application layer, which includes:

    • Intermediate Layer: Typically stores data that has been processed for consistency, accuracy, and cleanliness.

    • Source Layer: Generally holds raw data from business systems, serving as the foundation for further processing and development.

    • Application Layer: Tailored to business needs, it defines personalized and diverse data metrics for various scenarios.

    • General Layer: Commonly used for storing general aggregated data, such as aggregated data for a specific dimension within a subject area.

    Task Parameter Configuration

    Flink Task Default Parameter Configuration

    Once the real-time engine is enabled, you may input Flink-related parameters in the text box as configurations. These parameters will serve as default settings for any future Flink tasks created within this project. Parameters should be formatted as key-value pairs, such as key:value. An example would be taskmanager.numberOfTaskSlots:1.

  2. Click OK to finalize the creation of the tag project.

What to do next

Once the tag project is created, you can associate it with the corresponding marketplace during marketplace creation. For more information, see: Create a marketplace.