All Products
Search
Document Center

Dataphin:Real-time mode dependency configuration

Last Updated:Jan 21, 2025

This topic explains how to configure dependencies for real-time tasks, enabling a clear understanding of upstream and downstream tasks during debugging.

Procedure

  1. Refer to real-time mode configuration entry to access the Real-time Mode configuration page for real-time tasks.

  2. In the Dependency section of the configuration panel, input the dependency details for tasks in real-time mode.

    Important
    • Dependencies in real-time mode do not create actual scheduling dependencies.

    • In contrast, offline mode requires the configuration of scheduling dependencies.

    image

    Parameter

    Description

    Start Parsing

    Click Start Parsing for SQL node tasks to initiate code parsing. The system identifies tables in the code and matches them with the output name to determine the upstream dependencies of the current node.

    The system defaults to the production project name for parsing if project variables are referenced or no project is specified, ensuring schedule stability. For instance, with a development project name of onedata_dev:

    • Code select * from s_order results in a dependency of onedata.s_order.

    • Code select * from ${onedata}.s_order also results in a dependency of onedata.s_order.

    • Code select * from onedata.s_order results in a dependency of onedata.s_order.

    • Code select * from onedata_dev.s_order results in a dependency of onedata_dev.s_order.

    Upstream Dependency

    To add upstream nodes for node task scheduling, perform the following:

    1. Click Manually Add Upstream.

    2. The Create New Upstream Dependency dialog box offers two methods to search for dependency nodes:

      • Search for nodes by entering the output name of the dependent node as the keyword.

      • Type virtual to find virtual nodes, noting that each tenant or enterprise is assigned a root node upon initialization.

      Note

      Node output names are globally unique and case-insensitive.

    3. Click Confirm Addition.

    To manage added dependencies, use the Actions column's fagaga icon to delete them.

    Current Node

    To set the output names for the current node, which other nodes can use as dependencies, follow these steps:

    1. Click Manually Add Output.

    2. In the Add Current Node Output dialog box, enter the output name following a consistent naming convention, such as project name.table name, which is case-insensitive.

      For example, for a development project named onedata_dev, the recommended output name is onedata.s_order. Setting the output name to onedata_dev.s_order will only allow code referencing onedata_dev.s_order to parse the upstream dependency node.

    3. Click Confirm Addition.

    Additionally, you can execute these operations on the output names already associated with the current node:

    • To delete added output names, click the Actions column's fagaga icon.

    • If the node is already submitted or published and has downstream dependencies (with submitted tasks), click the Actions column's icon to see the dependent nodes.

  3. Click Confirm to finalize the real-time mode dependency configuration.