The task orchestration feature of Data Management (DMS) provides multiple types of task nodes, such as Cross Database SQL, Single Instance SQL, Script, Conditional Branch, and SQL Assignment for Single Instance. You can orchestrate multiple types of task nodes to implement complex scheduling. This improves data development efficiency.

Task node type Description Configuration item
Cross Database SQL Uses dynamic SQL (DSQL) statements for data query across database instances. You can use this node to analyze data across database instances and migrate a small amount of data.
  • Metadata
  • Optional:Variable
    • Node variable
    • Task flow variable
    • Input variable
  • SQL statement
Single Instance SQL Executes SQL statements in a specific relational database.
  • Database
  • Optional:Variable
    • Node variable
    • Task flow variable
    • Input variable
  • SQL statement
Script Creates script tasks based on Database Gateway.
  • Variable
    • Optional:Node variable
    • Optional:Task flow variable
    • Optional:Input variable
    • Output variable
  • Script
For more information about Script nodes, see Create a script task.
Audit Task Uses an audit task to check the data quality of a table. After you specify a quality rule for the table and a scheduling cycle for the audit task, DMS checks the data quality of the table and generates a report.
  • Optional:Variable
    • Node variable
    • Task flow variable
    • Input variable
  • Data quality rule
DLA Spark SQL Uses SQL statements to submit jobs to the Spark clusters of Data Lake Analytics (DLA).
  • Optional:Variable
    • Node variable
    • Task flow variable
    • Input variable
  • Basic configuration
  • Job configuration
DLA Serverless Spark Configures Spark jobs based on the serverless Spark engine of DLA. For more information, see Create and run Spark jobs.
  • Optional:Variable
    • Node variable
    • Task flow variable
    • Input variable
  • Basic configuration
  • Job configuration
Conditional Branch Makes conditional judgement in task flows. When a task flow is running, a Conditional Branch node in the task flow checks whether the conditions specified on the Conditional Branch node are met. If the conditions are met, the task flow continues to run. Otherwise, the task flow stops.
  • Optional:Input variable
  • Conditional expression
For more information about how to configure Conditional Branch nodes, see Configure Conditional Branch nodes.
SQL Assignment for Single Instance Assigns the data that is obtained by using the SELECT statement to its output variables. The output variables can be used as the input variables of the downstream node.
  • Database
  • Variable
    • Optional:Node variable
    • Optional:Task flow variable
    • Optional:Input variable
    • Output variable
  • SQL statement
For more information about how to configure SQL Assignment for Single Instance nodes, see Configure SQL assignment nodes.
Lindorm File Check Checks whether a file exists in an ApsaraDB for Lindorm instance that supports Hadoop Distributed File System (HDFS).
  • Optional:Variable
    • Node variable
    • Task flow variable
    • Input variable
  • Lindorm file engine

Common configurations

  • Select a database. Click the node that you want to manage on the canvas. From the drop-down list on the Node Information tab, enter a keyword to search for a database and select the required database from the matched results.
    Note
    • You must have the required permissions on the selected database. For more information about how to apply for permissions, see t1919605.html#section_5j7_n0q_x03.
      • For a Single Instance SQL node, you must have the change permissions on the selected database.
      • For an SQL Assignment for Single Instance node, you must have the query permissions on the selected database.
    • To view table schemas in the database, click the Metadata tab below the drop-down list.
  • Configure node variables. Click the node that you want to manage on the canvas. On the Node Information tab, click the Node Variable tab and configure node variables. For more information, see Configure time variables.
    Note Click the Info icon icon on the Variable Setting tab to view the help documentation about variable configuration.
  • Configure task flow variables. Click the node that you want to manage on the canvas. On the Node Information tab, click the Task Flow Variable tab and configure task flow variables. For more information, see Configure time variables.
    Note
    • Click the Info icon icon on the Variable Setting tab to view the help documentation about variable configuration.
    • You can configure task flow variables on one of the task nodes in a task flow. Then, the configuration about task flow variables is synchronized to other task nodes in the task flow.
  • View input variables. Click the node that you want to manage on the canvas. On the Node Information tab, click the Input Variables tab and view the input variables.
    Note
    • Input variables are derived from the output variables of the upstream node and are automatically obtained by the system. You can only view the input variables.
    • For more information about variables, see Variables.
  • Enter SQL statements. Click the node that you want to manage on the canvas. In the SQL section of the Node Information tab, enter the SQL statements.
    Note The syntax of the SQL statements that you enter must be consistent with that of the selected database.
    • To preview the SQL statements that you are compiling, click SQLPreview.
    • To verify the SQL statements that are compiled, click Validity Check.
    • To save the SQL statements that are compiled, click Save.