You can organize different types of nodes in a workflow based on business types. This way, you can develop code by business. This topic describes how to create, design, commit, and view a workflow.
Background information
A workspace supports multiple types of compute engines and can contain multiple workflows. A workflow is a collection of multiple types of objects. The object types include data integration, data analytics, table, resource, function, and algorithm.
Each type of object has an independent folder. You can also create subfolders in the folder for each type of object. To facilitate management, we recommend that you create a maximum of four levels of subfolders. If the number of levels exceeds four, the structure of your planned workflow may be too complex. We recommend that you split the workflow into two or more workflows and add related workflows to a solution for management to improve work efficiency.
Create a workflow
Design a workflow
- Data Integration
Find the required workflow and double-click Data Integration to view all data integration nodes of the workflow. For more information, see Create a batch sync node.
- MaxComputeNote The MaxCompute folder is displayed on the current page only after you add a MaxCompute compute engine instance on the Workspace Management page. For more information, see Configure a workspace.The MaxCompute compute engine allows you to create data analytics nodes such as ODPS SQL, SQL component, ODPS Spark, PyODPS, ODPS Script, and ODPS MR nodes. You can also view and create tables, resources, and functions.
- Data Analytics
Find the required workflow and right-click MaxCompute to create a data analytics node. For more information, see Create an ODPS SQL node, Create an SQL component node, Create an ODPS Spark node, Create a PyODPS 2 node,Create a PyODPS 3 node, Create an ODPS Script node, and Create an ODPS MR node.
- Table
Find the required workflow, right-click MaxCompute, and then choose to create a table. You can also view all the tables that are created in the current MaxCompute compute engine. For more information, see Create a MaxCompute table.
- Resource
Find the required workflow, right-click MaxCompute, and then choose to create a resource. You can also view all the resources that are created in the current MaxCompute compute engine. For more information, see Create MaxCompute resources.
- Function
Find the required workflow, right-click MaxCompute, and then choose to create a function. You can also view all the functions that are created in the current MaxCompute compute engine. For more information, see Create a MaxCompute function.
- Data Analytics
- AnalyticDB for PostgreSQLYou can create AnalyticDB for PostgreSQL nodes and tables. For more information, see Create an ADB for PostgreSQL node and Create an AnalyticDB for PostgreSQL table.Note The AnalyticDB for PostgreSQL folder is displayed on the current page only after you add an AnalyticDB for PostgreSQL compute engine instance on the Workspace Management page. For more information, see Configure a workspace.
- EMRThe E-MapReduce compute engine allows you to create data analytics nodes such as EMR Hive, EMR MR, EMR Spark SQL, and EMR Spark nodes. You can also view and create E-MapReduce resources.Note The E-MapReduce folder is displayed on the current page only after you add an E-MapReduce compute engine instance on the Workspace Management page. For more information, see Configure a workspace.
- Data Analytics
Find the required workflow, click EMR, and then right-click Data Analytics to create a data analytics node. For more information, see Create an EMR Hive node, Create an EMR MR node, Create an EMR Spark SQL node, and Create an EMR Spark node.
- Resource
Find the required workflow, click EMR, and then right-click Resource to create a resource. You can also view all the resources that are created in the current E-MapReduce compute engine.
- Data Analytics
- Algorithm
Find the required workflow and right-click Algorithm to create a Machine Learning Platform for AI (PAI) node. You can also view all the PAI nodes that are created in the current workflow. For more information, see Create a Machine Learning (PAI) node.
- General
Find the required workflow and right-click General to create a node. For more information, see OSS Object Inspection node, Create and configure a for-each node, Configure a do-while node, MERGE node, Configure a branch node, Configure an assignment node, Create a Shell node, Create a zero load node, and Cross-tenant collaboration node.
Note You can create cross-tenant collaboration nodes and Object Storage Service (OSS) object inspection nodes in DataWorks of all editions. However, to create other types of nodes, you must activate DataWorks Standard Edition or a more advanced edition. If required, you can click Upgrade Now to upgrade the edition. - UserDefined
Find the required workflow and right-click UserDefined to create a node. For more information, see Create a Hologres development node, Create a Data Lake Analytics node, and Create an AnalyticDB for MySQL node.
Commit a workflow
View all workflows

