DataWorks task scheduling is widely used in machine learning scenarios. It allows
you to periodically run DataWorks tasks to update your model, which helps you create
a model training pipeline. Machine Learning Platform for AI (PAI) can work with DataWorks
to periodically schedule machine learning experiments.
Background information
When all nodes in an experiment are up and running, you can deploy the experiment
to DataWorks and schedule DataWorks to periodically run the experiment.
Note Before you schedule nodes offline, make sure that all nodes in an experiment are up
and running and DataWorks is activated. For more information, see
Create a workspace.
Procedure
- Go to the Experiments page of the PAI console.
- Log on to the PAI console.
- In the left-side navigation pane, choose to go to the PAI Visualization Modeling page.
When you create a project, we recommend that you select By usage for the project.
PAI-TensorFlow tasks can run only by using GPU resources.

- Find the required project and click Machine Learning in the Operation column.
- In the left-side navigation pane, click Experiments. On the page that appears, find
the required experiment and double-click it. The Heart Disease Prediction experiment
is used in this example.
- On the tab that appears, choose to go to the Data Analytics page of DataWorks.

- Create a PAI node.
- In the Create Node dialog box, configure Node Name and Location.
If you do not create a node in the dialog box that appears, move the point over the

icon on the
Data Analytics page and choose .
Note The node name can be a maximum of 128 characters in length and can contain letters,
digits, underscores (_), and periods (.).
- Click Commit.
- On the tab that appears, select the PAI experiment that you created from the Experiment drop-down list.
To edit the loaded PAI experiment, click Edit in PAI Console to edit the experiment.
- In the right-side navigation pane, click the Properties tab. In the panel that appears, configure the properties for the node. For more information,
see Basic properties.
Configure task scheduling parameters, including the recurrence, input parameters,
and output parameters.
- Save and commit the node.
Notice You must configure Rerun and Parent Nodes before you commit the node.
- Click the
icon in the toolbar to save the node.
- Click the
icon in the toolbar.
- In the Commit Node dialog box, set Change description.
- Click OK.
In a workspace in standard mode, you must click
Publish in the upper-right corner after you commit the node. For more information, see
Deploy a node.
- In the upper-right corner, click Operation to view the running status and system logs of the PAI task.
You can also perform other operations, such as generating retroactive data and testing
the experiment. For more information, see
View auto triggered nodes.