All Products
Search
Document Center

DataWorks:Configure scheduling settings

Last Updated:Aug 15, 2023

To run auto triggered nodes as scheduled, you must go to the Scheduling Settings tab in DataStudio to enable periodic scheduling. This topic describes how to enable periodic scheduling and modify the default scheduling settings for auto triggered nodes.

Limits

The default scheduling settings take effect only for new auto triggered nodes.

Modify the default scheduling settings for auto triggered nodes

  1. Go to the DataStudio page.

    Log on to the DataWorks console. In the left-side navigation pane, choose Data Modeling and Development > DataStudio. On the page that appears, select the desired workspace from the drop-down list and click Go to DataStudio.

  2. In the lower-left corner of the DataStudio page, click the Settings icon. On the page that appears, click the Scheduling Settings tab.

  3. On the Scheduling Settings tab, click Change to configure the parameters.

    Scheduling SettingsThe following table describes the parameters.

    Parameter

    Description

    Periodic scheduling

    Specifies whether to enable periodic scheduling. Auto triggered nodes in the current workspace can be run as scheduled only if you turn on this switch.

    Note

    If you turn off this switch, the instances that have been generated for auto triggered nodes on the current day are still run as scheduled. However, DataWorks does not generate auto triggered node instances that are scheduled to run on the next day at night on the current day.

    Time Zone for Scheduling

    The time zone that is used for node scheduling. By default, the time zone that is used for node scheduling is the time zone of the server. If you select a region outside mainland China and you want to set the time zone that is used for node scheduling to UTC+8, submit a ticket.

    Note

    Before you modify this parameter, refer to Create a workspace to understand the impact imposed by the modification.

    Default Scheduling Cycle

    The time when the node is scheduled to run. By default, the scheduling time is randomly generated within the range from 00:00 to 00:30. You can specify a custom scheduling time based on your business requirements.

    Resource group

    The default resource group for scheduling that is used to schedule nodes.

    Data integration resource group

    The default resource group for Data Integration that is used to schedule Data Integration nodes.

    Rerun

    The default rerun policy for auto triggered nodes.

    Note

    If you set the Rerun parameter to Allow Regardless of Running Status or Allow upon Failure Only, make sure that the data idempotence of auto triggered nodes is not affected. Otherwise, data quality issues may occur after multiple reruns.

    Number of reruns

    The number of times that an auto triggered node can be rerun after it fails to run as scheduled.

    Valid values: 1 to 10. You can change the value of this parameter based on your business requirements.

    Rerun interval

    The default interval between two consecutive reruns.

    Valid values: 1 to 30. Unit: minutes.

    Enable auto parsing when submitting a file

    Specifies whether to enable automatic parsing for auto triggered nodes. If you enable automatic parsing, after a node is committed, DataWorks parses the output names of the node and its ancestor nodes based on the latest code.

  4. Click Save.

    Then, new nodes in the current workspace use the modified scheduling settings.