An auto triggered node instance is a snapshot that is taken for an auto triggered node at the time when the node is scheduled to run. You can view the details about and perform the related operations on an auto triggered node instance in the list of auto triggered node instances or in the directed acyclic graph (DAG) of the instance.

Limits

  • Only users of DataWorks Professional Edition or a more advanced edition can use the intelligent diagnosis feature. If you use another edition, you can have a trial use of the feature for free. However, we recommend that you upgrade the DataWorks service to DataWorks Professional Edition to use more features. For more information, see Instance diagnosis.
  • Only users of DataWorks Professional Edition or a more advanced edition can use the aggregation, upstream analysis, and downstream analysis features provided by DAGs. For more information, see Billing of DataWorks advanced editions.
  • Some features can be used only by users who are granted O&M permissions. If the entry point of a feature is dimmed or is not displayed, you can check whether you have the required O&M permissions on the Manage Members tab of the User Management page. For more information, see Manage workspace-level roles and members.
  • You cannot manually delete an auto triggered node instance. DataWorks deletes an auto triggered node instance approximately 30 days after the instance expires. If you no longer need to run an auto triggered node instance, you can freeze it.
  • Node instances that are run on the shared resource group for scheduling are retained for one month (30 days), and the run logs for the node instances are retained for one week (7 days).
  • Node instances that are run on an exclusive resource group for scheduling are retained for one month (30 days), and the run logs for the node instances are also retained for one month (30 days).
  • The system regularly clears excess run logs every day when the size of run logs generated for the auto triggered node instances that finish running exceeds 3 MB.

Precautions

  • Instances are generated as scheduled for auto triggered nodes. Each generated instance runs the most recent code. If you modify and recommit the code of a node after instances are generated for the node, the instances that are not run will run the most recent code.
  • If you want to monitor a node for which instances are generated, you must configure monitoring rules for the node first. If a node for which monitoring rules are configured fails to run and you do not receive an alert notification, check whether your mobile phone number and email address are configured on the Alert Contacts page. For more information, see View alert details.
  • The time at which instances are generated for an auto triggered node varies based on the value of the Instance Generation Mode parameter. The valid values of the Instance Generation Mode parameter include Next Day and Immediately After Deployment. For more information, see Instance generation modes.
    Note Nodes that are manually rerun do not trigger alerts generated based on custom rules.

Node instance running and issue troubleshooting

An auto triggered node instance can be scheduled to run only when the following conditions are met: Ancestor nodes of the node for which the auto triggered node instance is generated are successfully run, the scheduling time of the auto triggered node instance has arrived, scheduling resources are sufficient, and the auto triggered node instance is not frozen. For more information, see What are the conditions that are required for a node to successfully run?.

If an auto triggered node instance fails to run, you can use the ancestor node analysis feature on the Upstream Analysis tab of the DAG page to quickly identify ancestor node instances that block the running of the current node instance. Then, you can use the intelligent diagnosis feature to diagnose failure causes or related issues of the ancestor node instances. The intelligent diagnosis feature can also be used to quickly troubleshoot issues when dependencies between the current node instance and ancestor node instances are complex. This improves O&M efficiency.

View auto triggered node instances

Cycle Instance page
Operation Description
Filter Allows you to specify conditions to search for auto triggered node instances in the area marked with 1 in the preceding figure.

You can search for auto triggered node instances by node name or node ID. You can also specify the following conditions to search for auto triggered node instances: Data Timestamp, Data Timestamp, Node Type, Run At, Solution, Workflow, Region, Scheduling Resource Group, Engine Type, Engine Instance, Baseline, Owner, Scheduling Type, Status, My Nodes, My Nodes with Errors, My Incomplete Nodes, Rerun Nodes, Slow Nodes, and Nodes that generated alarms in the past 24 hours.

Note
  • By default, the data timestamp is set to the day before the current day.
  • You can search for your desired instance by instance ID. You can determine whether to use a node ID or an instance ID to search for your desired instance based on your business requirements.
    • If you want to search for all the auto triggered node instances that are generated on the current day by an auto triggered node scheduled by hour or minute, you can enter the ID of the node.
    • If you want to search for a specific auto triggered node instance that is generated on the current day by an auto triggered node scheduled by hour or minute, you can enter the ID of the auto triggered node instance.
  • For instances for which monitoring rules are configured, you can select the Nodes that generated alarms in the past 24 hours check box to search for the instances for which alerts are triggered within 24 hours from the current point in time. You can click the Alert icon in the DAG of an instance for which an alert is triggered within 24 hours from the current point in time to view the alert details about the instance. For more information about how to view the alert details about an instance in the DAG of the instance.
List of instances Allows you to view the auto triggered node instances that meet the specified conditions in the area marked with 2 in the preceding figure.
Actions Allows you to perform operations on an auto triggered node instance in the area marked with 3 in the preceding figure.
  • DAG: You can perform this operation to view the dependencies of an auto triggered node instance. On the DAG page, you can right-click the instance in the DAG to perform related operations. For more information, see Manage instances in a DAG.
  • To diagnose: You can perform this operation to carry out end-to-end analysis on an auto triggered node instance. If the auto triggered node instance is not run as expected, you can click To diagnose to troubleshoot issues. For more information, see Instance diagnosis.
  • Rerun: You can perform this operation to rerun an auto triggered node instance that is in the Succeeded or Failed state. After the auto triggered node instance is successfully run, its descendant instances that are in the Pending (Ancestor) state can be scheduled to run. This operation is used to process an auto triggered node instance that fails to be run or an auto triggered node instance that is not run as scheduled.
    Note Only the auto triggered node instances that are in the Succeeded or Failed state can be rerun.
  • More:
    • Rerun Descendant Nodes: You can perform this operation to rerun the descendant instances of an auto triggered node instance that is in the Succeeded or Failed state. You can select the descendant instance that you want to rerun. After the selected instance is successfully run, its descendant instances that are in the Pending (Ancestor) state can be scheduled to run. This operation is used to recover data.
      Note Only descendant instances in the Succeeded or Failed state can be selected. The value No appears in the Meet Rerun Condition column of instances in other states, and you cannot select the instances.
    • Set Status to Successful: You can perform this operation to set the status of an auto triggered node instance that fails to be run to Succeeded. You can perform this operation if you do not want an auto triggered node instance that fails to be run to block the running of its descendant instances. This operation is used to process an auto triggered node instance that fails to be run.
    • Stop: You can perform this operation to stop an auto triggered node instance that does not need to be run. After you perform this operation on an auto triggered node instance, the auto triggered node instance fails to be run and exits.
      Note Only auto triggered node instances in the Pending (Schedule), Pending (Resources), or Running state can be stopped.
    • Freeze: You can perform this operation if an auto triggered node instance and its descendant instances do not need to be run for a certain period of time. The freeze operation takes effect only on the current auto triggered node instance that is in the Running state. A frozen auto triggered node instance cannot be scheduled as expected and does not generate data. After an auto triggered node instance is frozen, its descendant instances cannot be scheduled and run as expected.
      Note Do not perform this operation on the projectname_root node, which is the root node of your workspace. All the instances of auto triggered nodes depend on this node. If this node is frozen, the instances of auto triggered nodes cannot be run.
    • Unfreeze: You can perform this operation to unfreeze an auto triggered node instance that is frozen.
      • If the auto triggered node instance is not run, it is automatically run after its ancestor instances are successfully run.
      • If all the ancestor instances of the auto triggered node instance are successfully run, the state of the auto triggered node instance is directly set to Failed. You must manually rerun the auto triggered node instance.
      Note The unfreeze operation takes effect only on the current auto triggered node instance. If the auto triggered node for which the instance is generated is frozen, instances that are scheduled to run on the next day are also frozen.
    • View Lineage: You can perform this operation to view the lineage of an auto triggered node instance.
    • View Node Details: You can perform this operation to view the basic information of an auto triggered node instance.
    • View Runtime Log: You can perform this operation to view the run logs of an auto triggered node instance.
Batch operations Allows you to perform operations on multiple auto triggered node instances at the same time. You can click the following buttons in the area marked with 4 in the preceding figure to perform operations on multiple auto triggered node instances at the same time: Stop, Rerun, Set Status to Successful, Freeze, and Unfreeze.
Items per page Allows you to select the number of items that you want to view on a single page in the area marked with 5 in the preceding figure. The maximum number that you can select is 200.

Manage auto triggered node instances in a DAG

Cycle Instance page
Functionality Description
Operations that you can perform on the DAG page You can click DAG in the Actions column of an auto triggered node instance to open the DAG of the auto triggered node instance. You can perform the following operations in the DAG of the auto triggered node instance:
  • Node aggregation: If an auto triggered node instance has multiple ancestor and descendant instances or the ancestor and descendant instances are distributed at multiple levels, you can aggregate the instances. The instances can be aggregated from dimensions such as instance status, workspace, owner, and priority. Then, you can view the number of instances from your required dimension. This allows you to have a command of the number of instances from different dimensions and helps the system run the instances.
  • Ancestor node analysis: In most cases, an auto triggered node instance has both upstream and downstream relationships. If an auto triggered node instance is not run for a long period of time, you can analyze the ancestor node instances of the instance. You can view the ancestor node instances that block the running of the instance in the DAG of the instance, and quickly locate and troubleshoot the issue. This improves running efficiency of the instance.
  • Descendant node analysis: If an auto triggered node instance has multiple descendant instances or the descendant instances of an auto triggered node instance are distributed at multiple levels, you can analyze the descendant instances of the auto triggered node instance. You can aggregate the descendant instances by instance status, workspace, owner, or priority. Then, you can view the number of instances at different levels from your required dimension or the total number of instances at all levels from your required dimension.
  • DAG display style: In the area marked with 3 in the preceding figure, you can adjust the display style of a DAG based on your business requirements by clicking the icons. The icons from left to right are Enable alarm status/Turn off alarm status, Update Node Status, Group/Ungroup, Zoom In, Zoom Out, 1:1, Fit Screen, and Toggle Full Screen View.
Operations on a single instance You can right-click your desired auto triggered node instance in a DAG and perform operations on the instance.
  • Show Ancestor Nodes: You can perform this operation to view ancestor node instances of the current auto triggered node instance. You can select this option to have a command of the instances that affect data output of the current node instance. You can view ancestor node instances of an auto triggered node instance by level. A maximum of six levels of ancestor node instances can be displayed at the same time.
  • Show Descendant Nodes: You can perform this operation to view descendant instances of the current auto triggered node instance. You can select this option to have a command of the instances whose data output is affected by the current node instance. You can view descendant instances of an auto triggered node instance by level. A maximum of six levels of descendant instances can be displayed at the same time.
  • View Code: You can perform this operation to view the code of the node for which the current auto triggered node instance is generated in the production environment. If the code of the node does not meet your expectations, you must check whether the latest code of the node is successfully deployed to the production environment.
  • Edit Node: You can perform this operation to go to the configuration tab of the node for which the current auto triggered node instance is generated on the DataStudio page.
  • Resume: You can perform this operation to allow the current auto triggered node instance to resume running from the position where it is stopped. For example, if an instance is run by executing multiple SQL statement segments, the instance resumes running from the SQL statement segment in which the SQL statements fail to be executed.
    Note
    • Only SQL nodes that are run based on a MaxCompute compute engine instance support this operation.
    • If you want to perform this operation on an auto triggered node instance that is run on an exclusive resource group for scheduling that you purchased after January 2021, submit a ticket for an upgrade.
  • Emergency Operations: You can perform emergency operations on the current auto triggered node instance. The emergency operations take effect only on the current node instance once.
    • Delete Dependencies: You can perform this operation to urgently delete dependencies for the current auto triggered node instance. In most cases, you can delete dependencies for an auto triggered node instance by clicking Delete Dependencies if the ancestor node instances of the instance fail to be run and the ancestor node instances do not affect data output of the instance.
      Note You must check whether this operation affects data output based on the code of the node for which the instance is generated and lineage of the instance.
    • Force Rerun: You can perform this operation to forcefully rerun the current auto triggered node instance. You can perform this operation on an auto triggered node instance that is in the Succeeded, Failed, or Pending (Ancestor) state. This operation is often performed to recover data.
    • Force Heavy Run Downstream: You can perform this operation to forcefully rerun the descendant instances of an auto triggered node instance whose data timestamp is the previous day or the day before the previous day. You can perform this operation on an auto triggered node instance that is in the Succeeded or Failed state. This operation is often performed to recover data.
      Note You can use only a workspace administrator, tenant administrator, or Alibaba Cloud account to forcefully rerun the descendant instances of an auto triggered node instance.
Note For information about other operations, see View auto triggered node instances.
Details about a single instance In the area marked with 5 in the preceding figure, you can perform the following operations:
  • View Logs: You can perform this operation to view the execution details about an auto triggered node instance.
  • Show Details: You can perform this operation to view detailed information about an auto triggered node instance on the following tabs: General, Context, Runtime Log, Operation Log, and Code. For more information, see View the details about an auto triggered node instance.

View the details about an auto triggered node instance

View the details about an auto triggered node instance
Tab Description
General On this tab, you can view the scheduling properties of an auto triggered node instance in the production environment. For more information about the basic parameters, see Configure basic properties.
Note
  • Relationship between a node ID and an instance ID:

    If you want to search for all the auto triggered node instances that are generated on the current day for an auto triggered node scheduled by hour or minute, you can enter the ID of the node. If you want to search for a specific auto triggered node instance that is generated on the current day for an auto triggered node scheduled by hour or minute, you can enter the ID of the auto triggered node instance.

  • Instance status interpretation: If the instance is in the Pending (Ancestor), Pending (Schedule), Pending (Resources), or Freeze state, you can use the intelligent diagnosis feature to quickly troubleshoot issues.
  • Time spent for waiting for resources: If an auto triggered node instance is in the Pending (Resources) state for a long period of time, you can use the intelligent diagnosis feature to identify the node instances that occupy resources at the time when the current node instance is waiting for resources. Then, you can quickly identify the instances on which exceptions occur and troubleshoot issues.
  • Long running duration: If the running duration of an auto triggered node instance is much longer than the average running duration over a historical period of time, you can troubleshoot the issue based on node types:
    • Non-synchronization node: You can consult the owner of the compute engine instance on which the auto triggered node instance generated for a non-synchronization node is run.
    • Batch synchronization node: The running speed of an auto triggered node instance that is generated for a batch synchronization node may be slow in a specific phase or the instance is in the Pending (Resources) state for a long period of time. For more information, see What do I do if a batch synchronization node runs for an extended period of time?.
Context On this tab, you can view all input and output parameters of the node for which an auto triggered node instance is generated. For more information, see Configure input and output parameters.
Runtime Log On this tab, you can view the running details about an auto triggered node instance.
Operation Log On this tab, you can view the operation records of a node or an instance, including the operation time, operator, and specific operations.
Code On this tab, you can view the latest code of the node for which the current auto triggered node instance is generated in the production environment. If the code of the node does not meet your expectations, you must check whether the latest code of the node is successfully deployed to the production environment. For more information, see Deploy nodes.

FAQ