The Stream Task page of the DataWorks console displays all real-time computing nodes and the basic information of the nodes. You can view the details about nodes on this page. You can also configure alert rules for nodes. This helps you quickly locate and fix errors if errors occur on a node.

Limits

DataWorks supports O&M only for EMR Spark Streaming and EMR Streaming SQL nodes.

Go to the Stream Task page

  1. Log on to the DataWorks console.
  2. In the left-side navigation pane, click Workspaces.
  3. After you select the region in which the workspace that you want to manage resides, find the workspace and click Data Analytics in the Actions column.
  4. On the DataStudio page, click the Icon icon in the upper-left corner and choose All Products > Task Operation > Operation Center.
  5. On the Operation Center page, choose RealTime Task Maintenance > Stream Task to go to the Stream Task page.

View the real-time computing nodes in the node list

The Stream Task page displays real-time computing nodes in the production environment. You can view the basic information of, start, stop, or undeploy a real-time computing node. You can also configure alert rules for a real-time computing node. Manage real-time computing nodes
Section Parameter and description
1 In this section, you can search for real-time computing nodes by node ID or node name. You can also specify Status, Task type, Engine Type, Engine Instance, and Responsible Person to perform the operation.
Note
  • If you search for nodes by node name or node ID, the search result is affected by other filter conditions that you specified. Only nodes that meet all the filter conditions that you specify are displayed.
  • If you search for nodes by node name, fuzzy match is supported. After you enter a keyword, all real-time computing nodes whose names contain the keyword are displayed.
2 In this section, you can view the basic information and details of a real-time computing node.
  • Node Name: the name of the real-time computing node. You can click the node name to go to the details page of the node.
  • Node ID: the ID of the real-time computing node.
  • Status: the status of the real-time computing node. The node can be in one of the following states: Not Running, Generating, Pending (Resources), Starting, Running, Stopped, Failed, Restarting, and Undeploying.
  • Start running: the time when the real-time computing node started to run.
  • End Running: the time when the running of the node was complete.
  • Runtime: the running duration of the real-time computing node. Unit: seconds.
  • Task type: the type of the real-time computing node.
    Note

    DataWorks supports O&M only for EMR Spark Streaming and EMR Streaming SQL nodes.

  • Engine Type: the type of the engine used to run the real-time computing node. DataWorks supports O&M only for real-time computing nodes that are run by using the EMR engine.
  • Engine Instance: the name of the EMR compute engine instance that is associated with the workspace to which the real-time computing node belongs.
  • Deployed At: the time when the real-time computing node was deployed. The time is in the yyyy-MM-dd HH:mm:ss format.
  • Responsible Person: the owner of the workspace to which the real-time computing node belongs.
Note You can rank all real-time computing nodes in ascending or descending order of Starting running, End Running, or Runtime.
3 In this section, you can perform the following operations on a real-time computing node:
4 In this section, you can perform an operation on multiple real-time computing nodes at a time. You can select multiple nodes and click Start, Terminate, Undeploy, or Monitoring Setting to perform the operation on these nodes.

View the details of a real-time computing node

Find the real-time computing node that you want to query in the node list and click the node name to go to the details panel of the node. In the node details panel, you can view the details, operational logs, operation logs, and code of the node on the Details, Runtime Log, Operations Log, and Code tabs. This allows you to quickly locate operations performed on the node and operation records, obtain details about the errors reported for the node, and fix the errors. Details page of a real-time computing node