This topic describes how to view job instance details in MaxCompute Studio.
View job instance details
- Use a Logview URL or a local offline Logview file to open the details about a job
instance in read-only mode.
Logview is a common method to view details about job instances. It allows you to view the status of tasks that are submitted by other users in other projects. You can view details about a job instance by entering a valid Logview URL in MaxCompute Studio.
In the top menu bar of IntelliJ IDEA, choose Open job detail by logview dialog box, you can copy a valid Logview URL or export a local offline Logview file.. In the
- In Job Explorer, double-click a MaxCompute job instance to view its details.
Job details view
- Execution view: displays the overall information for a job in the form of a directed acyclic graph (DAG). You can view the dependencies and detailed execution plans for each subtask.
- Details: displays the details about a job as a table. These details include the subtask list, worker list for each subtask, amount of data that is processed by the workers, execution time, and job status.
- Script: displays the SQL statement and parameter settings that are configured for a job that is submitted.
- Result: displays the execution results of a job.
- Timeline view: displays the execution timeline of a job. This view provides multiple filters and allows you to view the timeline based on different granularities.
- Summary (JSON): displays the running details about a job in the JSON format.
- Analysis: provides scatter charts, long tail distributions, and skewed data charts to show the execution results of a job.
- or : used to expand or collapse views on the left and right panes. These icons allow you to focus on a specific view.
- : used to stop a job that is running. Only the project owner or administrator has the permission to stop running jobs.
- : used to refresh details. The basic information such as status and quota of jobs is automatically refreshed. However, the details views on the right-side pane are not automatically refreshed. If you want to view the up-to-date details, you must manually refresh them.
- : used to copy Logviews.
- : used to open job details in a browser. You can click this icon to generate a Logview URL and open the URL in a browser.
- : used to save job details as a local file.
- : used to determine whether to enable the auto refresh feature. If this feature is enabled, MaxCompute Studio automatically refreshes all details about a running job on a scheduled basis.
Basic information page
This page displays the basic information about a job, including its ID, owner, status, start time and end time, computing resource usage, input tables, and output tables. The basic information about a running job is automatically refreshed on a scheduled basis.
You can double-click a table name to view the basic information about the table.
The Execution view is used to display the dependencies in the Job Scheduler Job, Job Scheduler Task, and Operation layers. It also provides a series of auxiliary tools such as job playback, progress view, and heat map view. The Execution view is useful for troubleshooting.
- Click to go to another layer.
- Zoom in and zoom out.
- The dependency table.
- The Job Scheduler task worker.
- The thumbnail.
- Click to display the job dependencies.
- The dependencies in the Job Scheduler Task layer are displayed by default.
- Job Scheduler Job layer
Click MaxCompute Job to open the Job Scheduler Job layer. Workers in the Job Scheduler Job layer include the name, start time, and end time of each Job Scheduler task.
- Job Scheduler Task layer
Double-click a Job Scheduler job worker to go to the Job Scheduler Task layer.
If multiple Job Scheduler jobs exist, the Job Scheduler Task layer of the last Job Scheduler job is opened by default. This layer displays the dependencies, input tables, output tables, and partitions of Job Scheduler tasks. When a job is completed, you can click the drop-down box in the toolbar to switch between Progress Chart, Input Heat Chart, Output Heat Chart, TaskTime Heat Chart, and InstanceTime Heat Chart. The progress chart displays the task progress for the worker. The heat chart uses different colors to distinguish between the heat sources of different workers.A Job Scheduler task worker shows the following information:
- Instance Count: expressed in the format of
a/b/c. This expression indicates the status of the subtask instances at a specific point in time. The value of a represents the number of running subtask instances, the value of b represents the number of completed subtask instances, and the value of c represents the total number of subtask instances.
- I/O Records: displays the numbers of input and output records at a specific point in time.
- Percentage and orange progress bar: indicates the running status of the task. The percentage is obtained based on the analysis of running subtask instances.
- Line that connects subtasks: shows the number of output records. The arrow indicates the direction of data flow.
- Instance Count: expressed in the format of
- Operation layer
Double-click a Job Scheduler task to open the Operation layer.
The Operation layer reveals how Job Scheduler tasks run internally. You can click a worker to view all Operation layer information.
The Details view is used to display details for SQL data manipulation language (DML) jobs. This view displays the Job Scheduler task lists and compute worker lists of SQL DML jobs on the compute cluster. Typically, each job is associated with one or more Job Scheduler jobs. Each Job Scheduler job is divided into multiple Job Scheduler tasks (stages), and each Job Scheduler task contains multiple Job Scheduler instances (workers). You can right-click a Job Scheduler instance to view the standard output, standard errors, and debugging information.
- The Job Scheduler job tab.
- The Job Scheduler task list.
- The details and compute worker list of each Job Scheduler task.
MaxCompute Studio provides the job playback feature. You can view the execution history of a job within 12 seconds in the same manner as playing a media file. This feature helps you understand the running status of a MaxCompute job instance at different time points. You can determine the order and the amount of time to run subtasks and obtain the key path to run the job. This allows you to optimize subtasks that run slowly.
The Timeline view displays the details for the distributed execution of a job in a Gantt chart. You can adjust the display granularity to show all compute workers in a Gantt chart. Gantt charts display the time bottlenecks and long-tail workers of running jobs in a clear manner. You can also use multiple filters to select the key path, the largest data worker, and the worker that consumes the longest amount of time during the job execution.
The Analysis view displays the long-tail workers and skewed data workers of jobs. It also displays worker scatter charts and column charts to help diagnose job execution bottlenecks. Scatter charts and column charts allow you to click a worker to open the details view page and view the details about Job Scheduler instances.
The Result view displays information based on the job type and the parameters that are set when you submit the job.
- For SELECT statements that contain
odps.sql.select.output.format = HumanReadable, this view displays the result in the text format.
- For SELECT statements that do not specify the output format parameter, this view displays the result in the table format.
- For scripts that are run to export data to a table, this view displays the name of the output table and the link that redirects to the table details.
- For abnormal jobs, this view displays the error details.