This topic describes the features and usage of Logview V2.0 (hereinafter referred to as Logview). You can access the URL of Logview to view job information.

Overview

Logview is used to record and display the status of MaxCompute jobs. Logview provides the following features:
  • Provides an interactive directed acyclic graph (DAG) to display the logical architecture of job processing. You can also view the operators of a job.
  • Allows you to play back the job running process.
  • Allows you to view memory usage and CPU utilization by using Fuxi Sensor.

Entry point

When you submit a MaxCompute job, the system automatically generates a URL that starts with https://logview.aliyun.com/logview.
  • If you use the MaxCompute client to submit the job, you must copy the URL to the address bar of a browser to access Logview.
  • If you use DataWorks to submit the job, you can click the URL to directly go to the Logview page.
The following figure shows the Logview page. URL
No.Section
1The title and functionality section. For more information, see Title and functionality section.
2The Basic Info section. For more information, see Basic Info section.
3The job details section. For more information, see Job details section.

Title and functionality section

This section shows the job ID and job name. The job ID uniquely identifies a MaxCompute job. The job ID is generated when you submit the job. The job name is displayed only if the job is submitted by using an SDK. You can also click the icons on the right of this section to perform operations.
IconDescription
OpenOpen the Logview_detail.txt file that contains job details. The file is saved on your computer.
BackReturn to the Logview V1.0 page.
SaveSave the job details as a file to your computer.

Basic Info section

This section shows the basic information about a job.
ParameterDescription
MaxCompute ServiceThe endpoint of MaxCompute on which the job runs. For more information, see Endpoints.
ProjectThe name of the MaxCompute project to which the job belongs.
Cloud accountThe Alibaba Cloud account that is used to submit the job.
TypeThe type of the job. Valid values: SQL, SQLRT, LOT, XLib, CUPID, AlgoTask, and Graph.
StatusThe status of the job. Valid values:
  • Success: The job succeeds.
  • Failed: The job fails.
  • Canceled: The job is canceled.
  • Waiting: The job is being processed in MaxCompute but is not submitted to Job Scheduler.
  • Running: The job is being processed in Job Scheduler.
  • Terminated: The job is complete.
Start TimeThe time at which the job was submitted.
End TimeThe time at which the job is complete.
LatencyThe period during which the job is run.
ProgressThe progress of the job.
PriorityThe priority of the job.
QueueThe position of the job in the queue in the resource quota group.

Job details section

In the job details section, you can query details about a job. This section consists of the following tabs:

Job Details

  • Progress chart
    In the upper part of the Job Details tab, the progress chart of a job is displayed. The progress chart shows the subtask dependencies from the following dimensions: Fuxi jobs, Fuxi tasks, and operators. The progress chart also provides a series of tools to help you identify issues. The following figure shows the upper part of the Job Details tab. Progress chart
    No.Description
    1The breadcrumb navigation that is used to switch Fuxi jobs. JOB:_SQL_0_0_0_job_0 is the name of a Fuxi job.
    2The tool that is used to troubleshoot issues. You can use Progress Chart, Input Heat Chart, Output Heat Chart, TaskTime Heart Chart, and InstanceTime Heart Chart for troubleshooting.
    3You can click the Refresh icon icon to refresh the job status and click the Full Screen Display icon to zoom in or out on the progress chart. You can also click the Help icon icon to obtain MaxCompute Studio documentation and the Switch Level icon to switch to the upper level of the job.
    4The zoom tool.
    5The Fuxi task. A MaxCompute job consists of one or more Fuxi jobs. Each Fuxi job consists of one or more Fuxi tasks. Each Fuxi task consists of one or more Fuxi instances. If the amount of input data increases, MaxCompute starts more nodes for each task to process the data. A node is equivalent to a Fuxi instance. For example, a simple MapReduce job generates two Fuxi tasks: map task (M1) and reduce task (R2). If an SQL statement is complex, multiple Fuxi tasks may be generated.

    You can view the name of each Fuxi task on the execution page. For example, M1 indicates a map task. The 3 and 9 fields in R4_3_9 indicate that the map task can be run only after M3 and C9_3 are complete. Similarly, M2_4_9_10_16 indicates that the M2 task can be run only after R4_3_9, C9_3, R10_1_16, and C16_1 are complete. R/W indicates the numbers of rows that the task reads and writes.

    Click or right-click a task to view the operator dependencies and operator graphs of the task.

    You can quickly view the input and output tables.

    6The Fuxi task playback. You can click the Play icon to start or stop the playback. You can also drag the progress bar. The start time and end time are displayed on the sides of the progress bar. The playing time is displayed in the middle.
    7The thumbnail.
    Note
    • The playback feature is not applicable to Fuxi tasks that are in the Running state.
    • An AlgoTask job, such as a Machine Learning Platform for AI (PAI) job, contains only one Fuxi task. Therefore, no progress charts are provided for these jobs.
    • For non-SQL jobs, only Fuxi jobs and Fuxi tasks are displayed.
    • If only one Fuxi job exists, the progress chart shows the dependencies among Fuxi tasks. If multiple Fuxi jobs exist, the progress chart shows the dependencies among the Fuxi jobs.
  • Job status
    In the lower part of the Job Details tab, detailed information about the job is displayed. The following figure shows the lower part of the Job Details tab. Job status
    No.Description
    1The Fuxi Jobs tab. You can switch Fuxi jobs on this tab.
    2The details about the Fuxi tasks of the Fuxi job. Click a Fuxi task to display information about the Fuxi instance of this task. By default, information about the Fuxi instance of the first Fuxi task for the first Fuxi job is displayed.
    For AlgoTask jobs and jobs that run in the Cupid console, the Sensor column appears in this section. You can click the sensor that corresponds to a Fuxi task to view the CPU and memory information of the Fuxi instance. Fuxi jobs are classified into the following types:
    • M: A job whose ID starts with M is a data scanning job.
    • R: A job whose ID starts with R is a Reduce job.
    • J: A job whose ID starts with J is a JOIN job.
    • C: A job whose ID starts with C indicates a virtual node. The node does not involve calculation and is used only for branch selection.
    Note
    • Fuxi Sensor is available in the China (Chengdu), China (Shenzhen), China (Shanghai), China (Hangzhou), China (Zhangjiakou), and China (Beijing) regions.
    • If a Fuxi task is in the Interrupted state, the task may be rerun. In this case, the value in the Progress column indicates only the rerun progress. As a result, the entire job is successful but the progress of this Fuxi task is not 100%. This condition is normal.
    3Logview divides instances into groups based on their status. You can click the value next to Failed to query information about faulty nodes.
    4Fuxi instances.
    • In the preceding example, the ID of the Fuxi instance is M1#0_0. The first 0 in the ID of the Fuxi instance indicates the auto-increment ID of the Fuxi instance. The second 0 in the ID of the Fuxi instance indicates the number of times that the Fuxi task is rerun. The value 0 indicates that the Fuxi task fails to rerun.
    • StdOut and StdErr. You can view the output messages, error messages, and information that you want to display. You can also download the information.
    • Debug. You can debug and troubleshoot errors.
    • Limits on the number of Fuxi instances.
      The number of Fuxi instances is limited to 1 when the SQL statement that you use meets one of the following conditions:
      • The LIMIT clause is used.
      • The PARTITION BY KEY clause is not included in the SQL statement that is used together with a window function.
      • The GROUP BY KEY clause is not included in the SQL statement that is used together with an aggregate function.
      • No equi-join key is included in the SQL statement that is used to perform JOIN operations.
      • The ORDER BY clause is used.

Fuxi Sensor

Fuxi Sensor is a resource view that displays information about a MaxCompute job in all dimensions. You can use Fuxi Sensor to view the memory usage and CPU utilization of a Fuxi instance. You can also use Fuxi Sensor to identify job issues and analyze job performance. For example, you can use Fuxi Sensor in the following scenarios:
  • If an out-of-memory (OOM) error occurs, analyze the size of memory that is used.
  • Compare the numbers of requested and used resources to optimize the resource request process.
For example, you can use Fuxi Sensor to view the resource usage of a Fuxi instance.
  • CPU utilization
    The cpu_usage chart has two lines. One line indicates the number of CPUs requested (cpu_plan), and the other line indicates the number of CPUs used (cpu_usage). In the y-axis, 400 indicates four processors. CPU
  • Memory usage

    The mem_usage chart has two lines. One line indicates the number of memory resources requested (mem_plan), and the other line indicates the number of memory resources used (mem_usage).

    mem_usage contains Resident Set Size (RSS) and PageCache. RSS indicates the memory that is allocated after kernel page faults occur. RSS is used when you call Malloc to request memory by using non-file mappings. If the memory is insufficient, RSS cannot be reclaimed. PageCache is the memory that is occupied by the kernel to cache the files that are required by read and write requests, such as log files. If the memory is insufficient, PageCache can be reclaimed.
    • Memory detailsmem_usage
    • RSS usageRSS
    • PageCache usagePage-cache

Result

The Result tab displays information based on the job status:
  • If the job is successfully run, the job result is displayed.
    • You can use the following command to disable the display of the job result.
      setproject odps.forbid.fetch.result.by.bearertoken=true;
    • You can configure the odps.sql.select.output.format parameter to specify the result display format.
      -- To display the result in the CSV format, set this parameter to csv.
      set odps.sql.select.output.format=csv;
      -- To display the result in the text format, set this parameter to HumanReadable.
      set odps.sql.select.output.format=HumanReadable;
  • If the job fails to run, the cause of the failure is displayed.

SourceXML

  • XML

    Job levels and task-related information are displayed in the XML format on this tab.

  • Settings

    The settings of the current task are displayed on this tab.

  • Command

    The flag settings of the current task are displayed on this tab.

SQL Script

The SQL script for the current task is displayed on this tab.

History

The history of the current task is displayed on this tab.

SubStatusHistory

The status of the current task is displayed on this tab. substatus
Message for status codeDescription
Waiting for schedulingThe job is submitted and is waiting for the MaxCompute framework to schedule. In most cases, the waiting time is short.
Waiting for cluster resourceThe job is waiting for resources because the MaxCompute framework identifies that the resources of the Fuxi computing cluster are insufficient.
Waiting for concurrent task slotThe project-level throttling is triggered. You can configure the number of SQL tasks that can be submitted at the same time for a project.
Waiting for data replicationThe job is waiting for data to be replicated.
Waiting for execution slotThe system-level throttling is triggered.
Waiting for cleaning up of previous task attemptThe job is waiting for the cleanup operation of historical tasks to be complete.
Waiting for executionThe job is waiting to be distributed from the parent process queue to a child process. In most cases, the waiting time is short.
Preparing for executionThe job is going to be distributed to a child process. If the child process is abnormal, the preparation time may be long.
Task is executingThe job is being processed in the MaxCompute framework.
SQLTask is initializingThe SQL task is being initialized.
SQLTask is compiling queryThe SQL task is being compiled.
SQLTask is optimizing queryThe SQL task is optimizing queries. If the execution plan is complex, the optimization time is slightly long. If the optimization time is excessively long, an error may occur.
SQLTask is splitting data sourcesThe SQL task is splitting data sources for optimization.
SQLTask is generating execution planThe SQL task is generating an execution plan. If this process tasks a long period of time, data may be read from an excessive number of partitions or small files.
SQLTask is submitting execution planThe SQL task is submitting the execution plan.
Job has been submittedThe job is submitted to the computing cluster.
Offline Job Waiting for runningBefore the job is submitted to the Fuxi computing cluster, the MaxCompute framework identifies that computing resources are available for the job. However, after the job is submitted, the MaxCompute framework identifies that no computing resources are available in the quota group. As a result, the job waits for computing resources. This status code appears only once and no longer appears even if no computing resources are available for the job.
Offline Job is runningIf no resources are available for the Fuxi job that is running, this message appears. For example, high-priority jobs preempt resources. As a result, specific Fuxi instances cannot run and the instances are in the ready state.
Offline Job is failedThe Fuxi job fails to run.
Offline Job is succeedThe Fuxi job is successful.
SQLTask is updating meta informationThe SQL task is updating the metadata information status and generating dynamic partitions. This process may be time-consuming.
SQLTask is finishingThe SQL task is complete.
Online Job is cancelled by fuxiThe job in service mode is canceled.
Task rerunThe job reruns. The job in service mode may fail to run and change to run in offline mode. Alternatively, data is replicated across different clusters.
Online Job Waiting for runningThe job in service mode is waiting to be run.
Online Job is runningThe job in service mode is running.
Online Job is failedThe job in service mode fails to run.
Online Job is succeedThe job in service mode is successful.
Online Job is cancelled by fuxiThe job in service mode is canceled.
Task key-path executing finishedThe key path of the job is complete, but information such as DetailStatus is not generated.
Task key-path is finishedThe key path of the job is complete.
Instance key-path is finishedThe key path of the instance is complete.
Task execution is finishedThe job is complete, and DetailStatus is generated.
Instance execution is finishedThe job is complete.
Execution failedThe job fails to run.