On the Deployments page of the console of fully managed Flink, you can quickly view the numbers and lists of normal jobs, abnormal jobs, and jobs that have risks to learn the health status of the jobs in real time. This topic describes how to view the numbers and lists of abnormal jobs and jobs that have risks.

Background information

The following table describes a job that has risks and an abnormal job.
Job type Description
Abnormal job The system refreshes the job status every minute. If the state of a job is FAILED, the job is considered abnormal.
Job that has risks The system refreshes the job status every minute. If the system detects a risk for a job that is running, the system displays the risk type for the job on the right side of RUNNING. The risk type is Unstable, Failing, or ClusterUnreachable.
  • Unstable: The job fails to run three times within an hour.
  • Failing: The job fails to run three times within 10 minutes.
  • ClusterUnreachable: The data of the JobManager is missing.
The system classifies jobs that have risks into the following three levels based on the severity of the risks:
  • High: The job has risks that may cause errors or data inaccuracy.
  • Mid: The job may have a performance bottleneck.
  • Low: The job may have low resource utilization. You can configure parameters to increase resource utilization or reduce resources.

Precautions

After you filter jobs that have risks and abnormal jobs, you can click the name of the desired job to go to the job details page. Then, you can click Diagnosis to view the cause of the related risk or exception. You can fix the issue or tune the performance of the job based on the recommendations that are provided by the system to restore the job to normal. For more information about how to diagnose a job, see Job diagnostics.

Procedure

  1. Log on to the Realtime Compute for Apache Flink console.
  2. On the Fully Managed Flink tab, find the workspace that you want to manage and click Console in the Actions column.
  3. In the left-side navigation pane, choose Applications > Deployments.
  4. View the following information based on your business requirements.
    • Numbers of jobs that have risks and abnormal jobs
      In the upper part of the Deployments page, view the numbers of jobs that have risks and abnormal jobs. Numbers of jobs that have risks and abnormal jobs
    • List of jobs that have risks
      In the upper part of the Deployments page, click Risk. List of jobs that have risks
      Note If you want to remove the filter condition to display all jobs, you can click Risk again.
    • List of abnormal jobs
      In the upper part of the Deployments page, click Failed. List of abnormal jobs
      Note If you want to remove the filter condition to display all jobs, you can click Failed again.