Manage real-time instances

A real-time instance is created from the data that is generated after a real-time task is submitted to the Operation Center. Dataphin lets you view data from real-time tasks and manage real-time instances. Management operations include viewing tasks, instance code, and instance parameters. This topic describes the statuses of real-time instances and explains how to manage them.

Go to the real-time instance page

In the top navigation bar of the Dataphin homepage, choose Develop > O&M.
In the top navigation bar, select the production or development environment.
In the navigation pane on the left, choose Instance O&M > Real-time Instance to go to the Real-time Instance page.

Introduction to the real-time instance page

The Real-time Instance page lists the real-time instances that the system has generated. You can perform operations and maintenance (O&M) on this page.

Block	Description
① Filter and search area	Enter an instance object name or ID to search for a real-time instance. This feature is useful for quick searches when you know the real-time object name or ID. My Stopped Instances: Instances that have been stopped and for which the current user is the schedule owner. Instances I Operated Today: Instances that the current user has operated on today and for which the current user is the schedule owner. Project: Filter by the project to which the node belongs. You can select multiple projects for which the current user has O&M-Access Directory permissions. Expand Filter: Click Filter to expand all filter options. The options include the following: Subtype: Includes Flink SQL, Flink Template SQL, Flink DataStream, and real-time integration. Running Status: Select one or more running statuses to filter instances. The statuses include Completed, Starting, Startup Failed, Running, Failed, Stopping, and Stopped. Last Operator: Filter by the last user who operated on the instance. Owner: Filter by the O&M owner of the instance. Engine Type: Filter by DPI engine type. The types include Ververica Flink, Blink, Flink, and FusionInsight Flink. Cluster: The cluster ID of the Ververica Flink real-time computing engine. Engine Version: The database engine version used by the instance. Operated On: The time when the instance was operated on. Reset: After you open all filter options, click Reset to quickly clear the selected filter conditions.
② List operations	View Instance Statistics: Click to go to the Instance Statistics page to view details. For more information, see View instance statistics. Export Instance List: Click to export the filtered or searched instances. The exported file is in the .csv format. Customize list header: Click the icon to select the items to display in the list and adjust their order. Note The first column and the Actions column cannot be hidden or reordered. If you clear the browser cache or switch browsers, the header settings are reset to default. Refresh list: Click the icon to refresh the current list.
③ Real-time instance list	The Real-time Instance page lists the real-time instances that the system has generated. The list includes information such as instance object, running status, last operator, last update time, owner, and project. Instance Object: When a real-time task is submitted and published to the Operation Center, a real-time instance object is generated. Its initial running status is Stopped. You can manually start it. Running Status: The current running status of the instance. The statuses include Starting, Running, Stopping, Stopped, Failed, and Startup Failed. For more information, see Real-time instance running statuses. Consumption Latency (s): The latency of the node, which is the latency in data source consumption. Last Operator: The resources consumed by the instance during runtime. Last Updated By: The account that last updated the instance. Owner: The O&M owner of the instance. Project: The project to which the instance belongs. The format is `Project English Name (Project Chinese Name)`. Engine Type: The DPI engine type used by the current real-time instance. Cluster: The cluster in which the real-time instance runs. This applies only to the Ververica Flink real-time engine. For other DPI engine instances, this field is empty. Resource Queue: The resource queue in which the real-time instance runs. This applies only to Blink and Flink real-time engines. For other DPI engine instances, this field is empty. Engine Version: The database engine version on which the real-time instance runs. The Actions column in the list displays the O&M operations supported for real-time instances. For information about individual O&M operations, see Individual operations.
④ Batch operations	Use the Start, Stop, and Unpublish functions in the batch operations area to process real-time instances in batches. This improves operational efficiency. For information about batch O&M operations, see Batch operations.

Real-time instance running status description

Status icon	Status	Description
	Completed	The instance that processes bounded stream data is complete.
	Starting	The instance has started, but its operating system is not running.
	Running	The instance is running.
	Stopping	The stop operation has been triggered, and the system is stopping the instance.
	Stop	The running instance is stopped.
	Failed	The instance failed to run.
	Startup Failed	The instance failed to start.

Single operation

Operation	Description
Start	In the Actions column of the target real-time instance, click the icon to start the instance. For more information, see Start a real-time instance.
Stop	In the Actions column of the target real-time instance, click the icon to stop the instance that is in the Running state. Two stop modes are supported: Stateless Stop and Stop And Retain Current State. Stateless Stop: Does not save the current state (Savepoint). The next time the job starts, it will be a stateless start. Important A stateless stop clears previously saved Savepoints but does not clear Checkpoints. Stop and Retain Current State: Saves the current state (Savepoint). The next time the job starts, you can choose to start it from the saved state. Note The Blink real-time computing engine does not support selecting Stateless Stop and Stop with Current State Preserved. Real-time integration instances support only stateless stop. After you click Stop, the real-time incremental node stops, and all offline nodes that are being synchronized also stop.
Unpublish	In the Actions column of the target real-time instance, click the icon to unpublish the instance from the production environment. You can unpublish instances that are in the Stopped, Failed, or Startup Failed state.
View Running Analysis	In the Actions column of the target real-time instance, click the icon to view the running information, data curves, Failover, and other parameters of the instance. The compute engine is Apache Flink. For more information, see View running analysis. For real-time instances that use the Ververica Flink or Blink real-time engine, see View job details.
View Real-time Task	In the Actions column of the target real-time instance, click the icon and select View Real-time Task. You are navigated to the node page that generated the current instance. On this page, you can view the details of the real-time task.
View Instance Code	In the Actions column of the target real-time instance, click the icon and select View Instance Code to view the node code that generated the current instance. If the compute engine is Flink, click Logical Code or Physical Code to switch between views. Logic Code: The code that you write for a task. Physical Code: Compiled code that can run on the Flink engine.
View Instance Parameters	In the Actions column of the target real-time instance, click the icon and select View Instance Parameters to view the runtime parameters of the instance, such as `metrics.reporter.promappmgr.port`.
View Sync Objects	In the Actions column of the target real-time instance, click the icon and select View Sync Objects to open the Sync Objects tab of the instance details page. The details include Incremental synchronization and Full synchronization. Incremental synchronization: Displays the running status, start time, latency, and latest read offset. You can start, stop, view running logs, and view node code. Running status: The running status of the original real-time integration node. The statuses include Completed, Starting, Startup Failed, Running, Failed, Stopping, and Stopped. If the node fails to run, you can view the error cause. If the real-time task is paused, its status is Running, but the running status here is displayed as Running (Paused). Start time: The start time of the real-time incremental synchronization. Latency: The latency of the real-time incremental node. Latest read offset: The time offset that has been read. Full synchronization: Displays the overall sync status, sync progress, source object, target object, status, sync time, duration, speed, and data volume. You can search by source or target object name and filter by object sync status. Overall sync status: The overall sync status of all source objects in a full synchronization. The statuses include Not Started, Synchronizing, Completed, and Failed. Not Started: The sync status of all source objects is Not Started. In this state, the sync progress is not displayed. Synchronizing: One or more source objects are being synchronized. Completed: The sync statuses of all source objects are Completed, Ignored, or Stopped. Failed: One or more source objects failed to be synchronized. Sync progress: The overall progress of the full synchronization. The format is `Number of completed objects/Total number of objects`. Source object and Target object: The formats are `schema.source/target_table_name` and `source/target_data_source_name`. Status: Includes Not Started, Stopping, Stopped, Ignored, Synchronizing, Failed, and Completed. Not Started: The object is not running. It is waiting for the incremental node to be paused before starting the synchronization. Stopping: The last temporary node that was being synchronized was manually stopped but has not finished stopping. Stopped: The previous synchronization task was manually stopped. Ignored: The node status is displayed as Ignored after you click Ignore when the node has not started or has failed. Syncing: The latest temporary task is running. Synchronization failed: The last temporary task failed. Synchronization complete: The last temporary task was successful. Sync time: The start and end time of the temporary node. The format is `Start time - End time`. If the node has not started or finished, this is displayed as -. Duration: The total time that the temporary task runs. Speed: The synchronization speed. The formula is `Total data volume synchronized by the last completed temporary node/Duration`. Data volume: The total data volume synchronized by the last completed temporary node. Operations: You can view logs, run, stop, and ignore objects only when the Incremental synchronization status is Running. View logs: Click to open the log page of the last temporary node. Run and Batch run: Supported only when the status is Ignored, Stopped, Failed, or Completed. When a full synchronization is performed on a sync object, the data in the corresponding target table is cleared. The schedule resource group for the node is also cleared and can be adjusted. Stop and Batch stop: Supported only for sync objects that are being synchronized. Stopping an object does not block the node from running. Ignore and Batch ignore: Supported only for sync objects that are in the Not Started or Failed state. Ignoring an object does not block the node from running. Note This feature is supported only for real-time integration instances that use the incremental and full synchronization solution in the development environment.
Refresh Sync Objects	In the Actions column of the target real-time instance, click the icon and select Refresh Sync Objects. In the Refresh Sync Objects dialog box, click OK. The instance runs based on the new synchronization scope. Only the sync object scope is changed: New sync objects are added: The incremental synchronization is paused, but the node status remains Running. Temporary full synchronization nodes for the new objects are run. Full synchronization nodes that have run successfully are not run again. After all new temporary full synchronization nodes are complete, the incremental synchronization continues. Sync objects are deleted: The sync objects are removed. The incremental synchronization node continues to run with a smaller synchronization scope. Other changes are made in addition to the sync object scope (such as node configuration adjustments or target table adjustments): The incremental synchronization stops. If the synchronization scope is changed, the synchronization of new objects is completed first. Then, the incremental synchronization node is restarted. For existing full synchronization nodes, you can manually process them by object on the sync objects page as needed. Note This feature is supported only for real-time integration instances that use the incremental and full synchronization solution, are in the Running state, and have updated synchronization content.

Batch operations

Operation	Description
Start	Note The instance statuses that support the start operation vary based on the real-time engine. Flink: Stopped. Ververica Flink: Stopped, Failed, Succeeded. Blink: Stopped, Not Started. FusionInsight Flink: Stopped, Failed. Start Real-time Integration Instances: Select one or more real-time integration instances, or select all instances on the current page. Then, click Start and select Start Real-time Integration Instances. In the Batch Start Real-time Integration Instances dialog box, configure the start information. Start n instances: Displays the number of instances to start and their names. The names are separated by commas (,). Start Scope: Select Incremental + Full (available only for incremental and full real-time integration nodes) or Incremental Only. Full Synchronization: You can configure parameters in this section only if you set Start scope to Incremental + Full. Rerun Mode: Select Initialization or Resume. Initialization: Starts temporary full synchronization nodes for all sync objects. After all temporary nodes are complete, the real-time incremental synchronization node starts. Resume: Starts temporary full synchronization nodes for all sync objects that are not complete. This excludes objects that are already completed, stopped, or ignored. After all temporary nodes are complete, the real-time incremental synchronization node starts. Memory (optional): Do not enter decimals or negative numbers. The maximum memory cannot exceed 8192 MB (8 GB). Rate Limiting (optional): Limits the speed based on the data volume. Do not enter decimals or negative numbers. The default value is 10 MB/s. If you leave this empty, the speed is not limited. Concurrency (optional): Sets the maximum desired concurrency for full synchronization nodes. Do not enter decimals or negative numbers. The default value is 10. If you leave this empty, the concurrency is 3. Note You can configure schedule resource groups, memory, rate limiting, and concurrency only in the production environment. Incremental Synchronization: You can configure parameters in this section only if you set Start scope to Incremental only. Start Mode: Only Stateless Start is supported. This means the instance starts without any initial state. Data Read Start Time: If you set a start time, all source tables that support `startTime` start to read data from that time. These tables include Log Service, Kafka, MySQL, MongoDB, and OceanBase. Note The code parameter settings in SQL nodes take precedence over this specified start time. Click OK to start the instances. Start Real-time Development Instances: Select one or more real-time development instances, or click Select All On This Page. Then, click Start and select Start Real-time Development Instances. In the Batch Start Real-time Development Instances dialog box, configure the start information. Start n instances: Displays the number of instances to start and their names. The names are separated by commas (,). Start Mode: Supports stateless start and start from a valid state. Stateless Start: Starts without any initial state. Start From A Valid State: Starts from an existing valid state. Note If the selected instances include stateless instances, only Stateless start is supported for batch starting. Data Read Start Time: If you set a start time, all source tables that support startTime start to read data from that time. These tables include Log Service, Kafka, MySQL, MongoDB, and OceanBase. Note The code parameter settings in SQL jobs take precedence over this specified start time. Data Timestamp: Assign values to time variables by configuring the data timestamp. Time-type variables are calculated based on the data timestamp, which defaults to the current day. Note The code parameter settings in SQL jobs take precedence over this specified start time. Click OK to start the instances.
Stop	Note The stop operation is supported only for instances in the Failed, Running, Startup Failed, Resume Failed, or Completed state. Stop Real-time Integration Instances: Select one or more real-time integration instances, or select Select All On This Page. Then, click Stop and select Stop Real-time Integration Instances. In the Batch Stop Real-time Integration Instances dialog box, configure the information. Stop n instances: Displays the number of instances to stop and their names. The names are separated by commas (,). Stop Mode: Currently, only Stateless Stop is supported. This means the current state is not saved, and the instance starts without a state next time. Stop Real-time Development Instances: Select one or more real-time development instances, or select Select All On This Page. Then, click Stop and select Stop Real-time Development Instances. In the Batch Stop Real-time Development Instances dialog box, configure the information. Stop n instances: Displays the number of instances to stop and their names. The names are separated by commas (,). Stop Mode: Stateless Stop: Does not save the current state. The next time the job starts, it will be a stateless start. Stop And Retain Current State: Saves the current state. The next time the job starts, you can choose to start it from the saved state.
Unpublish	Unpublish the selected real-time instances from the production environment. Note You can unpublish only real-time instances that are in the Stopped state.

Real-time instance DAG

Click the name of an instance object in the instance list. A directed acyclic graph (DAG) appears on the right side of the page with the current node as the central node. By default, the DAG shows one level of parent and child nodes. You can also adjust the display of the DAG:

Click the icon to hide the real-time instance list and enlarge the DAG.
Click the icon to hide the DAG.
Hover over the icon and drag to expand or shrink the DAG display area.

The DAG provides a dynamic visualization of the node's upstream and downstream dependencies. It also supports O&M operations on upstream and downstream nodes:

Section	Description
① Node information bar	Displays summary information for the selected node. Click View Node Details to see more information. Ververica Flink real-time engine: Includes node details, sync objects (supported only by real-time instances generated from incremental and full real-time integration nodes), running logs, operation logs, node code, and task parameters (supported only by compute task instances). Flink real-time engine: Includes node details, sync objects (supported only by real-time instances generated from incremental and full real-time integration nodes), running analysis, logs, operation logs, node code, and task parameters (supported only by compute task instances). Node details: Contains basic information and running information of the current node. Basic information includes the node name, ID, instance ID, and project. Running information includes the running status and runtime. Sync objects: Contains source and target objects. It displays the status of full and incremental synchronization. When the incremental synchronization node is in the Running state, you can manually Run, Stop, and Ignore the full synchronization. When running, you also need to configure the schedule resource group, memory, rate limiting, concurrency, and shard key (supported only when running a single sync object). Schedule resource group: The schedule resource used by the full synchronization node. By default, the project's default resource group is used. Memory: The channel configuration. Do not enter decimals or negative numbers. The maximum memory cannot exceed 16384 MB (16 GB). The default value is 4096 MB. If you leave this empty, the default value is 1024 MB. Rate limiting: Limits the speed based on the data volume. Do not enter decimals or negative numbers. The default value is 10 MB/s. If you leave this empty, the speed is not limited. Concurrency: The channel configuration. It sets the maximum desired concurrency for full synchronization nodes. Do not enter decimals or negative numbers. The default value is 10. If you leave this empty, the concurrency is 3. Shard key: Chunks data based on the configured field. It is used with concurrency to achieve concurrent reads. This parameter is empty by default. If it is empty, the system automatically identifies a field to use. You can use a column from the source data table as the shard key. The field must be of an integer or datetime type. Otherwise, chunking may be inaccurate. We recommend that you use a primary key or an indexed column as the shard key to ensure transmission performance. Running logs: Displays the running logs of the current node. If the node fails and generates dirty data, you can download the dirty data file. Operation logs: Displays the operation logs of the current node, including the operation time, operator, and specific operation. Running analysis: Uses a data dashboard to display various running data within a selected time period, such as the number of failures and backpressure data. Logs: Displays startup logs, running logs, and error messages. Node code: Displays the code of the current node. Task parameters: Displays the task parameters of the current node.
② Node search and filter area	Quickly set the number of upstream and downstream levels to expand from the central node. If many nodes are expanded and difficult to view, you can search for a node name to quickly locate a specific node within the current DAG display range.
③ Scheduling dependency graph	Displays the scheduling dependency graph of the instance. You can expand more upstream and downstream nodes. You can also perform O&M operations on upstream and downstream nodes. Hover over a node in the DAG to view its name, type, scheduling cycle, owner, and description.
④ Canvas adjustment area	Quickly adjust the DAG display ratio. You can set a specific ratio (default is 100%), zoom in (up to 200%), zoom out (down to 20%), fit to canvas, and enter full screen. This area also displays the node ID and node name of the central node in the current DAG.

Operations supported by real-time instance DAG nodes

Operation	Description
Expand Parent Nodes	Expand dependency nodes at different levels from the central node in the DAG.
Expand Child Nodes
View Real-time Task	Go to the task node DAG that generated the current instance node. You can view task node details, upstream and downstream node information, and perform O&M on task nodes. For more information, see View and manage real-time tasks.
View Node Code	You can view the task code that generated the current instance. If the compute engine is Flink, click Logical Code or Physical Code to switch between views. Logical Code: The node code that was written. Physical Code: The compiled code that is executable on the Flink engine.
View Running Logs	View the running logs of the instance node. If the node has dirty data, click Download Dirty Data File to download the dirty data to your computer. The file includes the generation time, error content, and error cause of the dirty data. By default, the file is retained for 7 days. If the compute engine is Flink, you can view the startup logs, running logs, and error messages separately to help locate and troubleshoot errors. Startup logs: Log information from the process of the Dataphin client submitting the Flink job to YARN. Running logs: Log information from the JobManager and TaskManager during the Flink job execution. Note Logs are stored for 7 calendar days by default. The log output level includes Warning and Error level log information. Error messages: Exception error information generated during node execution. This includes the time of the exception, the name of the exception, the task where the exception occurred, and the hostname or IP address of the TaskManager node. Click the view icon to view the details of the exception history.
View Task Parameters	View the task parameters of the current instance node, such as `execution.checkpointing.interval`.
Edit Development Node	Go to the node editing page in the Dev project for the node that generated the current instance. This is applicable only to the Dev-Prod development mode.
Edit Node	Go to the node editing page for the node that generated the current instance. This is applicable only to the Basic mode.
View Generated Node	View the node configuration in the Prod project for the node that generated the current instance.
View Operation Logs	View the operation logs of the current instance. The logs include the Operation Time, Operator, and Operation content.
View Running Analysis	View the running information, data curves, Failover, and other parameters of the current real-time instance. For real-time instances that use the Ververica Flink or Alibaba Blink real-time engine, see View job details. For real-time instances that use the Flink real-time engine, see View running analysis.
View Sync Objects	In the Sync Objects tab of the current real-time instance details, view the sync object details, including Incremental synchronization and Full synchronization. For field details, see View Sync Objects in the Individual operations section. Note This operation is supported only by real-time instances generated from incremental and full real-time integration nodes.
Refresh Sync Objects	Click Refresh Sync Objects. In the Refresh Sync Objects dialog box, click OK to run the current real-time instance based on the new synchronization scope. For field details, see Refresh Sync Objects in the Individual operations section. Note This operation is supported only by real-time instances generated from incremental and full real-time integration nodes.
Start	Start the current real-time instance. For more information, see Start a real-time instance.
Stop	Stop the current real-time instance. This operation applies to instances in the Running or Completed state. For Completed instances, only stateless stop is supported. Two stop modes are available: Stateless Stop and Stop And Retain Current State. Stateless stop: Does not save the current state (Savepoint). The next time the job starts, it will be a stateless start. Stop and retain current state: Saves the current state (Savepoint). The next time the job starts, you can choose to start it from the saved state. Note The Blink real-time computing engine does not support the Stateless Stop and Stop And Retain Current State modes.
Unpublish	Unpublish the real-time instance from the production environment. You can unpublish instances that are in the Stopped state.