Reference for the server-sent events that the Paraformer real-time speech recognition service pushes to clients over WebSocket. This topic documents the data structure and field semantics of the four event types: task-started, result-generated, task-finished, and task-failed.
User guide: For model details and selection guidance, see Speech-to-text.
Event interaction flow: For the event sequence diagram, see WebSocket API.
task-started
Description: The task has started successfully. The client can begin sending audio data.
|
header |
|
|
payload Always |
result-generated
Description: Recognition result. Includes intermediate results (sentence_end=false) and final results (sentence_end=true).
|
header |
|
|
payload |
task-finished
Description: The task ended normally. You can close the connection or reuse it.
|
header |
|
|
payload The contents can be ignored. Typically |
task-failed
Description: The task failed. The connection is closed and cannot be reused.
|
header |
|
|
payload Always |