Qwen-Omni-Realtime is a real-time audio and video chat model. It processes streaming audio and image inputs, such as continuous image frames extracted from a video stream, and generates text and audio output in real time.
Supported regions: Singapore, Beijing. Use the API key for each region.
How to use
1. Establish a connection
Connect to Qwen-Omni-Realtime using WebSocket. Use native Python or the DashScope SDK.
A single WebSocket session lasts up to 120 minutes before closing automatically.
Native WebSocket connection
You need the following configuration items:
Configuration item | Description |
Endpoint | China (Beijing): wss://dashscope.aliyuncs.com/api-ws/v1/realtime International (Singapore): wss://dashscope-intl.aliyuncs.com/api-ws/v1/realtime |
Query parameter | Set the |
Request header | Authenticate with Bearer Token: Authorization: Bearer DASHSCOPE_API_KEY DASHSCOPE_API_KEY is your API key from Model Studio. |
# pip install websocket-client
import json
import websocket
import os
API_KEY=os.getenv("DASHSCOPE_API_KEY")
API_URL = "wss://dashscope-intl.aliyuncs.com/api-ws/v1/realtime?model=qwen3.5-omni-plus-realtime"
headers = [
"Authorization: Bearer " + API_KEY
]
def on_open(ws):
print(f"Connected to server: {API_URL}")
def on_message(ws, message):
data = json.loads(message)
print("Received event:", json.dumps(data, indent=2))
def on_error(ws, error):
print("Error:", error)
ws = websocket.WebSocketApp(
API_URL,
header=headers,
on_open=on_open,
on_message=on_message,
on_error=on_error
)
ws.run_forever()DashScope SDK
# SDK version 1.23.9 or later
import os
import json
from dashscope.audio.qwen_omni import OmniRealtimeConversation,OmniRealtimeCallback
import dashscope
# The API keys for Singapore and Beijing differ. To get an API key, see https://www.alibabacloud.com/help/en/model-studio/get-api-key
# If you have not configured an API key, replace the next line with dashscope.api_key = "sk-xxx"
dashscope.api_key = os.getenv("DASHSCOPE_API_KEY")
class PrintCallback(OmniRealtimeCallback):
def on_open(self) -> None:
print("Connected Successfully")
def on_event(self, response: dict) -> None:
print("Received event:")
print(json.dumps(response, indent=2, ensure_ascii=False))
def on_close(self, close_status_code: int, close_msg: str) -> None:
print(f"Connection closed (code={close_status_code}, msg={close_msg}).")
callback = PrintCallback()
conversation = OmniRealtimeConversation(
model="qwen3.5-omni-plus-realtime",
callback=callback,
# The following is the URL for the Singapore region. If you use a model in the Beijing region, replace the URL with wss://dashscope.aliyuncs.com/api-ws/v1/realtime
url="wss://dashscope-intl.aliyuncs.com/api-ws/v1/realtime"
)
try:
conversation.connect()
print("Conversation started. Press Ctrl+C to exit.")
conversation.thread.join()
except KeyboardInterrupt:
conversation.close()// SDK version 2.20.9 or later
import com.alibaba.dashscope.audio.omni.*;
import com.alibaba.dashscope.exception.NoApiKeyException;
import com.google.gson.JsonObject;
import java.util.concurrent.CountDownLatch;
public class Main {
public static void main(String[] args) throws InterruptedException, NoApiKeyException {
CountDownLatch latch = new CountDownLatch(1);
OmniRealtimeParam param = OmniRealtimeParam.builder()
.model("qwen3.5-omni-plus-realtime")
.apikey(System.getenv("DASHSCOPE_API_KEY"))
// The following is the URL for the Singapore region. If you use a model in the Beijing region, replace the URL with wss://dashscope.aliyuncs.com/api-ws/v1/realtime
.url("wss://dashscope-intl.aliyuncs.com/api-ws/v1/realtime")
.build();
OmniRealtimeConversation conversation = new OmniRealtimeConversation(param, new OmniRealtimeCallback() {
@Override
public void onOpen() {
System.out.println("Connected Successfully");
}
@Override
public void onEvent(JsonObject message) {
System.out.println(message);
}
@Override
public void onClose(int code, String reason) {
System.out.println("connection closed code: " + code + ", reason: " + reason);
latch.countDown();
}
});
conversation.connect();
latch.await();
conversation.close(1000, "bye");
System.exit(0);
}
}2. Configure the session
Send the client event session.update:
{
// The ID of this event, generated by the client.
"event_id": "event_ToPZqeobitzUJnt3QqtWg",
// The event type. This is fixed to session.update.
"type": "session.update",
// Session configuration.
"session": {
// Output modalities. Supported values are ["text"] (text only) or ["text","audio"] (text and audio).
"modalities": [
"text",
"audio"
],
// Voice for output audio.
"voice": "Cherry",
// Input audio format. Only pcm is supported.
"input_audio_format": "pcm",
// Output audio format. Only pcm is supported.
"output_audio_format": "pcm",
// System message. Sets the model's goal or role.
"instructions": "You are an AI customer service agent for a five-star hotel. Answer customer inquiries about room types, facilities, prices, and booking policies accurately and friendly. Always respond with a professional and helpful attitude. Do not provide unconfirmed information or information beyond the scope of the hotel's services.",
// Enables voice activity detection. To enable it, pass a configuration object. The server will automatically detect speech start/end based on this object.
// Set to null to let the client decide when to initiate a model response.
"turn_detection": {
// VAD type. Must be set to server_vad.
"type": "server_vad",
// VAD detection threshold. Increase in noisy environments and decrease in quiet ones.
"threshold": 0.5,
// Silence duration to detect speech end. A model response triggers if this value is exceeded.
"silence_duration_ms": 800
}
}
}3. Input audio and images
Send Base64-encoded audio (required) and image (optional) data to the server buffer using the input_audio_buffer.append and input_image_buffer.append events.
Images can come from local files or be captured in real time from a video stream.
When server-side VAD is enabled, the server automatically submits data and triggers a response when speech ends. When VAD is disabled (manual mode), the client must call the input_audio_buffer.commit event to submit data.
4. Receive model responses
The model response format depends on the configured output modalities.
Text only
Receive streaming text via the response.text.delta event. Retrieve the full text with the response.text.done event.
Text and audio
Text: Receive streaming text via the response.audio_transcript.delta event. Retrieve the full text with the response.audio_transcript.done event.
Audio: Retrieve Base64-encoded streaming audio output data via the response.audio.delta event. The response.audio.done event signals audio generation completion.
Model selection
Qwen3.5-Omni-Realtime is in preview. Model invocation is temporarily free, but tool calling still incurs fees, see Billing details.
Qwen3.5-Omni-Realtime is the latest real-time multimodal model. Compared to Qwen3-Omni-Flash-Realtime, it offers:
Intelligence level
Significantly improved intelligence, matching Qwen3.5-Plus.
Web search
Supports web search natively. The model autonomously decides whether to search. See Web search.
Semantic interruption
Automatically identifies conversation intent to avoid interruptions from filler sounds and meaningless background noise.
Voice control
Control volume, speaking rate, and emotion using voice commands such as "speak faster," "speak louder," or "speak cheerfully."
Languages supported
Supports speech recognition in 113 languages and dialects, and speech synthesis in 36 languages and dialects.
Voice options
Offers 55 voices (47 multilingual + 8 dialect-specific). See Voice list.
See Model list for model names, context, pricing, and snapshot versions. For concurrency throttling, see Rate limits.
Getting started
Get an API key and configure the API key as an environment variable.
Select a programming language and follow these steps to start a real-time conversation.
DashScope Python SDK
Prepare the runtime environment
Your Python version must be 3.10 or later.
First, install pyaudio based on your operating system.
macOS
brew install portaudio && pip install pyaudioDebian/Ubuntu
If you are not using a virtual environment, install directly using the system package manager:
sudo apt-get install python3-pyaudioIf you are using a virtual environment, first install compilation dependencies:
sudo apt update sudo apt install -y python3-dev portaudio19-devThen install pyaudio using pip in the virtual environment.
pip install pyaudio
CentOS
sudo yum install -y portaudio portaudio-devel && pip install pyaudioWindows
pip install pyaudioAfter installation, install dependencies using pip:
pip install websocket-client dashscopeSelect an interaction mode
VAD mode (Voice Activity Detection, automatic speech start/end detection)
The server automatically detects when the user starts and stops speaking and responds.
Manual mode (press-to-talk, release-to-send)
The client controls speech timing. After the user finishes speaking, the client sends a message to the server.
VAD mode
Create a file named vad_dash.py with the following code:
Run
vad_dash.py. The system detects speech start/end and responds automatically.Manual mode
Create a file named
manual_dash.pywith the following code:Run
manual_dash.py. Press Enter to speak, then press Enter again to receive the model's audio response.
DashScope Java SDK
Select an interaction mode
VAD mode (Voice Activity Detection, automatic speech start/end detection)
The Realtime API automatically detects speech timing and responds.
Manual mode (press-to-talk, release-to-send)
The client controls speech timing. After the user finishes speaking, the client sends a message to the server.
VAD mode
Run OmniServerVad.main(). The system detects speech start/end and responds automatically.
Manual mode
Run OmniWithoutServerVad.main(). Press Enter to start recording, then press Enter again to stop and send.
WebSocket (Python)
Prepare the runtime environment
Your Python version must be 3.10 or later.
First, install pyaudio based on your operating system.
macOS
brew install portaudio && pip install pyaudioDebian/Ubuntu
sudo apt-get install python3-pyaudio or pip install pyaudioWe recommend using
pip install pyaudio. If installation fails, first install theportaudiodependency for your OS.CentOS
sudo yum install -y portaudio portaudio-devel && pip install pyaudioWindows
pip install pyaudioAfter installation, install WebSocket-related dependencies using pip:
pip install websockets==15.0.1Create the client
Create a file named
omni_realtime_client.pywith the following code:Select an interaction mode
VAD mode (Voice Activity Detection, automatic speech start/end detection)
The Realtime API automatically detects speech timing and responds.
Manual mode (press-to-talk, release-to-send)
The client controls speech timing. After the user finishes speaking, the client sends a message to the server.
VAD mode
In the same directory as
omni_realtime_client.py, createvad_mode.pywith the following code:Run
vad_mode.py. The system detects speech start/end and responds automatically.Manual mode
In the same directory as
omni_realtime_client.py, createmanual_mode.pywith the following code:Run
manual_mode.py. Press Enter to speak, then press Enter again to receive the model's audio response.
Interaction flow
VAD mode
Set session.turn_detection to "server_vad" in session.update to enable VAD mode. The server automatically detects speech start/end and responds. Suitable for voice calls.
The interaction flow is as follows:
The server detects speech start and sends the input_audio_buffer.speech_started event.
The client can send input_audio_buffer.append and input_image_buffer.append events at any time to append audio and images to the buffer.
Before sending an input_image_buffer.append event, you must send at least one input_audio_buffer.append event.
The server detects speech end and sends the input_audio_buffer.speech_stopped event.
The server sends the input_audio_buffer.committed event to commit the audio buffer.
The server sends a conversation.item.created event containing the user message item created from the buffer.
Lifecycle | Client events | Server events |
Session initialization | Session configuration | Session created Session configuration updated |
User audio input | Add audio to the buffer Add an image to the buffer | input_audio_buffer.speech_started Speech start detected input_audio_buffer.speech_stopped Speech end detected Server received the submitted audio |
Server audio output | None | Server starts generating a response New output content during response Conversation item created New output content added to the assistant message response.audio_transcript.delta Incrementally generated transcribed text Incrementally generated audio from the model response.audio_transcript.done Text transcription complete Audio generation complete Streaming of text or audio content for the assistant message is complete Streaming of the entire output item for the assistant message is complete Response complete |
Manual mode
Set session.turn_detection to null in session.update to enable Manual mode. The client explicitly sends input_audio_buffer.commit and response.create to request a response. Suitable for push-to-talk scenarios, such as voice messages in chat applications.
The interaction flow is as follows:
The client can send input_audio_buffer.append and input_image_buffer.append events at any time to append audio and images to the buffer.
Before sending an input_image_buffer.append event, you must send at least one input_audio_buffer.append event.
The client sends the input_audio_buffer.commit event to submit the audio and image buffers, signaling to the server that all user input (audio and images) for the current turn has been sent.
The server responds with an input_audio_buffer.committed event.
The client sends the response.create event, waiting for the server to return the model's output.
The server responds with a conversation.item.created event.
Lifecycle | Client events | Server events |
Session initialization | Session configuration | Session created Session configuration updated |
User audio input | Add audio to the buffer Add an image to the buffer Submit audio and images to the server Create a model response | Server received the submitted audio |
Server audio output | Clear the audio from the buffer | Server starts generating a response New output content during response Conversation item created New output content added to the assistant message item response.audio_transcript.delta Incrementally generated transcribed text Incrementally generated audio from the model response.audio_transcript.done Text transcription complete Audio generation complete Streaming of text or audio content for the assistant message is complete Streaming of the entire output item for the assistant message is complete Response complete |
Web search
Web search lets the model reply using real-time retrieved data for scenarios that need up-to-date information, such as stock prices or weather forecasts. The model autonomously decides whether to search.
Only theQwen3.5-Omni-Realtimemodel supports web search. It is disabled by default. Enable it using thesession.updateevent.
For billing details, see the agent policy in the Billing detailsHow to enable
In the session.update event, add these parameters:
enable_search: Set totrueto enable web search.search_options.enable_source: Set totrueto return a list of search result sources.
For full parameter details, see session.update.
Response format
After you enable web search, the response.done event includes a new plugins field in the usage object. This field records search usage metrics:
{
"usage": {
"total_tokens": 2937,
"input_tokens": 2554,
"output_tokens": 383,
"input_tokens_details": {
"text_tokens": 2512,
"audio_tokens": 42
},
"output_tokens_details": {
"text_tokens": 90,
"audio_tokens": 293
},
"plugins": {
"search": {
"count": 1,
"strategy": "agent"
}
}
}
}Code examples
The following examples show how to enable web search.
DashScope Python SDK
In the update_session call, pass the enable_search and search_options parameters:
import os
import base64
import time
import json
import pyaudio
from dashscope.audio.qwen_omni import MultiModality, AudioFormat, OmniRealtimeCallback, OmniRealtimeConversation
import dashscope
dashscope.api_key = os.getenv('DASHSCOPE_API_KEY')
url = 'wss://dashscope-intl.aliyuncs.com/api-ws/v1/realtime'
model = 'qwen3.5-omni-plus-realtime'
voice = 'Tina'
class SearchCallback(OmniRealtimeCallback):
def __init__(self, pya):
self.pya = pya
self.out = None
def on_open(self):
self.out = self.pya.open(format=pyaudio.paInt16, channels=1, rate=24000, output=True)
def on_event(self, response):
if response['type'] == 'response.audio.delta':
self.out.write(base64.b64decode(response['delta']))
elif response['type'] == 'conversation.item.input_audio_transcription.completed':
print(f"[User] {response['transcript']}")
elif response['type'] == 'response.audio_transcript.done':
print(f"[LLM] {response['transcript']}")
elif response['type'] == 'response.done':
usage = response.get('response', {}).get('usage', {})
plugins = usage.get('plugins', {})
if plugins.get('search'):
print(f"[Search] count={plugins['search']['count']}, strategy={plugins['search']['strategy']}")
pya = pyaudio.PyAudio()
callback = SearchCallback(pya)
conv = OmniRealtimeConversation(model=model, callback=callback, url=url)
conv.connect()
conv.update_session(
output_modalities=[MultiModality.AUDIO, MultiModality.TEXT],
voice=voice,
instructions="You are Xiao Yun, a personal assistant",
enable_search=True,
search_options={'enable_source': True}
)
mic = pya.open(format=pyaudio.paInt16, channels=1, rate=16000, input=True)
print("Web search is enabled. Speak into the microphone (press Ctrl+C to exit)...")
try:
while True:
audio_data = mic.read(3200, exception_on_overflow=False)
conv.append_audio(base64.b64encode(audio_data).decode())
time.sleep(0.01)
except KeyboardInterrupt:
conv.close()
mic.close()
callback.out.close()
pya.terminate()
print("\nConversation ended")DashScope Java SDK
In updateSession, pass web search settings through the parameters map:
import com.alibaba.dashscope.audio.omni.*;
import com.alibaba.dashscope.exception.NoApiKeyException;
import com.google.gson.JsonObject;
import javax.sound.sampled.*;
import java.nio.ByteBuffer;
import java.util.*;
import java.util.concurrent.ConcurrentLinkedQueue;
import java.util.concurrent.atomic.AtomicBoolean;
public class OmniSearch {
static class SequentialAudioPlayer {
private final SourceDataLine line;
private final Queue<byte[]> audioQueue = new ConcurrentLinkedQueue<>();
private final Thread playerThread;
private final AtomicBoolean shouldStop = new AtomicBoolean(false);
public SequentialAudioPlayer() throws LineUnavailableException {
AudioFormat format = new AudioFormat(24000, 16, 1, true, false);
line = AudioSystem.getSourceDataLine(format);
line.open(format);
line.start();
playerThread = new Thread(() -> {
while (!shouldStop.get()) {
byte[] audio = audioQueue.poll();
if (audio != null) {
line.write(audio, 0, audio.length);
} else {
try { Thread.sleep(10); } catch (InterruptedException ignored) {}
}
}
}, "AudioPlayer");
playerThread.start();
}
public void play(String base64Audio) {
audioQueue.add(Base64.getDecoder().decode(base64Audio));
}
public void close() {
shouldStop.set(true);
try { playerThread.join(1000); } catch (InterruptedException ignored) {}
line.drain();
line.close();
}
}
public static void main(String[] args) {
try {
SequentialAudioPlayer player = new SequentialAudioPlayer();
AtomicBoolean shouldStop = new AtomicBoolean(false);
OmniRealtimeParam param = OmniRealtimeParam.builder()
.model("qwen3.5-omni-plus-realtime")
.apikey(System.getenv("DASHSCOPE_API_KEY"))
.url("wss://dashscope-intl.aliyuncs.com/api-ws/v1/realtime")
.build();
OmniRealtimeConversation conversation = new OmniRealtimeConversation(param, new OmniRealtimeCallback() {
@Override public void onOpen() {
System.out.println("Connection established");
}
@Override public void onClose(int code, String reason) {
System.out.println("Connection closed");
shouldStop.set(true);
}
@Override public void onEvent(JsonObject event) {
String type = event.get("type").getAsString();
if ("response.audio.delta".equals(type)) {
player.play(event.get("delta").getAsString());
} else if ("response.audio_transcript.done".equals(type)) {
System.out.println("[LLM] " + event.get("transcript").getAsString());
} else if ("response.done".equals(type)) {
JsonObject response = event.getAsJsonObject("response");
if (response != null && response.has("usage")) {
JsonObject usage = response.getAsJsonObject("usage");
if (usage.has("plugins")) {
JsonObject plugins = usage.getAsJsonObject("plugins");
if (plugins.has("search")) {
JsonObject search = plugins.getAsJsonObject("search");
System.out.println("[Search] count=" + search.get("count").getAsInt()
+ ", strategy=" + search.get("strategy").getAsString());
}
}
}
}
}
});
conversation.connect();
conversation.updateSession(OmniRealtimeConfig.builder()
.modalities(Arrays.asList(OmniRealtimeModality.AUDIO, OmniRealtimeModality.TEXT))
.voice("Tina")
.enableTurnDetection(true)
.enableInputAudioTranscription(true)
.parameters(Map.of(
"instructions", "You are Xiao Yun, a personal assistant",
"enable_search", true,
"search_options", Map.of("enable_source", true)
))
.build()
);
System.out.println("Web search is enabled. Start speaking (press Ctrl+C to exit)...");
AudioFormat format = new AudioFormat(16000, 16, 1, true, false);
TargetDataLine mic = AudioSystem.getTargetDataLine(format);
mic.open(format);
mic.start();
ByteBuffer buffer = ByteBuffer.allocate(3200);
while (!shouldStop.get()) {
int bytesRead = mic.read(buffer.array(), 0, buffer.capacity());
if (bytesRead > 0) {
conversation.appendAudio(Base64.getEncoder().encodeToString(buffer.array()));
}
Thread.sleep(20);
}
conversation.close(1000, "Normal end");
player.close();
mic.close();
} catch (NoApiKeyException e) {
System.err.println("API key not found: Set the DASHSCOPE_API_KEY environment variable");
} catch (Exception e) {
e.printStackTrace();
}
}
}WebSocket (Python)
In the JSON payload for session.update, add the enable_search and search_options fields:
import json
import os
import websocket
import base64
import pyaudio
import threading
API_KEY = os.getenv("DASHSCOPE_API_KEY")
API_URL = "wss://dashscope-intl.aliyuncs.com/api-ws/v1/realtime?model=qwen3.5-omni-plus-realtime"
pya = pyaudio.PyAudio()
out_stream = pya.open(format=pyaudio.paInt16, channels=1, rate=24000, output=True)
def on_open(ws):
ws.send(json.dumps({
"type": "session.update",
"session": {
"modalities": ["text", "audio"],
"voice": "Tina",
"instructions": "You are Xiao Yun, a personal assistant",
"input_audio_format": "pcm",
"output_audio_format": "pcm",
"enable_search": True,
"search_options": {
"enable_source": True
}
}
}))
print("Web search is enabled. Speak into the microphone...")
def send_audio():
mic = pya.open(format=pyaudio.paInt16, channels=1, rate=16000, input=True)
try:
while True:
audio = mic.read(3200, exception_on_overflow=False)
ws.send(json.dumps({
"type": "input_audio_buffer.append",
"audio": base64.b64encode(audio).decode()
}))
except Exception:
mic.close()
threading.Thread(target=send_audio, daemon=True).start()
def on_message(ws, message):
event = json.loads(message)
if event["type"] == "response.audio.delta":
out_stream.write(base64.b64decode(event["delta"]))
elif event["type"] == "response.audio_transcript.done":
print(f"[LLM] {event['transcript']}")
elif event["type"] == "response.done":
usage = event.get("response", {}).get("usage", {})
plugins = usage.get("plugins", {})
if plugins.get("search"):
print(f"[Search] count={plugins['search']['count']}, strategy={plugins['search']['strategy']}")
def on_error(ws, error):
print(f"Error: {error}")
headers = ["Authorization: Bearer " + API_KEY]
ws = websocket.WebSocketApp(API_URL, header=headers, on_open=on_open, on_message=on_message, on_error=on_error)
ws.run_forever()API reference
Billing and rate limiting
Billing rules
Qwen-Omni-Realtime bills based on token usage per modality (audio and images). For details, see Model list.
Rate limiting
See Rate limiting.
Error codes
If the model call fails and returns an error message, see Error messages for resolution.
Voice list
Set the voice request parameter to the value in the voice parameter column.qwen3.5-omni-realtime
|
|
Details |
Languages supported |
|
|
Voice name: Tina Description: A voice like warm milk tea—sweet and cozy, yet sharp when solving problems |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Cindy Description: A sweet-talking young woman from Taiwan |
Chinese (Taiwanese accent), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Liora Mira Description: A gentle voice that weaves warmth into everyday life
|
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Sunnybobi Description: A cheerful, socially awkward neighbor girl |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Raymond Description: A clear-voiced, takeout-loving homebody |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Ethan Description: Standard Mandarin with a slight northern accent. Bright, warm, energetic, and youthful |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Theo Calm Description: Conveys understanding in silence and healing through words |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Serena Description: A gentle young woman |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Harvey Description: A voice that carries the weight of time—deep, mellow, and scented with coffee and old books |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Maia Description: A blend of intellect and gentleness |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Evan Description: A college student—youthful and endearing |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Qiao Description: Not just cute—she’s sweet on the surface and full of personality underneath |
Chinese (Taiwanese accent), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Momo Description: Playful and mischievous—here to cheer you up |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Wil Description: A young man from Shenzhen who speaks with a Hong Kong–Taiwan accent |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Angel Description: Slightly Taiwanese-accented—and very sweet |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Li Cassian Description: Speaks with restraint—three parts silence, seven parts reading the room |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Mia Description: A lifestyle artist who shares slow-living aesthetics and daily comfort through a soothing voice |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Joyner Description: Funny, exaggerated, and down-to-earth |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Gold Description: A West Coast Black rapper |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Katerina Description: A mature, commanding voice with rich rhythm and resonance |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Ryan Description: High-energy delivery with strong dramatic presence—realism meets intensity |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Jennifer Description: A premium, cinematic-quality American female voice |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Aiden Description: An American young man skilled in cooking |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Mione Description: A mature, intelligent British neighbor girl |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Sichuan–Sunny Description: A sweet Sichuan girl who warms your heart |
Chinese (Sichuan dialect), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Beijing–Dylan Description: A youth raised in Beijing’s hutongs |
Chinese (Beijing dialect), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Sichuan–Eric Description: A lively Chengdu man from Sichuan |
Chinese (Sichuan dialect), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Tianjin–Peter Description: A Tianjin-style xiangsheng performer—professional foil |
Chinese (Tianjin dialect), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Joseph Chen Description: A longtime overseas Chinese from Southeast Asia with a warm, nostalgic voice |
Chinese (Hokkien), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Shaanxi–Marcus Description: Broad face, few words, sincere heart, deep voice—the true flavor of Shaanxi |
Chinese (Shaanxi dialect), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Nanjing–Li Description: A grumpy uncle |
Chinese (Nanjing dialect), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Cantonese–Rocky Description: A witty and humorous online chat companion |
Chinese (Cantonese), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Sohee Description: A warm, cheerful, emotionally expressive Korean unnie |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Lenn Description: Rational at core, rebellious in detail—a German youth who wears suits and listens to post-punk |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Ono Anna Description: A clever, playful childhood friend |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Sonrisa Description: A warm, outgoing Latin American woman |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Bodega Description: A warm, enthusiastic Spanish man |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Emilien Description: A romantic French big brother |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Andre Description: A magnetic, natural, and steady male voice |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Radio Gol Description: A passionate football commentator who narrates games with poetic flair |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Alek Description: Cold like the Russian spirit—yet warm as wool beneath a coat |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Rizky Description: A young Indonesian man with a distinctive voice |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Roya Description: A sporty girl with a free-spirited heart |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Arda Description: Neither high nor low—clean, crisp, and gently warm |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Hana Description: A mature Vietnamese woman who loves dogs |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Dolce Description: A laid-back Italian man |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Jakub Description: A charismatic, artistic young man from a Polish town |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Griet Description: A mature, artistic Dutch woman |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Eliška Description: Every word carries Central European craftsmanship and warmth |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Marina Description: A girl raised in a multicultural city |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Siiri Description: Reserved and gentle—with a calm, lake-like speaking pace |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Ingrid Description: A woman from rural Norway |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Sigga Description: An intellectual young woman from an Icelandic town |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Bea Description: A sweet Filipino woman who loves coffee |
Chinese (Mandarin), Chinese, English French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
|
|
Voice name: Chloe Description: A Malaysian office worker |
Chinese (Mandarin), Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean, Thai, Indonesian, Arabic, Vietnamese, Turkish, Finnish, Polish, Hindi, Dutch, Czech, Urdu, Tagalog, Swedish, Danish, Hebrew, Icelandic, Malay, Norwegian, Persian |
qwen3-omni-flash-realtime-2025-12-01
Voice name |
| Voice effect | Description | Languages supported |
Cherry | Cherry | A sunny, positive, friendly, and natural young woman | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Serena | Serena | A gentle young woman | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Ethan | Ethan | Standard Mandarin with a slight northern accent. Sunny, warm, energetic, and vibrant | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Chelsie | Chelsie | A two-dimensional virtual girlfriend | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Momo | Momo | Playful and mischievous, cheering you up | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Vivian | Vivian | Confident, cute, and slightly feisty | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Moon | Moon | Effortlessly cool Moon White | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Maia | Maia | A blend of intellect and gentleness | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Kai | Kai | A soothing audio spa for your ears | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Nofish | Nofish | A designer who cannot pronounce retroflex sounds | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Bella | Bella | A little girl who drinks but never throws punches when drunk | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Jennifer | Jennifer | A premium, cinematic-quality American English female voice | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Ryan | Ryan | Full of rhythm, bursting with dramatic flair, balancing authenticity and tension | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Katerina | Katerina | A mature-woman voice with rich, memorable rhythm | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Aiden | Aiden | An American English young man skilled in cooking | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Eldric Sage | Eldric Sage | A calm and wise elder—weathered like a pine tree, yet clear-minded as a mirror | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Mia | Mia | Gentle as spring water, obedient as fresh snow | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Mochi | Mochi | A clever, quick-witted young adult—childlike innocence remains, yet wisdom shines through | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Bellona | Bellona | A powerful, clear voice that brings characters to life—so stirring it makes your blood boil. With heroic grandeur and perfect diction, this voice captures the full spectrum of human expression. | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Vincent | Vincent | A uniquely raspy, smoky voice—just one line evokes armies and heroic tales | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Bunny | Bunny | A little girl overflowing with "cuteness" | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Neil | Neil | A flat baseline intonation with precise, clear pronunciation—the most professional news anchor | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Elias | Elias | Maintains academic rigor while using storytelling techniques to turn complex knowledge into digestible learning modules | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Arthur | Arthur | A simple, earthy voice steeped in time and tobacco smoke—slowly unfolding village stories and curiosities | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Nini | Nini | A soft, clingy voice like sweet rice cakes—those drawn-out calls of “Big Brother” are so sweet they melt your bones | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Ebona | Ebona | Her whisper is like a rusty key slowly turning in the darkest corner of your mind—where childhood shadows and unknown fears hide | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Seren | Seren | A gentle, soothing voice to help you fall asleep faster. Good night, sweet dreams | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Pip | Pip | A playful, mischievous boy full of childlike wonder—is this your memory of Shin-chan? | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Stella | Stella | Normally a cloyingly sweet, dazed teenage-girl voice—but when shouting “I represent the moon to defeat you!”, she instantly radiates unwavering love and justice | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Bodega | Bodega | A passionate Spanish man | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Sonrisa | Sonrisa | A cheerful, outgoing Latin American woman | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Alek | Alek | Cold like the Russian spirit, yet warm like wool coat lining | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Dolce | Dolce | A laid-back Italian man | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Sohee | Sohee | A warm, cheerful, emotionally expressive Korean unnie | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Ono Anna | Ono Anna | A clever, spirited childhood friend | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Lenn | Lenn | Rational at heart, rebellious in detail—a German youth who wears suits and listens to post-punk | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Emilien | Emilien | A romantic French big brother | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Andre | Andre | A magnetic, natural, and steady male voice | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Radio Gol | Radio Gol | Football poet Radio Gol! Today I’ll commentate on football using my name. | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Shanghai - Jada | Jada | A fast-paced, energetic Shanghai auntie | Shanghainese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Beijing - Dylan | Dylan | A young man raised in Beijing’s hutongs | Beijing dialect, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Nanjing - Li | Li | A patient yoga teacher | Nanjing dialect, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Shaanxi - Marcus | Marcus | Broad face, few words, sincere heart, deep voice—the authentic Shaanxi flavor | Shaanxi dialect, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Southern Min - Roy | Roy | A humorous, straightforward, lively Taiwanese guy | Southern Min, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Tianjin - Peter | Peter | Tianjin-style crosstalk, professional foil | Tianjin dialect, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Sichuan - Sunny | Sunny | A Sichuan girl sweet enough to melt your heart | Sichuan dialect, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Sichuan - Eric | Eric | A Sichuanese man from Chengdu who stands out in everyday life | Sichuan dialect, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Cantonese - Rocky | Rocky | A humorous, witty A Qiang providing live chat | Cantonese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Cantonese - Kiki | Kiki | A sweet Hong Kong girl best friend | Cantonese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean |
qwen3-omni-flash-realtime, qwen3-omni-flash-realtime-2025-09-15
Voice name |
| Voice effect | Description | Languages supported |
Cherry | Cherry | A sunny, positive, friendly, and natural young woman | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Ethan | Ethan | Standard Mandarin with a slight northern accent. Sunny, warm, energetic, and vibrant | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Nofish | Nofish | A designer who cannot pronounce retroflex sounds | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Jennifer | Jennifer | A premium, cinematic-quality American English female voice | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Ryan | Ryan | Full of rhythm, bursting with dramatic flair, balancing authenticity and tension | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Katerina | Katerina | A mature-woman voice with rich, memorable rhythm | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Elias | Elias | Maintains academic rigor while using storytelling techniques to turn complex knowledge into digestible learning modules | Chinese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Shanghai - Jada | Jada | A fast-paced, energetic Shanghai auntie | Shanghainese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Beijing - Dylan | Dylan | A young man raised in Beijing’s hutongs | Beijing dialect, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Sichuan - Sunny | Sunny | A Sichuan girl sweet enough to melt your heart | Sichuan dialect, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Nanjing - Li | Li | A patient yoga teacher | Nanjing dialect, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Shaanxi - Marcus | Marcus | Broad face, few words, sincere heart, deep voice—the authentic Shaanxi flavor | Shaanxi dialect, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Southern Min - Roy | Roy | A humorous, straightforward, lively Taiwanese guy | Southern Min, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Tianjin - Peter | Peter | Tianjin-style crosstalk, professional foil | Tianjin dialect, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Cantonese - Rocky | Rocky | A humorous, witty A Qiang providing live chat | Cantonese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Cantonese - Kiki | Kiki | A sweet Hong Kong girl best friend | Cantonese, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean | |
Sichuan - Eric | Eric | A Sichuanese man from Chengdu who stands out in everyday life | Sichuan dialect, English, French, German, Russian, Italian, Spanish, Portuguese, Japanese, Korean |
Qwen-Omni-Turbo-Realtime
Voice name |
| Voice Effect | Description | Languages supported |
Cherry | Cherry | A sunny, positive, friendly, and natural young woman | Chinese, English | |
Serena | Serena | A gentle young woman | Chinese, English | |
Ethan | Ethan | Standard Mandarin with a slight northern accent. Sunny, warm, energetic, and vibrant | Chinese, English | |
Chelsie | Chelsie | A two-dimensional virtual girlfriend | Chinese, English |