Model API calls are billed on a pay-as-you-go basis by default.
Tiered pricing rules
Some Model Studio models use tiered pricing. The unit price is determined by the total number of input tokens in a single request. All tokens in the request are billed at the unit price of the corresponding tier.
For example, a model has two pricing tiers: 0 < tokens ≤ 32K and 32K < tokens ≤ 128K. If a request contains 100K input tokens, it falls into the second tier (32K < 100K ≤ 128K), and all tokens are billed at the unit price of the second tier.
Text generation - Qwen
Qwen-Max
You are charged for input tokens and output tokens.
If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price. If the model supports context cache, only input tokens receive a discount. These two discounts cannot apply simultaneously.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought + answer |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3.7-max Currently equivalent to qwen3.7-max-2026-05-20 context caching discount |
International |
Non-Thinking and Thinking modes |
0<Token≤1M |
$2.5 |
$7.5 |
1 million tokens |
|
qwen3.7-max-2026-06-08 context caching discount |
International |
Non-Thinking and Thinking modes |
0<Token≤1M |
$2.5 |
$7.5 |
1 million tokens |
|
qwen3.7-max-2026-05-20 context caching discount |
International |
Non-Thinking and Thinking modes |
0<Token≤1M |
$2.5 |
$7.5 |
1 million tokens |
|
qwen3.7-max-preview Currently equivalent to qwen3.7-max-2026-05-17 |
International |
Thinking mode only |
0<Token≤1M |
$2.5 |
$7.5 |
1 million tokens |
|
qwen3.7-max-2026-05-17 |
International |
Thinking mode only |
0<Token≤1M |
$2.5 |
$7.5 |
1 million tokens |
|
qwen3.6-max-preview context caching discount |
International |
Non-Thinking and Thinking modes |
0<Token≤128K |
$1.3 |
$7.8 |
1 million tokens |
|
128K<Token≤256K |
$2 |
$12 |
||||
|
qwen3-max Currently equivalent to qwen3-max-2026-01-23 context caching discount |
International |
Non-Thinking and Thinking modes |
0<Token≤32K |
$1.2 |
$6 |
1 million tokens |
|
32K<Token≤128K |
$2.4 |
$12 |
||||
|
128K<Token≤256K |
$3 |
$15 |
||||
|
qwen3-max-2026-01-23 |
International |
Non-Thinking and Thinking modes |
0<Token≤32K |
$1.2 |
$6 |
1 million tokens |
|
32K<Token≤128K |
$2.4 |
$12 |
||||
|
128K<Token≤256K |
$3 |
$15 |
||||
|
qwen3-max-2025-09-23 |
International |
Non-Thinking mode only |
0<Token≤32K |
$1.2 |
$6 |
1 million tokens |
|
32K<Token≤128K |
$2.4 |
$12 |
||||
|
128K<Token≤256K |
$3 |
$15 |
||||
|
qwen3-max-preview context caching discount |
International |
Non-Thinking and Thinking modes |
0<Token≤32K |
$1.2 |
$6 |
1 million tokens |
|
32K<Token≤128K |
$2.4 |
$12 |
||||
|
128K<Token≤256K |
$3 |
$15 |
More models
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen-max Currently equivalent to qwen-max-2025-01-25 50% batch inference discount |
International |
Non-Thinking mode only |
No tiered pricing |
$1.6 |
$6.4 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought + answer |
|
qwen3.7-max Currently equivalent to qwen3.7-max-2026-05-20 50% batch inference discount context caching discount |
Chinese mainland |
Non-Thinking and Thinking modes |
0<Token≤1M |
$1.65 |
$4.951 |
|
qwen3.7-max-2026-06-08 context caching discount |
Chinese mainland |
Non-Thinking and Thinking modes |
0<Token≤1M |
$1.65 |
$4.951 |
|
qwen3.7-max-2026-05-20 context caching discount |
Chinese mainland |
Non-Thinking and Thinking modes |
0<Token≤1M |
$1.65 |
$4.951 |
|
qwen3.6-max-preview context caching discount |
Chinese mainland |
Non-Thinking and Thinking modes |
0<Token≤128K |
$1.238 |
$7.426 |
|
128K<Token≤256K |
$2.063 |
$12.377 |
|||
|
qwen3-max Currently equivalent to qwen3-max-2026-01-23 50% batch inference discount context caching discount |
Chinese mainland |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.359 |
$1.434 |
|
32K<Token≤128K |
$0.574 |
$2.294 |
|||
|
128K<Token≤256K |
$1.004 |
$4.014 |
|||
|
qwen3-max-2026-01-23 |
Chinese mainland |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.359 |
$1.434 |
|
32K<Token≤128K |
$0.574 |
$2.294 |
|||
|
128K<Token≤256K |
$1.004 |
$4.014 |
|||
|
qwen3-max-2025-09-23 |
Chinese mainland |
Non-Thinking mode only |
0<Token≤32K |
$0.861 |
$3.441 |
|
32K<Token≤128K |
$1.434 |
$5.735 |
|||
|
128K<Token≤256K |
$2.151 |
$8.602 |
|||
|
qwen3-max-preview context caching discount |
Chinese mainland |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.861 |
$3.441 |
|
32K<Token≤128K |
$1.434 |
$5.735 |
|||
|
128K<Token≤256K |
$2.151 |
$8.602 |
More models
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen-max Currently equivalent to qwen-max-2024-09-19 |
Chinese mainland |
Non-Thinking mode only |
No tiered pricing |
$0.345 |
$1.377 |
Hong Kong (China)
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought + answer |
|
qwen3.7-max Currently equivalent to qwen3.7-max-2026-05-20 context caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤1M |
$1.65 |
$4.951 |
|
qwen3.7-max-2026-06-08 context caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤1M |
$1.65 |
$4.951 |
|
qwen3.7-max-2026-05-20 context caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤1M |
$1.65 |
$4.951 |
|
qwen3-max Currently equivalent to qwen3-max-2026-01-23 context caching discount |
Hong Kong (China) |
Non-Thinking and Thinking modes |
0<Token≤32K |
$1.2 |
$6 |
|
32K<Token≤128K |
$2.4 |
$12 |
|||
|
128K<Token≤256K |
$3 |
$15 |
|||
|
qwen3-max-2026-01-23 |
Hong Kong (China) |
Non-Thinking and Thinking modes |
0<Token≤32K |
$1.2 |
$6 |
|
32K<Token≤128K |
$2.4 |
$12 |
|||
|
128K<Token≤256K |
$3 |
$15 |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought + answer |
|
qwen3.7-max Currently equivalent to qwen3.7-max-2026-05-20 context caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤1M |
$1.65 |
$4.951 |
|
qwen3.7-max-2026-06-08 context caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤1M |
$1.65 |
$4.951 |
|
qwen3.7-max-2026-05-20 context caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤1M |
$1.65 |
$4.951 |
|
qwen3-max Currently equivalent to qwen3-max-2026-01-23 context caching discount |
Global |
Non-Thinking mode only |
0<Token≤32K |
$0.359 |
$1.434 |
|
32K<Token≤128K |
$0.574 |
$2.294 |
|||
|
128K<Token≤256K |
$1.004 |
$4.014 |
|||
|
qwen3-max Currently equivalent to qwen3-max-2026-01-23 50% batch inference discount context caching discount |
EU |
Non-Thinking and Thinking modes |
0<Token≤32K |
$1.2 |
$6 |
|
32K<Token≤128K |
$2.4 |
$12 |
|||
|
128K<Token≤256K |
$3 |
$15 |
|||
|
qwen3-max-2026-01-23 |
EU |
Non-Thinking and Thinking modes |
0<Token≤32K |
$1.2 |
$6 |
|
32K<Token≤128K |
$2.4 |
$12 |
|||
|
128K<Token≤256K |
$3 |
$15 |
|||
|
qwen3-max-2025-09-23 |
Global |
Non-Thinking mode only |
0<Token≤32K |
$0.861 |
$3.441 |
|
32K<Token≤128K |
$1.434 |
$5.735 |
|||
|
128K<Token≤256K |
$2.151 |
$8.602 |
|||
|
qwen3-max-preview context caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.861 |
$3.441 |
|
32K<Token≤128K |
$1.434 |
$5.735 |
|||
|
128K<Token≤256K |
$2.151 |
$8.602 |
US (Virginia)
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought + answer |
|
qwen3.7-max Currently equivalent to qwen3.7-max-2026-05-20 context caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤1M |
$1.65 |
$4.951 |
|
qwen3.7-max-2026-06-08 context caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤1M |
$1.65 |
$4.951 |
|
qwen3.7-max-2026-05-20 context caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤1M |
$1.65 |
$4.951 |
|
qwen3-max Currently equivalent to qwen3-max-2026-01-23 context caching discount |
Global |
Non-Thinking mode only |
0<Token≤32K |
$0.359 |
$1.434 |
|
32K<Token≤128K |
$0.574 |
$2.294 |
|||
|
128K<Token≤256K |
$1.004 |
$4.014 |
|||
|
qwen3-max-2025-09-23 |
Global |
Non-Thinking mode only |
0<Token≤32K |
$0.861 |
$3.441 |
|
32K<Token≤128K |
$1.434 |
$5.735 |
|||
|
128K<Token≤256K |
$2.151 |
$8.602 |
|||
|
qwen3-max-preview context caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.861 |
$3.441 |
|
32K<Token≤128K |
$1.434 |
$5.735 |
|||
|
128K<Token≤256K |
$2.151 |
$8.602 |
Japan (Tokyo)
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought + answer |
|
qwen3.7-max Currently equivalent to qwen3.7-max-2026-05-20 Context cachecontext caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤1M |
$1.65 |
$4.951 |
|
qwen3.7-max-2026-05-20 Context cachecontext caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤1M |
$1.65 |
$4.951 |
Qwen-Plus
You are charged for input tokens and output tokens.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
|||||
|
qwen3.7-plus Currently equivalent to qwen3.7-plus-2026-05-26 context caching discount |
International |
0<Token≤256K |
$0.4 |
$1.6 |
$1.6 |
1 million tokens |
|
256K<Token≤1M |
$1.2 |
$4.8 |
$4.8 |
|||
|
qwen3.7-plus-2026-05-26 context caching discount |
International |
0<Token≤256K |
$0.4 |
$1.6 |
$1.6 |
1 million tokens |
|
256K<Token≤1M |
$1.2 |
$4.8 |
$4.8 |
|||
|
qwen3.6-plus Currently equivalent to qwen3.6-plus-2026-04-02 |
International |
0<Token≤256K |
$0.5 |
$3 |
$3 |
1 million tokens |
|
256K<Token≤1M |
$2 |
$6 |
$6 |
|||
|
qwen3.6-plus-2026-04-02 |
International |
0<Token≤256K |
$0.5 |
$3 |
$3 |
1 million tokens |
|
256K<Token≤1M |
$2 |
$6 |
$6 |
|||
|
qwen3.5-plus Currently equivalent to qwen3.5-plus-2026-02-15 |
International |
0<Token≤256K |
$0.4 |
$2.4 |
$2.4 |
1 million tokens |
|
256K<Token≤1M |
$0.5 |
$3 |
$3 |
|||
|
qwen3.5-plus-2026-04-20 |
International |
0<Token≤256K |
$0.4 |
$2.4 |
$2.4 |
1 million tokens |
|
256K<Token≤1M |
$0.5 |
$3 |
$3 |
|||
|
qwen3.5-plus-2026-02-15 |
International |
0<Token≤256K |
$0.4 |
$2.4 |
$2.4 |
1 million tokens |
|
256K<Token≤1M |
$0.5 |
$3 |
$3 |
|||
|
qwen-plus Currently equivalent to qwen-plus-2025-12-01 |
International |
0<Token≤256K |
$0.4 |
$1.2 |
$4 |
1 million tokens |
|
256K<Token≤1M |
$1.2 |
$3.6 |
$12 |
|||
|
qwen-plus-latest |
International |
0<Token≤256K |
$0.4 |
$1.2 |
$4 |
1 million tokens |
|
256K<Token≤1M |
$1.2 |
$3.6 |
$12 |
|||
|
qwen-plus-2025-12-01 |
International |
0<Token≤256K |
$0.4 |
$1.2 |
$4 |
1 million tokens |
|
256K<Token≤1M |
$1.2 |
$3.6 |
$12 |
|||
|
qwen-plus-2025-09-11 |
International |
0<Token≤256K |
$0.4 |
$1.2 |
$4 |
1 million tokens |
|
256K<Token≤1M |
$1.2 |
$3.6 |
$12 |
|||
|
qwen-plus-2025-07-28 |
International |
0<Token≤256K |
$0.4 |
$1.2 |
$4 |
1 million tokens |
|
256K<Token≤1M |
$1.2 |
$3.6 |
$12 |
|||
|
qwen-plus-2025-07-14 |
International |
No tiered pricing |
$0.4 |
$1.2 |
$4 |
1 million tokens |
|
qwen-plus-2025-04-28 |
International |
No tiered pricing |
$0.4 |
$1.2 |
$4 |
1 million tokens |
|
qwen-plus-2025-01-25 |
International |
No tiered pricing |
$0.4 |
$1.2 |
- |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
||||
|
qwen3.7-plus Currently equivalent to qwen3.7-plus-2026-05-26 context caching discount |
Chinese mainland |
0<Token≤256K |
$0.276 |
$1.101 |
$1.101 |
|
256K<Token≤1M |
$0.826 |
$3.301 |
$3.301 |
||
|
qwen3.7-plus-2026-05-26 context caching discount |
Chinese mainland |
0<Token≤256K |
$0.276 |
$1.101 |
$1.101 |
|
256K<Token≤1M |
$0.826 |
$3.301 |
$3.301 |
||
|
qwen3.6-plus Currently equivalent to qwen3.6-plus-2026-04-02 |
Chinese mainland |
0<Token≤256K |
$0.276 |
$1.651 |
$1.651 |
|
256K<Token≤1M |
$1.101 |
$6.602 |
$6.602 |
||
|
qwen3.6-plus-2026-04-02 |
Chinese mainland |
0<Token≤256K |
$0.276 |
$1.651 |
$1.651 |
|
256K<Token≤1M |
$1.101 |
$6.602 |
$6.602 |
||
|
qwen3.5-plus Currently equivalent to qwen3.5-plus-2026-02-15 |
Chinese mainland |
0<Token≤128K |
$0.115 |
$0.688 |
$0.688 |
|
128K<Token≤256K |
$0.287 |
$1.72 |
$1.72 |
||
|
256K<Token≤1M |
$0.573 |
$3.44 |
$3.44 |
||
|
qwen3.5-plus-2026-04-20 |
Chinese mainland |
0<Token≤128K |
$0.115 |
$0.688 |
$0.688 |
|
128K<Token≤256K |
$0.287 |
$1.72 |
$1.72 |
||
|
256K<Token≤1M |
$0.573 |
$3.44 |
$3.44 |
||
|
qwen3.5-plus-2026-02-15 |
Chinese mainland |
0<Token≤128K |
$0.115 |
$0.688 |
$0.688 |
|
128K<Token≤256K |
$0.287 |
$1.72 |
$1.72 |
||
|
256K<Token≤1M |
$0.573 |
$3.44 |
$3.44 |
||
|
qwen-plus Currently equivalent to qwen-plus-2025-12-01 |
Chinese mainland |
0<Token≤128K |
$0.115 |
$0.287 |
$1.147 |
|
128K<Token≤256K |
$0.345 |
$2.868 |
$3.441 |
||
|
256K<Token≤1M |
$0.689 |
$6.881 |
$9.175 |
||
|
qwen-plus-latest |
Chinese mainland |
0<Token≤128K |
$0.115 |
$0.287 |
$1.147 |
|
128K<Token≤256K |
$0.345 |
$2.868 |
$3.441 |
||
|
256K<Token≤1M |
$0.689 |
$6.881 |
$9.175 |
||
|
qwen-plus-2025-12-01 |
Chinese mainland |
0<Token≤128K |
$0.115 |
$0.287 |
$1.147 |
|
128K<Token≤256K |
$0.345 |
$2.868 |
$3.441 |
||
|
256K<Token≤1M |
$0.689 |
$6.881 |
$9.175 |
||
|
qwen-plus-2025-09-11 |
Chinese mainland |
0<Token≤128K |
$0.115 |
$0.287 |
$1.147 |
|
128K<Token≤256K |
$0.345 |
$2.868 |
$3.441 |
||
|
256K<Token≤1M |
$0.689 |
$6.881 |
$9.175 |
||
|
qwen-plus-2025-07-28 |
Chinese mainland |
0<Token≤128K |
$0.115 |
$0.287 |
$1.147 |
|
128K<Token≤256K |
$0.345 |
$2.868 |
$3.441 |
||
|
256K<Token≤1M |
$0.689 |
$6.881 |
$9.175 |
||
|
qwen-plus-2025-07-14 |
Chinese mainland |
No tiered pricing |
$0.115 |
$0.287 |
$1.147 |
|
qwen-plus-2025-04-28 |
Chinese mainland |
No tiered pricing |
$0.115 |
$0.287 |
$1.147 |
More models
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen-plus-2025-01-25 |
Chinese mainland |
No tiered pricing |
$0.115 |
$0.287 |
|
qwen-plus-2025-01-12 |
Chinese mainland |
No tiered pricing |
$0.115 |
$0.287 |
|
qwen-plus-2024-12-20 |
Chinese mainland |
No tiered pricing |
$0.115 |
$0.287 |
Hong Kong (China)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
||||
|
qwen3.7-plus Currently equivalent to qwen3.7-plus-2026-05-26 context caching discount |
Global |
0<Token≤256K |
$0.276 |
$1.101 |
$1.101 |
|
256K<Token≤1M |
$0.826 |
$3.301 |
$3.301 |
||
|
qwen3.7-plus-2026-05-26 context caching discount |
Global |
0<Token≤256K |
$0.276 |
$1.101 |
$1.101 |
|
256K<Token≤1M |
$0.826 |
$3.301 |
$3.301 |
||
|
qwen3.6-plus Currently equivalent to qwen3.6-plus-2026-04-02 |
Global |
0<Token≤256K |
$0.276 |
$1.651 |
$1.651 |
|
256K<Token≤1M |
$1.101 |
$6.602 |
$6.602 |
||
|
qwen-plus Currently equivalent to qwen-plus-2025-12-01 |
Hong Kong (China) |
0<Token≤256K |
$0.4 |
$1.2 |
$4 |
|
256K<Token≤1M |
$1.2 |
$3.6 |
$12 |
||
|
qwen-plus-2025-12-01 |
Hong Kong (China) |
0<Token≤256K |
$0.4 |
$1.2 |
$4 |
|
256K<Token≤1M |
$1.2 |
$3.6 |
$12 |
||
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
||||
|
qwen3.7-plus Currently equivalent to qwen3.7-plus-2026-05-26 context caching discount |
Global |
0<Token≤256K |
$0.276 |
$1.101 |
$1.101 |
|
256K<Token≤1M |
$0.826 |
$3.301 |
$3.301 |
||
|
qwen3.7-plus-2026-05-26 context caching discount |
Global |
0<Token≤256K |
$0.276 |
$1.101 |
$1.101 |
|
256K<Token≤1M |
$0.826 |
$3.301 |
$3.301 |
||
|
qwen3.6-plus Currently equivalent to qwen3.6-plus-2026-04-02 |
Global |
0<Token≤256K |
$0.276 |
$1.651 |
$1.651 |
|
256K<Token≤1M |
$1.101 |
$6.602 |
$6.602 |
||
|
qwen3.6-plus-2026-04-02 |
Global |
0<Token≤256K |
$0.276 |
$1.651 |
$1.651 |
|
256K<Token≤1M |
$1.101 |
$6.602 |
$6.602 |
||
|
qwen3.5-plus Currently equivalent to qwen3.5-plus-2026-02-15 |
Global |
0<Token≤128K |
$0.115 |
$0.688 |
$0.688 |
|
128K<Token≤256K |
$0.287 |
$1.72 |
$1.72 |
||
|
256K<Token≤1M |
$0.573 |
$3.44 |
$3.44 |
||
|
qwen3.5-plus-2026-02-15 |
Global |
0<Token≤128K |
$0.115 |
$0.688 |
$0.688 |
|
128K<Token≤256K |
$0.287 |
$1.72 |
$1.72 |
||
|
256K<Token≤1M |
$0.573 |
$3.44 |
$3.44 |
||
|
qwen-plus Currently equivalent to qwen-plus-2025-12-01 |
Global |
0<Token≤128K |
$0.115 |
$0.287 |
$1.147 |
|
128K<Token≤256K |
$0.345 |
$2.868 |
$3.441 |
||
|
256K<Token≤1M |
$0.689 |
$6.881 |
$9.175 |
||
|
qwen-plus Currently equivalent to qwen-plus-2025-12-01 |
EU |
0<Token≤256K |
$0.4 |
$1.2 |
$4 |
|
256K<Token≤1M |
$1.2 |
$3.6 |
$12 |
||
|
qwen-plus-2025-12-01 |
Global |
0<Token≤128K |
$0.115 |
$0.287 |
$1.147 |
|
128K<Token≤256K |
$0.345 |
$2.868 |
$3.441 |
||
|
256K<Token≤1M |
$0.689 |
$6.881 |
$9.175 |
||
|
qwen-plus-2025-12-01 |
EU |
0<Token≤256K |
$0.4 |
$1.2 |
$4 |
|
256K<Token≤1M |
$1.2 |
$3.6 |
$12 |
||
|
qwen-plus-2025-09-11 |
Global |
0<Token≤128K |
$0.115 |
$0.287 |
$1.147 |
|
128K<Token≤256K |
$0.345 |
$2.868 |
$3.441 |
||
|
256K<Token≤1M |
$0.689 |
$6.881 |
$9.175 |
||
|
qwen-plus-2025-07-28 |
Global |
0<Token≤128K |
$0.115 |
$0.287 |
$1.147 |
|
128K<Token≤256K |
$0.345 |
$2.868 |
$3.441 |
||
|
256K<Token≤1M |
$0.689 |
$6.881 |
$9.175 |
||
US (Virginia)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
||||
|
qwen3.7-plus Currently equivalent to qwen3.7-plus-2026-05-26 context caching discount |
Global |
0<Token≤256K |
$0.276 |
$1.101 |
$1.101 |
|
256K<Token≤1M |
$0.826 |
$3.301 |
$3.301 |
||
|
qwen3.7-plus-2026-05-26 context caching discount |
Global |
0<Token≤256K |
$0.276 |
$1.101 |
$1.101 |
|
256K<Token≤1M |
$0.826 |
$3.301 |
$3.301 |
||
|
qwen3.6-plus Currently equivalent to qwen3.6-plus-2026-04-02 |
Global |
0<Token≤256K |
$0.276 |
$1.651 |
$1.651 |
|
256K<Token≤1M |
$1.101 |
$6.602 |
$6.602 |
||
|
qwen3.6-plus-2026-04-02 |
Global |
0<Token≤256K |
$0.276 |
$1.651 |
$1.651 |
|
256K<Token≤1M |
$1.101 |
$6.602 |
$6.602 |
||
|
qwen3.5-plus Currently equivalent to qwen3.5-plus-2026-02-15 |
Global |
0<Token≤128K |
$0.115 |
$0.688 |
$0.688 |
|
128K<Token≤256K |
$0.287 |
$1.72 |
$1.72 |
||
|
256K<Token≤1M |
$0.573 |
$3.44 |
$3.44 |
||
|
qwen3.5-plus-2026-02-15 |
Global |
0<Token≤128K |
$0.115 |
$0.688 |
$0.688 |
|
128K<Token≤256K |
$0.287 |
$1.72 |
$1.72 |
||
|
256K<Token≤1M |
$0.573 |
$3.44 |
$3.44 |
||
|
qwen-plus Currently equivalent to qwen-plus-2025-12-01 |
Global |
0<Token≤128K |
$0.115 |
$0.287 |
$1.147 |
|
128K<Token≤256K |
$0.345 |
$2.868 |
$3.441 |
||
|
256K<Token≤1M |
$0.689 |
$6.881 |
$9.175 |
||
|
qwen-plus-us |
US |
0<Token≤256K |
$0.4 |
$1.2 |
$4 |
|
256K<Token≤1M |
$1.2 |
$3.6 |
$12 |
||
|
qwen-plus-2025-12-01 |
Global |
0<Token≤128K |
$0.115 |
$0.287 |
$1.147 |
|
128K<Token≤256K |
$0.345 |
$2.868 |
$3.441 |
||
|
256K<Token≤1M |
$0.689 |
$6.881 |
$9.175 |
||
|
qwen-plus-2025-12-01-us |
US |
0<Token≤256K |
$0.4 |
$1.2 |
$4 |
|
256K<Token≤1M |
$1.2 |
$3.6 |
$12 |
||
|
qwen-plus-2025-09-11 |
Global |
0<Token≤128K |
$0.115 |
$0.287 |
$1.147 |
|
128K<Token≤256K |
$0.345 |
$2.868 |
$3.441 |
||
|
256K<Token≤1M |
$0.689 |
$6.881 |
$9.175 |
||
|
qwen-plus-2025-07-28 |
Global |
0<Token≤128K |
$0.115 |
$0.287 |
$1.147 |
|
128K<Token≤256K |
$0.345 |
$2.868 |
$3.441 |
||
|
256K<Token≤1M |
$0.689 |
$6.881 |
$9.175 |
||
Japan (Tokyo)
|
Model ID |
Deployment scope |
Input token range per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
|
Non-Thinking mode |
Thinking mode(Chain of thought + answer) |
||||
|
qwen3.7-plus Currently equivalent to qwen3.7-plus-2026-05-26 Context cachecontext caching discount |
Japan |
0<Token≤256K |
$0.4 |
$1.6 |
$1.6 |
|
256K<Token≤1M |
$1.2 |
$4.8 |
$4.8 |
||
|
qwen3.7-plus-2026-05-26 Context cachecontext caching discount |
Japan |
0<Token≤256K |
$0.4 |
$1.6 |
$1.6 |
|
256K<Token≤1M |
$1.2 |
$4.8 |
$4.8 |
||
|
qwen3.7-plus Currently equivalent to qwen3.7-plus-2026-05-26 Context cachecontext caching discount |
Global |
0<Token≤256K |
$0.276 |
$1.101 |
$1.101 |
|
256K<Token≤1M |
$0.826 |
$3.301 |
$3.301 |
||
|
qwen3.7-plus-2026-05-26 Context cachecontext caching discount |
Global |
0<Token≤256K |
$0.276 |
$1.101 |
$1.101 |
|
256K<Token≤1M |
$0.826 |
$3.301 |
$3.301 |
||
|
qwen3.6-plus Currently equivalent to qwen3.6-plus-2026-04-02 Context cachecontext caching discount |
Global |
0<Token≤256K |
$0.276 |
$1.651 |
$1.651 |
|
256K<Token≤1M |
$1.101 |
$6.602 |
$6.602 |
||
|
qwen3.6-plus-2026-04-02 |
Global |
0<Token≤256K |
$0.276 |
$1.651 |
$1.651 |
|
256K<Token≤1M |
$1.101 |
$6.602 |
$6.602 |
||
Qwen-Flash
You are charged for input tokens and output tokens.
If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price. If the model supports context cache, only input tokens receive a discount. These two discounts cannot apply simultaneously.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3.6-flash Currently equivalent to qwen3.6-flash-2026-04-16 50% batch inference discount context caching discount |
International |
0<Token≤256K |
$0.25 |
$1.5 |
1 million tokens |
|
256K<Token≤1M |
$1 |
$4 |
|||
|
qwen3.6-flash-2026-04-16 |
International |
0<Token≤256K |
$0.25 |
$1.5 |
1 million tokens |
|
256K<Token≤1M |
$1 |
$4 |
|||
|
qwen3.5-flash Currently equivalent to qwen3.5-flash-2026-02-23 50% batch inference discount context caching discount |
International |
0<Token≤1M |
$0.1 |
$0.4 |
1 million tokens |
|
qwen3.5-flash-2026-02-23 |
International |
0<Token≤1M |
$0.1 |
$0.4 |
1 million tokens |
|
qwen-flash Currently equivalent to qwen-flash-2025-07-28 50% batch inference discount context caching discount |
International |
0<Token≤256K |
$0.05 |
$0.4 |
1 million tokens |
|
256K<Token≤1M |
$0.25 |
$2 |
|||
|
qwen-flash-2025-07-28 |
International |
0<Token≤256K |
$0.05 |
$0.4 |
1 million tokens |
|
256K<Token≤1M |
$0.25 |
$2 |
China (Beijing)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen3.6-flash Currently equivalent to qwen3.6-flash-2026-04-16 50% batch inference discount context caching discount |
Chinese mainland |
0<Token≤256K |
$0.165 |
$0.99 |
|
256K<Token≤1M |
$0.66 |
$3.961 |
||
|
qwen3.6-flash-2026-04-16 |
Chinese mainland |
0<Token≤256K |
$0.165 |
$0.99 |
|
256K<Token≤1M |
$0.66 |
$3.961 |
||
|
qwen3.5-flash Currently equivalent to qwen3.5-flash-2026-02-23 |
Chinese mainland |
0<Token≤128K |
$0.029 |
$0.287 |
|
128K<Token≤256K |
$0.115 |
$1.147 |
||
|
256K<Token≤1M |
$0.172 |
$1.72 |
||
|
qwen3.5-flash-2026-02-23 |
Chinese mainland |
0<Token≤128K |
$0.029 |
$0.287 |
|
128K<Token≤256K |
$0.115 |
$1.147 |
||
|
256K<Token≤1M |
$0.172 |
$1.72 |
||
|
qwen-flash Currently equivalent to qwen-flash-2025-07-28 context caching discount |
Chinese mainland |
0<Token≤128K |
$0.022 |
$0.216 |
|
128K<Token≤256K |
$0.087 |
$0.861 |
||
|
256K<Token≤1M |
$0.173 |
$1.721 |
||
|
qwen-flash-2025-07-28 |
Chinese mainland |
0<Token≤128K |
$0.022 |
$0.216 |
|
128K<Token≤256K |
$0.087 |
$0.861 |
||
|
256K<Token≤1M |
$0.173 |
$1.721 |
Hong Kong (China)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen3.6-flash Currently equivalent to qwen3.6-flash-2026-04-16 |
Global |
0<Token≤256K |
$0.165 |
$0.99 |
|
256K<Token≤1M |
$0.66 |
$3.961 |
||
|
qwen3.5-flash Currently equivalent to qwen3.5-flash-2026-02-23 context caching discount |
Hong Kong (China) |
0<Token≤1M |
$0.1 |
$0.4 |
|
qwen3.5-flash-2026-02-23 |
Hong Kong (China) |
0<Token≤1M |
$0.1 |
$0.4 |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen3.6-flash Currently equivalent to qwen3.6-flash-2026-04-16 |
Global |
0<Token≤256K |
$0.165 |
$0.99 |
|
256K<Token≤1M |
$0.66 |
$3.961 |
||
|
qwen3.6-flash-2026-04-16 |
Global |
0<Token≤256K |
$0.165 |
$0.99 |
|
256K<Token≤1M |
$0.66 |
$3.961 |
||
|
qwen3.5-flash Currently equivalent to qwen3.5-flash-2026-02-23 |
Global |
0<Token≤128K |
$0.029 |
$0.287 |
|
128K<Token≤256K |
$0.115 |
$1.147 |
||
|
256K<Token≤1M |
$0.172 |
$1.72 |
||
|
qwen3.5-flash Currently equivalent to qwen3.5-flash-2026-02-23 context caching discount |
EU |
0<Token≤1M |
$0.1 |
$0.4 |
|
qwen3.5-flash-2026-02-23 |
Global |
0<Token≤128K |
$0.029 |
$0.287 |
|
128K<Token≤256K |
$0.115 |
$1.147 |
||
|
256K<Token≤1M |
$0.172 |
$1.72 |
||
|
qwen3.5-flash-2026-02-23 |
EU |
0<Token≤1M |
$0.1 |
$0.4 |
|
qwen-flash Currently equivalent to qwen-flash-2025-07-28 context caching discount |
Global |
0<Token≤128K |
$0.022 |
$0.216 |
|
128K<Token≤256K |
$0.087 |
$0.861 |
||
|
256K<Token≤1M |
$0.173 |
$1.721 |
||
|
qwen-flash-2025-07-28 |
Global |
0<Token≤128K |
$0.022 |
$0.216 |
|
128K<Token≤256K |
$0.087 |
$0.861 |
||
|
256K<Token≤1M |
$0.173 |
$1.721 |
US (Virginia)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen3.6-flash Currently equivalent to qwen3.6-flash-2026-04-16 |
Global |
0<Token≤256K |
$0.165 |
$0.99 |
|
256K<Token≤1M |
$0.66 |
$3.961 |
||
|
qwen3.6-flash-2026-04-16 |
Global |
0<Token≤256K |
$0.165 |
$0.99 |
|
256K<Token≤1M |
$0.66 |
$3.961 |
||
|
qwen3.5-flash Currently equivalent to qwen3.5-flash-2026-02-23 |
Global |
0<Token≤128K |
$0.029 |
$0.287 |
|
128K<Token≤256K |
$0.115 |
$1.147 |
||
|
256K<Token≤1M |
$0.172 |
$1.72 |
||
|
qwen3.5-flash-2026-02-23 |
Global |
0<Token≤128K |
$0.029 |
$0.287 |
|
128K<Token≤256K |
$0.115 |
$1.147 |
||
|
256K<Token≤1M |
$0.172 |
$1.72 |
||
|
qwen-flash Currently equivalent to qwen-flash-2025-07-28 context caching discount |
Global |
0<Token≤128K |
$0.022 |
$0.216 |
|
128K<Token≤256K |
$0.087 |
$0.861 |
||
|
256K<Token≤1M |
$0.173 |
$1.721 |
||
|
qwen-flash-us |
US |
0<Token≤256K |
$0.05 |
$0.4 |
|
256K<Token≤1M |
$0.25 |
$2 |
||
|
qwen-flash-2025-07-28 |
Global |
0<Token≤128K |
$0.022 |
$0.216 |
|
128K<Token≤256K |
$0.087 |
$0.861 |
||
|
256K<Token≤1M |
$0.173 |
$1.721 |
||
|
qwen-flash-2025-07-28-us |
US |
0<Token≤256K |
$0.05 |
$0.4 |
|
256K<Token≤1M |
$0.25 |
$2 |
Japan (Tokyo)
|
Model ID |
Deployment scope |
Input token range per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen3.6-flash Currently equivalent to qwen3.6-flash-2026-04-16 Context cachecontext caching discount |
Global |
0<Token≤256K |
$0.165 |
$0.99 |
|
256K<Token≤1M |
$0.66 |
$3.961 |
||
|
qwen3.6-flash-2026-04-16 |
Global |
0<Token≤256K |
$0.165 |
$0.99 |
|
256K<Token≤1M |
$0.66 |
$3.961 |
Qwen-Turbo
Qwen-Turbo will no longer be updated. We recommend switching to Qwen-Flash.
You are charged for input tokens and output tokens.
If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
||||
|
qwen-turbo Currently equivalent to qwen-turbo-2025-04-28 50% batch inference discount |
International |
$0.05 |
$0.2 |
$0.5 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
|||
|
qwen-turbo Currently equivalent to qwen-turbo-2025-04-28 |
Chinese mainland |
$0.044 |
$0.087 |
$0.431 |
QwQ
You are charged for input tokens and output tokens.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwq-plus Currently equivalent to qwq-plus-2025-03-05 |
International |
$0.8 |
$2.4 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwq-plus Currently equivalent to qwq-plus-2025-03-05 |
Chinese mainland |
$0.230 |
$0.574 |
Qwen-Long
You are charged for input tokens and output tokens.
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) |
|
qwen-long-latest |
International |
$0.072 |
$0.287 |
No free quota |
|
qwen-long-2025-01-25 |
International |
$0.072 |
$0.287 |
No free quota |
Qwen-Omni
Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, seeBilling and rate limits.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
||
|
Text/Image/video |
Audio |
Text Multimodal input |
Text + audio Audio only billed |
|||
|
qwen3.5-omni-plus Currently equivalent to qwen3.5-omni-plus-2026-03-15 |
International |
$1.4 |
$11 |
$8.3 |
$44 |
1 million tokens |
|
qwen3.5-omni-plus-2026-03-15 |
International |
$1.4 |
$11 |
$8.3 |
$44 |
1 million tokens |
|
qwen3.5-omni-flash Currently equivalent to qwen3.5-omni-flash-2026-03-15 |
International |
$0.4 |
$3 |
$2.2 |
$11.9 |
1 million tokens |
|
qwen3.5-omni-flash-2026-03-15 |
International |
$0.4 |
$3 |
$2.2 |
$11.9 |
1 million tokens |
More models
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
||
|
Text/Image/video |
Audio |
Text Multimodal input |
Text + audio Audio only billed |
||
|
qwen3.5-omni-plus Currently equivalent to qwen3.5-omni-plus-2026-03-15 |
Chinese mainland |
$0.96 |
$7.29 |
$5.5 |
$29.29 |
|
qwen3.5-omni-plus-2026-03-15 |
Chinese mainland |
$0.96 |
$7.29 |
$5.5 |
$29.29 |
|
qwen3.5-omni-flash Currently equivalent to qwen3.5-omni-flash-2026-03-15 |
Chinese mainland |
$0.3 |
$2.48 |
$1.83 |
$9.9 |
|
qwen3.5-omni-flash-2026-03-15 |
Chinese mainland |
$0.3 |
$2.48 |
$1.83 |
$9.9 |
More models
Qwen-Omni-Realtime
Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, seeBilling and rate limits.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
||
|
Text/image |
Audio |
Text Multimodal input |
Text + audio Audio only billed |
|||
|
qwen3.5-omni-plus-realtime |
International |
$2.1 |
$16.5 |
$12.4 |
$62 |
1 million tokens |
|
qwen3.5-omni-plus-realtime-2026-03-15 |
International |
$2.1 |
$16.5 |
$12.4 |
$62 |
1 million tokens |
|
qwen3.5-omni-flash-realtime |
International |
$0.55 |
$4.5 |
$3.3 |
$17.7 |
1 million tokens |
|
qwen3.5-omni-flash-realtime-2026-03-15 |
International |
$0.55 |
$4.5 |
$3.3 |
$17.7 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
||
|
Text/image |
Audio |
Text Multimodal input |
Text + audio Audio only billed |
||
|
qwen3.5-omni-plus-realtime |
Chinese mainland |
$1.38 |
$11 |
$8.25 |
$41.26 |
|
qwen3.5-omni-plus-realtime-2026-03-15 |
Chinese mainland |
$1.38 |
$11 |
$8.25 |
$41.26 |
|
qwen3.5-omni-flash-realtime |
Chinese mainland |
$0.45 |
$3.71 |
$2.75 |
$14.71 |
|
qwen3.5-omni-flash-realtime-2026-03-15 |
Chinese mainland |
$0.45 |
$3.71 |
$2.75 |
$14.71 |
QVQ
Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, seeBilling and rate limits.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qvq-max Currently equivalent to qvq-max-2025-03-25 |
International |
$1.2 |
$4.8 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qvq-max Currently equivalent to qvq-max-2025-03-25 |
Chinese mainland |
$1.147 |
$4.588 |
|
qvq-plus Currently equivalent to qvq-plus-2025-05-15 |
Chinese mainland |
$0.287 |
$0.717 |
Qwen-VL
You are charged for input tokens and output tokens.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought + answer |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3-vl-plus Currently equivalent to qwen3-vl-plus-2025-12-19 context caching discount |
International |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.2 |
$1.6 |
1 million tokens |
|
32K<Token≤128K |
$0.3 |
$2.4 |
||||
|
128K<Token≤256K |
$0.6 |
$4.8 |
||||
|
qwen3-vl-plus-2025-12-19 |
International |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.2 |
$1.6 |
1 million tokens |
|
32K<Token≤128K |
$0.3 |
$2.4 |
||||
|
128K<Token≤256K |
$0.6 |
$4.8 |
||||
|
qwen3-vl-plus-2025-09-23 |
International |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.2 |
$1.6 |
1 million tokens |
|
32K<Token≤128K |
$0.3 |
$2.4 |
||||
|
128K<Token≤256K |
$0.6 |
$4.8 |
||||
|
qwen3-vl-flash Currently equivalent to qwen3-vl-flash-2026-01-22 context caching discount |
International |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.05 |
$0.4 |
1 million tokens |
|
32K<Token≤128K |
$0.075 |
$0.6 |
||||
|
128K<Token≤256K |
$0.12 |
$0.96 |
||||
|
qwen3-vl-flash-2026-01-22 |
International |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.05 |
$0.4 |
1 million tokens |
|
32K<Token≤128K |
$0.075 |
$0.6 |
||||
|
128K<Token≤256K |
$0.12 |
$0.96 |
||||
|
qwen3-vl-flash-2025-10-15 |
International |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.05 |
$0.4 |
1 million tokens |
|
32K<Token≤128K |
$0.075 |
$0.6 |
||||
|
128K<Token≤256K |
$0.12 |
$0.96 |
More models
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen-vl-max Currently equivalent to qwen-vl-max-2025-08-13 context caching discount |
International |
No tiered pricing |
$0.8 |
$3.2 |
1 million tokens |
|
qwen-vl-plus Currently equivalent to qwen-vl-plus-2025-08-15 context caching discount |
International |
No tiered pricing |
$0.21 |
$0.63 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought + answer |
|
qwen3-vl-plus Currently equivalent to qwen3-vl-plus-2025-12-19 context caching discount |
Chinese mainland |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.143 |
$1.434 |
|
32K<Token≤128K |
$0.215 |
$2.15 |
|||
|
128K<Token≤256K |
$0.43 |
$4.301 |
|||
|
qwen3-vl-plus-2025-12-19 |
Chinese mainland |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.143 |
$1.434 |
|
32K<Token≤128K |
$0.215 |
$2.15 |
|||
|
128K<Token≤256K |
$0.43 |
$4.301 |
|||
|
qwen3-vl-plus-2025-09-23 |
Chinese mainland |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.143 |
$1.434 |
|
32K<Token≤128K |
$0.215 |
$2.15 |
|||
|
128K<Token≤256K |
$0.43 |
$4.301 |
|||
|
qwen3-vl-flash Currently equivalent to qwen3-vl-flash-2026-01-22 context caching discount |
Chinese mainland |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.022 |
$0.215 |
|
32K<Token≤128K |
$0.043 |
$0.43 |
|||
|
128K<Token≤256K |
$0.086 |
$0.859 |
|||
|
qwen3-vl-flash-2026-01-22 |
Chinese mainland |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.022 |
$0.215 |
|
32K<Token≤128K |
$0.043 |
$0.43 |
|||
|
128K<Token≤256K |
$0.086 |
$0.859 |
|||
|
qwen3-vl-flash-2025-10-15 |
Chinese mainland |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.022 |
$0.215 |
|
32K<Token≤128K |
$0.043 |
$0.43 |
|||
|
128K<Token≤256K |
$0.086 |
$0.859 |
More models
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen-vl-max Currently equivalent to qwen-vl-max-2025-08-13 context caching discount |
Chinese mainland |
No tiered pricing |
$0.23 |
$0.574 |
|
qwen-vl-plus Currently equivalent to qwen-vl-plus-2025-08-15 context caching discount |
Chinese mainland |
No tiered pricing |
$0.115 |
$0.287 |
Hong Kong (China)
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought + answer |
|
qwen3-vl-plus Currently equivalent to qwen3-vl-plus-2025-12-19 context caching discount |
Hong Kong (China) |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.2 |
$1.6 |
|
32K<Token≤128K |
$0.3 |
$2.4 |
|||
|
128K<Token≤256K |
$0.6 |
$4.8 |
|||
|
qwen3-vl-plus-2025-12-19 |
Hong Kong (China) |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.2 |
$1.6 |
|
32K<Token≤128K |
$0.3 |
$2.4 |
|||
|
128K<Token≤256K |
$0.6 |
$4.8 |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought + answer |
|
qwen3-vl-flash Currently equivalent to qwen3-vl-flash-2025-10-15 context caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.022 |
$0.215 |
|
32K<Token≤128K |
$0.043 |
$0.43 |
|||
|
128K<Token≤256K |
$0.086 |
$0.859 |
|||
|
qwen3-vl-flash Currently equivalent to qwen3-vl-flash-2026-01-22 context caching discount |
EU |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.05 |
$0.4 |
|
32K<Token≤128K |
$0.075 |
$0.6 |
|||
|
128K<Token≤256K |
$0.12 |
$0.96 |
|||
|
qwen3-vl-flash-2026-01-22 |
EU |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.05 |
$0.4 |
|
32K<Token≤128K |
$0.075 |
$0.6 |
|||
|
128K<Token≤256K |
$0.12 |
$0.96 |
|||
|
qwen3-vl-flash-2025-10-15 |
Global |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.022 |
$0.215 |
|
32K<Token≤128K |
$0.043 |
$0.43 |
|||
|
128K<Token≤256K |
$0.086 |
$0.859 |
|||
|
qwen3-vl-flash-2025-10-15 |
EU |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.05 |
$0.4 |
|
32K<Token≤128K |
$0.075 |
$0.6 |
|||
|
128K<Token≤256K |
$0.12 |
$0.96 |
|||
|
qwen3-vl-plus Currently equivalent to qwen3-vl-plus-2025-12-19 context caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.143 |
$1.434 |
|
32K<Token≤128K |
$0.215 |
$2.15 |
|||
|
128K<Token≤256K |
$0.43 |
$4.301 |
|||
|
qwen3-vl-plus context caching discount |
EU |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.2 |
$1.6 |
|
32K<Token≤128K |
$0.3 |
$2.4 |
|||
|
128K<Token≤256K |
$0.6 |
$4.8 |
|||
|
qwen3-vl-plus-2025-09-23 |
Global |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.143 |
$1.434 |
|
32K<Token≤128K |
$0.215 |
$2.15 |
|||
|
128K<Token≤256K |
$0.43 |
$4.301 |
US (Virginia)
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought + answer |
|
qwen3-vl-flash Currently equivalent to qwen3-vl-flash-2025-10-15 context caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.022 |
$0.215 |
|
32K<Token≤128K |
$0.043 |
$0.43 |
|||
|
128K<Token≤256K |
$0.086 |
$0.859 |
|||
|
qwen3-vl-flash-us context caching discount |
US |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.05 |
$0.4 |
|
32K<Token≤128K |
$0.075 |
$0.6 |
|||
|
128K<Token≤256K |
$0.12 |
$0.96 |
|||
|
qwen3-vl-flash-2026-01-22-us |
US |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.05 |
$0.4 |
|
32K<Token≤128K |
$0.075 |
$0.6 |
|||
|
128K<Token≤256K |
$0.12 |
$0.96 |
|||
|
qwen3-vl-flash-2025-10-15 |
Global |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.022 |
$0.215 |
|
32K<Token≤128K |
$0.043 |
$0.43 |
|||
|
128K<Token≤256K |
$0.086 |
$0.859 |
|||
|
qwen3-vl-flash-2025-10-15-us |
US |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.05 |
$0.4 |
|
32K<Token≤128K |
$0.075 |
$0.6 |
|||
|
128K<Token≤256K |
$0.12 |
$0.96 |
|||
|
qwen3-vl-plus Currently equivalent to qwen3-vl-plus-2025-12-19 context caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.143 |
$1.434 |
|
32K<Token≤128K |
$0.215 |
$2.15 |
|||
|
128K<Token≤256K |
$0.43 |
$4.301 |
|||
|
qwen3-vl-plus-2025-09-23 |
Global |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.143 |
$1.434 |
|
32K<Token≤128K |
$0.215 |
$2.15 |
|||
|
128K<Token≤256K |
$0.43 |
$4.301 |
Qwen-OCR
You are charged for input tokens and output tokens.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen-vl-ocr |
International |
$0.07 |
$0.16 |
1 million tokens |
|
qwen-vl-ocr-2025-11-20 |
International |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen3.5-ocr |
Chinese mainland |
$0.069 |
$0.275 |
|
qwen-vl-ocr |
Chinese mainland |
$0.043 |
$0.072 |
|
qwen-vl-ocr-latest |
Chinese mainland |
$0.043 |
$0.072 |
|
qwen-vl-ocr-2025-11-20 |
Chinese mainland |
$0.043 |
$0.072 |
|
qwen-vl-ocr-2025-08-28 |
Chinese mainland |
$0.717 |
$0.717 |
|
qwen-vl-ocr-2025-04-13 |
Chinese mainland |
$0.717 |
$0.717 |
|
qwen-vl-ocr-2024-10-28 |
Chinese mainland |
$0.717 |
$0.717 |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen-vl-ocr |
Global |
$0.043 |
$0.072 |
|
qwen-vl-ocr-2025-11-20 |
Global |
$0.043 |
$0.072 |
US (Virginia)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen-vl-ocr |
Global |
$0.043 |
$0.072 |
|
qwen-vl-ocr-2025-11-20 |
Global |
$0.043 |
$0.072 |
Qwen Math
You are charged for input tokens and output tokens.
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) |
|
qwen-math-plus |
Chinese mainland |
$0.574 |
$1.721 |
No free quota |
|
qwen-math-plus-latest |
Chinese mainland |
$0.574 |
$1.721 |
No free quota |
|
qwen-math-plus-2024-09-19 |
Chinese mainland |
$0.574 |
$1.721 |
No free quota |
|
qwen-math-plus-2024-08-16 |
Chinese mainland |
$0.574 |
$1.721 |
No free quota |
|
qwen-math-turbo |
Chinese mainland |
$0.287 |
$0.861 |
No free quota |
Qwen-Coder
You are charged for input tokens and output tokens.
If the model supports context cache, only input tokens receive a discount.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3-coder-plus Currently equivalent to qwen3-coder-plus-2025-09-23 context caching discount |
International |
0<Token≤32K |
$1 |
$5 |
1 million tokens |
|
32K<Token≤128K |
$1.8 |
$9 |
|||
|
128K<Token≤256K |
$3 |
$15 |
|||
|
256K<Token≤1M |
$6 |
$60 |
|||
|
qwen3-coder-plus-2025-09-23 |
International |
0<Token≤32K |
$1 |
$5 |
1 million tokens |
|
32K<Token≤128K |
$1.8 |
$9 |
|||
|
128K<Token≤256K |
$3 |
$15 |
|||
|
256K<Token≤1M |
$6 |
$60 |
|||
|
qwen3-coder-plus-2025-07-22 |
International |
0<Token≤32K |
$1 |
$5 |
1 million tokens |
|
32K<Token≤128K |
$1.8 |
$9 |
|||
|
128K<Token≤256K |
$3 |
$15 |
|||
|
256K<Token≤1M |
$6 |
$60 |
|||
|
qwen3-coder-flash Currently equivalent to qwen3-coder-flash-2025-07-28 |
International |
0<Token≤32K |
$0.3 |
$1.5 |
1 million tokens |
|
32K<Token≤128K |
$0.5 |
$2.5 |
|||
|
128K<Token≤256K |
$0.8 |
$4 |
|||
|
256K<Token≤1M |
$1.6 |
$9.6 |
|||
|
qwen3-coder-flash-2025-07-28 |
International |
0<Token≤32K |
$0.3 |
$1.5 |
1 million tokens |
|
32K<Token≤128K |
$0.5 |
$2.5 |
|||
|
128K<Token≤256K |
$0.8 |
$4 |
|||
|
256K<Token≤1M |
$1.6 |
$9.6 |
China (Beijing)
qwen3-coderseries models
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen3-coder-plus Currently equivalent to qwen3-coder-plus-2025-09-23 context caching discount |
Chinese mainland |
0<Token≤32K |
$0.574 |
$2.294 |
|
32K<Token≤128K |
$0.861 |
$3.441 |
||
|
128K<Token≤256K |
$1.434 |
$5.735 |
||
|
256K<Token≤1M |
$2.868 |
$28.671 |
||
|
qwen3-coder-plus-2025-09-23 |
Chinese mainland |
0<Token≤32K |
$0.574 |
$2.294 |
|
32K<Token≤128K |
$0.861 |
$3.441 |
||
|
128K<Token≤256K |
$1.434 |
$5.735 |
||
|
256K<Token≤1M |
$2.868 |
$28.671 |
||
|
qwen3-coder-plus-2025-07-22 |
Chinese mainland |
0<Token≤32K |
$0.574 |
$2.294 |
|
32K<Token≤128K |
$0.861 |
$3.441 |
||
|
128K<Token≤256K |
$1.434 |
$5.735 |
||
|
256K<Token≤1M |
$2.868 |
$28.671 |
||
|
qwen3-coder-flash Currently equivalent to qwen3-coder-flash-2025-07-28 |
Chinese mainland |
0<Token≤32K |
$0.144 |
$0.574 |
|
32K<Token≤128K |
$0.216 |
$0.861 |
||
|
128K<Token≤256K |
$0.359 |
$1.434 |
||
|
256K<Token≤1M |
$0.717 |
$3.584 |
||
|
qwen3-coder-flash-2025-07-28 |
Chinese mainland |
0<Token≤32K |
$0.144 |
$0.574 |
|
32K<Token≤128K |
$0.216 |
$0.861 |
||
|
128K<Token≤256K |
$0.359 |
$1.434 |
||
|
256K<Token≤1M |
$0.717 |
$3.584 |
Legacy qwen-coder series models
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen-coder-plus Currently equivalent to qwen-coder-plus-2024-11-06 |
Chinese mainland |
No tiered pricing |
$0.502 |
$1.004 |
|
qwen-coder-turbo Currently equivalent to qwen-coder-turbo-2024-09-19 |
Chinese mainland |
No tiered pricing |
$0.287 |
$0.861 |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen3-coder-plus Currently equivalent to qwen3-coder-plus-2025-09-23 context caching discount |
Global |
0<Token≤32K |
$0.574 |
$2.294 |
|
32K<Token≤128K |
$0.861 |
$3.441 |
||
|
128K<Token≤256K |
$1.434 |
$5.735 |
||
|
256K<Token≤1M |
$2.868 |
$28.671 |
||
|
qwen3-coder-plus-2025-09-23 |
Global |
0<Token≤32K |
$0.574 |
$2.294 |
|
32K<Token≤128K |
$0.861 |
$3.441 |
||
|
128K<Token≤256K |
$1.434 |
$5.735 |
||
|
256K<Token≤1M |
$2.868 |
$28.671 |
||
|
qwen3-coder-plus-2025-07-22 |
Global |
0<Token≤32K |
$0.574 |
$2.294 |
|
32K<Token≤128K |
$0.861 |
$3.441 |
||
|
128K<Token≤256K |
$1.434 |
$5.735 |
||
|
256K<Token≤1M |
$2.868 |
$28.671 |
||
|
qwen3-coder-flash Currently equivalent to qwen3-coder-flash-2025-07-28 context caching discount |
Global |
0<Token≤32K |
$0.144 |
$0.574 |
|
32K<Token≤128K |
$0.216 |
$0.861 |
||
|
128K<Token≤256K |
$0.359 |
$1.434 |
||
|
256K<Token≤1M |
$0.717 |
$3.584 |
||
|
qwen3-coder-flash-2025-07-28 |
Global |
0<Token≤32K |
$0.144 |
$0.574 |
|
32K<Token≤128K |
$0.216 |
$0.861 |
||
|
128K<Token≤256K |
$0.359 |
$1.434 |
||
|
256K<Token≤1M |
$0.717 |
$3.584 |
US (Virginia)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen3-coder-plus Currently equivalent to qwen3-coder-plus-2025-09-23 context caching discount |
Global |
0<Token≤32K |
$0.574 |
$2.294 |
|
32K<Token≤128K |
$0.861 |
$3.441 |
||
|
128K<Token≤256K |
$1.434 |
$5.735 |
||
|
256K<Token≤1M |
$2.868 |
$28.671 |
||
|
qwen3-coder-plus-2025-09-23 |
Global |
0<Token≤32K |
$0.574 |
$2.294 |
|
32K<Token≤128K |
$0.861 |
$3.441 |
||
|
128K<Token≤256K |
$1.434 |
$5.735 |
||
|
256K<Token≤1M |
$2.868 |
$28.671 |
||
|
qwen3-coder-plus-2025-07-22 |
Global |
0<Token≤32K |
$0.574 |
$2.294 |
|
32K<Token≤128K |
$0.861 |
$3.441 |
||
|
128K<Token≤256K |
$1.434 |
$5.735 |
||
|
256K<Token≤1M |
$2.868 |
$28.671 |
||
|
qwen3-coder-flash Currently equivalent to qwen3-coder-flash-2025-07-28 context caching discount |
Global |
0<Token≤32K |
$0.144 |
$0.574 |
|
32K<Token≤128K |
$0.216 |
$0.861 |
||
|
128K<Token≤256K |
$0.359 |
$1.434 |
||
|
256K<Token≤1M |
$0.717 |
$3.584 |
||
|
qwen3-coder-flash-2025-07-28 |
Global |
0<Token≤32K |
$0.144 |
$0.574 |
|
32K<Token≤128K |
$0.216 |
$0.861 |
||
|
128K<Token≤256K |
$0.359 |
$1.434 |
||
|
256K<Token≤1M |
$0.717 |
$3.584 |
Qwen Translation
You are charged for input tokens and output tokens.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen-mt-plus |
International |
$2.46 |
$7.37 |
1 million tokens |
|
qwen-mt-flash |
International |
$0.16 |
$0.49 |
1 million tokens |
|
qwen-mt-lite |
International |
$0.12 |
$0.36 |
1 million tokens |
|
qwen-mt-turbo |
International |
$0.16 |
$0.49 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen-mt-plus |
Chinese mainland |
$0.259 |
$0.775 |
|
qwen-mt-flash |
Chinese mainland |
$0.101 |
$0.280 |
|
qwen-mt-lite |
Chinese mainland |
$0.086 |
$0.229 |
|
qwen-mt-turbo |
Chinese mainland |
$0.101 |
$0.280 |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen-mt-plus |
Global |
$0.259 |
$0.775 |
|
qwen-mt-flash |
Global |
$0.101 |
$0.280 |
|
qwen-mt-lite |
Global |
$0.086 |
$0.229 |
US (Virginia)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen-mt-flash |
Global |
$0.101 |
$0.280 |
|
qwen-mt-lite |
Global |
$0.086 |
$0.229 |
|
qwen-mt-lite-us |
US |
$0.12 |
$0.36 |
|
qwen-mt-plus |
Global |
$0.259 |
$0.775 |
Qwen Data Mining
You are charged for input tokens and output tokens.
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) |
|
qwen-doc-turbo |
Chinese mainland |
$0.087 |
$0.144 |
No free quota |
Qwen Deep Research
You are charged for input tokens and output tokens.
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) |
|
qwen-deep-research |
Chinese mainland |
$7.742 |
$23.367 |
None |
Text generation - Qwen (open source)
Qwen3.6
You are charged for input tokens and output tokens.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
|||||
|
qwen3.6-35b-a3b |
International |
0<Token≤256K |
$0.375 |
$2.25 |
$2.25 |
1 million tokens |
|
qwen3.6-27b |
International |
0<Token≤256K |
$0.6 |
$3.6 |
$3.6 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
||||
|
qwen3.6-35b-a3b |
Chinese mainland |
0<Token≤256K |
$0.248 |
$1.485 |
$1.485 |
|
qwen3.6-27b |
Chinese mainland |
0<Token≤256K |
$0.412564 |
$2.475384 |
$2.475384 |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
||||
|
qwen3.6-35b-a3b |
Global |
0<Token≤256K |
$0.248 |
$1.485 |
$1.485 |
US (Virginia)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
||||
|
qwen3.6-35b-a3b |
Global |
0<Token≤256K |
$0.248 |
$1.485 |
$1.485 |
Qwen3.5
You are charged for input tokens and output tokens.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
|||||
|
qwen3.5-397b-a17b |
International |
0<Token≤256K |
$0.6 |
$3.6 |
$3.6 |
1 million tokens |
|
qwen3.5-122b-a10b |
International |
0<Token≤256K |
$0.4 |
$3.2 |
$3.2 |
1 million tokens |
|
qwen3.5-27b |
International |
0<Token≤256K |
$0.3 |
$2.4 |
$2.4 |
1 million tokens |
|
qwen3.5-35b-a3b |
International |
0<Token≤256K |
$0.25 |
$2 |
$2 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
||||
|
qwen3.5-397b-a17b |
Chinese mainland |
0<Token≤128K |
$0.172 |
$1.032 |
$1.032 |
|
128K<Token≤256K |
$0.43 |
$2.58 |
$2.58 |
||
|
qwen3.5-122b-a10b |
Chinese mainland |
0<Token≤128K |
$0.115 |
$0.917 |
$0.917 |
|
128K<Token≤256K |
$0.287 |
$2.294 |
$2.294 |
||
|
qwen3.5-27b |
Chinese mainland |
0<Token≤128K |
$0.086 |
$0.688 |
$0.688 |
|
128K<Token≤256K |
$0.258 |
$2.064 |
$2.064 |
||
|
qwen3.5-35b-a3b |
Chinese mainland |
0<Token≤128K |
$0.057 |
$0.459 |
$0.459 |
|
128K<Token≤256K |
$0.229 |
$1.835 |
$1.835 |
||
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
||||
|
qwen3.5-397b-a17b |
Global |
0<Token≤128K |
$0.172 |
$1.032 |
$1.032 |
|
128K<Token≤256K |
$0.43 |
$2.58 |
$2.58 |
||
|
qwen3.5-122b-a10b |
Global |
0<Token≤128K |
$0.115 |
$0.917 |
$0.917 |
|
128K<Token≤256K |
$0.287 |
$2.294 |
$2.294 |
||
|
qwen3.5-27b |
Global |
0<Token≤128K |
$0.086 |
$0.688 |
$0.688 |
|
128K<Token≤256K |
$0.258 |
$2.064 |
$2.064 |
||
|
qwen3.5-35b-a3b |
Global |
0<Token≤128K |
$0.057 |
$0.459 |
$0.459 |
|
128K<Token≤256K |
$0.229 |
$1.835 |
$1.835 |
||
US (Virginia)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
||||
|
qwen3.5-397b-a17b |
Global |
0<Token≤128K |
$0.172 |
$1.032 |
$1.032 |
|
128K<Token≤256K |
$0.43 |
$2.58 |
$2.58 |
||
|
qwen3.5-122b-a10b |
Global |
0<Token≤128K |
$0.115 |
$0.917 |
$0.917 |
|
128K<Token≤256K |
$0.287 |
$2.294 |
$2.294 |
||
|
qwen3.5-27b |
Global |
0<Token≤128K |
$0.086 |
$0.688 |
$0.688 |
|
128K<Token≤256K |
$0.258 |
$2.064 |
$2.064 |
||
|
qwen3.5-35b-a3b |
Global |
0<Token≤128K |
$0.057 |
$0.459 |
$0.459 |
|
128K<Token≤256K |
$0.229 |
$1.835 |
$1.835 |
||
Qwen3
You are charged for input tokens and output tokens.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Mode |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
|
Non-Thinking mode |
Thinking mode |
|||||
|
qwen3-next-80b-a3b-thinking |
International |
Thinking mode only |
$0.15 |
- |
$1.2 |
1 million tokens |
|
qwen3-next-80b-a3b-instruct |
International |
Non-Thinking mode only |
$0.15 |
$1.2 |
- |
1 million tokens |
|
qwen3-235b-a22b-thinking-2507 |
International |
Thinking mode only |
$0.23 |
- |
$2.3 |
1 million tokens |
|
qwen3-235b-a22b-instruct-2507 |
International |
Non-Thinking mode only |
$0.23 |
$0.92 |
- |
1 million tokens |
|
qwen3-30b-a3b-thinking-2507 |
International |
Thinking mode only |
$0.2 |
- |
$2.4 |
1 million tokens |
|
qwen3-30b-a3b-instruct-2507 |
International |
Non-Thinking mode only |
$0.2 |
$0.8 |
- |
1 million tokens |
|
qwen3-235b-a22b |
International |
Non-Thinking and Thinking modes |
$0.7 |
$2.8 |
$8.4 |
1 million tokens |
|
qwen3-32b |
International |
Non-Thinking and Thinking modes |
$0.16 |
$0.64 |
$0.64 |
1 million tokens |
|
qwen3-30b-a3b |
International |
Non-Thinking and Thinking modes |
$0.2 |
$0.8 |
$2.4 |
1 million tokens |
|
qwen3-14b |
International |
Non-Thinking and Thinking modes |
$0.35 |
$1.4 |
$4.2 |
1 million tokens |
|
qwen3-8b |
International |
Non-Thinking and Thinking modes |
$0.18 |
$0.7 |
$2.1 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Mode |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
||||
|
qwen3-next-80b-a3b-thinking |
Chinese mainland |
Thinking mode only |
$0.144 |
- |
$1.434 |
|
qwen3-next-80b-a3b-instruct |
Chinese mainland |
Non-Thinking mode only |
$0.144 |
$0.574 |
- |
|
qwen3-235b-a22b-thinking-2507 |
Chinese mainland |
Thinking mode only |
$0.287 |
- |
$2.868 |
|
qwen3-235b-a22b-instruct-2507 |
Chinese mainland |
Non-Thinking mode only |
$0.287 |
$1.147 |
- |
|
qwen3-30b-a3b-thinking-2507 |
Chinese mainland |
Thinking mode only |
$0.108 |
- |
$1.076 |
|
qwen3-30b-a3b-instruct-2507 |
Chinese mainland |
Non-Thinking mode only |
$0.108 |
$0.431 |
- |
|
qwen3-235b-a22b |
Chinese mainland |
Non-Thinking and Thinking modes |
$0.287 |
$1.147 |
$2.868 |
|
qwen3-32b |
Chinese mainland |
Non-Thinking and Thinking modes |
$0.287 |
$1.147 |
$2.868 |
|
qwen3-30b-a3b |
Chinese mainland |
Non-Thinking and Thinking modes |
$0.108 |
$0.431 |
$1.076 |
|
qwen3-14b |
Chinese mainland |
Non-Thinking and Thinking modes |
$0.144 |
$0.574 |
$1.434 |
|
qwen3-8b |
Chinese mainland |
Non-Thinking and Thinking modes |
$0.072 |
$0.287 |
$0.717 |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Mode |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
||||
|
qwen3-next-80b-a3b-thinking |
Global |
Thinking mode only |
$0.144 |
- |
$1.434 |
|
qwen3-next-80b-a3b-instruct |
Global |
Non-Thinking mode only |
$0.144 |
$0.574 |
- |
|
qwen3-235b-a22b-thinking-2507 |
Global |
Thinking mode only |
$0.23 |
- |
$2.3 |
|
qwen3-235b-a22b-instruct-2507 |
Global |
Non-Thinking mode only |
$0.23 |
$0.92 |
- |
|
qwen3-30b-a3b-thinking-2507 |
Global |
Thinking mode only |
$0.108 |
- |
$1.076 |
|
qwen3-30b-a3b-instruct-2507 |
Global |
Non-Thinking mode only |
$0.108 |
$0.431 |
- |
|
qwen3-235b-a22b |
Global |
Non-Thinking and Thinking modes |
$0.287 |
$1.147 |
$2.868 |
|
qwen3-32b |
Global |
Non-Thinking and Thinking modes |
$0.16 |
$0.64 |
$0.64 |
|
qwen3-30b-a3b |
Global |
Non-Thinking and Thinking modes |
$0.108 |
$0.431 |
$1.076 |
|
qwen3-14b |
Global |
Non-Thinking and Thinking modes |
$0.144 |
$0.574 |
$1.434 |
|
qwen3-8b |
Global |
Non-Thinking and Thinking modes |
$0.072 |
$0.287 |
$0.717 |
US (Virginia)
|
Model ID |
Deployment scope |
Mode |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
|
Non-Thinking mode |
Thinking mode (chain of thought + answer) |
||||
|
qwen3-next-80b-a3b-thinking |
Global |
Thinking mode only |
$0.144 |
- |
$1.434 |
|
qwen3-next-80b-a3b-instruct |
Global |
Non-Thinking mode only |
$0.144 |
$0.574 |
- |
|
qwen3-235b-a22b-thinking-2507 |
Global |
Thinking mode only |
$0.23 |
- |
$2.3 |
|
qwen3-235b-a22b-instruct-2507 |
Global |
Non-Thinking mode only |
$0.23 |
$0.92 |
- |
|
qwen3-30b-a3b-thinking-2507 |
Global |
Thinking mode only |
$0.108 |
- |
$1.076 |
|
qwen3-30b-a3b-instruct-2507 |
Global |
Non-Thinking mode only |
$0.108 |
$0.431 |
- |
|
qwen3-235b-a22b |
Global |
Non-Thinking and Thinking modes |
$0.287 |
$1.147 |
$2.868 |
|
qwen3-32b |
Global |
Non-Thinking and Thinking modes |
$0.16 |
$0.64 |
$0.64 |
|
qwen3-30b-a3b |
Global |
Non-Thinking and Thinking modes |
$0.108 |
$0.431 |
$1.076 |
|
qwen3-14b |
Global |
Non-Thinking and Thinking modes |
$0.144 |
$0.574 |
$1.434 |
|
qwen3-8b |
Global |
Non-Thinking and Thinking modes |
$0.072 |
$0.287 |
$0.717 |
Qwen-Omni
Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, seeBilling and rate limits.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
||||
|
Text |
Audio |
Image/video |
Text Text-only input |
Text Multimodal input |
Text + audio Audio only billed |
|||
|
qwen2.5-omni-7b |
International |
$0.10 |
$6.76 |
$0.28 |
$0.40 |
$0.84 |
$13.51 |
1 million tokens (regardless of modality) |
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
||||
|
Input: text |
Input: audio |
Input: image/video |
Output: text Text-only input |
Output: text Multimodal input |
Output: text + audio Audio only billed |
||
|
qwen2.5-omni-7b |
Chinese mainland |
$0.087 |
$5.448 |
$0.287 |
$0.345 |
$0.861 |
$10.895 |
Qwen3-Omni-Captioner
You are charged for input tokens and output tokens.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3-omni-30b-a3b-captioner |
International |
$3.81 |
$3.06 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen3-omni-30b-a3b-captioner |
Chinese mainland |
$2.265 |
$1.821 |
Qwen-VL
You are charged for input tokens and output tokens.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Mode |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought + answer |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3-vl-235b-a22b-thinking |
International |
Thinking mode only |
$0.4 |
$4 |
1 million tokens |
|
qwen3-vl-235b-a22b-instruct |
International |
Non-Thinking mode only |
$0.4 |
$1.6 |
1 million tokens |
|
qwen3-vl-32b-thinking |
International |
Thinking mode only |
$0.16 |
$0.64 |
1 million tokens |
|
qwen3-vl-32b-instruct |
International |
Non-Thinking mode only |
$0.16 |
$0.64 |
1 million tokens |
|
qwen3-vl-30b-a3b-thinking |
International |
Thinking mode only |
$0.2 |
$2.4 |
1 million tokens |
|
qwen3-vl-30b-a3b-instruct |
International |
Non-Thinking mode only |
$0.2 |
$0.8 |
1 million tokens |
|
qwen3-vl-8b-thinking |
International |
Thinking mode only |
$0.18 |
$2.1 |
1 million tokens |
|
qwen3-vl-8b-instruct |
International |
Non-Thinking mode only |
$0.18 |
$0.7 |
1 million tokens |
More models
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
China (Beijing)
|
Model ID |
Deployment scope |
Mode |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought + answer |
|
qwen3-vl-235b-a22b-thinking |
Chinese mainland |
Thinking mode only |
$0.287 |
$2.867 |
|
qwen3-vl-235b-a22b-instruct |
Chinese mainland |
Non-Thinking mode only |
$0.287 |
$1.147 |
|
qwen3-vl-32b-thinking |
Chinese mainland |
Thinking mode only |
$0.287 |
$2.868 |
|
qwen3-vl-32b-instruct |
Chinese mainland |
Non-Thinking mode only |
$0.287 |
$1.147 |
|
qwen3-vl-30b-a3b-thinking |
Chinese mainland |
Thinking mode only |
$0.108 |
$1.076 |
|
qwen3-vl-30b-a3b-instruct |
Chinese mainland |
Non-Thinking mode only |
$0.108 |
$0.431 |
|
qwen3-vl-8b-thinking |
Chinese mainland |
Thinking mode only |
$0.072 |
$0.717 |
|
qwen3-vl-8b-instruct |
Chinese mainland |
Non-Thinking mode only |
$0.072 |
$0.287 |
More models
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen2-vl-72b-instruct |
Chinese mainland |
$2.294 |
$6.881 |
|
qwen2-vl-7b-instruct |
Chinese mainland |
Limited-time free |
|
|
qwen2-vl-2b-instruct |
Chinese mainland |
||
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Mode |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought + answer |
|
qwen3-vl-235b-a22b-thinking |
Global |
Thinking mode only |
$0.287 |
$2.867 |
|
qwen3-vl-235b-a22b-instruct |
Global |
Non-Thinking mode only |
$0.287 |
$1.147 |
|
qwen3-vl-32b-thinking |
Global |
Thinking mode only |
$0.16 |
$0.64 |
|
qwen3-vl-32b-instruct |
Global |
Non-Thinking mode only |
$0.16 |
$0.64 |
|
qwen3-vl-30b-a3b-thinking |
Global |
Thinking mode only |
$0.108 |
$1.076 |
|
qwen3-vl-30b-a3b-instruct |
Global |
Non-Thinking mode only |
$0.108 |
$0.431 |
|
qwen3-vl-8b-thinking |
Global |
Thinking mode only |
$0.072 |
$0.717 |
|
qwen3-vl-8b-instruct |
Global |
Non-Thinking mode only |
$0.072 |
$0.287 |
US (Virginia)
|
Model ID |
Deployment scope |
Mode |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought + answer |
|
qwen3-vl-235b-a22b-thinking |
Global |
Thinking mode only |
$0.287 |
$2.867 |
|
qwen3-vl-235b-a22b-instruct |
Global |
Non-Thinking mode only |
$0.287 |
$1.147 |
|
qwen3-vl-32b-thinking |
Global |
Thinking mode only |
$0.16 |
$0.64 |
|
qwen3-vl-32b-instruct |
Global |
Non-Thinking mode only |
$0.16 |
$0.64 |
|
qwen3-vl-30b-a3b-thinking |
Global |
Thinking mode only |
$0.108 |
$1.076 |
|
qwen3-vl-30b-a3b-instruct |
Global |
Non-Thinking mode only |
$0.108 |
$0.431 |
|
qwen3-vl-8b-thinking |
Global |
Thinking mode only |
$0.072 |
$0.717 |
|
qwen3-vl-8b-instruct |
Global |
Non-Thinking mode only |
$0.072 |
$0.287 |
Qwen-Coder
You are charged for input tokens and output tokens.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3-coder-next |
International |
0<Token≤32K |
$0.3 |
$1.5 |
1 million tokens |
|
32K<Token≤128K |
$0.5 |
$2.5 |
|||
|
128K<Token≤256K |
$0.8 |
$4 |
|||
|
qwen3-coder-480b-a35b-instruct |
International |
0<Token≤32K |
$1.5 |
$7.5 |
1 million tokens |
|
32K<Token≤128K |
$2.7 |
$13.5 |
|||
|
128K<Token≤200K |
$4.5 |
$22.5 |
|||
|
qwen3-coder-30b-a3b-instruct |
International |
0<Token≤32K |
$0.45 |
$2.25 |
1 million tokens |
|
32K<Token≤128K |
$0.75 |
$3.75 |
|||
|
128K<Token≤200K |
$1.2 |
$6 |
China (Beijing)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen3-coder-next |
Chinese mainland |
0<Token≤32K |
$0.144 |
$0.574 |
|
32K<Token≤128K |
$0.216 |
$0.861 |
||
|
128K<Token≤256K |
$0.359 |
$1.434 |
||
|
qwen3-coder-480b-a35b-instruct |
Chinese mainland |
0<Token≤32K |
$0.861 |
$3.441 |
|
32K<Token≤128K |
$1.291 |
$5.161 |
||
|
128K<Token≤200K |
$2.151 |
$8.602 |
||
|
qwen3-coder-30b-a3b-instruct |
Chinese mainland |
0<Token≤32K |
$0.216 |
$0.861 |
|
32K<Token≤128K |
$0.323 |
$1.291 |
||
|
128K<Token≤200K |
$0.538 |
$2.151 |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen3-coder-30b-a3b-instruct |
Global |
0<Token≤32K |
$0.216 |
$0.861 |
|
32K<Token≤128K |
$0.323 |
$1.291 |
||
|
128K<Token≤200K |
$0.538 |
$2.151 |
||
|
qwen3-coder-480b-a35b-instruct |
Global |
0<Token≤32K |
$0.861 |
$3.441 |
|
32K<Token≤128K |
$1.291 |
$5.161 |
||
|
128K<Token≤200K |
$2.151 |
$8.602 |
||
|
qwen3-coder-next |
EU |
0<Token≤32K |
$0.3 |
$1.5 |
|
32K<Token≤128K |
$0.5 |
$2.5 |
||
|
128K<Token≤256K |
$0.8 |
$4 |
US (Virginia)
|
Model ID |
Deployment scope |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen3-coder-480b-a35b-instruct |
Global |
0<Token≤32K |
$0.861 |
$3.441 |
|
32K<Token≤128K |
$1.291 |
$5.161 |
||
|
128K<Token≤200K |
$2.151 |
$8.602 |
||
|
qwen3-coder-30b-a3b-instruct |
Global |
0<Token≤32K |
$0.216 |
$0.861 |
|
32K<Token≤128K |
$0.323 |
$1.291 |
||
|
128K<Token≤200K |
$0.538 |
$2.151 |
Text generation - third-party models
DeepSeek
You are charged for input tokens and output tokens.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
deepseek-v4-pro context caching discount |
International |
$2.400 |
$4.800 |
1 million tokens |
|
deepseek-v4-flash context caching discount |
International |
$0.200 |
$0.400 |
1 million tokens |
|
deepseek-v3.2 context caching discount |
International |
$0.57 |
$1.71 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) |
|
deepseek-v4-pro context caching discount |
Chinese mainland |
$1.65 |
$3.301 |
No free quota |
|
deepseek-v4-flash context caching discount |
Chinese mainland |
$0.138 |
$0.275 |
No free quota |
|
deepseek-v3.2 context caching discount |
Chinese mainland |
$0.287 |
$0.431 |
No free quota |
|
deepseek-v3.2-exp |
Chinese mainland |
$0.287 |
$0.431 |
No free quota |
|
deepseek-v3.1 |
Chinese mainland |
$0.574 |
$1.721 |
No free quota |
|
deepseek-r1 |
Chinese mainland |
$0.574 |
$2.294 |
No free quota |
|
deepseek-r1-0528 |
Chinese mainland |
$0.574 |
$2.294 |
No free quota |
|
deepseek-v3 |
Chinese mainland |
$0.287 |
$1.147 |
No free quota |
|
deepseek-r1-distill-qwen-1.5b |
Chinese mainland |
Limited-time free |
||
|
deepseek-r1-distill-qwen-7b |
Chinese mainland |
$0.072 |
$0.144 |
No free quota |
|
deepseek-r1-distill-qwen-14b |
Chinese mainland |
$0.144 |
$0.431 |
No free quota |
|
deepseek-r1-distill-qwen-32b |
Chinese mainland |
$0.287 |
$0.861 |
No free quota |
|
deepseek-r1-distill-llama-8b |
Chinese mainland |
Limited-time free |
||
|
deepseek-r1-distill-llama-70b |
Chinese mainland |
Limited-time free |
||
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
deepseek-v4-pro context caching discount |
Global |
$1.65 |
$3.3 |
|
deepseek-v4-flash context caching discount |
Global |
$0.14 |
$0.28 |
US (Virginia)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
deepseek-v4-pro context caching discount |
Global |
$1.65 |
$3.3 |
|
deepseek-v4-flash context caching discount |
Global |
$0.14 |
$0.28 |
Japan (Tokyo)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
deepseek-v4-pro Context cachecontext caching discount |
Global |
$1.65 |
$3.3 |
|
deepseek-v4-flash Context cachecontext caching discount |
Global |
$0.14 |
$0.28 |
|
deepseek-v4-pro Context cachecontext caching discount |
Japan |
$2.400 |
$4.800 |
|
deepseek-v4-flash Context cachecontext caching discount |
Japan |
$0.200 |
$0.400 |
Kimi
You are charged for input tokens and output tokens.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
kimi-k2.7-code |
Chinese mainland |
$0.894 |
$3.713 |
$3.7131 |
|
kimi-k2.6 |
Chinese mainland |
$0.8939 |
$3.7131 |
No free quota |
|
kimi-k2.5 |
Chinese mainland |
$0.574 |
$3.011 |
No free quota |
|
kimi-k2-thinking |
Chinese mainland |
$0.574 |
$2.294 |
No free quota |
|
Moonshot-Kimi-K2-Instruct |
Chinese mainland |
$0.574 |
$2.294 |
No free quota |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
kimi-k2.7-code |
Global |
$0.894 |
$3.713 |
|
kimi-k2.5 |
Global |
$0.574 |
$3.011 |
US (Virginia)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
kimi-k2.7-code |
Global |
$0.894 |
$3.713 |
|
kimi-k2.5 |
Global |
$0.574 |
$3.011 |
Japan (Tokyo)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
kimi-k2.5 Context cachecontext caching discount |
Global |
$0.574 |
$3.011 |
China(Hong Kong)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
kimi-k2.7-code |
Global |
$0.894 |
$3.713 |
MiniMax
You are charged for input tokens and output tokens.
China (Beijing)
|
Model ID |
Deployment scope |
Mode |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought and answer |
|
MiniMax-M2.5 |
Chinese mainland |
Thinking mode only |
$0.304 |
$1.213 |
GLM
You are charged for input tokens and output tokens.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought and answer |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
glm-5.1 |
International |
Non-Thinking and Thinking modes |
0<Token≤200K |
$1.4 |
$4.4 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought and answer |
|
glm-5.2 |
Chinese mainland |
Non-Thinking and Thinking modes |
flat-rate pricing |
$1.100 |
$3.851 |
|
glm-5.1 |
Chinese mainland |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.825 |
$3.301 |
|
32K<Token≤200K |
$1.100 |
$3.851 |
|||
|
glm-5 |
Chinese mainland |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.573 |
$2.58 |
|
32K<Token≤166K |
$0.86 |
$3.154 |
|||
|
glm-4.7 |
Chinese mainland |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.431 |
$2.007 |
|
32K<Token≤166K |
$0.574 |
$2.294 |
|||
|
glm-4.6 |
Chinese mainland |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.431 |
$2.007 |
|
32K<Token≤166K |
$0.574 |
$2.294 |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought and answer |
|
glm-5.2 |
Global |
Non-Thinking and Thinking modes |
flat-rate pricing |
$1.100 |
$3.851 |
|
glm-5.1 |
Global |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.825 |
$3.301 |
|
32K<Token≤200K |
$1.100 |
$3.851 |
US (Virginia)
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought and answer |
|
glm-5.2 |
Global |
Non-Thinking and Thinking modes |
flat-rate pricing |
$1.100 |
$3.851 |
|
glm-5.1 |
Global |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.825 |
$3.301 |
|
32K<Token≤200K |
$1.100 |
$3.851 |
Japan (Tokyo)
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought + answer |
|
glm-5.1 Context cachecontext caching discount |
Global |
Non-Thinking and Thinking modes |
0<Token≤32K |
$0.825 |
$3.301 |
|
32K<Token≤200K |
$1.100 |
$3.851 |
China(Hong Kong)
Tab 正文
|
Model ID |
Deployment scope |
Mode |
Input tokens per request |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) Chain of thought and answer |
|
glm-5.2 |
Global |
Non-Thinking and Thinking modes |
flat-rate pricing |
$1.100 |
$3.851 |
Image generation
You are not charged for input. You are charged for output based on the number of successfully generated images.
Formula: Cost = Image unit price × Number of images generated.
Notes:
-
Cost does not depend on image resolution or aspect ratio.
-
Failed requests incur no cost and do not consume your free quota.
Qwen Text-to-Image
Only output is billed. For pricing rules, seeImage generation.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen-image-2.0-pro |
International |
$0.075/image |
100 images |
|
qwen-image-2.0-pro-2026-04-22 |
International |
$0.075/image |
100 images |
|
qwen-image-2.0-pro-2026-03-03 |
International |
$0.075/image |
100 images |
|
qwen-image-2.0 |
International |
$0.035/image |
100 images |
|
qwen-image-2.0-2026-03-03 |
International |
$0.035/image |
100 images |
|
qwen-image-max Currently equivalent to qwen-image-max-2025-12-30 |
International |
$0.075/image |
100 images |
|
qwen-image-max-2025-12-30 |
International |
$0.075/image |
100 images |
|
qwen-image-plus Currently equivalent to qwen-image |
International |
$0.03/image |
100 images |
|
qwen-image-plus-2026-01-09 |
International |
$0.03/image |
100 images |
|
qwen-image |
International |
$0.035/image |
100 images |
China (Beijing)
|
Model ID |
Deployment scope |
Output price |
|
qwen-image-2.0-pro |
Chinese mainland |
$0.071676/image |
|
qwen-image-2.0-pro-2026-04-22 |
Chinese mainland |
$0.071676/image |
|
qwen-image-2.0-pro-2026-03-03 |
Chinese mainland |
$0.071676/image |
|
qwen-image-2.0 |
Chinese mainland |
$0.028671/image |
|
qwen-image-2.0-2026-03-03 |
Chinese mainland |
$0.028671/image |
|
qwen-image-max Currently equivalent to qwen-image-max-2025-12-30 |
Chinese mainland |
$0.071677/image |
|
qwen-image-max-2025-12-30 |
Chinese mainland |
$0.071677/image |
|
qwen-image-plus Currently equivalent to qwen-image |
Chinese mainland |
$0.028671/image |
|
qwen-image-plus-2026-01-09 |
Chinese mainland |
$0.028671/image |
|
qwen-image |
Chinese mainland |
$0.035/image |
Qwen Image Editing
Only output is billed. For pricing rules, seeImage generation.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen-image-2.0-pro |
International |
$0.075/image |
100 images |
|
qwen-image-2.0-pro-2026-04-22 |
International |
$0.075/image |
100 images |
|
qwen-image-2.0-pro-2026-03-03 |
International |
$0.075/image |
100 images |
|
qwen-image-2.0 |
International |
$0.035/image |
100 images |
|
qwen-image-2.0-2026-03-03 |
International |
$0.035/image |
100 images |
|
qwen-image-edit-max Currently equivalent to qwen-image-edit-max-2026-01-16 |
International |
$0.075/image |
100 images |
|
qwen-image-edit-max-2026-01-16 |
International |
$0.075/image |
100 images |
|
qwen-image-edit-plus Currently equivalent to qwen-image-edit-plus-2025-10-30 |
International |
$0.03/image |
100 images |
|
qwen-image-edit-plus-2025-12-15 |
International |
$0.03/image |
100 images |
|
qwen-image-edit-plus-2025-10-30 |
International |
$0.03/image |
100 images |
|
qwen-image-edit |
International |
$0.045/image |
100 images |
China (Beijing)
|
Model ID |
Deployment scope |
Output price |
|
qwen-image-2.0-pro |
Chinese mainland |
$0.071676/image |
|
qwen-image-2.0-pro-2026-04-22 |
Chinese mainland |
$0.071676/image |
|
qwen-image-2.0-pro-2026-03-03 |
Chinese mainland |
$0.071676/image |
|
qwen-image-2.0 |
Chinese mainland |
$0.028671/image |
|
qwen-image-2.0-2026-03-03 |
Chinese mainland |
$0.028671/image |
|
qwen-image-edit-max Currently equivalent to qwen-image-edit-max-2026-01-16 |
Chinese mainland |
$0.071677/image |
|
qwen-image-edit-max-2026-01-16 |
Chinese mainland |
$0.071677/image |
|
qwen-image-edit-plus Currently equivalent to qwen-image-edit-plus-2025-10-30 |
Chinese mainland |
$0.028671/image |
|
qwen-image-edit-plus-2025-12-15 |
Chinese mainland |
$0.028671/image |
|
qwen-image-edit-plus-2025-10-30 |
Chinese mainland |
$0.028671/image |
|
qwen-image-edit |
Chinese mainland |
$0.043/image |
Qwen Image Translation
Only output is billed. For pricing rules, seeImage generation.
China (Beijing)
|
Model ID |
Deployment scope |
Output price |
Free quota(Note) |
|
qwen-mt-image |
Chinese mainland |
$0.000431/image |
No free quota |
Qwen-Text-to-Image-Z-Image
Only output is billed. For pricing rules, seeImage generation.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
z-image-turbo |
International |
Prompt rewriting disabled ( Prompt rewriting enabled ( |
100 images |
China (Beijing)
|
Model ID |
Deployment scope |
Output price |
|
z-image-turbo |
Chinese mainland |
Prompt rewriting disabled ( Prompt rewriting enabled ( |
Wanx Text-to-Image
Only output is billed. For pricing rules, seeImage generation.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
wan2.6-t2i |
International |
$0.03/image |
50 images |
|
wan2.5-t2i-preview |
International |
$0.03/image |
50 images |
|
wan2.2-t2i-plus |
International |
$0.05/image |
100 images |
|
wan2.2-t2i-flash |
International |
$0.025/image |
100 images |
|
wan2.1-t2i-plus |
International |
$0.05/image |
200 images |
|
wan2.1-t2i-turbo |
International |
$0.025/image |
200 images |
China (Beijing)
|
Model ID |
Deployment scope |
Output price |
|
wan2.6-t2i |
Chinese mainland |
$0.028671/image |
|
wan2.5-t2i-preview |
Chinese mainland |
$0.028671/image |
|
wan2.2-t2i-plus |
Chinese mainland |
$0.020070/image |
|
wan2.2-t2i-flash |
Chinese mainland |
$0.028671/image |
|
wanx2.1-t2i-plus |
Chinese mainland |
$0.028671/image |
|
wanx2.1-t2i-turbo |
Chinese mainland |
$0.020070/image |
|
wanx2.0-t2i-turbo |
Chinese mainland |
$0.005735/image |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Output price |
|
wan2.6-t2i |
Global |
$0.028671/image |
US (Virginia)
|
Model ID |
Deployment scope |
Output price |
|
wan2.6-t2i |
Global |
$0.028671/image |
Wanx Image Generation and Editing
Only output is billed. For pricing rules, seeImage generation.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
wan2.7-image-pro |
International |
$0.075/image |
50 images |
|
wan2.7-image |
International |
$0.03/image |
50 images |
|
wan2.6-image |
International |
$0.03/image |
50 images |
China (Beijing)
|
Model ID |
Deployment scope |
Output price |
|
wan2.7-image-pro |
Chinese mainland |
$0.068761/image |
|
wan2.7-image |
Chinese mainland |
$0.028671/image |
|
wan2.6-image |
Chinese mainland |
$0.028671/image |
US (Virginia)
|
Model ID |
Deployment scope |
Output price |
|
wan2.6-image |
Global |
$0.028671/image |
Wanx General Image Editing
Only output is billed. For pricing rules, seeImage generation.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
wan2.5-i2i-preview |
International |
$0.03/image |
50 images |
China (Beijing)
|
Model ID |
Deployment scope |
Output price |
|
wan2.5-i2i-preview |
Chinese mainland |
$0.028671/image |
|
wanx2.1-imageedit |
Chinese mainland |
$0.020070/image |
AIVirtual Try-on - OutfitAnyone
-
aitryon-plus: Input is free while output is billed. For pricing rules, seeImage generation.
-
aitryon-parsing-v1: Input is billed while output is free. Billed by the number of input images. Failed requests are not billed.
China (Beijing)
|
Model ID |
Deployment scope |
Unit price |
Free quota(Note) |
|
aitryon-plus |
Chinese mainland |
$0.071677/image |
No free quota |
|
aitryon-parsing-v1 |
Chinese mainland |
$0.000574/image |
Video generation
You are not charged for input. You are charged for output based on the total duration of successfully generated videos (in seconds).
Formula: Cost = Video unit price × Video duration (seconds).
Notes:
-
Some models charge by output video resolution. Prices differ for resolutions such as 480P, 720P, and 1080P.
-
Some models charge by output video edition. Prices differ for editions such as Standard Edition and Professional Edition.
-
Some models charge by output video aspect ratio. Prices differ for aspect ratios such as 1:1 and 3:4.
-
Some models use a flat rate, regardless of resolution, edition, or aspect ratio.
-
Failed requests incur no cost and do not consume your free quota.
HappyHorse-Text-to-video
Only output is billed. For pricing rules, seeVideo generation.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
happyhorse-1.1-t2v |
International |
720P |
$0.14/second |
10 seconds |
|
1080P |
$0.18/second |
|||
|
happyhorse-1.0-t2v |
International |
720P |
$0.14/second |
10 seconds |
|
1080P |
$0.24/second |
China (Beijing)
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
|
happyhorse-1.1-t2v |
Chinese mainland |
720P |
$0.123769/second |
|
1080P |
$0.165026/second |
||
|
happyhorse-1.0-t2v |
Chinese mainland |
720P |
$0.123769/second |
|
1080P |
$0.220034/second |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
|
happyhorse-1.1-t2v |
Global |
720P |
$0.123769/second |
|
1080P |
$0.165026/second |
||
|
happyhorse-1.0-t2v |
Global |
720P |
$0.123769/second |
|
1080P |
$0.220034/second |
US (Virginia)
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
|
happyhorse-1.1-t2v |
Global |
720P |
$0.123769/second |
|
1080P |
$0.165026/second |
||
|
happyhorse-1.0-t2v |
Global |
720P |
$0.123769/second |
|
1080P |
$0.220034/second |
HappyHorse-Image-to-video - first frame
Only output is billed. For pricing rules, seeVideo generation.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
happyhorse-1.1-i2v |
International |
720P |
$0.14/second |
10 seconds |
|
1080P |
$0.18/second |
|||
|
happyhorse-1.0-i2v |
International |
720P |
$0.14/second |
10 seconds |
|
1080P |
$0.24/second |
China (Beijing)
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
|
happyhorse-1.1-i2v |
Chinese mainland |
720P |
$0.123769/second |
|
1080P |
$0.165026/second |
||
|
happyhorse-1.0-i2v |
Chinese mainland |
720P |
$0.123769/second |
|
1080P |
$0.220034/second |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
|
happyhorse-1.1-i2v |
Global |
720P |
$0.123769/second |
|
1080P |
$0.165026/second |
||
|
happyhorse-1.0-i2v |
Global |
720P |
$0.123769/second |
|
1080P |
$0.220034/second |
US (Virginia)
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
|
happyhorse-1.1-i2v |
Global |
720P |
$0.123769/second |
|
1080P |
$0.165026/second |
||
|
happyhorse-1.0-i2v |
Global |
720P |
$0.123769/second |
|
1080P |
$0.220034/second |
HappyHorse-Reference-to-video
Only output is billed. For pricing rules, seeVideo generation.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
happyhorse-1.1-r2v |
International |
720P |
$0.14/second |
10 seconds |
|
1080P |
$0.18/second |
|||
|
happyhorse-1.0-r2v |
International |
720P |
$0.14/second |
10 seconds |
|
1080P |
$0.24/second |
China (Beijing)
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
|
happyhorse-1.1-r2v |
Chinese mainland |
720P |
$0.123769/second |
|
1080P |
$0.165026/second |
||
|
happyhorse-1.0-r2v |
Chinese mainland |
720P |
$0.123769/second |
|
1080P |
$0.220034/second |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
|
happyhorse-1.1-r2v |
Global |
720P |
$0.123769/second |
|
1080P |
$0.165026/second |
||
|
happyhorse-1.0-r2v |
Global |
720P |
$0.123769/second |
|
1080P |
$0.220034/second |
US (Virginia)
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
|
happyhorse-1.1-r2v |
Global |
720P |
$0.123769/second |
|
1080P |
$0.165026/second |
||
|
happyhorse-1.0-r2v |
Global |
720P |
$0.123769/second |
|
1080P |
$0.220034/second |
HappyHorse-Video editing
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.
|
Model ID |
Deployment scope |
Output video resolution |
Input and output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
happyhorse-1.0-video-edit |
International |
720P |
$0.14/second |
10 seconds |
|
1080P |
$0.24/second |
China (Beijing)
Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.
|
Model ID |
Deployment scope |
Output video resolution |
Input and output price |
|
happyhorse-1.0-video-edit |
Chinese mainland |
720P |
$0.123769/second |
|
1080P |
$0.220034/second |
Germany (Frankfurt)
Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.
|
Model ID |
Deployment scope |
Output video resolution |
Input and output price |
|
happyhorse-1.0-video-edit |
Global |
720P |
$0.123769/second |
|
1080P |
$0.220034/second |
US (Virginia)
Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.
|
Model ID |
Deployment scope |
Output video resolution |
Input and output price |
|
happyhorse-1.0-video-edit |
Global |
720P |
$0.123769/second |
|
1080P |
$0.220034/second |
Wanx-Text-to-Video
Only output is billed. For pricing rules, seeVideo generation.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
wan2.7-t2v-2026-04-25 |
International |
720P |
$0.10/second |
50 seconds |
|
1080P |
$0.15/second |
|||
|
wan2.7-t2v |
International |
720P |
$0.10/second |
50 seconds |
|
1080P |
$0.15/second |
|||
|
wan2.6-t2v |
International |
720P |
$0.10/second |
50 seconds |
|
1080P |
$0.15/second |
|||
|
wan2.5-t2v-preview |
International |
480P |
$0.05/second |
50 seconds |
|
720P |
$0.10/second |
|||
|
1080P |
$0.15/second |
|||
|
wan2.2-t2v-plus |
International |
480P |
$0.02/second |
50 seconds |
|
1080P |
$0.10/second |
|||
|
wan2.1-t2v-turbo |
International |
480P |
$0.036/second |
50 seconds |
|
720P |
$0.036/second |
|||
|
wan2.1-t2v-plus |
International |
720P |
$0.10/second |
50 seconds |
China (Beijing)
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
|
wan2.7-t2v-2026-04-25 |
Chinese mainland |
720P |
$0.086012/second |
|
1080P |
$0.143353/second |
||
|
wan2.7-t2v |
Chinese mainland |
720P |
$0.086012/second |
|
1080P |
$0.143353/second |
||
|
wan2.6-t2v |
Chinese mainland |
720P |
$0.086012/second |
|
1080P |
$0.143353/second |
||
|
wan2.5-t2v-preview |
Chinese mainland |
480P |
$0.043006/second |
|
720P |
$0.086012/second |
||
|
1080P |
$0.143353/second |
||
|
wan2.2-t2v-plus |
Chinese mainland |
480P |
$0.02007/second |
|
1080P |
$0.100347/second |
||
|
wanx2.1-t2v-turbo |
Chinese mainland |
480P |
$0.034405/second |
|
720P |
$0.034405/second |
||
|
wanx2.1-t2v-plus |
Chinese mainland |
720P |
$0.100347/second |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
|
wan2.6-t2v |
Global |
720P |
$0.086012/second |
|
1080P |
$0.143353/second |
US (Virginia)
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
|
wan2.6-t2v |
Global |
720P |
$0.086012/second |
|
1080P |
$0.143353/second |
||
|
wan2.6-t2v-us |
US |
720P |
$0.1/second |
|
1080P |
$0.15/second |
Wanx-Image-to-Video
Only output is billed. For pricing rules, seeVideo generation.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Output video type |
Output video resolution |
Output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
wan2.7-i2v-2026-04-25 |
International |
Audio video |
720P |
$0.10/second |
50 seconds |
|
1080P |
$0.15/second |
||||
|
wan2.7-i2v |
International |
Audio video |
720P |
$0.10/second |
50 seconds |
|
1080P |
$0.15/second |
China (Beijing)
|
Model ID |
Deployment scope |
Output video type |
Output video resolution |
Output price |
|
wan2.7-i2v-2026-04-25 |
Chinese mainland |
Audio video |
720P |
$0.086012/second |
|
1080P |
$0.143353/second |
|||
|
wan2.7-i2v |
Chinese mainland |
Audio video |
720P |
$0.086012/second |
|
1080P |
$0.143353/second |
Wanx-Image-to-Video-First-Frame
Only output is billed. For pricing rules, seeVideo generation.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Output video type |
Output video resolution |
Output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
wan2.6-i2v-flash |
International |
Audio video
|
720P |
$0.05/second |
50 seconds |
|
1080P |
$0.075/second |
||||
|
Silent video
|
720P |
$0.025/second |
|||
|
1080P |
$0.0375/second |
||||
|
wan2.6-i2v |
International |
Audio video |
720P |
$0.10/second |
50 seconds |
|
1080P |
$0.15/second |
||||
|
wan2.5-i2v-preview |
International |
Audio video |
480P |
$0.05/second |
50 seconds |
|
720P |
$0.10/second |
||||
|
1080P |
$0.15/second |
||||
|
wan2.2-i2v-flash |
International |
Silent video |
480P |
$0.015/second |
50 seconds |
|
720P |
$0.036/second |
||||
|
wan2.2-i2v-plus |
International |
Silent video |
480P |
$0.02/second |
50 seconds |
|
1080P |
$0.10/second |
||||
|
wan2.1-t2v-turbo |
International |
Silent video |
480P |
$0.036/second |
50 seconds |
|
720P |
$0.036/second |
||||
|
wan2.1-t2v-plus |
International |
Silent video |
720P |
$0.10/second |
50 seconds |
China (Beijing)
|
Model ID |
Deployment scope |
Output video type |
Output video resolution |
Output price |
|
wan2.6-i2v-flash |
Chinese mainland |
Audio video
|
720P |
$0.043006/second |
|
1080P |
$0.071676/second |
|||
|
Silent video
|
720P |
$0.021503/second |
||
|
1080P |
$0.035838/second |
|||
|
wan2.6-i2v |
Chinese mainland |
Audio video |
720P |
$0.086012/second |
|
1080P |
$0.143353/second |
|||
|
wan2.5-i2v-preview |
Chinese mainland |
Audio video |
480P |
$0.043006/second |
|
720P |
$0.086012/second |
|||
|
1080P |
$0.143353/second |
|||
|
wan2.2-i2v-plus |
Chinese mainland |
Silent video |
480P |
$0.02007/second |
|
1080P |
$0.100347/second |
|||
|
wanx2.1-t2v-turbo |
Chinese mainland |
Silent video |
480P |
$0.034405/second |
|
720P |
$0.034405/second |
|||
|
wanx2.1-t2v-plus |
Chinese mainland |
Silent video |
720P |
$0.100347/second |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Output video type |
Output video resolution |
Output price |
|
wan2.6-i2v |
Global |
Audio video |
720P |
$0.086012/second |
|
1080P |
$0.143353/second |
US (Virginia)
|
Model ID |
Deployment scope |
Output video type |
Output video resolution |
Output price |
|
wan2.6-i2v |
Global |
Audio video |
720P |
$0.086012/second |
|
1080P |
$0.143353/second |
|||
|
wan2.6-i2v-us |
US |
Audio video |
720P |
$0.1/second |
|
1080P |
$0.15/second |
Wanx-Image-to-Video-First-Last-Frame
Only output is billed. For pricing rules, seeVideo generation.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
wan2.2-kf2v-flash |
International |
480P |
$0.015/second |
50 seconds |
|
720P |
$0.036/second |
|||
|
1080P |
$0.07/second |
|||
|
wan2.1-kf2v-plus |
International |
720P |
$0.10/second |
50 seconds |
China (Beijing)
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
|
wan2.2-kf2v-flash |
Chinese mainland |
480P |
$0.014335/second |
|
720P |
$0.028671/second |
||
|
1080P |
$0.068809/second |
||
|
wanx2.1-kf2v-plus |
Chinese mainland |
720P |
$0.100347/second |
Wanx-Reference-to-Video
Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.
Billing formula: billable duration = input video duration (up to 5 seconds) + output video duration.
-
The billable duration of the input video does not exceed 5 seconds. For calculation rules, seeBilling and rate limiting.
-
The billable duration of the output video isduration (in seconds) of successfully generated videos.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Output video type |
Output video resolution |
Input and output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
wan2.7-r2v |
International |
Audio video |
720P |
$0.10/second |
50 seconds |
|
1080P |
$0.15/second |
||||
|
wan2.6-r2v-flash |
International |
Audio video
|
720P |
$0.05/second |
50 seconds |
|
1080P |
$0.075/second |
||||
|
Silent video
|
720P |
$0.025/second |
|||
|
1080P |
$0.0375/second |
||||
|
wan2.6-r2v |
International |
Audio video |
720P |
$0.10/second |
50 seconds |
|
1080P |
$0.15/second |
China (Beijing)
|
Model ID |
Deployment scope |
Output video type |
Output video resolution |
Input and output price |
|
wan2.7-r2v |
Chinese mainland |
Audio video |
720P |
$0.086012/second |
|
1080P |
$0.143353/second |
|||
|
wan2.6-r2v-flash |
Chinese mainland |
Audio video
|
720P |
$0.043006/second |
|
1080P |
$0.071676/second |
|||
|
Silent video
|
720P |
$0.021503/second |
||
|
1080P |
$0.035838/second |
|||
|
wan2.6-r2v |
Chinese mainland |
Audio video |
720P |
$0.086012/second |
|
1080P |
$0.143353/second |
Germany (Frankfurt)
|
Model ID |
Deployment scope |
Output video type |
Output video resolution |
Input and output price |
|
wan2.6-r2v |
Global |
Audio video |
720P |
$0.086012/second |
|
1080P |
$0.143353/second |
US (Virginia)
|
Model ID |
Deployment scope |
Output video type |
Output video resolution |
Input and output price |
|
wan2.6-r2v |
Global |
Audio video |
720P |
$0.086012/second |
|
1080P |
$0.143353/second |
Wanx-Video-Editing
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.
|
Model ID |
Deployment scope |
Output video resolution |
Input and output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
wan2.7-videoedit |
International |
720P |
$0.10/second |
50 seconds |
|
1080P |
$0.15/second |
Pricing rule: input is free. Output video is billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
wan2.1-vace-plus |
International |
720P |
$0.10/second |
50 seconds |
China (Beijing)
Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.
|
Model ID |
Deployment scope |
Output video resolution |
Input and output price |
|
wan2.7-videoedit |
Chinese mainland |
720P |
$0.086012/second |
|
1080P |
$0.143353/second |
Pricing rule: input is free. Output video is billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
|
wanx2.1-vace-plus |
Chinese mainland |
720P |
$0.100347/second |
Wanx-Digital Human
-
wan2.2-s2v-detect: Input is billed while output is free. Input is billed by the number of images processed. Each input image is billed once as long as the request succeeds, regardless of the detection result.
-
wan2.2-s2v: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, seeVideo generation.
China (Beijing)
|
Model ID |
Deployment scope |
Unit price |
Free quota(Note) |
|
wan2.2-s2v-detect |
Chinese mainland |
Input image: $0.000574/image |
No free quota |
|
wan2.2-s2v |
Chinese mainland |
Output video:
|
No free quota |
Wanx-Image-to-Motion
Only output is billed. For pricing rules, seeVideo generation.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Output video mode |
Output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
wan2.2-animate-move |
International |
Standard mode |
$0.12/second |
50 seconds |
|
Professional mode |
$0.18/second |
China (Beijing)
|
Model ID |
Deployment scope |
Output video mode |
Output price |
|
wan2.2-animate-move |
Chinese mainland |
Standard mode |
$0.06/second |
|
Professional mode |
$0.09/second |
Wanx-Video-Face-Swap
Only output is billed. For pricing rules, seeVideo generation.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Output video mode |
Output price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
wan2.2-animate-mix |
International |
Standard mode |
$0.18/second |
50 seconds |
|
Professional mode |
$0.26/second |
China (Beijing)
|
Model ID |
Deployment scope |
Output video mode |
Output price |
|
wan2.2-animate-mix |
Chinese mainland |
Standard mode |
$0.09/second |
|
Professional mode |
$0.13/second |
AnimateAnyone
-
animate-anyone-detect-gen2: Input is billed while output is free. Input is billed by the number of images processed. Each input image is billed once as long as the request succeeds, regardless of the detection result.
-
animate-anyone-template-gen2: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, seeVideo generation.
-
animate-anyone-gen2: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, seeVideo generation.
China (Beijing)
|
Model ID |
Deployment scope |
Unit price |
Free quota(Note) |
|
animate-anyone-detect-gen2 |
Chinese mainland |
Input image: $0.000574/image |
No free quota |
|
animate-anyone-template-gen2 |
Chinese mainland |
Output video: $0.011469/second |
No free quota |
|
animate-anyone-gen2 |
Chinese mainland |
Output video: $0.011469/second |
No free quota |
EMO
-
emo-detect-v1: Input is billed while output is free. Input is billed by the number of images processed. Each input image is billed once as long as the request succeeds, regardless of the detection result.
-
emo-v1: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, seeVideo generation.
China (Beijing)
|
Model ID |
Deployment scope |
Unit price |
Free quota(Note) |
|
emo-detect-v1 |
Chinese mainland |
Input image: $0.000574/image |
No free quota |
|
emo-v1 |
Chinese mainland |
Output video:
|
LivePortrait
-
liveportrait-detect: Input is billed while output is free. Input is billed by the number of images processed. Each input image is billed once as long as the request succeeds, regardless of the detection result.
-
liveportrait: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, seeVideo generation.
China (Beijing)
|
Model ID |
Deployment scope |
Unit price |
Free quota(Note) |
|
liveportrait-detect |
Chinese mainland |
Input image: $0.000574/image |
No free quota |
|
liveportrait |
Chinese mainland |
Output video: $0.002868/second |
Emoji Sticker
-
emoji-detect-v1: Input is billed while output is free. Input is billed by the number of images processed. Each input image is billed once as long as the request succeeds, regardless of the detection result.
-
emoji-v1: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, seeVideo generation.
China (Beijing)
|
Model ID |
Deployment scope |
Unit price |
Free quota(Note) |
|
emoji-detect-v1 |
Chinese mainland |
Input image: $0.000574/image |
No free quota |
|
emoji-v1 |
Chinese mainland |
Output video: $0.011469/second |
VideoRetalk
Only output is billed. For pricing rules, seeVideo generation.
China (Beijing)
|
Model ID |
Deployment scope |
Output price |
Free quota(Note) |
|
videoretalk |
Chinese mainland |
$0.011469/second |
No free quota |
Video Style Repaint
Only output is billed. For pricing rules, seeVideo generation.
China (Beijing)
|
Model ID |
Deployment scope |
Output video resolution |
Output price |
Free quota(Note) |
|
video-style-transform |
Chinese mainland |
540P |
$0.028671/second |
No free quota |
|
720P |
$0.071677/second |
Music generation
Pricing rule: billed by the duration (in seconds) of output audio. Input is free.
China (Beijing)
|
Model ID |
Deployment scope |
Output price (per second) |
Free quota(Note) |
|
fun-music-preview |
Chinese mainland |
$0.000695 |
No free quota |
|
fun-music-v1 |
Chinese mainland |
$0.000275 |
Speech synthesis (text-to-speech)
Qwen-TTS
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
Qwen3-TTS-Instruct-Flash
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3-tts-instruct-flash |
International |
$0.115 |
110,000 characters |
|
qwen3-tts-instruct-flash-2026-01-26 |
International |
$0.115 |
110,000 characters |
Qwen3-TTS-VD
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3-tts-vd-2026-01-26 |
International |
$0.115 |
110,000 characters |
Qwen3-TTS-VC
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3-tts-vc-2026-01-22 |
International |
$0.115 |
110,000 characters |
Qwen3-TTS-Flash
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3-tts-flash Currently equivalent to qwen3-tts-flash-2025-11-27 |
International |
$0.1 |
110,000 characters |
|
qwen3-tts-flash-2025-11-27 |
International |
$0.1 |
110,000 characters |
|
qwen3-tts-flash-2025-09-18 |
International |
$0.1 |
2025 (after November 13, 0:00 UTC+8): 10,000 characters |
China (Beijing)
Qwen3-TTS-Instruct-Flash
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Output price (per 10,000 characters) |
|
qwen3-tts-instruct-flash |
Chinese mainland |
$0.115 |
Free |
|
qwen3-tts-instruct-flash-2026-01-26 |
Chinese mainland |
$0.115 |
Free |
Qwen3-TTS-VD
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Output price (per 10,000 characters) |
|
qwen3-tts-vd-2026-01-26 |
Chinese mainland |
$0.115 |
Free |
Qwen3-TTS-VC
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Output price (per 10,000 characters) |
|
qwen3-tts-vc-2026-01-22 |
Chinese mainland |
$0.115 |
Free |
Qwen3-TTS-Flash
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Output price (per 10,000 characters) |
|
qwen3-tts-flash Currently equivalent to qwen3-tts-flash-2025-11-27 |
Chinese mainland |
$0.114682 |
Free |
|
qwen3-tts-flash-2025-11-27 |
Chinese mainland |
$0.114682 |
Free |
|
qwen3-tts-flash-2025-09-18 |
Chinese mainland |
$0.114682 |
Free |
Qwen-TTS
Pricing rule: billed by input tokens and output tokens.
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
|
qwen-tts-flash |
Chinese mainland |
$0.23 |
$1.434 |
|
qwen-tts-latest |
Chinese mainland |
$0.23 |
$1.434 |
|
qwen-tts-2025-05-22 |
Chinese mainland |
$0.23 |
$1.434 |
|
qwen-tts-2025-04-10 |
Chinese mainland |
$0.23 |
$1.434 |
Qwen-TTS-Realtime
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
Qwen3-TTS-Instruct-Flash-Realtime
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3-tts-instruct-flash-realtime |
International |
$0.143 |
110,000 characters |
|
qwen3-tts-instruct-flash-realtime-2026-01-22 |
International |
$0.143 |
110,000 characters |
Qwen3-TTS-VD-Realtime
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3-tts-vd-realtime-2026-01-15 |
International |
$0.143353 |
110,000 characters |
|
qwen3-tts-vd-realtime-2025-12-16 |
International |
$0.143353 |
110,000 characters |
Qwen3-TTS-VC-Realtime
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3-tts-vc-realtime-2026-01-15 |
International |
$0.13 |
110,000 characters |
|
qwen3-tts-vc-realtime-2025-11-27 |
International |
110,000 characters |
Qwen3-TTS-Flash-Realtime
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3-tts-flash-realtime |
International |
$0.13 |
2025 (after November 13, 0:00 UTC+8): 10,000 characters |
|
qwen3-tts-flash-realtime-2025-11-27 |
International |
$0.13 |
110,000 characters |
|
qwen3-tts-flash-realtime-2025-09-18 |
International |
$0.13 |
2025 (after November 13, 0:00 UTC+8): 10,000 characters |
China (Beijing)
Qwen3-TTS-Instruct-Flash-Realtime
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Output price |
|
qwen3-tts-instruct-flash-realtime |
Chinese mainland |
$0.143 |
Free |
|
qwen3-tts-instruct-flash-realtime-2026-01-22 |
Chinese mainland |
$0.143 |
Free |
Qwen3-TTS-VD-Realtime
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Output price |
|
qwen3-tts-vd-realtime-2026-01-15 |
Chinese mainland |
$0.143353 |
Free |
|
qwen3-tts-vd-realtime-2025-12-16 |
Chinese mainland |
$0.143353 |
Free |
Qwen3-TTS-VC-Realtime
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Output price |
|
qwen3-tts-vc-realtime-2026-01-15 |
Chinese mainland |
$0.143353 |
Free |
|
qwen3-tts-vc-realtime-2025-11-27 |
Chinese mainland |
Qwen3-TTS-Flash-Realtime
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Output price |
|
qwen3-tts-flash-realtime |
Chinese mainland |
$0.143353 |
Free |
|
qwen3-tts-flash-realtime-2025-11-27 |
Chinese mainland |
$0.143353 |
Free |
|
qwen3-tts-flash-realtime-2025-09-18 |
Chinese mainland |
$0.143353 |
Free |
Qwen-TTS-Realtime
Pricing rule: billed by input tokens and output tokens.
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Input price (per 1 million tokens) |
|
qwen-tts-realtime |
Chinese mainland |
$0.345 |
$1.721 |
|
qwen-tts-realtime-latest |
Chinese mainland |
$0.345 |
$1.721 |
|
qwen-tts-realtime-2025-07-15 |
Chinese mainland |
$0.345 |
$1.721 |
Qwen-TTS Voice cloning
Pricing rule: billed by the number of new voice clones created.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Price (per voice clone) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen-voice-enrollment |
International |
$0.01 |
1,000 voices/account |
China (Beijing)
|
Model ID |
Deployment scope |
Price (per voice clone) |
|
qwen-voice-enrollment |
Chinese mainland |
$0.01 |
Qwen-TTS Voice design
Pricing rule: billed by the number of new voice clones created.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Price (per voice clone) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen-voice-design |
International |
$0.2 |
10 voices/account |
China (Beijing)
|
Model ID |
Deployment scope |
Price (per voice clone) |
|
qwen-voice-design |
Chinese mainland |
$0.2 |
CosyVoice
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
cosyvoice-v3-plus |
International |
$0.26 |
110,000 characters |
|
cosyvoice-v3-flash |
International |
$0.13 |
1 million tokens |
China (Beijing)
Pricing rule: billed by the number of input text characters. Output is free.
|
Model ID |
Deployment scope |
Input price (per 10,000 characters) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
cosyvoice-v3.5-plus |
Chinese mainland |
$0.22 |
No free quota |
|
cosyvoice-v3.5-flash |
Chinese mainland |
$0.116 |
No free quota |
|
cosyvoice-v3-plus |
Chinese mainland |
$0.286706 |
No free quota |
|
cosyvoice-v3-flash |
Chinese mainland |
$0.14335 |
No free quota |
|
cosyvoice-v2 |
Chinese mainland |
$0.286706 |
No free quota |
Speech recognition (speech-to-text) and translation (speech-to-text in a specified language)
Qwen-LiveTranslate-Flash-Realtime
Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, seeBilling.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
||
|
Input: audio |
Input: image |
Output: text |
Output: audio |
|||
|
qwen3.5-livetranslate-flash-realtime |
International |
$7.5 |
$0.55 |
$20 |
$30 |
1 million tokens |
|
qwen3.5-livetranslate-flash-realtime-2026-05-19 |
International |
$7.5 |
$0.55 |
$20 |
$30 |
1 million tokens |
|
qwen3-livetranslate-flash-realtime |
International |
$10 |
$1.3 |
$10 |
$38 |
1 million tokens |
|
qwen3-livetranslate-flash-realtime-2025-09-22 |
International |
$10 |
$1.3 |
$10 |
$38 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
||
|
Input: audio |
Input: image |
Output: text |
Output: audio |
||
|
qwen3.5-livetranslate-flash-realtime |
Chinese mainland |
$5.501 |
$0.454 |
$13.752 |
$22.003 |
|
qwen3.5-livetranslate-flash-realtime-2026-05-19 |
Chinese mainland |
$5.501 |
$0.454 |
$13.752 |
$22.003 |
|
qwen3-livetranslate-flash-realtime |
Chinese mainland |
$9.175 |
$1.147 |
$9.175 |
$34.405 |
|
qwen3-livetranslate-flash-realtime-2025-09-22 |
Chinese mainland |
$9.175 |
$1.147 |
$9.175 |
$34.405 |
Qwen-LiveTranslate-Flash
Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, seeBilling.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
||
|
Input: audio |
Input: image |
Output: text |
Output: audio |
|||
|
qwen3-livetranslate-flash |
International |
$1.577 |
$0.631 |
$1.577 |
$6.308 |
1 million tokens |
|
qwen3-livetranslate-flash-2025-12-01 |
International |
$1.577 |
$0.631 |
$1.577 |
$6.308 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
||
|
Input: audio |
Input: image |
Output: text |
Output: audio |
||
|
qwen3-livetranslate-flash |
Chinese mainland |
$1.434 |
$0.573 |
$1.434 |
$5.734 |
|
qwen3-livetranslate-flash-2025-12-01 |
Chinese mainland |
$1.434 |
$0.573 |
$1.434 |
$5.734 |
Qwen-ASR
Pricing rule: billed by the duration (in seconds) of input audio. Output is free.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3-asr-flash-filetrans |
International |
$0.000035/second |
36,000 seconds (10 hours) |
|
qwen3-asr-flash-filetrans-2025-11-17 |
International |
$0.000035/second |
36,000 seconds (10 hours) |
|
qwen3-asr-flash Currently equivalent to qwen3-asr-flash-2025-09-08 |
International |
$0.000035/second |
36,000 seconds (10 hours) |
|
qwen3-asr-flash-2026-02-10 |
International |
$0.000035/second |
36,000 seconds (10 hours) |
|
qwen3-asr-flash-2025-09-08 |
International |
$0.000035/second |
36,000 seconds (10 hours) |
China (Beijing)
|
Model ID |
Deployment scope |
Input price |
|
qwen3-asr-flash-filetrans |
Chinese mainland |
$0.000032/second |
|
qwen3-asr-flash-filetrans-2025-11-17 |
Chinese mainland |
$0.000032/second |
|
qwen3-asr-flash Currently equivalent to qwen3-asr-flash-2025-09-08 |
Chinese mainland |
$0.000032/second |
|
qwen3-asr-flash-2026-02-10 |
Chinese mainland |
$0.000032/second |
|
qwen3-asr-flash-2025-09-08 |
Chinese mainland |
$0.000032/second |
US (Virginia)
|
Model ID |
Deployment scope |
Input price |
|
qwen3-asr-flash-us |
US |
$0.000035/second |
|
qwen3-asr-flash-2025-09-08-us |
US |
$0.000035/second |
Qwen-ASR-Realtime
Pricing rule: billed by the duration (in seconds) of input audio. Output is free.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3-asr-flash-realtime |
International |
$0.000090/second |
36,000 seconds (10 hours) |
|
qwen3-asr-flash-realtime-2026-02-10 |
International |
$0.000090/second |
36,000 seconds (10 hours) |
|
qwen3-asr-flash-realtime-2025-10-27 |
International |
$0.000090/second |
36,000 seconds (10 hours) |
China (Beijing)
|
Model ID |
Deployment scope |
Input price |
|
qwen3-asr-flash-realtime |
Chinese mainland |
$0.000047/second |
|
qwen3-asr-flash-realtime-2026-02-10 |
Chinese mainland |
$0.000047/second |
|
qwen3-asr-flash-realtime-2025-10-27 |
Chinese mainland |
$0.000047/second |
Fun-ASR
Audio file recognition
Pricing rule: billed by the duration (in seconds) of input audio. Output is free.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
fun-asr |
International |
$0.000035/second |
36,000 seconds (10 hours) |
|
fun-asr-2025-11-07 |
International |
$0.000035/second |
36,000 seconds (10 hours) |
|
fun-asr-2025-08-25 |
International |
$0.000035/second |
36,000 seconds (10 hours) |
|
fun-asr-mtl |
International |
$0.000035/second |
36,000 seconds (10 hours) |
|
fun-asr-mtl-2025-08-25 |
International |
$0.000035/second |
36,000 seconds (10 hours) |
|
fun-asr-flash-2026-06-15 |
International |
$0.000035/second |
36,000 seconds (10 hours) |
China (Beijing)
|
Model ID |
Deployment scope |
Input price |
|
fun-asr |
Chinese mainland |
$0.000032/second |
|
fun-asr-2025-11-07 |
Chinese mainland |
$0.000032/second |
|
fun-asr-2025-08-25 |
Chinese mainland |
$0.000032/second |
|
fun-asr-mtl |
Chinese mainland |
$0.000032/second |
|
fun-asr-mtl-2025-08-25 |
Chinese mainland |
$0.000032/second |
|
fun-asr-flash-2026-06-15 |
Chinese mainland |
$0.00003/second |
Real-time speech recognition
Pricing rule: billed by the duration (in seconds) of input audio. Output is free.
Singapore
|
Model ID |
Deployment scope |
Input price |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
fun-asr-realtime |
International |
$0.00009/second |
36,000 seconds (10 hours) |
|
fun-asr-realtime-2025-11-07 |
International |
$0.00009/second |
36,000 seconds (10 hours) |
China (Beijing)
|
Model ID |
Deployment scope |
Input price |
|
fun-asr-realtime |
Chinese mainland |
$0.000047/second |
|
fun-asr-realtime-2026-02-28 |
Chinese mainland |
$0.000047/second |
|
fun-asr-realtime-2025-11-07 |
Chinese mainland |
$0.000047/second |
|
fun-asr-realtime-2025-09-15 |
Chinese mainland |
$0.000047/second |
|
fun-asr-mtl-realtime |
Chinese mainland |
$0.000047/second |
|
fun-asr-mtl-realtime-2025-12-10 |
Chinese mainland |
$0.000047/second |
|
fun-asr-flash-8k-realtime |
Chinese mainland |
$0.000032/second |
|
fun-asr-flash-8k-realtime-2026-01-28 |
Chinese mainland |
$0.000032/second |
Paraformer
Audio file recognition
Pricing rule: billed by the duration (in seconds) of input audio. Output is free.
China (Beijing)
|
Model ID |
Deployment scope |
Input price |
|
paraformer-v2 |
Chinese mainland |
$0.000012/second |
|
paraformer-8k-v2 |
Chinese mainland |
$0.000012/second |
Real-time speech recognition
Pricing rule: billed by the duration (in seconds) of input audio. Output is free.
China (Beijing)
|
Model ID |
Deployment scope |
Input price |
Free quota(Note) |
|
paraformer-realtime-v2 |
Chinese mainland |
$0.000035/second |
No free quota |
|
paraformer-realtime-8k-v2 |
Chinese mainland |
$0.000035/second |
Text embedding
Pricing rule: billed by input tokens. Output is free.
Singapore
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
text-embedding-v4 |
International |
$0.07 |
1 million tokens |
|
text-embedding-v3 |
International |
$0.07 |
500,000 tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
|
text-embedding-v4 |
Chinese mainland |
$0.072 |
Hong Kong (China)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
|
text-embedding-v4 |
Hong Kong (China) |
$0.07 |
Multimodal embedding
Pricing rule: billed by input tokens. Output is free.
Singapore
|
Model ID |
Deployment scope |
Input price (per million input tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
tongyi-embedding-vision-plus |
International |
$0.09 |
1 million tokens |
|
tongyi-embedding-vision-flash |
International |
Image/video:$0.03 Text: $0.09 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3-vl-embedding |
Chinese mainland |
Image/video:$0.258 Text: $0.1 |
1 million tokens |
|
multimodal-embedding-v1 |
Chinese mainland |
Free trial |
No token quota limit |
Text reranking
Pricing rule: billed by input tokens. Output is free.
Singapore
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen3-rerank |
International |
$0.1 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
|
qwen3-vl-rerank |
Chinese mainland |
Text input: $0.1 Image input: $0.258 |
|
gte-rerank-v2 |
Chinese mainland |
Text input: $0.115 |
Industry models
Intent understanding
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) |
|
tongyi-intent-detect-v3 |
Chinese mainland |
$0.058 |
$0.144 |
No free quota |
Role play
You are charged for input tokens and output tokens.
The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.
Singapore
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen-plus-character Session Cache discount |
International |
$0.5 |
$1.4 |
1 million tokens |
|
qwen-flash-character Session Cache discount |
International |
$0.05 |
$0.4 |
1 million tokens |
|
qwen-plus-character-ja |
International |
$0.5 |
$1.4 |
1 million tokens |
China (Beijing)
|
Model ID |
Deployment scope |
Input price (per 1 million tokens) |
Output price (per 1 million tokens) |
Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
|
qwen-plus-character Session Cache discount |
Chinese mainland |
$0.115 |
$0.287 |
1 million tokens |
|
qwen-flash-character Session Cache discount |
Chinese mainland |
$0.034 |
$0.203 |
Error codes
If a model call fails and returns an error message, seeError codesfor resolution.