Alibaba Cloud Model Studio model pricing - Alibaba Cloud Model Studio

Model API calls are billed on a pay-as-you-go basis by default.

Note

This document only lists standard prices. For the latest promotions, visit the Model Studio console.

Note

Some models support context caching (explicit cache and implicit cache). Cache-hit input tokens and the tokens used to create an explicit cache are billed at unit prices different from the standard input price (for example, explicit cache creation is billed at 125% of the standard input price, and cache hits at 10%). The input prices in the tables below do not include cache prices. For cache billing rules, discount rates, and supported models, see Context Cache.

Tiered pricing rules

Some Model Studio models use tiered pricing. The unit price is determined by the total number of input tokens in a single request. All tokens in the request are billed at the unit price of the corresponding tier.

In the pricing tiers, K means 1,000 and M means 1,000,000. For example, 128K equals 128,000 tokens, 256K equals 256,000 tokens, and 1M equals 1,000,000 tokens.

For example, a model has two pricing tiers: 0 < tokens ≤ 32K and 32K < tokens ≤ 128K. If a request contains 100K input tokens, it falls into the second tier (32K < 100K ≤ 128K), and all tokens are billed at the unit price of the second tier.

Text generation - Qwen

Qwen-Max

You are charged for input tokens and output tokens.

If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price. If the model supports context cache, only input tokens receive a discount. These two discounts cannot apply simultaneously.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Mode	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought + answer	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
qwen3.7-max Currently equivalent to qwen3.7-max-2026-05-20 context caching discount	International	Non-Thinking and Thinking modes	0<Token≤1M	List price $2.5 Limited-time 50% off	List price $7.5 Limited-time 50% off	1 million tokens
qwen3.7-max-2026-06-08 context caching discount	International	Non-Thinking and Thinking modes	0<Token≤1M	$2.5	$7.5	1 million tokens
qwen3.7-max-2026-05-20 context caching discount	International	Non-Thinking and Thinking modes	0<Token≤1M	$2.5	$7.5	1 million tokens
qwen3.7-max-preview Currently equivalent to qwen3.7-max-2026-05-17	International	Thinking mode only	0<Token≤1M	$2.5	$7.5	1 million tokens
qwen3.7-max-2026-05-17	International	Thinking mode only	0<Token≤1M	$2.5	$7.5	1 million tokens
qwen3.6-max-preview context caching discount	International	Non-Thinking and Thinking modes	0<Token≤128K	$1.3	$7.8	1 million tokens
qwen3.6-max-preview context caching discount	International	Non-Thinking and Thinking modes	128K<Token≤256K	$2	$12	1 million tokens
qwen3-max Currently equivalent to qwen3-max-2026-01-23 context caching discount	International	Non-Thinking and Thinking modes	0<Token≤32K	$1.2	$6	1 million tokens
			32K<Token≤128K	$2.4	$12
			128K<Token≤256K	$3	$15
qwen3-max-2026-01-23	International	Non-Thinking and Thinking modes	0<Token≤32K	$1.2	$6	1 million tokens
			32K<Token≤128K	$2.4	$12
			128K<Token≤256K	$3	$15
qwen3-max-2025-09-23	International	Non-Thinking mode only	0<Token≤32K	$1.2	$6	1 million tokens
			32K<Token≤128K	$2.4	$12
			128K<Token≤256K	$3	$15
qwen3-max-preview context caching discount	International	Non-Thinking and Thinking modes	0<Token≤32K	$1.2	$6	1 million tokens
			32K<Token≤128K	$2.4	$12
			128K<Token≤256K	$3	$15

More models

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

qwen-max

50% batch inference discount

International

Non-Thinking mode only

No tiered pricing

$1.6

$6.4

1 million tokens

China (Beijing)

Model ID	Deployment scope	Mode	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought + answer
qwen3.7-max Currently equivalent to qwen3.7-max-2026-05-20 50% batch inference discount context caching discount	Chinese mainland	Non-Thinking and Thinking modes	0<Token≤1M	List price $1.65 Limited-time 50% off	List price $4.951 Limited-time 50% off
qwen3.7-max-2026-06-08 context caching discount	Chinese mainland	Non-Thinking and Thinking modes	0<Token≤1M	$1.65	$4.951
qwen3.7-max-2026-05-20 context caching discount	Chinese mainland	Non-Thinking and Thinking modes	0<Token≤1M	$1.65	$4.951
qwen3.6-max-preview context caching discount	Chinese mainland	Non-Thinking and Thinking modes	0<Token≤128K	$1.238	$7.426
qwen3.6-max-preview context caching discount	Chinese mainland	Non-Thinking and Thinking modes	128K<Token≤256K	$2.063	$12.377
qwen3-max Currently equivalent to qwen3-max-2026-01-23 50% batch inference discount context caching discount	Chinese mainland	Non-Thinking and Thinking modes	0<Token≤32K	$0.359	$1.434
			32K<Token≤128K	$0.574	$2.294
			128K<Token≤256K	$1.004	$4.014
qwen3-max-2026-01-23	Chinese mainland	Non-Thinking and Thinking modes	0<Token≤32K	$0.359	$1.434
			32K<Token≤128K	$0.574	$2.294
			128K<Token≤256K	$1.004	$4.014
qwen3-max-2025-09-23	Chinese mainland	Non-Thinking mode only	0<Token≤32K	$0.861	$3.441
			32K<Token≤128K	$1.434	$5.735
			128K<Token≤256K	$2.151	$8.602
qwen3-max-preview context caching discount	Chinese mainland	Non-Thinking and Thinking modes	0<Token≤32K	$0.861	$3.441
			32K<Token≤128K	$1.434	$5.735
			128K<Token≤256K	$2.151	$8.602

More models

Model ID	Deployment scope	Mode	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen-max	Chinese mainland	Non-Thinking mode only	No tiered pricing	$0.345	$1.377

Hong Kong (China)

Note

The following table shows list prices. Some models offer limited-time night/daytime discounts (see labels next to prices). Night hours: 22:00 to 08:00 (UTC+8), based on billing time; other hours are daytime hours.

Model ID	Deployment scope	Mode	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought + answer
qwen3.7-max Currently equivalent to qwen3.7-max-2026-05-20 context caching discount	Global	Non-Thinking and Thinking modes	0<Token≤1M	List price $1.65 Limited-time night 80% off, daytime 50% off	List price $4.951 Limited-time night 80% off, daytime 50% off
qwen3.7-max-2026-06-08 context caching discount	Global	Non-Thinking and Thinking modes	0<Token≤1M	$1.65	$4.951
qwen3.7-max-2026-05-20 context caching discount	Global	Non-Thinking and Thinking modes	0<Token≤1M	$1.65	$4.951
qwen3-max Currently equivalent to qwen3-max-2026-01-23 context caching discount	Hong Kong (China)	Non-Thinking and Thinking modes	0<Token≤32K	$1.2	$6
			32K<Token≤128K	$2.4	$12
			128K<Token≤256K	$3	$15
qwen3-max-2026-01-23	Hong Kong (China)	Non-Thinking and Thinking modes	0<Token≤32K	$1.2	$6
			32K<Token≤128K	$2.4	$12
			128K<Token≤256K	$3	$15

Germany (Frankfurt)

Note

Model ID	Deployment scope	Mode	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought + answer
qwen3.7-max Currently equivalent to qwen3.7-max-2026-05-20 context caching discount	Global	Non-Thinking and Thinking modes	0<Token≤1M	List price $1.65 Limited-time night 80% off, daytime 50% off	List price $4.951 Limited-time night 80% off, daytime 50% off
qwen3.7-max-2026-06-08 context caching discount	Global	Non-Thinking and Thinking modes	0<Token≤1M	$1.65	$4.951
qwen3.7-max-2026-05-20 context caching discount	Global	Non-Thinking and Thinking modes	0<Token≤1M	$1.65	$4.951
qwen3-max Currently equivalent to qwen3-max-2026-01-23 context caching discount	Global	Non-Thinking mode only	0<Token≤32K	$0.359	$1.434
			32K<Token≤128K	$0.574	$2.294
			128K<Token≤256K	$1.004	$4.014
qwen3-max Currently equivalent to qwen3-max-2026-01-23 50% batch inference discount context caching discount	EU	Non-Thinking and Thinking modes	0<Token≤32K	$1.2	$6
			32K<Token≤128K	$2.4	$12
			128K<Token≤256K	$3	$15
qwen3-max-2026-01-23	EU	Non-Thinking and Thinking modes	0<Token≤32K	$1.2	$6
			32K<Token≤128K	$2.4	$12
			128K<Token≤256K	$3	$15
qwen3-max-2025-09-23	Global	Non-Thinking mode only	0<Token≤32K	$0.861	$3.441
			32K<Token≤128K	$1.434	$5.735
			128K<Token≤256K	$2.151	$8.602
qwen3-max-preview context caching discount	Global	Non-Thinking and Thinking modes	0<Token≤32K	$0.861	$3.441
			32K<Token≤128K	$1.434	$5.735
			128K<Token≤256K	$2.151	$8.602

US (Virginia)

Note

Model ID	Deployment scope	Mode	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought + answer
qwen3.7-max Currently equivalent to qwen3.7-max-2026-05-20 context caching discount	Global	Non-Thinking and Thinking modes	0<Token≤1M	List price $1.65 Limited-time night 80% off, daytime 50% off	List price $4.951 Limited-time night 80% off, daytime 50% off
qwen3.7-max-us context caching discount	US	Non-Thinking and Thinking modes	0<Token≤1M	List price $2.5 Limited-time 50% off	List price $7.5 Limited-time 50% off
qwen3.7-max-2026-06-08 context caching discount	Global	Non-Thinking and Thinking modes	0<Token≤1M	$1.65	$4.951
qwen3.7-max-2026-05-20 context caching discount	Global	Non-Thinking and Thinking modes	0<Token≤1M	$1.65	$4.951
qwen3-max Currently equivalent to qwen3-max-2026-01-23 context caching discount	Global	Non-Thinking mode only	0<Token≤32K	$0.359	$1.434
			32K<Token≤128K	$0.574	$2.294
			128K<Token≤256K	$1.004	$4.014
qwen3-max-2025-09-23	Global	Non-Thinking mode only	0<Token≤32K	$0.861	$3.441
			32K<Token≤128K	$1.434	$5.735
			128K<Token≤256K	$2.151	$8.602
qwen3-max-preview context caching discount	Global	Non-Thinking and Thinking modes	0<Token≤32K	$0.861	$3.441
			32K<Token≤128K	$1.434	$5.735
			128K<Token≤256K	$2.151	$8.602

Japan (Tokyo)

Note

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

qwen3.7-max

Currently equivalent to qwen3.7-max-2026-05-20

Context Cache context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤1M

List price $1.65 Limited-time night 80% off, daytime 50% off

List price $4.951 Limited-time night 80% off, daytime 50% off

qwen3.7-max-2026-05-20

Context Cache context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤1M

$1.65

$4.951

Qwen-Plus

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)		Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Non-Thinking mode	Thinking mode (chain of thought + answer)
qwen3.7-plus Currently equivalent to qwen3.7-plus-2026-05-26 context caching discount	International	0<Token≤256K	List price $0.4 Limited-time 20% off	List price $1.6 Limited-time 20% off	List price $1.6 Limited-time 20% off	1 million tokens
	International	256K<Token≤1M	List price $1.2 Limited-time 20% off	List price $4.8 Limited-time 20% off	List price $4.8 Limited-time 20% off	1 million tokens
qwen3.7-plus-2026-05-26 context caching discount	International	0<Token≤256K	$0.4	$1.6	$1.6	1 million tokens
qwen3.7-plus-2026-05-26 context caching discount	International	256K<Token≤1M	$1.2	$4.8	$4.8	1 million tokens
qwen3.6-plus Currently equivalent to qwen3.6-plus-2026-04-02	International	0<Token≤256K	$0.5	$3	$3	1 million tokens
	International	256K<Token≤1M	$2	$6	$6	1 million tokens
qwen3.6-plus-2026-04-02	International	0<Token≤256K	$0.5	$3	$3	1 million tokens
qwen3.6-plus-2026-04-02	International	256K<Token≤1M	$2	$6	$6	1 million tokens
qwen3.5-plus Currently equivalent to qwen3.5-plus-2026-02-15	International	0<Token≤256K	$0.4	$2.4	$2.4	1 million tokens
	International	256K<Token≤1M	$0.5	$3	$3	1 million tokens
qwen3.5-plus-2026-04-20	International	0<Token≤256K	$0.4	$2.4	$2.4	1 million tokens
qwen3.5-plus-2026-04-20	International	256K<Token≤1M	$0.5	$3	$3	1 million tokens
qwen3.5-plus-2026-02-15	International	0<Token≤256K	$0.4	$2.4	$2.4	1 million tokens
qwen3.5-plus-2026-02-15	International	256K<Token≤1M	$0.5	$3	$3	1 million tokens
qwen-plus Currently equivalent to qwen-plus-2025-12-01	International	0<Token≤256K	$0.4	$1.2	$4	1 million tokens
qwen-plus Currently equivalent to qwen-plus-2025-12-01	International	256K<Token≤1M	$1.2	$3.6	$12	1 million tokens
qwen-plus-latest	International	0<Token≤256K	$0.4	$1.2	$4	1 million tokens
qwen-plus-latest	International	256K<Token≤1M	$1.2	$3.6	$12	1 million tokens
qwen-plus-2025-12-01	International	0<Token≤256K	$0.4	$1.2	$4	1 million tokens
qwen-plus-2025-12-01	International	256K<Token≤1M	$1.2	$3.6	$12	1 million tokens
qwen-plus-2025-09-11	International	0<Token≤256K	$0.4	$1.2	$4	1 million tokens
qwen-plus-2025-09-11	International	256K<Token≤1M	$1.2	$3.6	$12	1 million tokens
qwen-plus-2025-07-28	International	0<Token≤256K	$0.4	$1.2	$4	1 million tokens
qwen-plus-2025-07-28	International	256K<Token≤1M	$1.2	$3.6	$12	1 million tokens
qwen-plus-2025-07-14	International	No tiered pricing	$0.4	$1.2	$4	1 million tokens
qwen-plus-2025-04-28	International	No tiered pricing	$0.4	$1.2	$4	1 million tokens
qwen-plus-2025-01-25	International	No tiered pricing	$0.4	$1.2	-	1 million tokens

China (Beijing)

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Non-Thinking mode	Thinking mode (chain of thought + answer)
qwen3.7-plus Currently equivalent to qwen3.7-plus-2026-05-26 context caching discount	Chinese mainland	0<Token≤256K	List price $0.276 Limited-time 20% off	List price $1.101 Limited-time 20% off	List price $1.101 Limited-time 20% off
	Chinese mainland	256K<Token≤1M	List price $0.826 Limited-time 20% off	List price $3.301 Limited-time 20% off	List price $3.301 Limited-time 20% off
qwen3.7-plus-2026-05-26 context caching discount	Chinese mainland	0<Token≤256K	$0.276	$1.101	$1.101
qwen3.7-plus-2026-05-26 context caching discount	Chinese mainland	256K<Token≤1M	$0.826	$3.301	$3.301
qwen3.6-plus Currently equivalent to qwen3.6-plus-2026-04-02	Chinese mainland	0<Token≤256K	$0.276	$1.651	$1.651
	Chinese mainland	256K<Token≤1M	$1.101	$6.602	$6.602
qwen3.6-plus-2026-04-02	Chinese mainland	0<Token≤256K	$0.276	$1.651	$1.651
qwen3.6-plus-2026-04-02	Chinese mainland	256K<Token≤1M	$1.101	$6.602	$6.602
qwen3.5-plus Currently equivalent to qwen3.5-plus-2026-02-15	Chinese mainland	0<Token≤128K	$0.115	$0.688	$0.688
		128K<Token≤256K	$0.287	$1.72	$1.72
		256K<Token≤1M	$0.573	$3.44	$3.44
qwen3.5-plus-2026-04-20	Chinese mainland	0<Token≤128K	$0.115	$0.688	$0.688
		128K<Token≤256K	$0.287	$1.72	$1.72
		256K<Token≤1M	$0.573	$3.44	$3.44
qwen3.5-plus-2026-02-15	Chinese mainland	0<Token≤128K	$0.115	$0.688	$0.688
		128K<Token≤256K	$0.287	$1.72	$1.72
		256K<Token≤1M	$0.573	$3.44	$3.44
qwen-plus Currently equivalent to qwen-plus-2025-12-01	Chinese mainland	0<Token≤128K	$0.115	$0.287	$1.147
		128K<Token≤256K	$0.345	$2.868	$3.441
		256K<Token≤1M	$0.689	$6.881	$9.175
qwen-plus-latest	Chinese mainland	0<Token≤128K	$0.115	$0.287	$1.147
		128K<Token≤256K	$0.345	$2.868	$3.441
		256K<Token≤1M	$0.689	$6.881	$9.175
qwen-plus-2025-12-01	Chinese mainland	0<Token≤128K	$0.115	$0.287	$1.147
		128K<Token≤256K	$0.345	$2.868	$3.441
		256K<Token≤1M	$0.689	$6.881	$9.175
qwen-plus-2025-09-11	Chinese mainland	0<Token≤128K	$0.115	$0.287	$1.147
		128K<Token≤256K	$0.345	$2.868	$3.441
		256K<Token≤1M	$0.689	$6.881	$9.175
qwen-plus-2025-07-28	Chinese mainland	0<Token≤128K	$0.115	$0.287	$1.147
		128K<Token≤256K	$0.345	$2.868	$3.441
		256K<Token≤1M	$0.689	$6.881	$9.175
qwen-plus-2025-07-14	Chinese mainland	No tiered pricing	$0.115	$0.287	$1.147
qwen-plus-2025-04-28	Chinese mainland	No tiered pricing	$0.115	$0.287	$1.147

More models

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen-plus-2025-01-25	Chinese mainland	No tiered pricing	$0.115	$0.287
qwen-plus-2025-01-12	Chinese mainland	No tiered pricing	$0.115	$0.287
qwen-plus-2024-12-20	Chinese mainland	No tiered pricing	$0.115	$0.287

Hong Kong (China)

Note

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Non-Thinking mode	Thinking mode (chain of thought + answer)
qwen3.7-plus Currently equivalent to qwen3.7-plus-2026-05-26 context caching discount	Global	0<Token≤256K	List price $0.276 Limited-time night 60% off, daytime 20% off	List price $1.101 Limited-time night 60% off, daytime 20% off	List price $1.101 Limited-time night 60% off, daytime 20% off
	Global	256K<Token≤1M	List price $0.826 Limited-time night 60% off, daytime 20% off	List price $3.301 Limited-time night 60% off, daytime 20% off	List price $3.301 Limited-time night 60% off, daytime 20% off
qwen3.7-plus-2026-05-26 context caching discount	Global	0<Token≤256K	$0.276	$1.101	$1.101
qwen3.7-plus-2026-05-26 context caching discount	Global	256K<Token≤1M	$0.826	$3.301	$3.301
qwen3.6-plus Currently equivalent to qwen3.6-plus-2026-04-02	Global	0<Token≤256K	$0.276	$1.651	$1.651
	Global	256K<Token≤1M	$1.101	$6.602	$6.602
qwen-plus Currently equivalent to qwen-plus-2025-12-01	Hong Kong (China)	0<Token≤256K	$0.4	$1.2	$4
qwen-plus Currently equivalent to qwen-plus-2025-12-01	Hong Kong (China)	256K<Token≤1M	$1.2	$3.6	$12
qwen-plus-2025-12-01	Hong Kong (China)	0<Token≤256K	$0.4	$1.2	$4
qwen-plus-2025-12-01	Hong Kong (China)	256K<Token≤1M	$1.2	$3.6	$12

Germany (Frankfurt)

Note

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Non-Thinking mode	Thinking mode (chain of thought + answer)
qwen3.7-plus Currently equivalent to qwen3.7-plus-2026-05-26 context caching discount	Global	0<Token≤256K	List price $0.276 Limited-time night 60% off, daytime 20% off	List price $1.101 Limited-time night 60% off, daytime 20% off	List price $1.101 Limited-time night 60% off, daytime 20% off
	Global	256K<Token≤1M	List price $0.826 Limited-time night 60% off, daytime 20% off	List price $3.301 Limited-time night 60% off, daytime 20% off	List price $3.301 Limited-time night 60% off, daytime 20% off
qwen3.7-plus-2026-05-26 context caching discount	Global	0<Token≤256K	$0.276	$1.101	$1.101
qwen3.7-plus-2026-05-26 context caching discount	Global	256K<Token≤1M	$0.826	$3.301	$3.301
qwen3.6-plus Currently equivalent to qwen3.6-plus-2026-04-02	Global	0<Token≤256K	$0.276	$1.651	$1.651
	Global	256K<Token≤1M	$1.101	$6.602	$6.602
qwen3.6-plus-2026-04-02	Global	0<Token≤256K	$0.276	$1.651	$1.651
qwen3.6-plus-2026-04-02	Global	256K<Token≤1M	$1.101	$6.602	$6.602
qwen3.5-plus Currently equivalent to qwen3.5-plus-2026-02-15	Global	0<Token≤128K	$0.115	$0.688	$0.688
		128K<Token≤256K	$0.287	$1.72	$1.72
		256K<Token≤1M	$0.573	$3.44	$3.44
qwen3.5-plus-2026-02-15	Global	0<Token≤128K	$0.115	$0.688	$0.688
		128K<Token≤256K	$0.287	$1.72	$1.72
		256K<Token≤1M	$0.573	$3.44	$3.44
qwen-plus Currently equivalent to qwen-plus-2025-12-01	Global	0<Token≤128K	$0.115	$0.287	$1.147
		128K<Token≤256K	$0.345	$2.868	$3.441
		256K<Token≤1M	$0.689	$6.881	$9.175
qwen-plus Currently equivalent to qwen-plus-2025-12-01	EU	0<Token≤256K	$0.4	$1.2	$4
qwen-plus Currently equivalent to qwen-plus-2025-12-01	EU	256K<Token≤1M	$1.2	$3.6	$12
qwen-plus-2025-12-01	Global	0<Token≤128K	$0.115	$0.287	$1.147
		128K<Token≤256K	$0.345	$2.868	$3.441
		256K<Token≤1M	$0.689	$6.881	$9.175
qwen-plus-2025-12-01	EU	0<Token≤256K	$0.4	$1.2	$4
qwen-plus-2025-12-01	EU	256K<Token≤1M	$1.2	$3.6	$12
qwen-plus-2025-09-11	Global	0<Token≤128K	$0.115	$0.287	$1.147
		128K<Token≤256K	$0.345	$2.868	$3.441
		256K<Token≤1M	$0.689	$6.881	$9.175
qwen-plus-2025-07-28	Global	0<Token≤128K	$0.115	$0.287	$1.147
		128K<Token≤256K	$0.345	$2.868	$3.441
		256K<Token≤1M	$0.689	$6.881	$9.175

US (Virginia)

Note

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Non-Thinking mode	Thinking mode (chain of thought + answer)
qwen3.7-plus Currently equivalent to qwen3.7-plus-2026-05-26 context caching discount	Global	0<Token≤256K	List price $0.276 Limited-time night 60% off, daytime 20% off	List price $1.101 Limited-time night 60% off, daytime 20% off	List price $1.101 Limited-time night 60% off, daytime 20% off
	Global	256K<Token≤1M	List price $0.826 Limited-time night 60% off, daytime 20% off	List price $3.301 Limited-time night 60% off, daytime 20% off	List price $3.301 Limited-time night 60% off, daytime 20% off
qwen3.7-plus-us context caching discount	US	0<Token≤256K	List price $0.4 Limited-time 20% off	List price $1.6 Limited-time 20% off	List price $1.6 Limited-time 20% off
qwen3.7-plus-us context caching discount	US	256K<Token≤1M	List price $1.2 Limited-time 20% off	List price $4.8 Limited-time 20% off	List price $4.8 Limited-time 20% off
qwen3.7-plus-2026-05-26 context caching discount	Global	0<Token≤256K	$0.276	$1.101	$1.101
qwen3.7-plus-2026-05-26 context caching discount	Global	256K<Token≤1M	$0.826	$3.301	$3.301
qwen3.6-plus Currently equivalent to qwen3.6-plus-2026-04-02	Global	0<Token≤256K	$0.276	$1.651	$1.651
	Global	256K<Token≤1M	$1.101	$6.602	$6.602
qwen3.6-plus-2026-04-02	Global	0<Token≤256K	$0.276	$1.651	$1.651
qwen3.6-plus-2026-04-02	Global	256K<Token≤1M	$1.101	$6.602	$6.602
qwen3.5-plus Currently equivalent to qwen3.5-plus-2026-02-15	Global	0<Token≤128K	$0.115	$0.688	$0.688
		128K<Token≤256K	$0.287	$1.72	$1.72
		256K<Token≤1M	$0.573	$3.44	$3.44
qwen3.5-plus-2026-02-15	Global	0<Token≤128K	$0.115	$0.688	$0.688
		128K<Token≤256K	$0.287	$1.72	$1.72
		256K<Token≤1M	$0.573	$3.44	$3.44
qwen-plus Currently equivalent to qwen-plus-2025-12-01	Global	0<Token≤128K	$0.115	$0.287	$1.147
		128K<Token≤256K	$0.345	$2.868	$3.441
		256K<Token≤1M	$0.689	$6.881	$9.175
qwen-plus-us	US	0<Token≤256K	$0.4	$1.2	$4
qwen-plus-us	US	256K<Token≤1M	$1.2	$3.6	$12
qwen-plus-2025-12-01	Global	0<Token≤128K	$0.115	$0.287	$1.147
		128K<Token≤256K	$0.345	$2.868	$3.441
		256K<Token≤1M	$0.689	$6.881	$9.175
qwen-plus-2025-12-01-us	US	0<Token≤256K	$0.4	$1.2	$4
qwen-plus-2025-12-01-us	US	256K<Token≤1M	$1.2	$3.6	$12
qwen-plus-2025-09-11	Global	0<Token≤128K	$0.115	$0.287	$1.147
		128K<Token≤256K	$0.345	$2.868	$3.441
		256K<Token≤1M	$0.689	$6.881	$9.175
qwen-plus-2025-07-28	Global	0<Token≤128K	$0.115	$0.287	$1.147
		128K<Token≤256K	$0.345	$2.868	$3.441
		256K<Token≤1M	$0.689	$6.881	$9.175

Japan (Tokyo)

Model ID	Deployment scope	Input token range per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
Model ID	Deployment scope	Input token range per request	Input price (per 1 million tokens)	Non-Thinking mode	Thinking mode (Chain of thought + answer)
qwen3.7-plus Currently equivalent to qwen3.7-plus-2026-05-26 Context Cache context caching discount	Japan	0<Token≤256K	$0.4	$1.6	$1.6
	Japan	256K<Token≤1M	$1.2	$4.8	$4.8
qwen3.7-plus-2026-05-26 Context Cache context caching discount	Japan	0<Token≤256K	$0.4	$1.6	$1.6
	Japan	256K<Token≤1M	$1.2	$4.8	$4.8
qwen3.7-plus Currently equivalent to qwen3.7-plus-2026-05-26 Context Cache context caching discount	Global	0<Token≤256K	List price $0.276 Limited-time night 60% off, daytime 20% off	List price $1.101 Limited-time night 60% off, daytime 20% off	List price $1.101 Limited-time night 60% off, daytime 20% off
	Global	256K<Token≤1M	List price $0.826 Limited-time night 60% off, daytime 20% off	List price $3.301 Limited-time night 60% off, daytime 20% off	List price $3.301 Limited-time night 60% off, daytime 20% off
qwen3.7-plus-2026-05-26 Context Cache context caching discount	Global	0<Token≤256K	$0.276	$1.101	$1.101
	Global	256K<Token≤1M	$0.826	$3.301	$3.301
qwen3.6-plus Currently equivalent to qwen3.6-plus-2026-04-02 Context Cache context caching discount	Global	0<Token≤256K	$0.276	$1.651	$1.651
	Global	256K<Token≤1M	$1.101	$6.602	$6.602
qwen3.6-plus-2026-04-02	Global	0<Token≤256K	$0.276	$1.651	$1.651
qwen3.6-plus-2026-04-02	Global	256K<Token≤1M	$1.101	$6.602	$6.602

Qwen-Flash

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
qwen3.6-flash Currently equivalent to qwen3.6-flash-2026-04-16 50% batch inference discount context caching discount	International	0<Token≤256K	$0.25	$1.5	1 million tokens
	International	256K<Token≤1M	$1	$4	1 million tokens
qwen3.6-flash-2026-04-16	International	0<Token≤256K	$0.25	$1.5	1 million tokens
qwen3.6-flash-2026-04-16	International	256K<Token≤1M	$1	$4	1 million tokens
qwen3.5-flash Currently equivalent to qwen3.5-flash-2026-02-23 50% batch inference discount context caching discount	International	0<Token≤1M	$0.1	$0.4	1 million tokens
qwen3.5-flash-2026-02-23	International	0<Token≤1M	$0.1	$0.4	1 million tokens
qwen-flash Currently equivalent to qwen-flash-2025-07-28 50% batch inference discount context caching discount	International	0<Token≤256K	$0.05	$0.4	1 million tokens
	International	256K<Token≤1M	$0.25	$2	1 million tokens
qwen-flash-2025-07-28	International	0<Token≤256K	$0.05	$0.4	1 million tokens
qwen-flash-2025-07-28	International	256K<Token≤1M	$0.25	$2	1 million tokens

China (Beijing)

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen3.6-flash Currently equivalent to qwen3.6-flash-2026-04-16 50% batch inference discount context caching discount	Chinese mainland	0<Token≤256K	$0.165	$0.99
	Chinese mainland	256K<Token≤1M	$0.66	$3.961
qwen3.6-flash-2026-04-16	Chinese mainland	0<Token≤256K	$0.165	$0.99
qwen3.6-flash-2026-04-16	Chinese mainland	256K<Token≤1M	$0.66	$3.961
qwen3.5-flash Currently equivalent to qwen3.5-flash-2026-02-23	Chinese mainland	0<Token≤128K	$0.029	$0.287
		128K<Token≤256K	$0.115	$1.147
		256K<Token≤1M	$0.172	$1.72
qwen3.5-flash-2026-02-23	Chinese mainland	0<Token≤128K	$0.029	$0.287
		128K<Token≤256K	$0.115	$1.147
		256K<Token≤1M	$0.172	$1.72
qwen-flash Currently equivalent to qwen-flash-2025-07-28 context caching discount	Chinese mainland	0<Token≤128K	$0.022	$0.216
		128K<Token≤256K	$0.087	$0.861
		256K<Token≤1M	$0.173	$1.721
qwen-flash-2025-07-28	Chinese mainland	0<Token≤128K	$0.022	$0.216
		128K<Token≤256K	$0.087	$0.861
		256K<Token≤1M	$0.173	$1.721

Hong Kong (China)

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen3.6-flash Currently equivalent to qwen3.6-flash-2026-04-16	Global	0<Token≤256K	$0.165	$0.99
	Global	256K<Token≤1M	$0.66	$3.961
qwen3.5-flash Currently equivalent to qwen3.5-flash-2026-02-23 context caching discount	Hong Kong (China)	0<Token≤1M	$0.1	$0.4
qwen3.5-flash-2026-02-23	Hong Kong (China)	0<Token≤1M	$0.1	$0.4

Germany (Frankfurt)

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen3.6-flash Currently equivalent to qwen3.6-flash-2026-04-16	Global	0<Token≤256K	$0.165	$0.99
	Global	256K<Token≤1M	$0.66	$3.961
qwen3.6-flash-2026-04-16	Global	0<Token≤256K	$0.165	$0.99
qwen3.6-flash-2026-04-16	Global	256K<Token≤1M	$0.66	$3.961
qwen3.5-flash Currently equivalent to qwen3.5-flash-2026-02-23	Global	0<Token≤128K	$0.029	$0.287
		128K<Token≤256K	$0.115	$1.147
		256K<Token≤1M	$0.172	$1.72
qwen3.5-flash Currently equivalent to qwen3.5-flash-2026-02-23 context caching discount	EU	0<Token≤1M	$0.1	$0.4
qwen3.5-flash-2026-02-23	Global	0<Token≤128K	$0.029	$0.287
		128K<Token≤256K	$0.115	$1.147
		256K<Token≤1M	$0.172	$1.72
qwen3.5-flash-2026-02-23	EU	0<Token≤1M	$0.1	$0.4
qwen-flash Currently equivalent to qwen-flash-2025-07-28 context caching discount	Global	0<Token≤128K	$0.022	$0.216
		128K<Token≤256K	$0.087	$0.861
		256K<Token≤1M	$0.173	$1.721
qwen-flash-2025-07-28	Global	0<Token≤128K	$0.022	$0.216
		128K<Token≤256K	$0.087	$0.861
		256K<Token≤1M	$0.173	$1.721

US (Virginia)

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen3.6-flash Currently equivalent to qwen3.6-flash-2026-04-16	Global	0<Token≤256K	$0.165	$0.99
	Global	256K<Token≤1M	$0.66	$3.961
qwen3.6-flash-2026-04-16	Global	0<Token≤256K	$0.165	$0.99
qwen3.6-flash-2026-04-16	Global	256K<Token≤1M	$0.66	$3.961
qwen3.5-flash Currently equivalent to qwen3.5-flash-2026-02-23	Global	0<Token≤128K	$0.029	$0.287
		128K<Token≤256K	$0.115	$1.147
		256K<Token≤1M	$0.172	$1.72
qwen3.5-flash-2026-02-23	Global	0<Token≤128K	$0.029	$0.287
		128K<Token≤256K	$0.115	$1.147
		256K<Token≤1M	$0.172	$1.72
qwen-flash Currently equivalent to qwen-flash-2025-07-28 context caching discount	Global	0<Token≤128K	$0.022	$0.216
		128K<Token≤256K	$0.087	$0.861
		256K<Token≤1M	$0.173	$1.721
qwen-flash-us	US	0<Token≤256K	$0.05	$0.4
qwen-flash-us	US	256K<Token≤1M	$0.25	$2
qwen-flash-2025-07-28	Global	0<Token≤128K	$0.022	$0.216
		128K<Token≤256K	$0.087	$0.861
		256K<Token≤1M	$0.173	$1.721
qwen-flash-2025-07-28-us	US	0<Token≤256K	$0.05	$0.4
qwen-flash-2025-07-28-us	US	256K<Token≤1M	$0.25	$2

Japan (Tokyo)

Model ID	Deployment scope	Input token range per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen3.6-flash Currently equivalent to qwen3.6-flash-2026-04-16 Context Cache context caching discount	Global	0<Token≤256K	$0.165	$0.99
	Global	256K<Token≤1M	$0.66	$3.961
qwen3.6-flash-2026-04-16	Global	0<Token≤256K	$0.165	$0.99
qwen3.6-flash-2026-04-16	Global	256K<Token≤1M	$0.66	$3.961

Qwen-Turbo

Note

Qwen-Turbo will no longer be updated. We recommend switching to Qwen-Flash.

You are charged for input tokens and output tokens.

If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen-turbo

50% batch inference discount

International

$0.05

$0.2

$0.5

1 million tokens

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)
			Non-Thinking mode	Thinking mode (chain of thought + answer)
qwen-turbo	Chinese mainland	$0.044	$0.087	$0.431

QwQ

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

qwq-plus

International

$0.8

$2.4

1 million tokens

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwq-plus	Chinese mainland	$0.230	$0.574

Qwen-Long

You are charged for input tokens and output tokens.

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)	Free quota(Note)
qwen-long-latest	International	$0.072	$0.287	No free quota
qwen-long-2025-01-25	International	$0.072	$0.287	No free quota

Qwen-Omni

Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, see Billing and rate limits.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Input price (per 1 million tokens)		Output price (per 1 million tokens)		Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
Model ID	Deployment scope	Text/Image/video	Audio	Text Multimodal input	Text + audio Audio only billed
qwen3.5-omni-plus Currently equivalent to qwen3.5-omni-plus-2026-03-15	International	$1.4	$11	$8.3	$44	1 million tokens
qwen3.5-omni-plus-2026-03-15	International	$1.4	$11	$8.3	$44	1 million tokens
qwen3.5-omni-flash Currently equivalent to qwen3.5-omni-flash-2026-03-15	International	$0.4	$3	$2.2	$11.9	1 million tokens
qwen3.5-omni-flash-2026-03-15	International	$0.4	$3	$2.2	$11.9	1 million tokens

More models

Model ID	Deployment scope	Mode	Input price (per 1 million tokens)			Output price (per 1 million tokens)			Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
Model ID	Deployment scope	Mode	Text	Audio	Image/video	Text Text-only input	Text Multimodal input	Text + audio Audio only billed
qwen3-omni-flash Currently equivalent to qwen3-omni-flash-2025-12-01	International	Non-Thinking and Thinking modes	$0.43	$3.81	$0.78	$1.66	$3.06	$15.11	1 million tokens (regardless of modality)
qwen3-omni-flash-2025-12-01	International	Non-Thinking and Thinking modes	$0.43	$3.81	$0.78	$1.66	$3.06	$15.11	1 million tokens (regardless of modality)
qwen3-omni-flash-2025-09-15	International	Non-Thinking and Thinking modes	$0.43	$3.81	$0.78	$1.66	$3.06	$15.11	1 million tokens (regardless of modality)
qwen-omni-turbo Currently equivalent to qwen-omni-turbo-2025-03-26	International	Non-Thinking mode	$0.07	$4.44	$0.21	$0.27	$0.63	$8.89	1 million tokens (regardless of modality)
qwen-omni-turbo-latest	International	Non-Thinking mode	$0.07	$4.44	$0.21	$0.27	$0.63	$8.89	1 million tokens (regardless of modality)
qwen-omni-turbo-2025-03-26	International	Non-Thinking mode	$0.07	$4.44	$0.21	$0.27	$0.63	$8.89	1 million tokens (regardless of modality)

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)		Output price (per 1 million tokens)
Model ID	Deployment scope	Text/Image/video	Audio	Text Multimodal input	Text + audio Audio only billed
qwen3.5-omni-plus Currently equivalent to qwen3.5-omni-plus-2026-03-15	Chinese mainland	$0.96	$7.29	$5.5	$29.29
qwen3.5-omni-plus-2026-03-15	Chinese mainland	$0.96	$7.29	$5.5	$29.29
qwen3.5-omni-flash Currently equivalent to qwen3.5-omni-flash-2026-03-15	Chinese mainland	$0.3	$2.48	$1.83	$9.9
qwen3.5-omni-flash-2026-03-15	Chinese mainland	$0.3	$2.48	$1.83	$9.9

More models

Model ID	Deployment scope	Mode	Input price (per 1 million tokens)			Output price (per 1 million tokens)
Model ID	Deployment scope	Mode	Text	Audio	Image/video	Text Text-only input	Text Multimodal input	Text + audio Audio only billed
qwen3-omni-flash Currently equivalent to qwen3-omni-flash-2025-12-01	Chinese mainland	Non-Thinking and Thinking modes	$0.258	$2.265	$0.473	$0.989	$1.821	$8.974
qwen3-omni-flash-2025-12-01	Chinese mainland	Non-Thinking and Thinking modes	$0.258	$2.265	$0.473	$0.989	$1.821	$8.974
qwen3-omni-flash-2025-09-15	Chinese mainland	Non-Thinking and Thinking modes	$0.258	$2.265	$0.473	$0.989	$1.821	$8.974
qwen-omni-turbo Currently equivalent to qwen-omni-turbo-2025-03-26	Chinese mainland	Non-Thinking mode	$0.058	$3.584	$0.216	$0.230	$0.646	$7.168
qwen-omni-turbo-latest	Chinese mainland	Non-Thinking mode	$0.058	$3.584	$0.216	$0.230	$0.646	$7.168
qwen-omni-turbo-2025-03-26	Chinese mainland	Non-Thinking mode	$0.058	$3.584	$0.216	$0.230	$0.646	$7.168
qwen-omni-turbo-2025-01-19	Chinese mainland	Non-Thinking mode	$0.058	$3.584	$0.216	$0.230	$0.646	$7.168

Qwen-Omni-Realtime

Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, see Billing and rate limits.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Input price (per 1 million tokens)		Output price (per 1 million tokens)		Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
Model ID	Deployment scope	Text/image	Audio	Text Multimodal input	Text + audio Audio only billed
qwen3.5-omni-plus-realtime Currently equivalent to qwen3.5-omni-plus-realtime-2026-03-15	International	$2.1	$16.5	$12.4	$62	1 million tokens
qwen3.5-omni-plus-realtime-2026-03-15	International	$2.1	$16.5	$12.4	$62	1 million tokens
qwen3.5-omni-flash-realtime Currently equivalent to qwen3.5-omni-flash-realtime-2026-03-15	International	$0.55	$4.5	$3.3	$17.7	1 million tokens
qwen3.5-omni-flash-realtime-2026-03-15	International	$0.55	$4.5	$3.3	$17.7	1 million tokens

More models

Model ID	Deployment scope	Input price (per 1 million tokens)			Output price (per 1 million tokens)			Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
Model ID	Deployment scope	Text	Audio	Image	Text Text-only input	Text Multimodal input	Text + audio Audio only billed
qwen3-omni-flash-realtime Currently equivalent to qwen3-omni-flash-realtime-2025-12-01	International	$0.52	$4.57	$0.94	$1.99	$3.67	$18.13	1 million tokens (regardless of modality)
qwen3-omni-flash-realtime-2025-12-01	International	$0.52	$4.57	$0.94	$1.99	$3.67	$18.13	1 million tokens (regardless of modality)
qwen3-omni-flash-realtime-2025-09-15	International	$0.52	$4.57	$0.94	$1.99	$3.67	$18.13	1 million tokens (regardless of modality)
qwen-omni-turbo-realtime Currently equivalent to qwen-omni-turbo-realtime-2025-05-08	International	$0.270	$4.440	$0.840	$1.070	$2.520	$8.890	1 million tokens (regardless of modality)
qwen-omni-turbo-realtime-latest	International	$0.270	$4.440	$0.840	$1.070	$2.520	$8.890	1 million tokens (regardless of modality)
qwen-omni-turbo-realtime-2025-05-08	International	$0.270	$4.440	$0.840	$1.070	$2.520	$8.890	1 million tokens (regardless of modality)

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)		Output price (per 1 million tokens)
Model ID	Deployment scope	Text/image	Audio	Text Multimodal input	Text + audio Audio only billed
qwen3.5-omni-plus-realtime Currently equivalent to qwen3.5-omni-plus-realtime-2026-03-15	Chinese mainland	$1.38	$11	$8.25	$41.26
qwen3.5-omni-plus-realtime-2026-03-15	Chinese mainland	$1.38	$11	$8.25	$41.26
qwen3.5-omni-flash-realtime Currently equivalent to qwen3.5-omni-flash-realtime-2026-03-15	Chinese mainland	$0.45	$3.71	$2.75	$14.71
qwen3.5-omni-flash-realtime-2026-03-15	Chinese mainland	$0.45	$3.71	$2.75	$14.71

More models

Model ID	Deployment scope	Input price (per 1 million tokens)			Output price (per 1 million tokens)
Model ID	Deployment scope	Text	Audio	Image	Text Text-only input	Text Multimodal input	Text + audio Audio only billed
qwen3-omni-flash-realtime Currently equivalent to qwen3-omni-flash-realtime-2025-12-01	Chinese mainland	$0.315	$2.709	$0.559	$1.19	$2.179	$10.766
qwen3-omni-flash-realtime-2025-12-01	Chinese mainland	$0.315	$2.709	$0.559	$1.19	$2.179	$10.766
qwen3-omni-flash-realtime-2025-09-15	Chinese mainland	$0.315	$2.709	$0.559	$1.19	$2.179	$10.766
qwen-omni-turbo-realtime Currently equivalent to qwen-omni-turbo-realtime-2025-05-08	Chinese mainland	$0.230	$3.584	$0.861	$0.918	$2.581	$7.168
qwen-omni-turbo-realtime-latest	Chinese mainland	$0.230	$3.584	$0.861	$0.918	$2.581	$7.168
qwen-omni-turbo-realtime-2025-05-08	Chinese mainland	$0.230	$3.584	$0.861	$0.918	$2.581	$7.168

QVQ

Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, see Billing and rate limits.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

qvq-max

International

$1.2

$4.8

1 million tokens

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qvq-max	Chinese mainland	$1.147	$4.588
qvq-plus	Chinese mainland	$0.287	$0.717

Qwen-VL

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Mode	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought + answer	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
qwen3-vl-plus Currently equivalent to qwen3-vl-plus-2025-12-19 context caching discount	International	Non-Thinking and Thinking modes	0<Token≤32K	$0.2	$1.6	1 million tokens
			32K<Token≤128K	$0.3	$2.4
			128K<Token≤256K	$0.6	$4.8
qwen3-vl-plus-2025-12-19	International	Non-Thinking and Thinking modes	0<Token≤32K	$0.2	$1.6	1 million tokens
			32K<Token≤128K	$0.3	$2.4
			128K<Token≤256K	$0.6	$4.8
qwen3-vl-plus-2025-09-23	International	Non-Thinking and Thinking modes	0<Token≤32K	$0.2	$1.6	1 million tokens
			32K<Token≤128K	$0.3	$2.4
			128K<Token≤256K	$0.6	$4.8
qwen3-vl-flash Currently equivalent to qwen3-vl-flash-2026-01-22 context caching discount	International	Non-Thinking and Thinking modes	0<Token≤32K	$0.05	$0.4	1 million tokens
			32K<Token≤128K	$0.075	$0.6
			128K<Token≤256K	$0.12	$0.96
qwen3-vl-flash-2026-01-22	International	Non-Thinking and Thinking modes	0<Token≤32K	$0.05	$0.4	1 million tokens
			32K<Token≤128K	$0.075	$0.6
			128K<Token≤256K	$0.12	$0.96
qwen3-vl-flash-2025-10-15	International	Non-Thinking and Thinking modes	0<Token≤32K	$0.05	$0.4	1 million tokens
			32K<Token≤128K	$0.075	$0.6
			128K<Token≤256K	$0.12	$0.96

More models

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

qwen-vl-max

context caching discount

International

No tiered pricing

$0.8

$3.2

1 million tokens

qwen-vl-plus

context caching discount

International

No tiered pricing

$0.21

$0.63

1 million tokens

China (Beijing)

Model ID	Deployment scope	Mode	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought + answer
qwen3-vl-plus Currently equivalent to qwen3-vl-plus-2025-12-19 context caching discount	Chinese mainland	Non-Thinking and Thinking modes	0<Token≤32K	$0.143	$1.434
			32K<Token≤128K	$0.215	$2.15
			128K<Token≤256K	$0.43	$4.301
qwen3-vl-plus-2025-12-19	Chinese mainland	Non-Thinking and Thinking modes	0<Token≤32K	$0.143	$1.434
			32K<Token≤128K	$0.215	$2.15
			128K<Token≤256K	$0.43	$4.301
qwen3-vl-plus-2025-09-23	Chinese mainland	Non-Thinking and Thinking modes	0<Token≤32K	$0.143	$1.434
			32K<Token≤128K	$0.215	$2.15
			128K<Token≤256K	$0.43	$4.301
qwen3-vl-flash Currently equivalent to qwen3-vl-flash-2026-01-22 context caching discount	Chinese mainland	Non-Thinking and Thinking modes	0<Token≤32K	$0.022	$0.215
			32K<Token≤128K	$0.043	$0.43
			128K<Token≤256K	$0.086	$0.859
qwen3-vl-flash-2026-01-22	Chinese mainland	Non-Thinking and Thinking modes	0<Token≤32K	$0.022	$0.215
			32K<Token≤128K	$0.043	$0.43
			128K<Token≤256K	$0.086	$0.859
qwen3-vl-flash-2025-10-15	Chinese mainland	Non-Thinking and Thinking modes	0<Token≤32K	$0.022	$0.215
			32K<Token≤128K	$0.043	$0.43
			128K<Token≤256K	$0.086	$0.859

More models

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen-vl-max

context caching discount

Chinese mainland

No tiered pricing

$0.23

$0.574

qwen-vl-plus

context caching discount

Chinese mainland

No tiered pricing

$0.115

$0.287

Hong Kong (China)

Model ID	Deployment scope	Mode	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought + answer
qwen3-vl-plus Currently equivalent to qwen3-vl-plus-2025-12-19 context caching discount	Hong Kong (China)	Non-Thinking and Thinking modes	0<Token≤32K	$0.2	$1.6
			32K<Token≤128K	$0.3	$2.4
			128K<Token≤256K	$0.6	$4.8
qwen3-vl-plus-2025-12-19	Hong Kong (China)	Non-Thinking and Thinking modes	0<Token≤32K	$0.2	$1.6
			32K<Token≤128K	$0.3	$2.4
			128K<Token≤256K	$0.6	$4.8

Germany (Frankfurt)

Model ID	Deployment scope	Mode	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought + answer
qwen3-vl-flash Currently equivalent to qwen3-vl-flash-2025-10-15 context caching discount	Global	Non-Thinking and Thinking modes	0<Token≤32K	$0.022	$0.215
			32K<Token≤128K	$0.043	$0.43
			128K<Token≤256K	$0.086	$0.859
qwen3-vl-flash Currently equivalent to qwen3-vl-flash-2026-01-22 context caching discount	EU	Non-Thinking and Thinking modes	0<Token≤32K	$0.05	$0.4
			32K<Token≤128K	$0.075	$0.6
			128K<Token≤256K	$0.12	$0.96
qwen3-vl-flash-2026-01-22	EU	Non-Thinking and Thinking modes	0<Token≤32K	$0.05	$0.4
			32K<Token≤128K	$0.075	$0.6
			128K<Token≤256K	$0.12	$0.96
qwen3-vl-flash-2025-10-15	Global	Non-Thinking and Thinking modes	0<Token≤32K	$0.022	$0.215
			32K<Token≤128K	$0.043	$0.43
			128K<Token≤256K	$0.086	$0.859
qwen3-vl-flash-2025-10-15	EU	Non-Thinking and Thinking modes	0<Token≤32K	$0.05	$0.4
			32K<Token≤128K	$0.075	$0.6
			128K<Token≤256K	$0.12	$0.96
qwen3-vl-plus Currently equivalent to qwen3-vl-plus-2025-12-19 context caching discount	Global	Non-Thinking and Thinking modes	0<Token≤32K	$0.143	$1.434
			32K<Token≤128K	$0.215	$2.15
			128K<Token≤256K	$0.43	$4.301
qwen3-vl-plus context caching discount	EU	Non-Thinking and Thinking modes	0<Token≤32K	$0.2	$1.6
			32K<Token≤128K	$0.3	$2.4
			128K<Token≤256K	$0.6	$4.8
qwen3-vl-plus-2025-09-23	Global	Non-Thinking and Thinking modes	0<Token≤32K	$0.143	$1.434
			32K<Token≤128K	$0.215	$2.15
			128K<Token≤256K	$0.43	$4.301

US (Virginia)

Model ID	Deployment scope	Mode	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought + answer
qwen3-vl-flash Currently equivalent to qwen3-vl-flash-2025-10-15 context caching discount	Global	Non-Thinking and Thinking modes	0<Token≤32K	$0.022	$0.215
			32K<Token≤128K	$0.043	$0.43
			128K<Token≤256K	$0.086	$0.859
qwen3-vl-flash-us context caching discount	US	Non-Thinking and Thinking modes	0<Token≤32K	$0.05	$0.4
			32K<Token≤128K	$0.075	$0.6
			128K<Token≤256K	$0.12	$0.96
qwen3-vl-flash-2026-01-22-us	US	Non-Thinking and Thinking modes	0<Token≤32K	$0.05	$0.4
			32K<Token≤128K	$0.075	$0.6
			128K<Token≤256K	$0.12	$0.96
qwen3-vl-flash-2025-10-15	Global	Non-Thinking and Thinking modes	0<Token≤32K	$0.022	$0.215
			32K<Token≤128K	$0.043	$0.43
			128K<Token≤256K	$0.086	$0.859
qwen3-vl-flash-2025-10-15-us	US	Non-Thinking and Thinking modes	0<Token≤32K	$0.05	$0.4
			32K<Token≤128K	$0.075	$0.6
			128K<Token≤256K	$0.12	$0.96
qwen3-vl-plus Currently equivalent to qwen3-vl-plus-2025-12-19 context caching discount	Global	Non-Thinking and Thinking modes	0<Token≤32K	$0.143	$1.434
			32K<Token≤128K	$0.215	$2.15
			128K<Token≤256K	$0.43	$4.301
qwen3-vl-plus-2025-09-23	Global	Non-Thinking and Thinking modes	0<Token≤32K	$0.143	$1.434
			32K<Token≤128K	$0.215	$2.15
			128K<Token≤256K	$0.43	$4.301

Qwen-OCR

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

qwen-vl-ocr

Currently equivalent to qwen-vl-ocr-2025-11-20

International

$0.07

$0.16

1 million tokens

qwen-vl-ocr-2025-11-20

International

1 million tokens

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen3.5-ocr	Chinese mainland	$0.069	$0.275
qwen-vl-ocr Currently equivalent to qwen-vl-ocr-2025-11-20	Chinese mainland	$0.043	$0.072
qwen-vl-ocr-latest	Chinese mainland	$0.043	$0.072
qwen-vl-ocr-2025-11-20	Chinese mainland	$0.043	$0.072
qwen-vl-ocr-2025-08-28	Chinese mainland	$0.717	$0.717
qwen-vl-ocr-2025-04-13	Chinese mainland	$0.717	$0.717
qwen-vl-ocr-2024-10-28	Chinese mainland	$0.717	$0.717

Germany (Frankfurt)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen-vl-ocr

Currently equivalent to qwen-vl-ocr-2025-11-20

Global

$0.043

$0.072

qwen-vl-ocr-2025-11-20

Global

$0.043

$0.072

US (Virginia)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen-vl-ocr

Currently equivalent to qwen-vl-ocr-2025-11-20

Global

$0.043

$0.072

qwen-vl-ocr-2025-11-20

Global

$0.043

$0.072

Qwen Math

You are charged for input tokens and output tokens.

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)	Free quota(Note)
qwen-math-plus	Chinese mainland	$0.574	$1.721	No free quota
qwen-math-plus-latest	Chinese mainland	$0.574	$1.721	No free quota
qwen-math-plus-2024-09-19	Chinese mainland	$0.574	$1.721	No free quota
qwen-math-plus-2024-08-16	Chinese mainland	$0.574	$1.721	No free quota
qwen-math-turbo	Chinese mainland	$0.287	$0.861	No free quota

Qwen-Coder

You are charged for input tokens and output tokens.

If the model supports context cache, only input tokens receive a discount.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
qwen3-coder-plus Currently equivalent to qwen3-coder-plus-2025-09-23 context caching discount	International	0<Token≤32K	$1	$5	1 million tokens
		32K<Token≤128K	$1.8	$9
		128K<Token≤256K	$3	$15
		256K<Token≤1M	$6	$60
qwen3-coder-plus-2025-09-23	International	0<Token≤32K	$1	$5	1 million tokens
		32K<Token≤128K	$1.8	$9
		128K<Token≤256K	$3	$15
		256K<Token≤1M	$6	$60
qwen3-coder-plus-2025-07-22	International	0<Token≤32K	$1	$5	1 million tokens
		32K<Token≤128K	$1.8	$9
		128K<Token≤256K	$3	$15
		256K<Token≤1M	$6	$60
qwen3-coder-flash Currently equivalent to qwen3-coder-flash-2025-07-28	International	0<Token≤32K	$0.3	$1.5	1 million tokens
		32K<Token≤128K	$0.5	$2.5
		128K<Token≤256K	$0.8	$4
		256K<Token≤1M	$1.6	$9.6
qwen3-coder-flash-2025-07-28	International	0<Token≤32K	$0.3	$1.5	1 million tokens
		32K<Token≤128K	$0.5	$2.5
		128K<Token≤256K	$0.8	$4
		256K<Token≤1M	$1.6	$9.6

China (Beijing)

qwen3-coderseries models

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen3-coder-plus Currently equivalent to qwen3-coder-plus-2025-09-23 context caching discount	Chinese mainland	0<Token≤32K	$0.574	$2.294
		32K<Token≤128K	$0.861	$3.441
		128K<Token≤256K	$1.434	$5.735
		256K<Token≤1M	$2.868	$28.671
qwen3-coder-plus-2025-09-23	Chinese mainland	0<Token≤32K	$0.574	$2.294
		32K<Token≤128K	$0.861	$3.441
		128K<Token≤256K	$1.434	$5.735
		256K<Token≤1M	$2.868	$28.671
qwen3-coder-plus-2025-07-22	Chinese mainland	0<Token≤32K	$0.574	$2.294
		32K<Token≤128K	$0.861	$3.441
		128K<Token≤256K	$1.434	$5.735
		256K<Token≤1M	$2.868	$28.671
qwen3-coder-flash Currently equivalent to qwen3-coder-flash-2025-07-28	Chinese mainland	0<Token≤32K	$0.144	$0.574
		32K<Token≤128K	$0.216	$0.861
		128K<Token≤256K	$0.359	$1.434
		256K<Token≤1M	$0.717	$3.584
qwen3-coder-flash-2025-07-28	Chinese mainland	0<Token≤32K	$0.144	$0.574
		32K<Token≤128K	$0.216	$0.861
		128K<Token≤256K	$0.359	$1.434
		256K<Token≤1M	$0.717	$3.584

Legacy qwen-coder series models

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen-coder-plus	Chinese mainland	No tiered pricing	$0.502	$1.004
qwen-coder-turbo	Chinese mainland	No tiered pricing	$0.287	$0.861

Germany (Frankfurt)

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen3-coder-plus Currently equivalent to qwen3-coder-plus-2025-09-23 context caching discount	Global	0<Token≤32K	$0.574	$2.294
		32K<Token≤128K	$0.861	$3.441
		128K<Token≤256K	$1.434	$5.735
		256K<Token≤1M	$2.868	$28.671
qwen3-coder-plus-2025-09-23	Global	0<Token≤32K	$0.574	$2.294
		32K<Token≤128K	$0.861	$3.441
		128K<Token≤256K	$1.434	$5.735
		256K<Token≤1M	$2.868	$28.671
qwen3-coder-plus-2025-07-22	Global	0<Token≤32K	$0.574	$2.294
		32K<Token≤128K	$0.861	$3.441
		128K<Token≤256K	$1.434	$5.735
		256K<Token≤1M	$2.868	$28.671
qwen3-coder-flash Currently equivalent to qwen3-coder-flash-2025-07-28 context caching discount	Global	0<Token≤32K	$0.144	$0.574
		32K<Token≤128K	$0.216	$0.861
		128K<Token≤256K	$0.359	$1.434
		256K<Token≤1M	$0.717	$3.584
qwen3-coder-flash-2025-07-28	Global	0<Token≤32K	$0.144	$0.574
		32K<Token≤128K	$0.216	$0.861
		128K<Token≤256K	$0.359	$1.434
		256K<Token≤1M	$0.717	$3.584

US (Virginia)

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen3-coder-plus Currently equivalent to qwen3-coder-plus-2025-09-23 context caching discount	Global	0<Token≤32K	$0.574	$2.294
		32K<Token≤128K	$0.861	$3.441
		128K<Token≤256K	$1.434	$5.735
		256K<Token≤1M	$2.868	$28.671
qwen3-coder-plus-2025-09-23	Global	0<Token≤32K	$0.574	$2.294
		32K<Token≤128K	$0.861	$3.441
		128K<Token≤256K	$1.434	$5.735
		256K<Token≤1M	$2.868	$28.671
qwen3-coder-plus-2025-07-22	Global	0<Token≤32K	$0.574	$2.294
		32K<Token≤128K	$0.861	$3.441
		128K<Token≤256K	$1.434	$5.735
		256K<Token≤1M	$2.868	$28.671
qwen3-coder-flash Currently equivalent to qwen3-coder-flash-2025-07-28 context caching discount	Global	0<Token≤32K	$0.144	$0.574
		32K<Token≤128K	$0.216	$0.861
		128K<Token≤256K	$0.359	$1.434
		256K<Token≤1M	$0.717	$3.584
qwen3-coder-flash-2025-07-28	Global	0<Token≤32K	$0.144	$0.574
		32K<Token≤128K	$0.216	$0.861
		128K<Token≤256K	$0.359	$1.434
		256K<Token≤1M	$0.717	$3.584

Qwen Translation

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
qwen-mt-plus	International	$2.46	$7.37	1 million tokens
qwen-mt-flash	International	$0.16	$0.49	1 million tokens
qwen-mt-lite	International	$0.12	$0.36	1 million tokens
qwen-mt-turbo	International	$0.16	$0.49	1 million tokens

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen-mt-plus	Chinese mainland	$0.259	$0.775
qwen-mt-flash	Chinese mainland	$0.101	$0.280
qwen-mt-lite	Chinese mainland	$0.086	$0.229
qwen-mt-turbo	Chinese mainland	$0.101	$0.280

Germany (Frankfurt)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen-mt-plus	Global	$0.259	$0.775
qwen-mt-flash	Global	$0.101	$0.280
qwen-mt-lite	Global	$0.086	$0.229

US (Virginia)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen-mt-flash	Global	$0.101	$0.280
qwen-mt-lite	Global	$0.086	$0.229
qwen-mt-lite-us	US	$0.12	$0.36
qwen-mt-plus	Global	$0.259	$0.775

Qwen Data Mining

You are charged for input tokens and output tokens.

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)	Free quota(Note)
qwen-doc-turbo	Chinese mainland	$0.087	$0.144	No free quota

Qwen Deep Research

You are charged for input tokens and output tokens.

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)	Free quota(Note)
qwen-deep-research	Chinese mainland	$7.742	$23.367	None

Text generation - Qwen (open source)

Qwen3.6

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)		Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Non-Thinking mode	Thinking mode (chain of thought + answer)
qwen3.6-35b-a3b	International	0<Token≤256K	$0.375	$2.25	$2.25	1 million tokens
qwen3.6-27b	International	0<Token≤256K	$0.6	$3.6	$3.6	1 million tokens

China (Beijing)

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Non-Thinking mode	Thinking mode (chain of thought + answer)
qwen3.6-35b-a3b	Chinese mainland	0<Token≤256K	$0.248	$1.485	$1.485
qwen3.6-27b	Chinese mainland	0<Token≤256K	$0.412564	$2.475384	$2.475384

Germany (Frankfurt)

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
				Non-Thinking mode	Thinking mode (chain of thought + answer)
qwen3.6-35b-a3b	Global	0<Token≤256K	$0.248	$1.485	$1.485

US (Virginia)

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
				Non-Thinking mode	Thinking mode (chain of thought + answer)
qwen3.6-35b-a3b	Global	0<Token≤256K	$0.248	$1.485	$1.485

Qwen3.5

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)		Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Non-Thinking mode	Thinking mode (chain of thought + answer)
qwen3.5-397b-a17b	International	0<Token≤256K	$0.6	$3.6	$3.6	1 million tokens
qwen3.5-122b-a10b	International	0<Token≤256K	$0.4	$3.2	$3.2	1 million tokens
qwen3.5-27b	International	0<Token≤256K	$0.3	$2.4	$2.4	1 million tokens
qwen3.5-35b-a3b	International	0<Token≤256K	$0.25	$2	$2	1 million tokens

China (Beijing)

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Non-Thinking mode	Thinking mode (chain of thought + answer)
qwen3.5-397b-a17b	Chinese mainland	0<Token≤128K	$0.172	$1.032	$1.032
qwen3.5-397b-a17b	Chinese mainland	128K<Token≤256K	$0.43	$2.58	$2.58
qwen3.5-122b-a10b	Chinese mainland	0<Token≤128K	$0.115	$0.917	$0.917
qwen3.5-122b-a10b	Chinese mainland	128K<Token≤256K	$0.287	$2.294	$2.294
qwen3.5-27b	Chinese mainland	0<Token≤128K	$0.086	$0.688	$0.688
qwen3.5-27b	Chinese mainland	128K<Token≤256K	$0.258	$2.064	$2.064
qwen3.5-35b-a3b	Chinese mainland	0<Token≤128K	$0.057	$0.459	$0.459
qwen3.5-35b-a3b	Chinese mainland	128K<Token≤256K	$0.229	$1.835	$1.835

Germany (Frankfurt)

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Non-Thinking mode	Thinking mode (chain of thought + answer)
qwen3.5-397b-a17b	Global	0<Token≤128K	$0.172	$1.032	$1.032
qwen3.5-397b-a17b	Global	128K<Token≤256K	$0.43	$2.58	$2.58
qwen3.5-122b-a10b	Global	0<Token≤128K	$0.115	$0.917	$0.917
qwen3.5-122b-a10b	Global	128K<Token≤256K	$0.287	$2.294	$2.294
qwen3.5-27b	Global	0<Token≤128K	$0.086	$0.688	$0.688
qwen3.5-27b	Global	128K<Token≤256K	$0.258	$2.064	$2.064
qwen3.5-35b-a3b	Global	0<Token≤128K	$0.057	$0.459	$0.459
qwen3.5-35b-a3b	Global	128K<Token≤256K	$0.229	$1.835	$1.835

US (Virginia)

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Non-Thinking mode	Thinking mode (chain of thought + answer)
qwen3.5-397b-a17b	Global	0<Token≤128K	$0.172	$1.032	$1.032
qwen3.5-397b-a17b	Global	128K<Token≤256K	$0.43	$2.58	$2.58
qwen3.5-122b-a10b	Global	0<Token≤128K	$0.115	$0.917	$0.917
qwen3.5-122b-a10b	Global	128K<Token≤256K	$0.287	$2.294	$2.294
qwen3.5-27b	Global	0<Token≤128K	$0.086	$0.688	$0.688
qwen3.5-27b	Global	128K<Token≤256K	$0.258	$2.064	$2.064
qwen3.5-35b-a3b	Global	0<Token≤128K	$0.057	$0.459	$0.459
qwen3.5-35b-a3b	Global	128K<Token≤256K	$0.229	$1.835	$1.835

Qwen3

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Mode	Input price (per 1 million tokens)	Output price (per 1 million tokens)		Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
Model ID	Deployment scope	Mode	Input price (per 1 million tokens)	Non-Thinking mode	Thinking mode
qwen3-next-80b-a3b-thinking	International	Thinking mode only	$0.15	-	$1.2	1 million tokens
qwen3-next-80b-a3b-instruct	International	Non-Thinking mode only	$0.15	$1.2	-	1 million tokens
qwen3-235b-a22b-thinking-2507	International	Thinking mode only	$0.23	-	$2.3	1 million tokens
qwen3-235b-a22b-instruct-2507	International	Non-Thinking mode only	$0.23	$0.92	-	1 million tokens
qwen3-30b-a3b-thinking-2507	International	Thinking mode only	$0.2	-	$2.4	1 million tokens
qwen3-30b-a3b-instruct-2507	International	Non-Thinking mode only	$0.2	$0.8	-	1 million tokens
qwen3-235b-a22b	International	Non-Thinking and Thinking modes	$0.7	$2.8	$8.4	1 million tokens
qwen3-32b	International	Non-Thinking and Thinking modes	$0.16	$0.64	$0.64	1 million tokens
qwen3-30b-a3b	International	Non-Thinking and Thinking modes	$0.2	$0.8	$2.4	1 million tokens
qwen3-14b	International	Non-Thinking and Thinking modes	$0.35	$1.4	$4.2	1 million tokens
qwen3-8b	International	Non-Thinking and Thinking modes	$0.18	$0.7	$2.1	1 million tokens

China (Beijing)

Model ID	Deployment scope	Mode	Input price (per 1 million tokens)	Output price (per 1 million tokens)
Model ID	Deployment scope	Mode	Input price (per 1 million tokens)	Non-Thinking mode	Thinking mode (chain of thought + answer)
qwen3-next-80b-a3b-thinking	Chinese mainland	Thinking mode only	$0.144	-	$1.434
qwen3-next-80b-a3b-instruct	Chinese mainland	Non-Thinking mode only	$0.144	$0.574	-
qwen3-235b-a22b-thinking-2507	Chinese mainland	Thinking mode only	$0.287	-	$2.868
qwen3-235b-a22b-instruct-2507	Chinese mainland	Non-Thinking mode only	$0.287	$1.147	-
qwen3-30b-a3b-thinking-2507	Chinese mainland	Thinking mode only	$0.108	-	$1.076
qwen3-30b-a3b-instruct-2507	Chinese mainland	Non-Thinking mode only	$0.108	$0.431	-
qwen3-235b-a22b	Chinese mainland	Non-Thinking and Thinking modes	$0.287	$1.147	$2.868
qwen3-32b	Chinese mainland	Non-Thinking and Thinking modes	$0.287	$1.147	$2.868
qwen3-30b-a3b	Chinese mainland	Non-Thinking and Thinking modes	$0.108	$0.431	$1.076
qwen3-14b	Chinese mainland	Non-Thinking and Thinking modes	$0.144	$0.574	$1.434
qwen3-8b	Chinese mainland	Non-Thinking and Thinking modes	$0.072	$0.287	$0.717

Germany (Frankfurt)

Model ID	Deployment scope	Mode	Input price (per 1 million tokens)	Output price (per 1 million tokens)
Model ID	Deployment scope	Mode	Input price (per 1 million tokens)	Non-Thinking mode	Thinking mode (chain of thought + answer)
qwen3-next-80b-a3b-thinking	Global	Thinking mode only	$0.144	-	$1.434
qwen3-next-80b-a3b-instruct	Global	Non-Thinking mode only	$0.144	$0.574	-
qwen3-235b-a22b-thinking-2507	Global	Thinking mode only	$0.23	-	$2.3
qwen3-235b-a22b-instruct-2507	Global	Non-Thinking mode only	$0.23	$0.92	-
qwen3-30b-a3b-thinking-2507	Global	Thinking mode only	$0.108	-	$1.076
qwen3-30b-a3b-instruct-2507	Global	Non-Thinking mode only	$0.108	$0.431	-
qwen3-235b-a22b	Global	Non-Thinking and Thinking modes	$0.287	$1.147	$2.868
qwen3-32b	Global	Non-Thinking and Thinking modes	$0.16	$0.64	$0.64
qwen3-30b-a3b	Global	Non-Thinking and Thinking modes	$0.108	$0.431	$1.076
qwen3-14b	Global	Non-Thinking and Thinking modes	$0.144	$0.574	$1.434
qwen3-8b	Global	Non-Thinking and Thinking modes	$0.072	$0.287	$0.717

US (Virginia)

Model ID	Deployment scope	Mode	Input price (per 1 million tokens)	Output price (per 1 million tokens)
Model ID	Deployment scope	Mode	Input price (per 1 million tokens)	Non-Thinking mode	Thinking mode (chain of thought + answer)
qwen3-next-80b-a3b-thinking	Global	Thinking mode only	$0.144	-	$1.434
qwen3-next-80b-a3b-instruct	Global	Non-Thinking mode only	$0.144	$0.574	-
qwen3-235b-a22b-thinking-2507	Global	Thinking mode only	$0.23	-	$2.3
qwen3-235b-a22b-instruct-2507	Global	Non-Thinking mode only	$0.23	$0.92	-
qwen3-30b-a3b-thinking-2507	Global	Thinking mode only	$0.108	-	$1.076
qwen3-30b-a3b-instruct-2507	Global	Non-Thinking mode only	$0.108	$0.431	-
qwen3-235b-a22b	Global	Non-Thinking and Thinking modes	$0.287	$1.147	$2.868
qwen3-32b	Global	Non-Thinking and Thinking modes	$0.16	$0.64	$0.64
qwen3-30b-a3b	Global	Non-Thinking and Thinking modes	$0.108	$0.431	$1.076
qwen3-14b	Global	Non-Thinking and Thinking modes	$0.144	$0.574	$1.434
qwen3-8b	Global	Non-Thinking and Thinking modes	$0.072	$0.287	$0.717

Qwen-Omni

Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, see Billing and rate limits.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

Text

Audio

Image/video

Text

Text-only input

Text

Multimodal input

Text + audio

Audio only billed

qwen2.5-omni-7b

International

$0.10

$6.76

$0.28

$0.40

$0.84

$13.51

1 million tokens (regardless of modality)

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Input: text

Input: audio

Input: image/video

Output: text

Text-only input

Output: text

Multimodal input

Output: text + audio

Audio only billed

qwen2.5-omni-7b

Chinese mainland

$0.087

$5.448

$0.287

$0.345

$0.861

$10.895

Qwen3-Omni-Captioner

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

qwen3-omni-30b-a3b-captioner

International

$3.81

$3.06

1 million tokens

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen3-omni-30b-a3b-captioner	Chinese mainland	$2.265	$1.821

Qwen-VL

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Mode	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought + answer	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
qwen3-vl-235b-a22b-thinking	International	Thinking mode only	$0.4	$4	1 million tokens
qwen3-vl-235b-a22b-instruct	International	Non-Thinking mode only	$0.4	$1.6	1 million tokens
qwen3-vl-32b-thinking	International	Thinking mode only	$0.16	$0.64	1 million tokens
qwen3-vl-32b-instruct	International	Non-Thinking mode only	$0.16	$0.64	1 million tokens
qwen3-vl-30b-a3b-thinking	International	Thinking mode only	$0.2	$2.4	1 million tokens
qwen3-vl-30b-a3b-instruct	International	Non-Thinking mode only	$0.2	$0.8	1 million tokens
qwen3-vl-8b-thinking	International	Thinking mode only	$0.18	$2.1	1 million tokens
qwen3-vl-8b-instruct	International	Non-Thinking mode only	$0.18	$0.7	1 million tokens

More models

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

China (Beijing)

Model ID	Deployment scope	Mode	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought + answer
qwen3-vl-235b-a22b-thinking	Chinese mainland	Thinking mode only	$0.287	$2.867
qwen3-vl-235b-a22b-instruct	Chinese mainland	Non-Thinking mode only	$0.287	$1.147
qwen3-vl-32b-thinking	Chinese mainland	Thinking mode only	$0.287	$2.868
qwen3-vl-32b-instruct	Chinese mainland	Non-Thinking mode only	$0.287	$1.147
qwen3-vl-30b-a3b-thinking	Chinese mainland	Thinking mode only	$0.108	$1.076
qwen3-vl-30b-a3b-instruct	Chinese mainland	Non-Thinking mode only	$0.108	$0.431
qwen3-vl-8b-thinking	Chinese mainland	Thinking mode only	$0.072	$0.717
qwen3-vl-8b-instruct	Chinese mainland	Non-Thinking mode only	$0.072	$0.287

More models

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen2-vl-72b-instruct	Chinese mainland	$2.294	$6.881
qwen2-vl-7b-instruct	Chinese mainland	Limited-time free
qwen2-vl-2b-instruct	Chinese mainland	Limited-time free

Germany (Frankfurt)

Model ID	Deployment scope	Mode	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought + answer
qwen3-vl-235b-a22b-thinking	Global	Thinking mode only	$0.287	$2.867
qwen3-vl-235b-a22b-instruct	Global	Non-Thinking mode only	$0.287	$1.147
qwen3-vl-32b-thinking	Global	Thinking mode only	$0.16	$0.64
qwen3-vl-32b-instruct	Global	Non-Thinking mode only	$0.16	$0.64
qwen3-vl-30b-a3b-thinking	Global	Thinking mode only	$0.108	$1.076
qwen3-vl-30b-a3b-instruct	Global	Non-Thinking mode only	$0.108	$0.431
qwen3-vl-8b-thinking	Global	Thinking mode only	$0.072	$0.717
qwen3-vl-8b-instruct	Global	Non-Thinking mode only	$0.072	$0.287

US (Virginia)

Model ID	Deployment scope	Mode	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought + answer
qwen3-vl-235b-a22b-thinking	Global	Thinking mode only	$0.287	$2.867
qwen3-vl-235b-a22b-instruct	Global	Non-Thinking mode only	$0.287	$1.147
qwen3-vl-32b-thinking	Global	Thinking mode only	$0.16	$0.64
qwen3-vl-32b-instruct	Global	Non-Thinking mode only	$0.16	$0.64
qwen3-vl-30b-a3b-thinking	Global	Thinking mode only	$0.108	$1.076
qwen3-vl-30b-a3b-instruct	Global	Non-Thinking mode only	$0.108	$0.431
qwen3-vl-8b-thinking	Global	Thinking mode only	$0.072	$0.717
qwen3-vl-8b-instruct	Global	Non-Thinking mode only	$0.072	$0.287

Qwen-Coder

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
qwen3-coder-next	International	0<Token≤32K	$0.3	$1.5	1 million tokens
		32K<Token≤128K	$0.5	$2.5
		128K<Token≤256K	$0.8	$4
qwen3-coder-480b-a35b-instruct	International	0<Token≤32K	$1.5	$7.5	1 million tokens
		32K<Token≤128K	$2.7	$13.5
		128K<Token≤200K	$4.5	$22.5
qwen3-coder-30b-a3b-instruct	International	0<Token≤32K	$0.45	$2.25	1 million tokens
		32K<Token≤128K	$0.75	$3.75
		128K<Token≤200K	$1.2	$6

China (Beijing)

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen3-coder-next	Chinese mainland	0<Token≤32K	$0.144	$0.574
		32K<Token≤128K	$0.216	$0.861
		128K<Token≤256K	$0.359	$1.434
qwen3-coder-480b-a35b-instruct	Chinese mainland	0<Token≤32K	$0.861	$3.441
		32K<Token≤128K	$1.291	$5.161
		128K<Token≤200K	$2.151	$8.602
qwen3-coder-30b-a3b-instruct	Chinese mainland	0<Token≤32K	$0.216	$0.861
		32K<Token≤128K	$0.323	$1.291
		128K<Token≤200K	$0.538	$2.151

Germany (Frankfurt)

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen3-coder-30b-a3b-instruct	Global	0<Token≤32K	$0.216	$0.861
		32K<Token≤128K	$0.323	$1.291
		128K<Token≤200K	$0.538	$2.151
qwen3-coder-480b-a35b-instruct	Global	0<Token≤32K	$0.861	$3.441
		32K<Token≤128K	$1.291	$5.161
		128K<Token≤200K	$2.151	$8.602
qwen3-coder-next	EU	0<Token≤32K	$0.3	$1.5
		32K<Token≤128K	$0.5	$2.5
		128K<Token≤256K	$0.8	$4

US (Virginia)

Model ID	Deployment scope	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen3-coder-480b-a35b-instruct	Global	0<Token≤32K	$0.861	$3.441
		32K<Token≤128K	$1.291	$5.161
		128K<Token≤200K	$2.151	$8.602
qwen3-coder-30b-a3b-instruct	Global	0<Token≤32K	$0.216	$0.861
		32K<Token≤128K	$0.323	$1.291
		128K<Token≤200K	$0.538	$2.151

Text generation - third-party models

DeepSeek

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
deepseek-v4-pro context caching discount	International	$2.400	$4.800	1 million tokens
deepseek-v4-flash context caching discount	International	$0.200	$0.400	1 million tokens
deepseek-v3.2 context caching discount	International	$0.57	$1.71	1 million tokens

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)	Free quota(Note)
deepseek-v4-pro context caching discount	Chinese mainland	$1.65	$3.301	No free quota
deepseek-v4-flash context caching discount	Chinese mainland	$0.138	$0.275	No free quota
deepseek-v3.2 context caching discount	Chinese mainland	$0.287	$0.431	No free quota
deepseek-v3.2-exp	Chinese mainland	$0.287	$0.431	No free quota
deepseek-v3.1	Chinese mainland	$0.574	$1.721	No free quota
deepseek-r1	Chinese mainland	$0.574	$2.294	No free quota
deepseek-r1-0528	Chinese mainland	$0.574	$2.294	No free quota
deepseek-v3	Chinese mainland	$0.287	$1.147	No free quota
deepseek-r1-distill-qwen-1.5b	Chinese mainland	Limited-time free
deepseek-r1-distill-qwen-7b	Chinese mainland	$0.072	$0.144	No free quota
deepseek-r1-distill-qwen-14b	Chinese mainland	$0.144	$0.431	No free quota
deepseek-r1-distill-qwen-32b	Chinese mainland	$0.287	$0.861	No free quota
deepseek-r1-distill-llama-8b	Chinese mainland	Discontinued This model has been discontinued. We recommend using Deep thinking, DeepSeek, Kimi - Alibaba Cloud as alternative models.
deepseek-r1-distill-llama-70b	Chinese mainland	Limited-time free

Germany (Frankfurt)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

deepseek-v4-pro

context caching discount

Global

$1.65

$3.3

deepseek-v4-flash

context caching discount

Global

$0.14

$0.28

US (Virginia)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

deepseek-v4-pro

context caching discount

Global

$1.65

$3.3

deepseek-v4-flash

context caching discount

Global

$0.14

$0.28

Japan (Tokyo)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)
deepseek-v4-pro Context Cache context caching discount	Global	$1.65	$3.3
deepseek-v4-flash Context Cache context caching discount	Global	$0.14	$0.28
deepseek-v4-pro Context Cache context caching discount	Japan	$2.400	$4.800
deepseek-v4-flash Context Cache context caching discount	Japan	$0.200	$0.400

Kimi

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
kimi-k2.7-code	Chinese mainland	$0.894	$3.713	$3.7131
kimi-k2.6	Chinese mainland	$0.8939	$3.7131	No free quota
kimi-k2.5	Chinese mainland	$0.574	$3.011	No free quota
kimi-k2-thinking	Chinese mainland	$0.574	$2.294	No free quota
Moonshot-Kimi-K2-Instruct	Chinese mainland	$0.574	$2.294	No free quota

Germany (Frankfurt)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)
kimi-k2.7-code	Global	$0.894	$3.713
kimi-k2.5	Global	$0.574	$3.011

US (Virginia)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)
kimi-k2.7-code	Global	$0.894	$3.713
kimi-k2.5	Global	$0.574	$3.011

Japan (Tokyo)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

kimi-k2.5

Context Cache context caching discount

Global

$0.574

$3.011

Singapore

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)
kimi-k2.7-code	International	$0.95	$4

China(Hong Kong)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)
kimi-k2.7-code	Global	$0.894	$3.713

MiniMax

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought and answer

MiniMax-M2.5

Chinese mainland

Thinking mode only

$0.304

$1.213

GLM

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Mode	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought and answer	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
glm-5.2	International	Non-Thinking and Thinking modes	flat-rate pricing	$1.400	$4.400	None
glm-5.2-fast-preview	International	Non-Thinking and Thinking modes	flat-rate pricing	$2.800	$8.800	None
glm-5.1	International	Non-Thinking and Thinking modes	0<Token≤200K	$1.400	$4.400	1 million tokens

China (Beijing)

Model ID	Deployment scope	Mode	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought and answer
glm-5.2	Chinese mainland	Non-Thinking and Thinking modes	flat-rate pricing	$1.100	$3.851
glm-5.2-fast-preview	Chinese mainland	Non-Thinking and Thinking modes	flat-rate pricing	$2.200	$7.702
glm-5.1	Chinese mainland	Non-Thinking and Thinking modes	0<Token≤32K	$0.825	$3.301
glm-5.1	Chinese mainland	Non-Thinking and Thinking modes	32K<Token≤200K	$1.100	$3.851
glm-5	Chinese mainland	Non-Thinking and Thinking modes	0<Token≤32K	$0.573	$2.58
glm-5	Chinese mainland	Non-Thinking and Thinking modes	32K<Token≤166K	$0.860	$3.154
glm-4.7	Chinese mainland	Non-Thinking and Thinking modes	0<Token≤32K	$0.431	$2.007
glm-4.7	Chinese mainland	Non-Thinking and Thinking modes	32K<Token≤166K	$0.574	$2.294
glm-4.6	Chinese mainland	Non-Thinking and Thinking modes	0<Token≤32K	$0.431	$2.007
glm-4.6	Chinese mainland	Non-Thinking and Thinking modes	32K<Token≤166K	$0.574	$2.294

Germany (Frankfurt)

Model ID	Deployment scope	Mode	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought and answer
glm-5.2	Global	Non-Thinking and Thinking modes	flat-rate pricing	$1.100	$3.851
glm-5.1	Global	Non-Thinking and Thinking modes	0<Token≤32K	$0.825	$3.301
glm-5.1	Global	Non-Thinking and Thinking modes	32K<Token≤200K	$1.100	$3.851

US (Virginia)

Model ID	Deployment scope	Mode	Input tokens per request	Input price (per 1 million tokens)	Output price (per 1 million tokens) Chain of thought and answer
glm-5.2	Global	Non-Thinking and Thinking modes	flat-rate pricing	$1.100	$3.851
glm-5.2-us	US	Non-Thinking and Thinking modes	flat-rate pricing	$1.400	$4.4
glm-5.1	Global	Non-Thinking and Thinking modes	0<Token≤32K	$0.825	$3.301
glm-5.1	Global	Non-Thinking and Thinking modes	32K<Token≤200K	$1.100	$3.851

Japan (Tokyo)

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

glm-5.1

Context Cache context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤32K

$0.825

$3.301

32K<Token≤200K

$1.100

$3.851

China (Hong Kong)

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought and answer

glm-5.2

Global

Non-Thinking and Thinking modes

flat-rate pricing

$1.100

$3.851

Image generation

You are not charged for input. You are charged for output based on the number of successfully generated images.

Formula: Cost = Image unit price × Number of images generated.

Notes:

Cost does not depend on image resolution or aspect ratio.
Failed requests incur no cost and do not consume your free quota.

Billing example: Some images fail to generate

Assume the image unit price is $0.10 per image. If you call the API to generate four images but only three image URLs return successfully, the system charges only for the three successfully generated images.

Number billed: 3 images.
Cost calculation: 0.1 × 3 = $0.3.

Qwen Text-to-Image

Only output is billed. For pricing rules, see Image generation.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Output price	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
qwen-image-2.0-pro Currently equivalent to qwen-image-2.0-pro-2026-04-22	International	$0.075/image	100 images
qwen-image-2.0-pro-2026-06-22	International	$0.075/image	100 images
qwen-image-2.0-pro-2026-04-22	International	$0.075/image	100 images
qwen-image-2.0-pro-2026-03-03	International	$0.075/image	100 images
qwen-image-2.0 Currently equivalent to qwen-image-2.0-2026-03-03	International	$0.035/image	100 images
qwen-image-2.0-2026-03-03	International	$0.035/image	100 images
qwen-image-max Currently equivalent to qwen-image-max-2025-12-30	International	$0.075/image	100 images
qwen-image-max-2025-12-30	International	$0.075/image	100 images
qwen-image-plus Currently equivalent to qwen-image	International	$0.03/image	100 images
qwen-image-plus-2026-01-09	International	$0.03/image	100 images
qwen-image	International	$0.035/image	100 images

China (Beijing)

Model ID	Deployment scope	Output price
qwen-image-2.0-pro Currently equivalent to qwen-image-2.0-pro-2026-04-22	Chinese mainland	$0.071676/image
qwen-image-2.0-pro-2026-06-22	Chinese mainland	$0.071676/image
qwen-image-2.0-pro-2026-04-22	Chinese mainland	$0.071676/image
qwen-image-2.0-pro-2026-03-03	Chinese mainland	$0.071676/image
qwen-image-2.0 Currently equivalent to qwen-image-2.0-2026-03-03	Chinese mainland	$0.028671/image
qwen-image-2.0-2026-03-03	Chinese mainland	$0.028671/image
qwen-image-max Currently equivalent to qwen-image-max-2025-12-30	Chinese mainland	$0.071677/image
qwen-image-max-2025-12-30	Chinese mainland	$0.071677/image
qwen-image-plus Currently equivalent to qwen-image	Chinese mainland	$0.028671/image
qwen-image-plus-2026-01-09	Chinese mainland	$0.028671/image
qwen-image	Chinese mainland	$0.035/image

Qwen Image Editing

Only output is billed. For pricing rules, see Image generation.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Output price	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
qwen-image-2.0-pro Currently equivalent to qwen-image-2.0-pro-2026-04-22	International	$0.075/image	100 images
qwen-image-2.0-pro-2026-06-22	International	$0.075/image	100 images
qwen-image-2.0-pro-2026-04-22	International	$0.075/image	100 images
qwen-image-2.0-pro-2026-03-03	International	$0.075/image	100 images
qwen-image-2.0 Currently equivalent to qwen-image-2.0-2026-03-03	International	$0.035/image	100 images
qwen-image-2.0-2026-03-03	International	$0.035/image	100 images
qwen-image-edit-max Currently equivalent to qwen-image-edit-max-2026-01-16	International	$0.075/image	100 images
qwen-image-edit-max-2026-01-16	International	$0.075/image	100 images
qwen-image-edit-plus Currently equivalent to qwen-image-edit-plus-2025-10-30	International	$0.03/image	100 images
qwen-image-edit-plus-2025-12-15	International	$0.03/image	100 images
qwen-image-edit-plus-2025-10-30	International	$0.03/image	100 images
qwen-image-edit	International	$0.045/image	100 images

China (Beijing)

Model ID	Deployment scope	Output price
qwen-image-2.0-pro Currently equivalent to qwen-image-2.0-pro-2026-04-22	Chinese mainland	$0.071676/image
qwen-image-2.0-pro-2026-06-22	Chinese mainland	$0.071676/image
qwen-image-2.0-pro-2026-04-22	Chinese mainland	$0.071676/image
qwen-image-2.0-pro-2026-03-03	Chinese mainland	$0.071676/image
qwen-image-2.0 Currently equivalent to qwen-image-2.0-2026-03-03	Chinese mainland	$0.028671/image
qwen-image-2.0-2026-03-03	Chinese mainland	$0.028671/image
qwen-image-edit-max Currently equivalent to qwen-image-edit-max-2026-01-16	Chinese mainland	$0.071677/image
qwen-image-edit-max-2026-01-16	Chinese mainland	$0.071677/image
qwen-image-edit-plus Currently equivalent to qwen-image-edit-plus-2025-10-30	Chinese mainland	$0.028671/image
qwen-image-edit-plus-2025-12-15	Chinese mainland	$0.028671/image
qwen-image-edit-plus-2025-10-30	Chinese mainland	$0.028671/image
qwen-image-edit	Chinese mainland	$0.043/image

Qwen Image Translation

Only output is billed. For pricing rules, see Image generation.

China (Beijing)

Model ID	Deployment scope	Output price	Free quota(Note)
qwen-mt-image	Chinese mainland	$0.000431/image	No free quota

Qwen-Text-to-Image-Z-Image

Only output is billed. For pricing rules, see Image generation.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output price

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

z-image-turbo

International

Prompt rewriting disabled (prompt_extend=false): $0.015/image

Prompt rewriting enabled (prompt_extend=true): $0.03/image

100 images

China (Beijing)

Model ID

Deployment scope

Output price

z-image-turbo

Chinese mainland

Prompt rewriting disabled (prompt_extend=false): $0.01434/image

Prompt rewriting enabled (prompt_extend=true): $0.02868/image

Wanx Text-to-Image

Only output is billed. For pricing rules, see Image generation.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Output price	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
wan2.6-t2i	International	$0.03/image	50 images
wan2.5-t2i-preview	International	$0.03/image	50 images
wan2.2-t2i-plus	International	$0.05/image	100 images
wan2.2-t2i-flash	International	$0.025/image	100 images
wan2.1-t2i-plus	International	$0.05/image	200 images
wan2.1-t2i-turbo	International	$0.025/image	200 images

China (Beijing)

Model ID	Deployment scope	Output price
wan2.6-t2i	Chinese mainland	$0.028671/image
wan2.5-t2i-preview	Chinese mainland	$0.028671/image
wan2.2-t2i-plus	Chinese mainland	$0.020070/image
wan2.2-t2i-flash	Chinese mainland	$0.028671/image
wanx2.1-t2i-plus	Chinese mainland	$0.028671/image
wanx2.1-t2i-turbo	Chinese mainland	$0.020070/image
wanx2.0-t2i-turbo	Chinese mainland	$0.005735/image

Germany (Frankfurt)

Model ID	Deployment scope	Output price
wan2.6-t2i	Global	$0.028671/image

US (Virginia)

Model ID	Deployment scope	Output price
wan2.6-t2i	Global	$0.028671/image

Wanx Image Generation and Editing

Only output is billed. For pricing rules, see Image generation.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Output price	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
wan2.7-image-pro	International	$0.075/image	50 images
wan2.7-image	International	$0.03/image	50 images
wan2.6-image	International	$0.03/image	50 images

China (Beijing)

Model ID	Deployment scope	Output price
wan2.7-image-pro	Chinese mainland	$0.068761/image
wan2.7-image	Chinese mainland	$0.028671/image
wan2.6-image	Chinese mainland	$0.028671/image

US (Virginia)

Model ID	Deployment scope	Output price
wan2.6-image	Global	$0.028671/image

Wanx General Image Editing

Only output is billed. For pricing rules, see Image generation.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output price

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

wan2.5-i2i-preview

International

$0.03/image

50 images

China (Beijing)

Model ID	Deployment scope	Output price
wan2.5-i2i-preview	Chinese mainland	$0.028671/image
wanx2.1-imageedit	Chinese mainland	$0.020070/image

AIVirtual Try-on - OutfitAnyone

aitryon-plus: Input is free while output is billed. For pricing rules, see Image generation.
aitryon-parsing-v1: Input is billed while output is free. Billed by the number of input images. Failed requests are not billed.

China (Beijing)

Model ID	Deployment scope	Unit price	Free quota(Note)
aitryon-plus	Chinese mainland	$0.071677/image	No free quota
aitryon-parsing-v1	Chinese mainland	$0.000574/image	No free quota

Video generation

You are not charged for input. You are charged for output based on the total duration of successfully generated videos (in seconds).

Formula: Cost = Video unit price × Video duration (seconds).

Notes:

Some models charge by output video resolution. Prices differ for resolutions such as 480P, 720P, and 1080P.
Some models charge by output video edition. Prices differ for editions such as Standard Edition and Professional Edition.
Some models charge by output video aspect ratio. Prices differ for aspect ratios such as 1:1 and 3:4.
Some models use a flat rate, regardless of resolution, edition, or aspect ratio.
Failed requests incur no cost and do not consume your free quota.

HappyHorse-Text-to-video

Only output is billed. For pricing rules, see Video generation.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Output video resolution	Output price	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
happyhorse-1.1-t2v	International	720P	List price $0.14/second Limited-time 40% off	10 seconds
		1080P	List price $0.18/second Limited-time 40% off
happyhorse-1.0-t2v	International	720P	List price $0.14/second Limited-time 20% off	10 seconds
		1080P	List price $0.24/second Limited-time 20% off

China (Beijing)

Model ID	Deployment scope	Output video resolution	Output price
happyhorse-1.1-t2v	Chinese mainland	720P	List price $0.123769/second Limited-time 40% off
happyhorse-1.1-t2v	Chinese mainland	1080P	List price $0.165026/second Limited-time 40% off
happyhorse-1.0-t2v	Chinese mainland	720P	List price $0.123769/second Limited-time 20% off
happyhorse-1.0-t2v	Chinese mainland	1080P	List price $0.220034/second Limited-time 20% off

Germany (Frankfurt)

Model ID	Deployment scope	Output video resolution	Output price
happyhorse-1.1-t2v	Global	720P	List price $0.123769/second Limited-time 40% off
happyhorse-1.1-t2v	Global	1080P	List price $0.165026/second Limited-time 40% off
happyhorse-1.0-t2v	Global	720P	List price $0.123769/second Limited-time 20% off
happyhorse-1.0-t2v	Global	1080P	List price $0.220034/second Limited-time 20% off

US (Virginia)

Model ID	Deployment scope	Output video resolution	Output price
happyhorse-1.1-t2v	Global	720P	List price $0.123769/second Limited-time 40% off
happyhorse-1.1-t2v	Global	1080P	List price $0.165026/second Limited-time 40% off
happyhorse-1.0-t2v	Global	720P	List price $0.123769/second Limited-time 20% off
happyhorse-1.0-t2v	Global	1080P	List price $0.220034/second Limited-time 20% off

Hong Kong (China)

Model ID	Deployment scope	Output video resolution	Output price
happyhorse-1.1-t2v	Global	720P	List price $0.123769/second Limited-time 40% off
happyhorse-1.1-t2v	Global	1080P	List price $0.165026/second Limited-time 40% off

HappyHorse-Image-to-video - first frame

Only output is billed. For pricing rules, see Video generation.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Output video resolution	Output price	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
happyhorse-1.1-i2v	International	720P	List price $0.14/second Limited-time 40% off	10 seconds
		1080P	List price $0.18/second Limited-time 40% off
happyhorse-1.0-i2v	International	720P	List price $0.14/second Limited-time 20% off	10 seconds
		1080P	List price $0.24/second Limited-time 20% off

China (Beijing)

Model ID	Deployment scope	Output video resolution	Output price
happyhorse-1.1-i2v	Chinese mainland	720P	List price $0.123769/second Limited-time 40% off
happyhorse-1.1-i2v	Chinese mainland	1080P	List price $0.165026/second Limited-time 40% off
happyhorse-1.0-i2v	Chinese mainland	720P	List price $0.123769/second Limited-time 20% off
happyhorse-1.0-i2v	Chinese mainland	1080P	List price $0.220034/second Limited-time 20% off

Germany (Frankfurt)

Model ID	Deployment scope	Output video resolution	Output price
happyhorse-1.1-i2v	Global	720P	List price $0.123769/second Limited-time 40% off
happyhorse-1.1-i2v	Global	1080P	List price $0.165026/second Limited-time 40% off
happyhorse-1.0-i2v	Global	720P	List price $0.123769/second Limited-time 20% off
happyhorse-1.0-i2v	Global	1080P	List price $0.220034/second Limited-time 20% off

US (Virginia)

Model ID	Deployment scope	Output video resolution	Output price
happyhorse-1.1-i2v	Global	720P	List price $0.123769/second Limited-time 40% off
happyhorse-1.1-i2v	Global	1080P	List price $0.165026/second Limited-time 40% off
happyhorse-1.0-i2v	Global	720P	List price $0.123769/second Limited-time 20% off
happyhorse-1.0-i2v	Global	1080P	List price $0.220034/second Limited-time 20% off

Hong Kong (China)

Model ID	Deployment scope	Output video resolution	Output price
happyhorse-1.1-i2v	Global	720P	List price $0.123769/second Limited-time 40% off
happyhorse-1.1-i2v	Global	1080P	List price $0.165026/second Limited-time 40% off

HappyHorse-Reference-to-video

Only output is billed. For pricing rules, see Video generation.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Output video resolution	Output price	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
happyhorse-1.1-r2v	International	720P	List price $0.14/second Limited-time 40% off	10 seconds
		1080P	List price $0.18/second Limited-time 40% off
happyhorse-1.0-r2v	International	720P	List price $0.14/second Limited-time 20% off	10 seconds
		1080P	List price $0.24/second Limited-time 20% off

China (Beijing)

Model ID	Deployment scope	Output video resolution	Output price
happyhorse-1.1-r2v	Chinese mainland	720P	List price $0.123769/second Limited-time 40% off
happyhorse-1.1-r2v	Chinese mainland	1080P	List price $0.165026/second Limited-time 40% off
happyhorse-1.0-r2v	Chinese mainland	720P	List price $0.123769/second Limited-time 20% off
happyhorse-1.0-r2v	Chinese mainland	1080P	List price $0.220034/second Limited-time 20% off

Germany (Frankfurt)

Model ID	Deployment scope	Output video resolution	Output price
happyhorse-1.1-r2v	Global	720P	List price $0.123769/second Limited-time 40% off
happyhorse-1.1-r2v	Global	1080P	List price $0.165026/second Limited-time 40% off
happyhorse-1.0-r2v	Global	720P	List price $0.123769/second Limited-time 20% off
happyhorse-1.0-r2v	Global	1080P	List price $0.220034/second Limited-time 20% off

US (Virginia)

Model ID	Deployment scope	Output video resolution	Output price
happyhorse-1.1-r2v	Global	720P	List price $0.123769/second Limited-time 40% off
happyhorse-1.1-r2v	Global	1080P	List price $0.165026/second Limited-time 40% off
happyhorse-1.0-r2v	Global	720P	List price $0.123769/second Limited-time 20% off
happyhorse-1.0-r2v	Global	1080P	List price $0.220034/second Limited-time 20% off

Hong Kong (China)

Model ID	Deployment scope	Output video resolution	Output price
happyhorse-1.1-r2v	Global	720P	List price $0.123769/second Limited-time 40% off
happyhorse-1.1-r2v	Global	1080P	List price $0.165026/second Limited-time 40% off

HappyHorse-Video editing

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID

Deployment scope

Output video resolution

Input and output price

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

happyhorse-1.0-video-edit

International

720P

List price $0.14/second Limited-time 20% off

10 seconds

1080P

List price $0.24/second Limited-time 20% off

China (Beijing)

Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID	Deployment scope	Output video resolution	Input and output price
happyhorse-1.0-video-edit	Chinese mainland	720P	List price $0.123769/second Limited-time 20% off
happyhorse-1.0-video-edit	Chinese mainland	1080P	List price $0.220034/second Limited-time 20% off

Germany (Frankfurt)

Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID	Deployment scope	Output video resolution	Input and output price
happyhorse-1.0-video-edit	Global	720P	List price $0.123769/second Limited-time 20% off
happyhorse-1.0-video-edit	Global	1080P	List price $0.220034/second Limited-time 20% off

US (Virginia)

Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID	Deployment scope	Output video resolution	Input and output price
happyhorse-1.0-video-edit	Global	720P	List price $0.123769/second Limited-time 20% off
happyhorse-1.0-video-edit	Global	1080P	List price $0.220034/second Limited-time 20% off

Wanx-Text-to-Video

Only output is billed. For pricing rules, see Video generation.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Output video resolution	Output price	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
wan2.7-t2v-2026-06-12	International	720P	$0.10/second	50 seconds
		1080P	$0.15/second
wan2.7-t2v-2026-04-25	International	720P	$0.10/second	50 seconds
		1080P	$0.15/second
wan2.7-t2v	International	720P	$0.10/second	50 seconds
		1080P	$0.15/second
wan2.6-t2v	International	720P	$0.10/second	50 seconds
		1080P	$0.15/second
wan2.5-t2v-preview	International	480P	$0.05/second	50 seconds
		720P	$0.10/second
		1080P	$0.15/second
wan2.2-t2v-plus	International	480P	$0.02/second	50 seconds
		1080P	$0.10/second
wan2.1-t2v-turbo	International	480P	$0.036/second	50 seconds
		720P	$0.036/second
wan2.1-t2v-plus	International	720P	$0.10/second	50 seconds

China (Beijing)

Model ID	Deployment scope	Output video resolution	Output price
wan2.7-t2v-2026-06-12	Chinese mainland	720P	$0.086012/second
wan2.7-t2v-2026-06-12	Chinese mainland	1080P	$0.143353/second
wan2.7-t2v-2026-04-25	Chinese mainland	720P	$0.086012/second
wan2.7-t2v-2026-04-25	Chinese mainland	1080P	$0.143353/second
wan2.7-t2v	Chinese mainland	720P	$0.086012/second
wan2.7-t2v	Chinese mainland	1080P	$0.143353/second
wan2.6-t2v	Chinese mainland	720P	$0.086012/second
wan2.6-t2v	Chinese mainland	1080P	$0.143353/second
wan2.5-t2v-preview	Chinese mainland	480P	$0.043006/second
		720P	$0.086012/second
		1080P	$0.143353/second
wan2.2-t2v-plus	Chinese mainland	480P	$0.02007/second
wan2.2-t2v-plus	Chinese mainland	1080P	$0.100347/second
wanx2.1-t2v-turbo	Chinese mainland	480P	$0.034405/second
wanx2.1-t2v-turbo	Chinese mainland	720P	$0.034405/second
wanx2.1-t2v-plus	Chinese mainland	720P	$0.100347/second

Germany (Frankfurt)

Model ID	Deployment scope	Output video resolution	Output price
wan2.6-t2v	Global	720P	$0.086012/second
wan2.6-t2v	Global	1080P	$0.143353/second

US (Virginia)

Model ID	Deployment scope	Output video resolution	Output price
wan2.6-t2v	Global	720P	$0.086012/second
wan2.6-t2v	Global	1080P	$0.143353/second
wan2.6-t2v-us	US	720P	$0.1/second
wan2.6-t2v-us	US	1080P	$0.15/second

Wanx-Image-to-Video

Only output is billed. For pricing rules, see Video generation.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Output video type	Output video resolution	Output price	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
wan2.7-i2v-2026-04-25	International	Audio video	720P	$0.10/second	50 seconds
			1080P	$0.15/second
wan2.7-i2v	International	Audio video	720P	$0.10/second	50 seconds
			1080P	$0.15/second

China (Beijing)

Model ID	Deployment scope	Output video type	Output video resolution	Output price
wan2.7-i2v-2026-04-25	Chinese mainland	Audio video	720P	$0.086012/second
			1080P	$0.143353/second
wan2.7-i2v	Chinese mainland	Audio video	720P	$0.086012/second
			1080P	$0.143353/second

Wanx-Image-to-Video-First-Frame

Only output is billed. For pricing rules, see Video generation.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Output video type	Output video resolution	Output price	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
wan2.6-i2v-flash	International	Audio video `audio=true`	720P	$0.05/second	50 seconds
			1080P	$0.075/second
		Silent video `audio=false`	720P	$0.025/second
			1080P	$0.0375/second
wan2.6-i2v	International	Audio video	720P	$0.10/second	50 seconds
			1080P	$0.15/second
wan2.5-i2v-preview	International	Audio video	480P	$0.05/second	50 seconds
			720P	$0.10/second
			1080P	$0.15/second
wan2.2-i2v-flash	International	Silent video	480P	$0.015/second	50 seconds
			720P	$0.036/second
wan2.2-i2v-plus	International	Silent video	480P	$0.02/second	50 seconds
			1080P	$0.10/second
wan2.1-t2v-turbo	International	Silent video	480P	$0.036/second	50 seconds
			720P	$0.036/second
wan2.1-t2v-plus	International	Silent video	720P	$0.10/second	50 seconds

China (Beijing)

Model ID	Deployment scope	Output video type	Output video resolution	Output price
wan2.6-i2v-flash	Chinese mainland	Audio video `audio=true`	720P	$0.043006/second
			1080P	$0.071676/second
		Silent video `audio=false`	720P	$0.021503/second
			1080P	$0.035838/second
wan2.6-i2v	Chinese mainland	Audio video	720P	$0.086012/second
			1080P	$0.143353/second
wan2.5-i2v-preview	Chinese mainland	Audio video	480P	$0.043006/second
			720P	$0.086012/second
			1080P	$0.143353/second
wan2.2-i2v-plus	Chinese mainland	Silent video	480P	$0.02007/second
			1080P	$0.100347/second
wanx2.1-t2v-turbo	Chinese mainland	Silent video	480P	$0.034405/second
			720P	$0.034405/second
wanx2.1-t2v-plus	Chinese mainland	Silent video	720P	$0.100347/second

Germany (Frankfurt)

Model ID	Deployment scope	Output video type	Output video resolution	Output price
wan2.6-i2v	Global	Audio video	720P	$0.086012/second
wan2.6-i2v	Global	Audio video	1080P	$0.143353/second

US (Virginia)

Model ID	Deployment scope	Output video type	Output video resolution	Output price
wan2.6-i2v	Global	Audio video	720P	$0.086012/second
			1080P	$0.143353/second
wan2.6-i2v-us	US	Audio video	720P	$0.1/second
			1080P	$0.15/second

Wanx-Image-to-Video-First-Last-Frame

Only output is billed. For pricing rules, see Video generation.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Output video resolution	Output price	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
wan2.2-kf2v-flash	International	480P	$0.015/second	50 seconds
		720P	$0.036/second
		1080P	$0.07/second
wan2.1-kf2v-plus	International	720P	$0.10/second	50 seconds

China (Beijing)

Model ID	Deployment scope	Output video resolution	Output price
wan2.2-kf2v-flash	Chinese mainland	480P	$0.014335/second
		720P	$0.028671/second
		1080P	$0.068809/second
wanx2.1-kf2v-plus	Chinese mainland	720P	$0.100347/second

Wanx-Reference-to-Video

Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Billing formula: billable duration = input video duration (up to 5 seconds) + output video duration.

The billable duration of the input video does not exceed 5 seconds. For calculation rules, see Billing and rate limiting.
The billable duration of the output video isduration (in seconds) of successfully generated videos.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Output video type	Output video resolution	Input and output price	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
wan2.7-r2v-2026-06-12	International	Audio video	720P	$0.10/second	50 seconds
			1080P	$0.15/second
wan2.7-r2v	International	Audio video	720P	$0.10/second	50 seconds
			1080P	$0.15/second
wan2.6-r2v-flash	International	Audio video `audio=true`	720P	$0.05/second	50 seconds
			1080P	$0.075/second
		Silent video `audio=false`	720P	$0.025/second
			1080P	$0.0375/second
wan2.6-r2v	International	Audio video	720P	$0.10/second	50 seconds
			1080P	$0.15/second

China (Beijing)

Model ID	Deployment scope	Output video type	Output video resolution	Input and output price
wan2.7-r2v-2026-06-12	Chinese mainland	Audio video	720P	$0.086012/second
			1080P	$0.143353/second
wan2.7-r2v	Chinese mainland	Audio video	720P	$0.086012/second
			1080P	$0.143353/second
wan2.6-r2v-flash	Chinese mainland	Audio video `audio=true`	720P	$0.043006/second
			1080P	$0.071676/second
		Silent video `audio=false`	720P	$0.021503/second
			1080P	$0.035838/second
wan2.6-r2v	Chinese mainland	Audio video	720P	$0.086012/second
			1080P	$0.143353/second

Germany (Frankfurt)

Model ID	Deployment scope	Output video type	Output video resolution	Input and output price
wan2.6-r2v	Global	Audio video	720P	$0.086012/second
wan2.6-r2v	Global	Audio video	1080P	$0.143353/second

US (Virginia)

Model ID	Deployment scope	Output video type	Output video resolution	Input and output price
wan2.6-r2v	Global	Audio video	720P	$0.086012/second
wan2.6-r2v	Global	Audio video	1080P	$0.143353/second

Wanx-Video-Editing

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID

Deployment scope

Output video resolution

Input and output price

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

wan2.7-videoedit

International

720P

$0.10/second

50 seconds

1080P

$0.15/second

Pricing rule: input is free. Output video is billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID

Deployment scope

Output video resolution

Output price

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

wan2.1-vace-plus

International

720P

$0.10/second

50 seconds

China (Beijing)

Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID	Deployment scope	Output video resolution	Input and output price
wan2.7-videoedit	Chinese mainland	720P	$0.086012/second
wan2.7-videoedit	Chinese mainland	1080P	$0.143353/second

Pricing rule: input is free. Output video is billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID	Deployment scope	Output video resolution	Output price
wanx2.1-vace-plus	Chinese mainland	720P	$0.100347/second

Wanx-Digital Human

wan2.2-s2v-detect: Input is billed while output is free. Input is billed by the number of images processed. Each input image is billed once as long as the request succeeds, regardless of the detection result.
wan2.2-s2v: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, see Video generation.

China (Beijing)

Model ID

Deployment scope

Unit price

Free quota(Note)

wan2.2-s2v-detect

Chinese mainland

Input image: $0.000574/image

No free quota

wan2.2-s2v

Chinese mainland

Output video:

480P: $0.071677/second
720P: $0.129018/second

No free quota

Wanx-Image-to-Motion

Only output is billed. For pricing rules, see Video generation.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output video mode

Output price

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

wan2.2-animate-move

International

Standard modewan-std

$0.12/second

50 seconds

Professional modewan-pro

$0.18/second

China (Beijing)

Model ID	Deployment scope	Output video mode	Output price
wan2.2-animate-move	Chinese mainland	Standard mode`wan-std`	$0.06/second
wan2.2-animate-move	Chinese mainland	Professional mode`wan-pro`	$0.09/second

Wanx-Video-Face-Swap

Only output is billed. For pricing rules, see Video generation.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output video mode

Output price

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

wan2.2-animate-mix

International

Standard modewan-std

$0.18/second

50 seconds

Professional modewan-pro

$0.26/second

China (Beijing)

Model ID	Deployment scope	Output video mode	Output price
wan2.2-animate-mix	Chinese mainland	Standard mode`wan-std`	$0.09/second
wan2.2-animate-mix	Chinese mainland	Professional mode`wan-pro`	$0.13/second

AnimateAnyone

animate-anyone-detect-gen2: Input is billed while output is free. Input is billed by the number of images processed. Each input image is billed once as long as the request succeeds, regardless of the detection result.
animate-anyone-template-gen2: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, see Video generation.
animate-anyone-gen2: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, see Video generation.

China (Beijing)

Model ID	Deployment scope	Unit price	Free quota(Note)
animate-anyone-detect-gen2	Chinese mainland	Input image: $0.000574/image	No free quota
animate-anyone-template-gen2	Chinese mainland	Output video: $0.011469/second	No free quota
animate-anyone-gen2	Chinese mainland	Output video: $0.011469/second	No free quota

EMO

emo-detect-v1: Input is billed while output is free. Input is billed by the number of images processed. Each input image is billed once as long as the request succeeds, regardless of the detection result.
emo-v1: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, see Video generation.

China (Beijing)

Model ID

Deployment scope

Unit price

Free quota(Note)

emo-detect-v1

Chinese mainland

Input image: $0.000574/image

No free quota

emo-v1

Chinese mainland

Output video:

1:1landscape video: $0.011469/second
3:4landscape video: $0.022937/second

LivePortrait

liveportrait-detect: Input is billed while output is free. Input is billed by the number of images processed. Each input image is billed once as long as the request succeeds, regardless of the detection result.
liveportrait: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, see Video generation.

China (Beijing)

Model ID	Deployment scope	Unit price	Free quota(Note)
liveportrait-detect	Chinese mainland	Input image: $0.000574/image	No free quota
liveportrait	Chinese mainland	Output video: $0.002868/second	No free quota

Emoji Sticker

emoji-detect-v1: Input is billed while output is free. Input is billed by the number of images processed. Each input image is billed once as long as the request succeeds, regardless of the detection result.
emoji-v1: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, see Video generation.

China (Beijing)

Model ID	Deployment scope	Unit price	Free quota(Note)
emoji-detect-v1	Chinese mainland	Input image: $0.000574/image	No free quota
emoji-v1	Chinese mainland	Output video: $0.011469/second	No free quota

VideoRetalk

Only output is billed. For pricing rules, see Video generation.

China (Beijing)

Model ID	Deployment scope	Output price	Free quota(Note)
videoretalk	Chinese mainland	$0.011469/second	No free quota

Video Style Repaint

Only output is billed. For pricing rules, see Video generation.

China (Beijing)

Model ID	Deployment scope	Output video resolution	Output price	Free quota(Note)
video-style-transform	Chinese mainland	540P	$0.028671/second	No free quota
video-style-transform	Chinese mainland	720P	$0.071677/second	No free quota

Music generation

Pricing rule: billed by the duration (in seconds) of output audio. Input is free.

China (Beijing)

Model ID	Deployment scope	Output price (per second)	Free quota(Note)
fun-music-preview	Chinese mainland	$0.000695	No free quota
fun-music-v1	Chinese mainland	$0.000275	No free quota

Speech synthesis (text-to-speech)

Qwen-Audio-TTS

Billing rules: Fees are charged based on the number of characters in the input text. Output is not billed.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Service deployment scope

Input unit price (per 10,000 characters)

Free quota (note)

^{Valid for 90 days after Alibaba Cloud Model Studio is activated}

qwen-audio-3.0-tts-plus

International

$0.2

10,000 characters

qwen-audio-3.0-tts-flash

International

$0.15

10,000 characters

China (Beijing)

Model ID	Service deployment scope	Input unit price (per 10,000 characters)
qwen-audio-3.0-tts-plus	Mainland China	$0.19253
qwen-audio-3.0-tts-flash	Mainland China	$0.137521

Qwen-TTS

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Qwen3-TTS-Instruct-Flash

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

qwen3-tts-instruct-flash

Currently equivalent to qwen3-tts-instruct-flash-2026-01-26

International

$0.115

110,000 characters

qwen3-tts-instruct-flash-2026-01-26

International

$0.115

110,000 characters

Qwen3-TTS-VD

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

qwen3-tts-vd-2026-01-26

International

$0.115

110,000 characters

Qwen3-TTS-VC

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

qwen3-tts-vc-2026-01-22

International

$0.115

110,000 characters

Qwen3-TTS-Flash

Pricing rule: billed by the number of input text characters. Output is free.

Model ID	Deployment scope	Input price (per 10,000 characters)	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
qwen3-tts-flash Currently equivalent to qwen3-tts-flash-2025-11-27	International	$0.1	110,000 characters
qwen3-tts-flash-2025-11-27	International	$0.1	110,000 characters
qwen3-tts-flash-2025-09-18	International	$0.1	2025 (after November 13, 0:00 UTC+8): 10,000 characters

China (Beijing)

Qwen3-TTS-Instruct-Flash

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price (per 10,000 characters)

qwen3-tts-instruct-flash

Currently equivalent to qwen3-tts-instruct-flash-2026-01-26

Chinese mainland

$0.115

Free

qwen3-tts-instruct-flash-2026-01-26

Chinese mainland

$0.115

Free

Qwen3-TTS-VD

Pricing rule: billed by the number of input text characters. Output is free.

Model ID	Deployment scope	Input price (per 10,000 characters)	Output price (per 10,000 characters)
qwen3-tts-vd-2026-01-26	Chinese mainland	$0.115	Free

Qwen3-TTS-VC

Pricing rule: billed by the number of input text characters. Output is free.

Model ID	Deployment scope	Input price (per 10,000 characters)	Output price (per 10,000 characters)
qwen3-tts-vc-2026-01-22	Chinese mainland	$0.115	Free

Qwen3-TTS-Flash

Pricing rule: billed by the number of input text characters. Output is free.

Model ID	Deployment scope	Input price (per 10,000 characters)	Output price (per 10,000 characters)
qwen3-tts-flash Currently equivalent to qwen3-tts-flash-2025-11-27	Chinese mainland	$0.114682	Free
qwen3-tts-flash-2025-11-27	Chinese mainland	$0.114682	Free
qwen3-tts-flash-2025-09-18	Chinese mainland	$0.114682	Free

Qwen-TTS

Pricing rule: billed by input tokens and output tokens.

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)
qwen-tts-flash	Chinese mainland	$0.23	$1.434
qwen-tts-latest	Chinese mainland	$0.23	$1.434
qwen-tts-2025-05-22	Chinese mainland	$0.23	$1.434
qwen-tts-2025-04-10	Chinese mainland	$0.23	$1.434

Qwen-TTS-Realtime

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Qwen3-TTS-Instruct-Flash-Realtime

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

qwen3-tts-instruct-flash-realtime

Currently equivalent to qwen3-tts-instruct-flash-realtime-2026-01-22

International

$0.143

110,000 characters

qwen3-tts-instruct-flash-realtime-2026-01-22

International

$0.143

110,000 characters

Qwen3-TTS-VD-Realtime

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

qwen3-tts-vd-realtime-2026-01-15

International

$0.143353

110,000 characters

qwen3-tts-vd-realtime-2025-12-16

International

$0.143353

110,000 characters

Qwen3-TTS-VC-Realtime

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

qwen3-tts-vc-realtime-2026-01-15

International

$0.13

110,000 characters

qwen3-tts-vc-realtime-2025-11-27

International

110,000 characters

Qwen3-TTS-Flash-Realtime

Pricing rule: billed by the number of input text characters. Output is free.

Model ID	Deployment scope	Input price (per 10,000 characters)	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
qwen3-tts-flash-realtime Currently equivalent to qwen3-tts-flash-realtime-2025-11-27	International	$0.13	2025 (after November 13, 0:00 UTC+8): 10,000 characters
qwen3-tts-flash-realtime-2025-11-27	International	$0.13	110,000 characters
qwen3-tts-flash-realtime-2025-09-18	International	$0.13	2025 (after November 13, 0:00 UTC+8): 10,000 characters

China (Beijing)

Qwen3-TTS-Instruct-Flash-Realtime

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price

qwen3-tts-instruct-flash-realtime

Currently equivalent to qwen3-tts-instruct-flash-realtime-2026-01-22

Chinese mainland

$0.143

Free

qwen3-tts-instruct-flash-realtime-2026-01-22

Chinese mainland

$0.143

Free

Qwen3-TTS-VD-Realtime

Pricing rule: billed by the number of input text characters. Output is free.

Model ID	Deployment scope	Input price (per 10,000 characters)	Output price
qwen3-tts-vd-realtime-2026-01-15	Chinese mainland	$0.143353	Free
qwen3-tts-vd-realtime-2025-12-16	Chinese mainland	$0.143353	Free

Qwen3-TTS-VC-Realtime

Pricing rule: billed by the number of input text characters. Output is free.

Model ID	Deployment scope	Input price (per 10,000 characters)	Output price
qwen3-tts-vc-realtime-2026-01-15	Chinese mainland	$0.143353	Free
qwen3-tts-vc-realtime-2025-11-27	Chinese mainland	$0.143353	Free

Qwen3-TTS-Flash-Realtime

Pricing rule: billed by the number of input text characters. Output is free.

Model ID	Deployment scope	Input price (per 10,000 characters)	Output price
qwen3-tts-flash-realtime	Chinese mainland	$0.143353	Free
qwen3-tts-flash-realtime-2025-11-27	Chinese mainland	$0.143353	Free
qwen3-tts-flash-realtime-2025-09-18	Chinese mainland	$0.143353	Free

Qwen-TTS-Realtime

Pricing rule: billed by input tokens and output tokens.

Model ID	Deployment scope	Input price (per 1 million tokens)	Input price (per 1 million tokens)
qwen-tts-realtime	Chinese mainland	$0.345	$1.721
qwen-tts-realtime-latest	Chinese mainland	$0.345	$1.721
qwen-tts-realtime-2025-07-15	Chinese mainland	$0.345	$1.721

Qwen-TTS Voice cloning

Pricing rule: billed by the number of new voice clones created.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Price (per voice clone)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

qwen-voice-enrollment

International

$0.01

1,000 voices/account

China (Beijing)

Model ID	Deployment scope	Price (per voice clone)
qwen-voice-enrollment	Chinese mainland	$0.01

Qwen-TTS Voice design

Pricing rule: billed by the number of new voice clones created.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Price (per voice clone)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

qwen-voice-design

International

$0.2

10 voices/account

China (Beijing)

Model ID	Deployment scope	Price (per voice clone)
qwen-voice-design	Chinese mainland	$0.2

CosyVoice

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

cosyvoice-v3-plus

International

$0.26

10,000 characters

cosyvoice-v3-flash

International

$0.13

10,000 characters

China (Beijing)

Pricing rule: billed by the number of input text characters. Output is free.

Model ID	Deployment scope	Input price (per 10,000 characters)
cosyvoice-v3.5-plus	Chinese mainland	$0.22
cosyvoice-v3.5-flash	Chinese mainland	$0.116
cosyvoice-v3-plus	Chinese mainland	$0.286706
cosyvoice-v3-flash	Chinese mainland	$0.14335
cosyvoice-v2	Chinese mainland	$0.286706

Speech recognition (speech-to-text) and translation (speech-to-text in a specified language)

Qwen-LiveTranslate-Flash-Realtime

Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, see Billing.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Input price (per 1 million tokens)		Output price (per 1 million tokens)		Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
Model ID	Deployment scope	Input: audio	Input: image	Output: text	Output: audio
qwen3.5-livetranslate-flash-realtime	International	$7.5	$0.55	$20	$30	1 million tokens
qwen3.5-livetranslate-flash-realtime-2026-05-19	International	$7.5	$0.55	$20	$30	1 million tokens
qwen3-livetranslate-flash-realtime Currently equivalent to qwen3-livetranslate-flash-realtime-2025-09-22	International	$10	$1.3	$10	$38	1 million tokens
qwen3-livetranslate-flash-realtime-2025-09-22	International	$10	$1.3	$10	$38	1 million tokens

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)		Output price (per 1 million tokens)
Model ID	Deployment scope	Input: audio	Input: image	Output: text	Output: audio
qwen3.5-livetranslate-flash-realtime	Chinese mainland	$5.501	$0.454	$13.752	$22.003
qwen3.5-livetranslate-flash-realtime-2026-05-19	Chinese mainland	$5.501	$0.454	$13.752	$22.003
qwen3-livetranslate-flash-realtime Currently equivalent to qwen3-livetranslate-flash-realtime-2025-09-22	Chinese mainland	$9.175	$1.147	$9.175	$34.405
qwen3-livetranslate-flash-realtime-2025-09-22	Chinese mainland	$9.175	$1.147	$9.175	$34.405

Qwen-LiveTranslate-Flash

Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, see Billing.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Input price (per 1 million tokens)		Output price (per 1 million tokens)		Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
Model ID	Deployment scope	Input: audio	Input: image	Output: text	Output: audio
qwen3-livetranslate-flash	International	$1.577	$0.631	$1.577	$6.308	1 million tokens
qwen3-livetranslate-flash-2025-12-01	International	$1.577	$0.631	$1.577	$6.308	1 million tokens

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)		Output price (per 1 million tokens)
Model ID	Deployment scope	Input: audio	Input: image	Output: text	Output: audio
qwen3-livetranslate-flash	Chinese mainland	$1.434	$0.573	$1.434	$5.734
qwen3-livetranslate-flash-2025-12-01	Chinese mainland	$1.434	$0.573	$1.434	$5.734

Qwen-ASR

Pricing rule: billed by the duration (in seconds) of input audio. Output is free.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Input price	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
qwen3-asr-flash-filetrans	International	$0.000035/second	36,000 seconds (10 hours)
qwen3-asr-flash-filetrans-2025-11-17	International	$0.000035/second	36,000 seconds (10 hours)
qwen3-asr-flash Currently equivalent to qwen3-asr-flash-2025-09-08	International	$0.000035/second	36,000 seconds (10 hours)
qwen3-asr-flash-2026-02-10	International	$0.000035/second	36,000 seconds (10 hours)
qwen3-asr-flash-2025-09-08	International	$0.000035/second	36,000 seconds (10 hours)

China (Beijing)

Model ID	Deployment scope	Input price
qwen3-asr-flash-filetrans	Chinese mainland	$0.000032/second
qwen3-asr-flash-filetrans-2025-11-17	Chinese mainland	$0.000032/second
qwen3-asr-flash Currently equivalent to qwen3-asr-flash-2025-09-08	Chinese mainland	$0.000032/second
qwen3-asr-flash-2026-02-10	Chinese mainland	$0.000032/second
qwen3-asr-flash-2025-09-08	Chinese mainland	$0.000032/second

US (Virginia)

Model ID	Deployment scope	Input price
qwen3-asr-flash-us	US	$0.000035/second
qwen3-asr-flash-2025-09-08-us	US	$0.000035/second

Qwen-ASR-Realtime

Pricing rule: billed by the duration (in seconds) of input audio. Output is free.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Input price	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
qwen3-asr-flash-realtime Currently equivalent to qwen3-asr-flash-realtime-2025-10-27	International	$0.000090/second	36,000 seconds (10 hours)
qwen3-asr-flash-realtime-2026-02-10	International	$0.000090/second	36,000 seconds (10 hours)
qwen3-asr-flash-realtime-2025-10-27	International	$0.000090/second	36,000 seconds (10 hours)

China (Beijing)

Model ID	Deployment scope	Input price
qwen3-asr-flash-realtime Currently equivalent to qwen3-asr-flash-realtime-2025-10-27	Chinese mainland	$0.000047/second
qwen3-asr-flash-realtime-2026-02-10	Chinese mainland	$0.000047/second
qwen3-asr-flash-realtime-2025-10-27	Chinese mainland	$0.000047/second

Fun-ASR

Audio file recognition

Pricing rule: billed by the duration (in seconds) of input audio. Output is free.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Input price	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
fun-asr Currently equivalent to fun-asr-2025-11-07	International	$0.000035/second	36,000 seconds (10 hours)
fun-asr-2025-11-07	International	$0.000035/second	36,000 seconds (10 hours)
fun-asr-2025-08-25	International	$0.000035/second	36,000 seconds (10 hours)
fun-asr-mtl	International	$0.000035/second	36,000 seconds (10 hours)
fun-asr-mtl-2025-08-25	International	$0.000035/second	36,000 seconds (10 hours)
fun-asr-flash-2026-06-15	International	$0.000035/second	36,000 seconds (10 hours)

China (Beijing)

Model ID	Deployment scope	Input price
fun-asr Currently equivalent to fun-asr-2025-11-07	Chinese mainland	$0.000032/second
fun-asr-2025-11-07	Chinese mainland	$0.000032/second
fun-asr-2025-08-25	Chinese mainland	$0.000032/second
fun-asr-mtl	Chinese mainland	$0.000032/second
fun-asr-mtl-2025-08-25	Chinese mainland	$0.000032/second
fun-asr-flash-2026-06-15	Chinese mainland	$0.00003/second

Real-time speech recognition

Pricing rule: billed by the duration (in seconds) of input audio. Output is free.

Singapore

Model ID

Deployment scope

Input price

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

fun-asr-realtime

International

$0.00009/second

36,000 seconds (10 hours)

fun-asr-realtime-2025-11-07

International

$0.00009/second

36,000 seconds (10 hours)

China (Beijing)

Model ID	Deployment scope	Input price
fun-asr-realtime	Chinese mainland	$0.000047/second
fun-asr-realtime-2026-02-28	Chinese mainland	$0.000047/second
fun-asr-realtime-2025-11-07	Chinese mainland	$0.000047/second
fun-asr-realtime-2025-09-15	Chinese mainland	$0.000047/second
fun-asr-mtl-realtime	Chinese mainland	$0.000047/second
fun-asr-mtl-realtime-2025-12-10	Chinese mainland	$0.000047/second
fun-asr-flash-8k-realtime Currently equivalent to fun-asr-flash-8k-realtime-2026-01-28	Chinese mainland	$0.000032/second
fun-asr-flash-8k-realtime-2026-01-28	Chinese mainland	$0.000032/second

Paraformer

Audio file recognition

Pricing rule: billed by the duration (in seconds) of input audio. Output is free.

China (Beijing)

Model ID	Deployment scope	Input price
paraformer-v2	Chinese mainland	$0.000012/second
paraformer-8k-v2	Chinese mainland	$0.000012/second

Real-time speech recognition

Pricing rule: billed by the duration (in seconds) of input audio. Output is free.

China (Beijing)

Model ID	Deployment scope	Input price	Free quota(Note)
paraformer-realtime-v2	Chinese mainland	$0.000035/second	No free quota
paraformer-realtime-8k-v2	Chinese mainland	$0.000035/second	No free quota

Voice Chat

Real-time Voice Chat

Real-time voice chat models support both text and audio input and output, billed separately by input tokens and output tokens. Audio tokens are calculated based on duration: Total tokens = Audio duration (seconds) × 12.5. Durations less than 1 second are rounded up to 1 second.

In multi-turn conversations, similar to text-based LLMs, the model maintains a complete conversation context to ensure coherent dialogue. Historical conversation content is processed and billed as input for subsequent turns, so the input token count increases progressively with each turn. The specific billing rules for each content type are as follows:

User input audio and text: Counted as context and billed as input in each subsequent turn. Audio is billed as audio tokens, and text is billed as text tokens.
User-configured instructions: Billed as text tokens once per turn.
Model output text: Counted as context using text tokens and billed as input in each subsequent turn.
Model output audio: Billed as audio tokens only once at output time and not counted as context.

As the number of conversation turns increases, the accumulated context tokens grow progressively. We recommend controlling the number of turns in a single session or starting a new session at appropriate times to optimize costs.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID	Deployment region	Input price (per million tokens)		Output price (per million tokens)
Model ID	Deployment region	Text	Audio	Text	Audio
qwen-audio-3.0-realtime-plus	China (mainland)	$0.688	$5.501	$5.501	$20.628
qwen-audio-3.0-realtime-flash	China (mainland)	$0.413	$4.126	$4.126	$13.752

Text embedding

Pricing rule: billed by input tokens. Output is free.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

text-embedding-v4

International

$0.07

1 million tokens

text-embedding-v3

International

$0.07

500,000 tokens

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)
text-embedding-v4	Chinese mainland	$0.072

Hong Kong (China)

Model ID	Deployment scope	Input price (per 1 million tokens)
text-embedding-v4	Hong Kong (China)	$0.07

Multimodal embedding

Pricing rule: billed by input tokens. Output is free.

Singapore

Model ID

Deployment scope

Input price (per million input tokens)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

tongyi-embedding-vision-plus

International

$0.09

1 million tokens

tongyi-embedding-vision-flash

International

Image/video: $0.03

Text: $0.09

1 million tokens

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

qwen3-vl-embedding

Chinese mainland

Image/video: $0.258

Text: $0.1

1 million tokens

multimodal-embedding-v1

Chinese mainland

Free trial

No token quota limit

Text reranking

Pricing rule: billed by input tokens. Output is free.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

qwen3-rerank

International

$0.1

1 million tokens

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

qwen3-vl-rerank

Chinese mainland

Text input: $0.1

Image input: $0.258

gte-rerank-v2

Chinese mainland

Text input: $0.115

Industry models

Intent understanding

China (Beijing)

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)	Free quota(Note)
tongyi-intent-detect-v3	Chinese mainland	$0.058	$0.144	No free quota

Role play

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID	Deployment scope	Input price (per 1 million tokens)	Output price (per 1 million tokens)	Free quota(Note) ^{Valid for 90 days after you activate Alibaba Cloud Model Studio}
qwen-plus-character Session Cache discount	International	$0.5	$1.4	1 million tokens
qwen-flash-character Session Cache discount	International	$0.05	$0.4	1 million tokens
qwen-plus-character-ja	International	$0.5	$1.4	1 million tokens

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

^{Valid for 90 days after you activate Alibaba Cloud Model Studio}

qwen-plus-character

Session Cache discount

Chinese mainland

$0.115

$0.287

1 million tokens

qwen-flash-character

Session Cache discount

Chinese mainland

$0.034

$0.203

Error codes

If a model call fails and returns an error message, see Error codes for resolution.