All Products
Search
Document Center

Alibaba Cloud Model Studio:Model inference pricing

Last Updated:Jun 22, 2026

Model API calls are billed on a pay-as-you-go basis by default.

Tiered pricing rules

Some Model Studio models use tiered pricing. The unit price is determined by the total number of input tokens in a single request. All tokens in the request are billed at the unit price of the corresponding tier.

For example, a model has two pricing tiers: 0 < tokens ≤ 32K and 32K < tokens ≤ 128K. If a request contains 100K input tokens, it falls into the second tier (32K < 100K ≤ 128K), and all tokens are billed at the unit price of the second tier.

Text generation - Qwen

Qwen-Max

You are charged for input tokens and output tokens.

If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price. If the model supports context cache, only input tokens receive a discount. These two discounts cannot apply simultaneously.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3.7-max

Currently equivalent to qwen3.7-max-2026-05-20
context caching discount

International

Non-Thinking and Thinking modes

0<Token≤1M

$2.5

$7.5

1 million tokens

qwen3.7-max-2026-06-08

context caching discount

International

Non-Thinking and Thinking modes

0<Token≤1M

$2.5

$7.5

1 million tokens

qwen3.7-max-2026-05-20

context caching discount

International

Non-Thinking and Thinking modes

0<Token≤1M

$2.5

$7.5

1 million tokens

qwen3.7-max-preview

Currently equivalent to qwen3.7-max-2026-05-17

International

Thinking mode only

0<Token≤1M

$2.5

$7.5

1 million tokens

qwen3.7-max-2026-05-17

International

Thinking mode only

0<Token≤1M

$2.5

$7.5

1 million tokens

qwen3.6-max-preview

context caching discount

International

Non-Thinking and Thinking modes

0<Token≤128K

$1.3

$7.8

1 million tokens

128K<Token≤256K

$2

$12

qwen3-max

Currently equivalent to qwen3-max-2026-01-23
context caching discount

International

Non-Thinking and Thinking modes

0<Token≤32K

$1.2

$6

1 million tokens

32K<Token≤128K

$2.4

$12

128K<Token≤256K

$3

$15

qwen3-max-2026-01-23

International

Non-Thinking and Thinking modes

0<Token≤32K

$1.2

$6

1 million tokens

32K<Token≤128K

$2.4

$12

128K<Token≤256K

$3

$15

qwen3-max-2025-09-23

International

Non-Thinking mode only

0<Token≤32K

$1.2

$6

1 million tokens

32K<Token≤128K

$2.4

$12

128K<Token≤256K

$3

$15

qwen3-max-preview

context caching discount

International

Non-Thinking and Thinking modes

0<Token≤32K

$1.2

$6

1 million tokens

32K<Token≤128K

$2.4

$12

128K<Token≤256K

$3

$15

More models

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-max

Currently equivalent to qwen-max-2025-01-25
50% batch inference discount

International

Non-Thinking mode only

No tiered pricing

$1.6

$6.4

1 million tokens

China (Beijing)

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

qwen3.7-max

Currently equivalent to qwen3.7-max-2026-05-20
50% batch inference discount
context caching discount

Chinese mainland

Non-Thinking and Thinking modes

0<Token≤1M

$1.65

$4.951

qwen3.7-max-2026-06-08

context caching discount

Chinese mainland

Non-Thinking and Thinking modes

0<Token≤1M

$1.65

$4.951

qwen3.7-max-2026-05-20

context caching discount

Chinese mainland

Non-Thinking and Thinking modes

0<Token≤1M

$1.65

$4.951

qwen3.6-max-preview

context caching discount

Chinese mainland

Non-Thinking and Thinking modes

0<Token≤128K

$1.238

$7.426

128K<Token≤256K

$2.063

$12.377

qwen3-max

Currently equivalent to qwen3-max-2026-01-23
50% batch inference discount
context caching discount

Chinese mainland

Non-Thinking and Thinking modes

0<Token≤32K

$0.359

$1.434

32K<Token≤128K

$0.574

$2.294

128K<Token≤256K

$1.004

$4.014

qwen3-max-2026-01-23

Chinese mainland

Non-Thinking and Thinking modes

0<Token≤32K

$0.359

$1.434

32K<Token≤128K

$0.574

$2.294

128K<Token≤256K

$1.004

$4.014

qwen3-max-2025-09-23

Chinese mainland

Non-Thinking mode only

0<Token≤32K

$0.861

$3.441

32K<Token≤128K

$1.434

$5.735

128K<Token≤256K

$2.151

$8.602

qwen3-max-preview

context caching discount

Chinese mainland

Non-Thinking and Thinking modes

0<Token≤32K

$0.861

$3.441

32K<Token≤128K

$1.434

$5.735

128K<Token≤256K

$2.151

$8.602

More models

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen-max

Currently equivalent to qwen-max-2024-09-19

Chinese mainland

Non-Thinking mode only

No tiered pricing

$0.345

$1.377

Hong Kong (China)

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

qwen3.7-max

Currently equivalent to qwen3.7-max-2026-05-20
context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤1M

$1.65

$4.951

qwen3.7-max-2026-06-08

context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤1M

$1.65

$4.951

qwen3.7-max-2026-05-20

context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤1M

$1.65

$4.951

qwen3-max

Currently equivalent to qwen3-max-2026-01-23
context caching discount

Hong Kong (China)

Non-Thinking and Thinking modes

0<Token≤32K

$1.2

$6

32K<Token≤128K

$2.4

$12

128K<Token≤256K

$3

$15

qwen3-max-2026-01-23

Hong Kong (China)

Non-Thinking and Thinking modes

0<Token≤32K

$1.2

$6

32K<Token≤128K

$2.4

$12

128K<Token≤256K

$3

$15

Germany (Frankfurt)

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

qwen3.7-max

Currently equivalent to qwen3.7-max-2026-05-20
context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤1M

$1.65

$4.951

qwen3.7-max-2026-06-08

context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤1M

$1.65

$4.951

qwen3.7-max-2026-05-20

context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤1M

$1.65

$4.951

qwen3-max

Currently equivalent to qwen3-max-2026-01-23
context caching discount

Global

Non-Thinking mode only

0<Token≤32K

$0.359

$1.434

32K<Token≤128K

$0.574

$2.294

128K<Token≤256K

$1.004

$4.014

qwen3-max

Currently equivalent to qwen3-max-2026-01-23
50% batch inference discount
context caching discount

EU

Non-Thinking and Thinking modes

0<Token≤32K

$1.2

$6

32K<Token≤128K

$2.4

$12

128K<Token≤256K

$3

$15

qwen3-max-2026-01-23

EU

Non-Thinking and Thinking modes

0<Token≤32K

$1.2

$6

32K<Token≤128K

$2.4

$12

128K<Token≤256K

$3

$15

qwen3-max-2025-09-23

Global

Non-Thinking mode only

0<Token≤32K

$0.861

$3.441

32K<Token≤128K

$1.434

$5.735

128K<Token≤256K

$2.151

$8.602

qwen3-max-preview

context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤32K

$0.861

$3.441

32K<Token≤128K

$1.434

$5.735

128K<Token≤256K

$2.151

$8.602

US (Virginia)

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

qwen3.7-max

Currently equivalent to qwen3.7-max-2026-05-20
context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤1M

$1.65

$4.951

qwen3.7-max-2026-06-08

context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤1M

$1.65

$4.951

qwen3.7-max-2026-05-20

context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤1M

$1.65

$4.951

qwen3-max

Currently equivalent to qwen3-max-2026-01-23
context caching discount

Global

Non-Thinking mode only

0<Token≤32K

$0.359

$1.434

32K<Token≤128K

$0.574

$2.294

128K<Token≤256K

$1.004

$4.014

qwen3-max-2025-09-23

Global

Non-Thinking mode only

0<Token≤32K

$0.861

$3.441

32K<Token≤128K

$1.434

$5.735

128K<Token≤256K

$2.151

$8.602

qwen3-max-preview

context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤32K

$0.861

$3.441

32K<Token≤128K

$1.434

$5.735

128K<Token≤256K

$2.151

$8.602

Japan (Tokyo)

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

qwen3.7-max

Currently equivalent to qwen3.7-max-2026-05-20
Context cachecontext caching discount

Global

Non-Thinking and Thinking modes

0<Token≤1M

$1.65

$4.951

qwen3.7-max-2026-05-20

Context cachecontext caching discount

Global

Non-Thinking and Thinking modes

0<Token≤1M

$1.65

$4.951

Qwen-Plus

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen3.7-plus

Currently equivalent to qwen3.7-plus-2026-05-26
context caching discount

International

0<Token≤256K

$0.4

$1.6

$1.6

1 million tokens

256K<Token≤1M

$1.2

$4.8

$4.8

qwen3.7-plus-2026-05-26

context caching discount

International

0<Token≤256K

$0.4

$1.6

$1.6

1 million tokens

256K<Token≤1M

$1.2

$4.8

$4.8

qwen3.6-plus

Currently equivalent to qwen3.6-plus-2026-04-02

International

0<Token≤256K

$0.5

$3

$3

1 million tokens

256K<Token≤1M

$2

$6

$6

qwen3.6-plus-2026-04-02

International

0<Token≤256K

$0.5

$3

$3

1 million tokens

256K<Token≤1M

$2

$6

$6

qwen3.5-plus

Currently equivalent to qwen3.5-plus-2026-02-15

International

0<Token≤256K

$0.4

$2.4

$2.4

1 million tokens

256K<Token≤1M

$0.5

$3

$3

qwen3.5-plus-2026-04-20

International

0<Token≤256K

$0.4

$2.4

$2.4

1 million tokens

256K<Token≤1M

$0.5

$3

$3

qwen3.5-plus-2026-02-15

International

0<Token≤256K

$0.4

$2.4

$2.4

1 million tokens

256K<Token≤1M

$0.5

$3

$3

qwen-plus

Currently equivalent to qwen-plus-2025-12-01

International

0<Token≤256K

$0.4

$1.2

$4

1 million tokens

256K<Token≤1M

$1.2

$3.6

$12

qwen-plus-latest

International

0<Token≤256K

$0.4

$1.2

$4

1 million tokens

256K<Token≤1M

$1.2

$3.6

$12

qwen-plus-2025-12-01

International

0<Token≤256K

$0.4

$1.2

$4

1 million tokens

256K<Token≤1M

$1.2

$3.6

$12

qwen-plus-2025-09-11

International

0<Token≤256K

$0.4

$1.2

$4

1 million tokens

256K<Token≤1M

$1.2

$3.6

$12

qwen-plus-2025-07-28

International

0<Token≤256K

$0.4

$1.2

$4

1 million tokens

256K<Token≤1M

$1.2

$3.6

$12

qwen-plus-2025-07-14

International

No tiered pricing

$0.4

$1.2

$4

1 million tokens

qwen-plus-2025-04-28

International

No tiered pricing

$0.4

$1.2

$4

1 million tokens

qwen-plus-2025-01-25

International

No tiered pricing

$0.4

$1.2

-

1 million tokens

China (Beijing)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen3.7-plus

Currently equivalent to qwen3.7-plus-2026-05-26
context caching discount

Chinese mainland

0<Token≤256K

$0.276

$1.101

$1.101

256K<Token≤1M

$0.826

$3.301

$3.301

qwen3.7-plus-2026-05-26

context caching discount

Chinese mainland

0<Token≤256K

$0.276

$1.101

$1.101

256K<Token≤1M

$0.826

$3.301

$3.301

qwen3.6-plus

Currently equivalent to qwen3.6-plus-2026-04-02

Chinese mainland

0<Token≤256K

$0.276

$1.651

$1.651

256K<Token≤1M

$1.101

$6.602

$6.602

qwen3.6-plus-2026-04-02

Chinese mainland

0<Token≤256K

$0.276

$1.651

$1.651

256K<Token≤1M

$1.101

$6.602

$6.602

qwen3.5-plus

Currently equivalent to qwen3.5-plus-2026-02-15

Chinese mainland

0<Token≤128K

$0.115

$0.688

$0.688

128K<Token≤256K

$0.287

$1.72

$1.72

256K<Token≤1M

$0.573

$3.44

$3.44

qwen3.5-plus-2026-04-20

Chinese mainland

0<Token≤128K

$0.115

$0.688

$0.688

128K<Token≤256K

$0.287

$1.72

$1.72

256K<Token≤1M

$0.573

$3.44

$3.44

qwen3.5-plus-2026-02-15

Chinese mainland

0<Token≤128K

$0.115

$0.688

$0.688

128K<Token≤256K

$0.287

$1.72

$1.72

256K<Token≤1M

$0.573

$3.44

$3.44

qwen-plus

Currently equivalent to qwen-plus-2025-12-01

Chinese mainland

0<Token≤128K

$0.115

$0.287

$1.147

128K<Token≤256K

$0.345

$2.868

$3.441

256K<Token≤1M

$0.689

$6.881

$9.175

qwen-plus-latest

Chinese mainland

0<Token≤128K

$0.115

$0.287

$1.147

128K<Token≤256K

$0.345

$2.868

$3.441

256K<Token≤1M

$0.689

$6.881

$9.175

qwen-plus-2025-12-01

Chinese mainland

0<Token≤128K

$0.115

$0.287

$1.147

128K<Token≤256K

$0.345

$2.868

$3.441

256K<Token≤1M

$0.689

$6.881

$9.175

qwen-plus-2025-09-11

Chinese mainland

0<Token≤128K

$0.115

$0.287

$1.147

128K<Token≤256K

$0.345

$2.868

$3.441

256K<Token≤1M

$0.689

$6.881

$9.175

qwen-plus-2025-07-28

Chinese mainland

0<Token≤128K

$0.115

$0.287

$1.147

128K<Token≤256K

$0.345

$2.868

$3.441

256K<Token≤1M

$0.689

$6.881

$9.175

qwen-plus-2025-07-14

Chinese mainland

No tiered pricing

$0.115

$0.287

$1.147

qwen-plus-2025-04-28

Chinese mainland

No tiered pricing

$0.115

$0.287

$1.147

More models

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen-plus-2025-01-25

Chinese mainland

No tiered pricing

$0.115

$0.287

qwen-plus-2025-01-12

Chinese mainland

No tiered pricing

$0.115

$0.287

qwen-plus-2024-12-20

Chinese mainland

No tiered pricing

$0.115

$0.287

Hong Kong (China)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen3.7-plus

Currently equivalent to qwen3.7-plus-2026-05-26
context caching discount

Global

0<Token≤256K

$0.276

$1.101

$1.101

256K<Token≤1M

$0.826

$3.301

$3.301

qwen3.7-plus-2026-05-26

context caching discount

Global

0<Token≤256K

$0.276

$1.101

$1.101

256K<Token≤1M

$0.826

$3.301

$3.301

qwen3.6-plus

Currently equivalent to qwen3.6-plus-2026-04-02

Global

0<Token≤256K

$0.276

$1.651

$1.651

256K<Token≤1M

$1.101

$6.602

$6.602

qwen-plus

Currently equivalent to qwen-plus-2025-12-01

Hong Kong (China)

0<Token≤256K

$0.4

$1.2

$4

256K<Token≤1M

$1.2

$3.6

$12

qwen-plus-2025-12-01

Hong Kong (China)

0<Token≤256K

$0.4

$1.2

$4

256K<Token≤1M

$1.2

$3.6

$12

Germany (Frankfurt)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen3.7-plus

Currently equivalent to qwen3.7-plus-2026-05-26
context caching discount

Global

0<Token≤256K

$0.276

$1.101

$1.101

256K<Token≤1M

$0.826

$3.301

$3.301

qwen3.7-plus-2026-05-26

context caching discount

Global

0<Token≤256K

$0.276

$1.101

$1.101

256K<Token≤1M

$0.826

$3.301

$3.301

qwen3.6-plus

Currently equivalent to qwen3.6-plus-2026-04-02

Global

0<Token≤256K

$0.276

$1.651

$1.651

256K<Token≤1M

$1.101

$6.602

$6.602

qwen3.6-plus-2026-04-02

Global

0<Token≤256K

$0.276

$1.651

$1.651

256K<Token≤1M

$1.101

$6.602

$6.602

qwen3.5-plus

Currently equivalent to qwen3.5-plus-2026-02-15

Global

0<Token≤128K

$0.115

$0.688

$0.688

128K<Token≤256K

$0.287

$1.72

$1.72

256K<Token≤1M

$0.573

$3.44

$3.44

qwen3.5-plus-2026-02-15

Global

0<Token≤128K

$0.115

$0.688

$0.688

128K<Token≤256K

$0.287

$1.72

$1.72

256K<Token≤1M

$0.573

$3.44

$3.44

qwen-plus

Currently equivalent to qwen-plus-2025-12-01

Global

0<Token≤128K

$0.115

$0.287

$1.147

128K<Token≤256K

$0.345

$2.868

$3.441

256K<Token≤1M

$0.689

$6.881

$9.175

qwen-plus

Currently equivalent to qwen-plus-2025-12-01

EU

0<Token≤256K

$0.4

$1.2

$4

256K<Token≤1M

$1.2

$3.6

$12

qwen-plus-2025-12-01

Global

0<Token≤128K

$0.115

$0.287

$1.147

128K<Token≤256K

$0.345

$2.868

$3.441

256K<Token≤1M

$0.689

$6.881

$9.175

qwen-plus-2025-12-01

EU

0<Token≤256K

$0.4

$1.2

$4

256K<Token≤1M

$1.2

$3.6

$12

qwen-plus-2025-09-11

Global

0<Token≤128K

$0.115

$0.287

$1.147

128K<Token≤256K

$0.345

$2.868

$3.441

256K<Token≤1M

$0.689

$6.881

$9.175

qwen-plus-2025-07-28

Global

0<Token≤128K

$0.115

$0.287

$1.147

128K<Token≤256K

$0.345

$2.868

$3.441

256K<Token≤1M

$0.689

$6.881

$9.175

US (Virginia)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen3.7-plus

Currently equivalent to qwen3.7-plus-2026-05-26
context caching discount

Global

0<Token≤256K

$0.276

$1.101

$1.101

256K<Token≤1M

$0.826

$3.301

$3.301

qwen3.7-plus-2026-05-26

context caching discount

Global

0<Token≤256K

$0.276

$1.101

$1.101

256K<Token≤1M

$0.826

$3.301

$3.301

qwen3.6-plus

Currently equivalent to qwen3.6-plus-2026-04-02

Global

0<Token≤256K

$0.276

$1.651

$1.651

256K<Token≤1M

$1.101

$6.602

$6.602

qwen3.6-plus-2026-04-02

Global

0<Token≤256K

$0.276

$1.651

$1.651

256K<Token≤1M

$1.101

$6.602

$6.602

qwen3.5-plus

Currently equivalent to qwen3.5-plus-2026-02-15

Global

0<Token≤128K

$0.115

$0.688

$0.688

128K<Token≤256K

$0.287

$1.72

$1.72

256K<Token≤1M

$0.573

$3.44

$3.44

qwen3.5-plus-2026-02-15

Global

0<Token≤128K

$0.115

$0.688

$0.688

128K<Token≤256K

$0.287

$1.72

$1.72

256K<Token≤1M

$0.573

$3.44

$3.44

qwen-plus

Currently equivalent to qwen-plus-2025-12-01

Global

0<Token≤128K

$0.115

$0.287

$1.147

128K<Token≤256K

$0.345

$2.868

$3.441

256K<Token≤1M

$0.689

$6.881

$9.175

qwen-plus-us

US

0<Token≤256K

$0.4

$1.2

$4

256K<Token≤1M

$1.2

$3.6

$12

qwen-plus-2025-12-01

Global

0<Token≤128K

$0.115

$0.287

$1.147

128K<Token≤256K

$0.345

$2.868

$3.441

256K<Token≤1M

$0.689

$6.881

$9.175

qwen-plus-2025-12-01-us

US

0<Token≤256K

$0.4

$1.2

$4

256K<Token≤1M

$1.2

$3.6

$12

qwen-plus-2025-09-11

Global

0<Token≤128K

$0.115

$0.287

$1.147

128K<Token≤256K

$0.345

$2.868

$3.441

256K<Token≤1M

$0.689

$6.881

$9.175

qwen-plus-2025-07-28

Global

0<Token≤128K

$0.115

$0.287

$1.147

128K<Token≤256K

$0.345

$2.868

$3.441

256K<Token≤1M

$0.689

$6.881

$9.175

Japan (Tokyo)

Model ID

Deployment scope

Input token range per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-Thinking mode

Thinking mode(Chain of thought + answer)

qwen3.7-plus

Currently equivalent to qwen3.7-plus-2026-05-26
Context cachecontext caching discount

Japan

0<Token≤256K

$0.4

$1.6

$1.6

256K<Token≤1M

$1.2

$4.8

$4.8

qwen3.7-plus-2026-05-26

Context cachecontext caching discount

Japan

0<Token≤256K

$0.4

$1.6

$1.6

256K<Token≤1M

$1.2

$4.8

$4.8

qwen3.7-plus

Currently equivalent to qwen3.7-plus-2026-05-26
Context cachecontext caching discount

Global

0<Token≤256K

$0.276

$1.101

$1.101

256K<Token≤1M

$0.826

$3.301

$3.301

qwen3.7-plus-2026-05-26

Context cachecontext caching discount

Global

0<Token≤256K

$0.276

$1.101

$1.101

256K<Token≤1M

$0.826

$3.301

$3.301

qwen3.6-plus

Currently equivalent to qwen3.6-plus-2026-04-02
Context cachecontext caching discount

Global

0<Token≤256K

$0.276

$1.651

$1.651

256K<Token≤1M

$1.101

$6.602

$6.602

qwen3.6-plus-2026-04-02

Global

0<Token≤256K

$0.276

$1.651

$1.651

256K<Token≤1M

$1.101

$6.602

$6.602

Qwen-Flash

You are charged for input tokens and output tokens.

If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price. If the model supports context cache, only input tokens receive a discount. These two discounts cannot apply simultaneously.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3.6-flash

Currently equivalent to qwen3.6-flash-2026-04-16
50% batch inference discount
context caching discount

International

0<Token≤256K

$0.25

$1.5

1 million tokens

256K<Token≤1M

$1

$4

qwen3.6-flash-2026-04-16

International

0<Token≤256K

$0.25

$1.5

1 million tokens

256K<Token≤1M

$1

$4

qwen3.5-flash

Currently equivalent to qwen3.5-flash-2026-02-23
50% batch inference discount
context caching discount

International

0<Token≤1M

$0.1

$0.4

1 million tokens

qwen3.5-flash-2026-02-23

International

0<Token≤1M

$0.1

$0.4

1 million tokens

qwen-flash

Currently equivalent to qwen-flash-2025-07-28
50% batch inference discount
context caching discount

International

0<Token≤256K

$0.05

$0.4

1 million tokens

256K<Token≤1M

$0.25

$2

qwen-flash-2025-07-28

International

0<Token≤256K

$0.05

$0.4

1 million tokens

256K<Token≤1M

$0.25

$2

China (Beijing)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen3.6-flash

Currently equivalent to qwen3.6-flash-2026-04-16
50% batch inference discount
context caching discount

Chinese mainland

0<Token≤256K

$0.165

$0.99

256K<Token≤1M

$0.66

$3.961

qwen3.6-flash-2026-04-16

Chinese mainland

0<Token≤256K

$0.165

$0.99

256K<Token≤1M

$0.66

$3.961

qwen3.5-flash

Currently equivalent to qwen3.5-flash-2026-02-23

Chinese mainland

0<Token≤128K

$0.029

$0.287

128K<Token≤256K

$0.115

$1.147

256K<Token≤1M

$0.172

$1.72

qwen3.5-flash-2026-02-23

Chinese mainland

0<Token≤128K

$0.029

$0.287

128K<Token≤256K

$0.115

$1.147

256K<Token≤1M

$0.172

$1.72

qwen-flash

Currently equivalent to qwen-flash-2025-07-28
context caching discount

Chinese mainland

0<Token≤128K

$0.022

$0.216

128K<Token≤256K

$0.087

$0.861

256K<Token≤1M

$0.173

$1.721

qwen-flash-2025-07-28

Chinese mainland

0<Token≤128K

$0.022

$0.216

128K<Token≤256K

$0.087

$0.861

256K<Token≤1M

$0.173

$1.721

Hong Kong (China)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen3.6-flash

Currently equivalent to qwen3.6-flash-2026-04-16

Global

0<Token≤256K

$0.165

$0.99

256K<Token≤1M

$0.66

$3.961

qwen3.5-flash

Currently equivalent to qwen3.5-flash-2026-02-23
context caching discount

Hong Kong (China)

0<Token≤1M

$0.1

$0.4

qwen3.5-flash-2026-02-23

Hong Kong (China)

0<Token≤1M

$0.1

$0.4

Germany (Frankfurt)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen3.6-flash

Currently equivalent to qwen3.6-flash-2026-04-16

Global

0<Token≤256K

$0.165

$0.99

256K<Token≤1M

$0.66

$3.961

qwen3.6-flash-2026-04-16

Global

0<Token≤256K

$0.165

$0.99

256K<Token≤1M

$0.66

$3.961

qwen3.5-flash

Currently equivalent to qwen3.5-flash-2026-02-23

Global

0<Token≤128K

$0.029

$0.287

128K<Token≤256K

$0.115

$1.147

256K<Token≤1M

$0.172

$1.72

qwen3.5-flash

Currently equivalent to qwen3.5-flash-2026-02-23
context caching discount

EU

0<Token≤1M

$0.1

$0.4

qwen3.5-flash-2026-02-23

Global

0<Token≤128K

$0.029

$0.287

128K<Token≤256K

$0.115

$1.147

256K<Token≤1M

$0.172

$1.72

qwen3.5-flash-2026-02-23

EU

0<Token≤1M

$0.1

$0.4

qwen-flash

Currently equivalent to qwen-flash-2025-07-28
context caching discount

Global

0<Token≤128K

$0.022

$0.216

128K<Token≤256K

$0.087

$0.861

256K<Token≤1M

$0.173

$1.721

qwen-flash-2025-07-28

Global

0<Token≤128K

$0.022

$0.216

128K<Token≤256K

$0.087

$0.861

256K<Token≤1M

$0.173

$1.721

US (Virginia)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen3.6-flash

Currently equivalent to qwen3.6-flash-2026-04-16

Global

0<Token≤256K

$0.165

$0.99

256K<Token≤1M

$0.66

$3.961

qwen3.6-flash-2026-04-16

Global

0<Token≤256K

$0.165

$0.99

256K<Token≤1M

$0.66

$3.961

qwen3.5-flash

Currently equivalent to qwen3.5-flash-2026-02-23

Global

0<Token≤128K

$0.029

$0.287

128K<Token≤256K

$0.115

$1.147

256K<Token≤1M

$0.172

$1.72

qwen3.5-flash-2026-02-23

Global

0<Token≤128K

$0.029

$0.287

128K<Token≤256K

$0.115

$1.147

256K<Token≤1M

$0.172

$1.72

qwen-flash

Currently equivalent to qwen-flash-2025-07-28
context caching discount

Global

0<Token≤128K

$0.022

$0.216

128K<Token≤256K

$0.087

$0.861

256K<Token≤1M

$0.173

$1.721

qwen-flash-us

US

0<Token≤256K

$0.05

$0.4

256K<Token≤1M

$0.25

$2

qwen-flash-2025-07-28

Global

0<Token≤128K

$0.022

$0.216

128K<Token≤256K

$0.087

$0.861

256K<Token≤1M

$0.173

$1.721

qwen-flash-2025-07-28-us

US

0<Token≤256K

$0.05

$0.4

256K<Token≤1M

$0.25

$2

Japan (Tokyo)

Model ID

Deployment scope

Input token range per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen3.6-flash

Currently equivalent to qwen3.6-flash-2026-04-16
Context cachecontext caching discount

Global

0<Token≤256K

$0.165

$0.99

256K<Token≤1M

$0.66

$3.961

qwen3.6-flash-2026-04-16

Global

0<Token≤256K

$0.165

$0.99

256K<Token≤1M

$0.66

$3.961

Qwen-Turbo

Note

Qwen-Turbo will no longer be updated. We recommend switching to Qwen-Flash.

You are charged for input tokens and output tokens.

If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen-turbo

Currently equivalent to qwen-turbo-2025-04-28
50% batch inference discount

International

$0.05

$0.2

$0.5

1 million tokens

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen-turbo

Currently equivalent to qwen-turbo-2025-04-28

Chinese mainland

$0.044

$0.087

$0.431

QwQ

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwq-plus

Currently equivalent to qwq-plus-2025-03-05

International

$0.8

$2.4

1 million tokens

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwq-plus

Currently equivalent to qwq-plus-2025-03-05

Chinese mainland

$0.230

$0.574

Qwen-Long

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

qwen-long-latest

International

$0.072

$0.287

No free quota

qwen-long-2025-01-25

International

$0.072

$0.287

No free quota

Qwen-Omni

Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, seeBilling and rate limits.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Text/Image/video

Audio

Text

Multimodal input

Text + audio

Audio only billed

qwen3.5-omni-plus

Currently equivalent to qwen3.5-omni-plus-2026-03-15

International

$1.4

$11

$8.3

$44

1 million tokens

qwen3.5-omni-plus-2026-03-15

International

$1.4

$11

$8.3

$44

1 million tokens

qwen3.5-omni-flash

Currently equivalent to qwen3.5-omni-flash-2026-03-15

International

$0.4

$3

$2.2

$11.9

1 million tokens

qwen3.5-omni-flash-2026-03-15

International

$0.4

$3

$2.2

$11.9

1 million tokens

More models

Model ID

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Text

Audio

Image/video

Text

Text-only input

Text

Multimodal input

Text + audio

Audio only billed

qwen3-omni-flash

Currently equivalent to qwen3-omni-flash-2025-12-01

International

Non-Thinking and Thinking modes

$0.43

$3.81

$0.78

$1.66

$3.06

$15.11

1 million tokens (regardless of modality)

qwen3-omni-flash-2025-12-01

International

Non-Thinking and Thinking modes

$0.43

$3.81

$0.78

$1.66

$3.06

$15.11

1 million tokens (regardless of modality)

qwen3-omni-flash-2025-09-15

International

Non-Thinking and Thinking modes

$0.43

$3.81

$0.78

$1.66

$3.06

$15.11

1 million tokens (regardless of modality)

qwen-omni-turbo

Currently equivalent to qwen-omni-turbo-2025-03-26

International

Non-Thinking mode

$0.07

$4.44

$0.21

$0.27

$0.63

$8.89

1 million tokens (regardless of modality)

qwen-omni-turbo-latest

International

Non-Thinking mode

$0.07

$4.44

$0.21

$0.27

$0.63

$8.89

1 million tokens (regardless of modality)

qwen-omni-turbo-2025-03-26

International

Non-Thinking mode

$0.07

$4.44

$0.21

$0.27

$0.63

$8.89

1 million tokens (regardless of modality)

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Text/Image/video

Audio

Text

Multimodal input

Text + audio

Audio only billed

qwen3.5-omni-plus

Currently equivalent to qwen3.5-omni-plus-2026-03-15

Chinese mainland

$0.96

$7.29

$5.5

$29.29

qwen3.5-omni-plus-2026-03-15

Chinese mainland

$0.96

$7.29

$5.5

$29.29

qwen3.5-omni-flash

Currently equivalent to qwen3.5-omni-flash-2026-03-15

Chinese mainland

$0.3

$2.48

$1.83

$9.9

qwen3.5-omni-flash-2026-03-15

Chinese mainland

$0.3

$2.48

$1.83

$9.9

More models

Model ID

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Text

Audio

Image/video

Text

Text-only input

Text

Multimodal input

Text + audio

Audio only billed

qwen3-omni-flash

Currently equivalent to qwen3-omni-flash-2025-12-01

Chinese mainland

Non-Thinking and Thinking modes

$0.258

$2.265

$0.473

$0.989

$1.821

$8.974

qwen3-omni-flash-2025-12-01

Chinese mainland

Non-Thinking and Thinking modes

$0.258

$2.265

$0.473

$0.989

$1.821

$8.974

qwen3-omni-flash-2025-09-15

Chinese mainland

Non-Thinking and Thinking modes

$0.258

$2.265

$0.473

$0.989

$1.821

$8.974

qwen-omni-turbo

Currently equivalent to qwen-omni-turbo-2025-03-26

Chinese mainland

Non-Thinking mode

$0.058

$3.584

$0.216

$0.230

$0.646

$7.168

qwen-omni-turbo-latest

Chinese mainland

Non-Thinking mode

$0.058

$3.584

$0.216

$0.230

$0.646

$7.168

qwen-omni-turbo-2025-03-26

Chinese mainland

Non-Thinking mode

$0.058

$3.584

$0.216

$0.230

$0.646

$7.168

qwen-omni-turbo-2025-01-19

Chinese mainland

Non-Thinking mode

$0.058

$3.584

$0.216

$0.230

$0.646

$7.168

Qwen-Omni-Realtime

Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, seeBilling and rate limits.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Text/image

Audio

Text

Multimodal input

Text + audio

Audio only billed

qwen3.5-omni-plus-realtime

International

$2.1

$16.5

$12.4

$62

1 million tokens

qwen3.5-omni-plus-realtime-2026-03-15

International

$2.1

$16.5

$12.4

$62

1 million tokens

qwen3.5-omni-flash-realtime

International

$0.55

$4.5

$3.3

$17.7

1 million tokens

qwen3.5-omni-flash-realtime-2026-03-15

International

$0.55

$4.5

$3.3

$17.7

1 million tokens

More models

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Text

Audio

Image

Text

Text-only input

Text

Multimodal input

Text + audio

Audio only billed

qwen3-omni-flash-realtime

International

$0.52

$4.57

$0.94

$1.99

$3.67

$18.13

1 million tokens (regardless of modality)

qwen3-omni-flash-realtime-2025-12-01

International

$0.52

$4.57

$0.94

$1.99

$3.67

$18.13

1 million tokens (regardless of modality)

qwen3-omni-flash-realtime-2025-09-15

International

$0.52

$4.57

$0.94

$1.99

$3.67

$18.13

1 million tokens (regardless of modality)

qwen-omni-turbo-realtime

International

$0.270

$4.440

$0.840

$1.070

$2.520

$8.890

1 million tokens (regardless of modality)

qwen-omni-turbo-realtime-latest

International

$0.270

$4.440

$0.840

$1.070

$2.520

$8.890

1 million tokens (regardless of modality)

qwen-omni-turbo-realtime-2025-05-08

International

$0.270

$4.440

$0.840

$1.070

$2.520

$8.890

1 million tokens (regardless of modality)

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Text/image

Audio

Text

Multimodal input

Text + audio

Audio only billed

qwen3.5-omni-plus-realtime

Chinese mainland

$1.38

$11

$8.25

$41.26

qwen3.5-omni-plus-realtime-2026-03-15

Chinese mainland

$1.38

$11

$8.25

$41.26

qwen3.5-omni-flash-realtime

Chinese mainland

$0.45

$3.71

$2.75

$14.71

qwen3.5-omni-flash-realtime-2026-03-15

Chinese mainland

$0.45

$3.71

$2.75

$14.71

More models

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Text

Audio

Image

Text

Text-only input

Text

Multimodal input

Text + audio

Audio only billed

qwen3-omni-flash-realtime

Chinese mainland

$0.315

$2.709

$0.559

$1.19

$2.179

$10.766

qwen3-omni-flash-realtime-2025-12-01

Chinese mainland

$0.315

$2.709

$0.559

$1.19

$2.179

$10.766

qwen3-omni-flash-realtime-2025-09-15

Chinese mainland

$0.315

$2.709

$0.559

$1.19

$2.179

$10.766

qwen-omni-turbo-realtime

Chinese mainland

$0.230

$3.584

$0.861

$0.918

$2.581

$7.168

qwen-omni-turbo-realtime-latest

Chinese mainland

$0.230

$3.584

$0.861

$0.918

$2.581

$7.168

qwen-omni-turbo-realtime-2025-05-08

Chinese mainland

$0.230

$3.584

$0.861

$0.918

$2.581

$7.168

QVQ

Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, seeBilling and rate limits.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qvq-max

Currently equivalent to qvq-max-2025-03-25

International

$1.2

$4.8

1 million tokens

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qvq-max

Currently equivalent to qvq-max-2025-03-25

Chinese mainland

$1.147

$4.588

qvq-plus

Currently equivalent to qvq-plus-2025-05-15

Chinese mainland

$0.287

$0.717

Qwen-VL

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-vl-plus

Currently equivalent to qwen3-vl-plus-2025-12-19
context caching discount

International

Non-Thinking and Thinking modes

0<Token≤32K

$0.2

$1.6

1 million tokens

32K<Token≤128K

$0.3

$2.4

128K<Token≤256K

$0.6

$4.8

qwen3-vl-plus-2025-12-19

International

Non-Thinking and Thinking modes

0<Token≤32K

$0.2

$1.6

1 million tokens

32K<Token≤128K

$0.3

$2.4

128K<Token≤256K

$0.6

$4.8

qwen3-vl-plus-2025-09-23

International

Non-Thinking and Thinking modes

0<Token≤32K

$0.2

$1.6

1 million tokens

32K<Token≤128K

$0.3

$2.4

128K<Token≤256K

$0.6

$4.8

qwen3-vl-flash

Currently equivalent to qwen3-vl-flash-2026-01-22
context caching discount

International

Non-Thinking and Thinking modes

0<Token≤32K

$0.05

$0.4

1 million tokens

32K<Token≤128K

$0.075

$0.6

128K<Token≤256K

$0.12

$0.96

qwen3-vl-flash-2026-01-22

International

Non-Thinking and Thinking modes

0<Token≤32K

$0.05

$0.4

1 million tokens

32K<Token≤128K

$0.075

$0.6

128K<Token≤256K

$0.12

$0.96

qwen3-vl-flash-2025-10-15

International

Non-Thinking and Thinking modes

0<Token≤32K

$0.05

$0.4

1 million tokens

32K<Token≤128K

$0.075

$0.6

128K<Token≤256K

$0.12

$0.96

More models

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-vl-max

Currently equivalent to qwen-vl-max-2025-08-13
context caching discount

International

No tiered pricing

$0.8

$3.2

1 million tokens

qwen-vl-plus

Currently equivalent to qwen-vl-plus-2025-08-15
context caching discount

International

No tiered pricing

$0.21

$0.63

1 million tokens

China (Beijing)

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

qwen3-vl-plus

Currently equivalent to qwen3-vl-plus-2025-12-19
context caching discount

Chinese mainland

Non-Thinking and Thinking modes

0<Token≤32K

$0.143

$1.434

32K<Token≤128K

$0.215

$2.15

128K<Token≤256K

$0.43

$4.301

qwen3-vl-plus-2025-12-19

Chinese mainland

Non-Thinking and Thinking modes

0<Token≤32K

$0.143

$1.434

32K<Token≤128K

$0.215

$2.15

128K<Token≤256K

$0.43

$4.301

qwen3-vl-plus-2025-09-23

Chinese mainland

Non-Thinking and Thinking modes

0<Token≤32K

$0.143

$1.434

32K<Token≤128K

$0.215

$2.15

128K<Token≤256K

$0.43

$4.301

qwen3-vl-flash

Currently equivalent to qwen3-vl-flash-2026-01-22
context caching discount

Chinese mainland

Non-Thinking and Thinking modes

0<Token≤32K

$0.022

$0.215

32K<Token≤128K

$0.043

$0.43

128K<Token≤256K

$0.086

$0.859

qwen3-vl-flash-2026-01-22

Chinese mainland

Non-Thinking and Thinking modes

0<Token≤32K

$0.022

$0.215

32K<Token≤128K

$0.043

$0.43

128K<Token≤256K

$0.086

$0.859

qwen3-vl-flash-2025-10-15

Chinese mainland

Non-Thinking and Thinking modes

0<Token≤32K

$0.022

$0.215

32K<Token≤128K

$0.043

$0.43

128K<Token≤256K

$0.086

$0.859

More models

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen-vl-max

Currently equivalent to qwen-vl-max-2025-08-13
context caching discount

Chinese mainland

No tiered pricing

$0.23

$0.574

qwen-vl-plus

Currently equivalent to qwen-vl-plus-2025-08-15
context caching discount

Chinese mainland

No tiered pricing

$0.115

$0.287

Hong Kong (China)

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

qwen3-vl-plus

Currently equivalent to qwen3-vl-plus-2025-12-19
context caching discount

Hong Kong (China)

Non-Thinking and Thinking modes

0<Token≤32K

$0.2

$1.6

32K<Token≤128K

$0.3

$2.4

128K<Token≤256K

$0.6

$4.8

qwen3-vl-plus-2025-12-19

Hong Kong (China)

Non-Thinking and Thinking modes

0<Token≤32K

$0.2

$1.6

32K<Token≤128K

$0.3

$2.4

128K<Token≤256K

$0.6

$4.8

Germany (Frankfurt)

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

qwen3-vl-flash

Currently equivalent to qwen3-vl-flash-2025-10-15
context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤32K

$0.022

$0.215

32K<Token≤128K

$0.043

$0.43

128K<Token≤256K

$0.086

$0.859

qwen3-vl-flash

Currently equivalent to qwen3-vl-flash-2026-01-22
context caching discount

EU

Non-Thinking and Thinking modes

0<Token≤32K

$0.05

$0.4

32K<Token≤128K

$0.075

$0.6

128K<Token≤256K

$0.12

$0.96

qwen3-vl-flash-2026-01-22

EU

Non-Thinking and Thinking modes

0<Token≤32K

$0.05

$0.4

32K<Token≤128K

$0.075

$0.6

128K<Token≤256K

$0.12

$0.96

qwen3-vl-flash-2025-10-15

Global

Non-Thinking and Thinking modes

0<Token≤32K

$0.022

$0.215

32K<Token≤128K

$0.043

$0.43

128K<Token≤256K

$0.086

$0.859

qwen3-vl-flash-2025-10-15

EU

Non-Thinking and Thinking modes

0<Token≤32K

$0.05

$0.4

32K<Token≤128K

$0.075

$0.6

128K<Token≤256K

$0.12

$0.96

qwen3-vl-plus

Currently equivalent to qwen3-vl-plus-2025-12-19
context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤32K

$0.143

$1.434

32K<Token≤128K

$0.215

$2.15

128K<Token≤256K

$0.43

$4.301

qwen3-vl-plus

context caching discount

EU

Non-Thinking and Thinking modes

0<Token≤32K

$0.2

$1.6

32K<Token≤128K

$0.3

$2.4

128K<Token≤256K

$0.6

$4.8

qwen3-vl-plus-2025-09-23

Global

Non-Thinking and Thinking modes

0<Token≤32K

$0.143

$1.434

32K<Token≤128K

$0.215

$2.15

128K<Token≤256K

$0.43

$4.301

US (Virginia)

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

qwen3-vl-flash

Currently equivalent to qwen3-vl-flash-2025-10-15
context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤32K

$0.022

$0.215

32K<Token≤128K

$0.043

$0.43

128K<Token≤256K

$0.086

$0.859

qwen3-vl-flash-us

context caching discount

US

Non-Thinking and Thinking modes

0<Token≤32K

$0.05

$0.4

32K<Token≤128K

$0.075

$0.6

128K<Token≤256K

$0.12

$0.96

qwen3-vl-flash-2026-01-22-us

US

Non-Thinking and Thinking modes

0<Token≤32K

$0.05

$0.4

32K<Token≤128K

$0.075

$0.6

128K<Token≤256K

$0.12

$0.96

qwen3-vl-flash-2025-10-15

Global

Non-Thinking and Thinking modes

0<Token≤32K

$0.022

$0.215

32K<Token≤128K

$0.043

$0.43

128K<Token≤256K

$0.086

$0.859

qwen3-vl-flash-2025-10-15-us

US

Non-Thinking and Thinking modes

0<Token≤32K

$0.05

$0.4

32K<Token≤128K

$0.075

$0.6

128K<Token≤256K

$0.12

$0.96

qwen3-vl-plus

Currently equivalent to qwen3-vl-plus-2025-12-19
context caching discount

Global

Non-Thinking and Thinking modes

0<Token≤32K

$0.143

$1.434

32K<Token≤128K

$0.215

$2.15

128K<Token≤256K

$0.43

$4.301

qwen3-vl-plus-2025-09-23

Global

Non-Thinking and Thinking modes

0<Token≤32K

$0.143

$1.434

32K<Token≤128K

$0.215

$2.15

128K<Token≤256K

$0.43

$4.301

Qwen-OCR

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-vl-ocr

International

$0.07

$0.16

1 million tokens

qwen-vl-ocr-2025-11-20

International

1 million tokens

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen3.5-ocr

Chinese mainland

$0.069

$0.275

qwen-vl-ocr

Chinese mainland

$0.043

$0.072

qwen-vl-ocr-latest

Chinese mainland

$0.043

$0.072

qwen-vl-ocr-2025-11-20

Chinese mainland

$0.043

$0.072

qwen-vl-ocr-2025-08-28

Chinese mainland

$0.717

$0.717

qwen-vl-ocr-2025-04-13

Chinese mainland

$0.717

$0.717

qwen-vl-ocr-2024-10-28

Chinese mainland

$0.717

$0.717

Germany (Frankfurt)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen-vl-ocr

Global

$0.043

$0.072

qwen-vl-ocr-2025-11-20

Global

$0.043

$0.072

US (Virginia)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen-vl-ocr

Global

$0.043

$0.072

qwen-vl-ocr-2025-11-20

Global

$0.043

$0.072

Qwen Math

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

qwen-math-plus

Chinese mainland

$0.574

$1.721

No free quota

qwen-math-plus-latest

Chinese mainland

$0.574

$1.721

No free quota

qwen-math-plus-2024-09-19

Chinese mainland

$0.574

$1.721

No free quota

qwen-math-plus-2024-08-16

Chinese mainland

$0.574

$1.721

No free quota

qwen-math-turbo

Chinese mainland

$0.287

$0.861

No free quota

Qwen-Coder

You are charged for input tokens and output tokens.

If the model supports context cache, only input tokens receive a discount.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-coder-plus

Currently equivalent to qwen3-coder-plus-2025-09-23
context caching discount

International

0<Token≤32K

$1

$5

1 million tokens

32K<Token≤128K

$1.8

$9

128K<Token≤256K

$3

$15

256K<Token≤1M

$6

$60

qwen3-coder-plus-2025-09-23

International

0<Token≤32K

$1

$5

1 million tokens

32K<Token≤128K

$1.8

$9

128K<Token≤256K

$3

$15

256K<Token≤1M

$6

$60

qwen3-coder-plus-2025-07-22

International

0<Token≤32K

$1

$5

1 million tokens

32K<Token≤128K

$1.8

$9

128K<Token≤256K

$3

$15

256K<Token≤1M

$6

$60

qwen3-coder-flash

Currently equivalent to qwen3-coder-flash-2025-07-28

International

0<Token≤32K

$0.3

$1.5

1 million tokens

32K<Token≤128K

$0.5

$2.5

128K<Token≤256K

$0.8

$4

256K<Token≤1M

$1.6

$9.6

qwen3-coder-flash-2025-07-28

International

0<Token≤32K

$0.3

$1.5

1 million tokens

32K<Token≤128K

$0.5

$2.5

128K<Token≤256K

$0.8

$4

256K<Token≤1M

$1.6

$9.6

China (Beijing)

qwen3-coderseries models

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen3-coder-plus

Currently equivalent to qwen3-coder-plus-2025-09-23
context caching discount

Chinese mainland

0<Token≤32K

$0.574

$2.294

32K<Token≤128K

$0.861

$3.441

128K<Token≤256K

$1.434

$5.735

256K<Token≤1M

$2.868

$28.671

qwen3-coder-plus-2025-09-23

Chinese mainland

0<Token≤32K

$0.574

$2.294

32K<Token≤128K

$0.861

$3.441

128K<Token≤256K

$1.434

$5.735

256K<Token≤1M

$2.868

$28.671

qwen3-coder-plus-2025-07-22

Chinese mainland

0<Token≤32K

$0.574

$2.294

32K<Token≤128K

$0.861

$3.441

128K<Token≤256K

$1.434

$5.735

256K<Token≤1M

$2.868

$28.671

qwen3-coder-flash

Currently equivalent to qwen3-coder-flash-2025-07-28

Chinese mainland

0<Token≤32K

$0.144

$0.574

32K<Token≤128K

$0.216

$0.861

128K<Token≤256K

$0.359

$1.434

256K<Token≤1M

$0.717

$3.584

qwen3-coder-flash-2025-07-28

Chinese mainland

0<Token≤32K

$0.144

$0.574

32K<Token≤128K

$0.216

$0.861

128K<Token≤256K

$0.359

$1.434

256K<Token≤1M

$0.717

$3.584

Legacy qwen-coder series models

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen-coder-plus

Currently equivalent to qwen-coder-plus-2024-11-06

Chinese mainland

No tiered pricing

$0.502

$1.004

qwen-coder-turbo

Currently equivalent to qwen-coder-turbo-2024-09-19

Chinese mainland

No tiered pricing

$0.287

$0.861

Germany (Frankfurt)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen3-coder-plus

Currently equivalent to qwen3-coder-plus-2025-09-23
context caching discount

Global

0<Token≤32K

$0.574

$2.294

32K<Token≤128K

$0.861

$3.441

128K<Token≤256K

$1.434

$5.735

256K<Token≤1M

$2.868

$28.671

qwen3-coder-plus-2025-09-23

Global

0<Token≤32K

$0.574

$2.294

32K<Token≤128K

$0.861

$3.441

128K<Token≤256K

$1.434

$5.735

256K<Token≤1M

$2.868

$28.671

qwen3-coder-plus-2025-07-22

Global

0<Token≤32K

$0.574

$2.294

32K<Token≤128K

$0.861

$3.441

128K<Token≤256K

$1.434

$5.735

256K<Token≤1M

$2.868

$28.671

qwen3-coder-flash

Currently equivalent to qwen3-coder-flash-2025-07-28
context caching discount

Global

0<Token≤32K

$0.144

$0.574

32K<Token≤128K

$0.216

$0.861

128K<Token≤256K

$0.359

$1.434

256K<Token≤1M

$0.717

$3.584

qwen3-coder-flash-2025-07-28

Global

0<Token≤32K

$0.144

$0.574

32K<Token≤128K

$0.216

$0.861

128K<Token≤256K

$0.359

$1.434

256K<Token≤1M

$0.717

$3.584

US (Virginia)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen3-coder-plus

Currently equivalent to qwen3-coder-plus-2025-09-23
context caching discount

Global

0<Token≤32K

$0.574

$2.294

32K<Token≤128K

$0.861

$3.441

128K<Token≤256K

$1.434

$5.735

256K<Token≤1M

$2.868

$28.671

qwen3-coder-plus-2025-09-23

Global

0<Token≤32K

$0.574

$2.294

32K<Token≤128K

$0.861

$3.441

128K<Token≤256K

$1.434

$5.735

256K<Token≤1M

$2.868

$28.671

qwen3-coder-plus-2025-07-22

Global

0<Token≤32K

$0.574

$2.294

32K<Token≤128K

$0.861

$3.441

128K<Token≤256K

$1.434

$5.735

256K<Token≤1M

$2.868

$28.671

qwen3-coder-flash

Currently equivalent to qwen3-coder-flash-2025-07-28
context caching discount

Global

0<Token≤32K

$0.144

$0.574

32K<Token≤128K

$0.216

$0.861

128K<Token≤256K

$0.359

$1.434

256K<Token≤1M

$0.717

$3.584

qwen3-coder-flash-2025-07-28

Global

0<Token≤32K

$0.144

$0.574

32K<Token≤128K

$0.216

$0.861

128K<Token≤256K

$0.359

$1.434

256K<Token≤1M

$0.717

$3.584

Qwen Translation

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-mt-plus

International

$2.46

$7.37

1 million tokens

qwen-mt-flash

International

$0.16

$0.49

1 million tokens

qwen-mt-lite

International

$0.12

$0.36

1 million tokens

qwen-mt-turbo

International

$0.16

$0.49

1 million tokens

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen-mt-plus

Chinese mainland

$0.259

$0.775

qwen-mt-flash

Chinese mainland

$0.101

$0.280

qwen-mt-lite

Chinese mainland

$0.086

$0.229

qwen-mt-turbo

Chinese mainland

$0.101

$0.280

Germany (Frankfurt)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen-mt-plus

Global

$0.259

$0.775

qwen-mt-flash

Global

$0.101

$0.280

qwen-mt-lite

Global

$0.086

$0.229

US (Virginia)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen-mt-flash

Global

$0.101

$0.280

qwen-mt-lite

Global

$0.086

$0.229

qwen-mt-lite-us

US

$0.12

$0.36

qwen-mt-plus

Global

$0.259

$0.775

Qwen Data Mining

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

qwen-doc-turbo

Chinese mainland

$0.087

$0.144

No free quota

Qwen Deep Research

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

qwen-deep-research

Chinese mainland

$7.742

$23.367

None

Text generation - Qwen (open source)

Qwen3.6

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen3.6-35b-a3b

International

0<Token≤256K

$0.375

$2.25

$2.25

1 million tokens

qwen3.6-27b

International

0<Token≤256K

$0.6

$3.6

$3.6

1 million tokens

China (Beijing)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen3.6-35b-a3b

Chinese mainland

0<Token≤256K

$0.248

$1.485

$1.485

qwen3.6-27b

Chinese mainland

0<Token≤256K

$0.412564

$2.475384

$2.475384

Germany (Frankfurt)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen3.6-35b-a3b

Global

0<Token≤256K

$0.248

$1.485

$1.485

US (Virginia)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen3.6-35b-a3b

Global

0<Token≤256K

$0.248

$1.485

$1.485

Qwen3.5

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen3.5-397b-a17b

International

0<Token≤256K

$0.6

$3.6

$3.6

1 million tokens

qwen3.5-122b-a10b

International

0<Token≤256K

$0.4

$3.2

$3.2

1 million tokens

qwen3.5-27b

International

0<Token≤256K

$0.3

$2.4

$2.4

1 million tokens

qwen3.5-35b-a3b

International

0<Token≤256K

$0.25

$2

$2

1 million tokens

China (Beijing)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen3.5-397b-a17b

Chinese mainland

0<Token≤128K

$0.172

$1.032

$1.032

128K<Token≤256K

$0.43

$2.58

$2.58

qwen3.5-122b-a10b

Chinese mainland

0<Token≤128K

$0.115

$0.917

$0.917

128K<Token≤256K

$0.287

$2.294

$2.294

qwen3.5-27b

Chinese mainland

0<Token≤128K

$0.086

$0.688

$0.688

128K<Token≤256K

$0.258

$2.064

$2.064

qwen3.5-35b-a3b

Chinese mainland

0<Token≤128K

$0.057

$0.459

$0.459

128K<Token≤256K

$0.229

$1.835

$1.835

Germany (Frankfurt)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen3.5-397b-a17b

Global

0<Token≤128K

$0.172

$1.032

$1.032

128K<Token≤256K

$0.43

$2.58

$2.58

qwen3.5-122b-a10b

Global

0<Token≤128K

$0.115

$0.917

$0.917

128K<Token≤256K

$0.287

$2.294

$2.294

qwen3.5-27b

Global

0<Token≤128K

$0.086

$0.688

$0.688

128K<Token≤256K

$0.258

$2.064

$2.064

qwen3.5-35b-a3b

Global

0<Token≤128K

$0.057

$0.459

$0.459

128K<Token≤256K

$0.229

$1.835

$1.835

US (Virginia)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen3.5-397b-a17b

Global

0<Token≤128K

$0.172

$1.032

$1.032

128K<Token≤256K

$0.43

$2.58

$2.58

qwen3.5-122b-a10b

Global

0<Token≤128K

$0.115

$0.917

$0.917

128K<Token≤256K

$0.287

$2.294

$2.294

qwen3.5-27b

Global

0<Token≤128K

$0.086

$0.688

$0.688

128K<Token≤256K

$0.258

$2.064

$2.064

qwen3.5-35b-a3b

Global

0<Token≤128K

$0.057

$0.459

$0.459

128K<Token≤256K

$0.229

$1.835

$1.835

Qwen3

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Non-Thinking mode

Thinking mode

qwen3-next-80b-a3b-thinking

International

Thinking mode only

$0.15

-

$1.2

1 million tokens

qwen3-next-80b-a3b-instruct

International

Non-Thinking mode only

$0.15

$1.2

-

1 million tokens

qwen3-235b-a22b-thinking-2507

International

Thinking mode only

$0.23

-

$2.3

1 million tokens

qwen3-235b-a22b-instruct-2507

International

Non-Thinking mode only

$0.23

$0.92

-

1 million tokens

qwen3-30b-a3b-thinking-2507

International

Thinking mode only

$0.2

-

$2.4

1 million tokens

qwen3-30b-a3b-instruct-2507

International

Non-Thinking mode only

$0.2

$0.8

-

1 million tokens

qwen3-235b-a22b

International

Non-Thinking and Thinking modes

$0.7

$2.8

$8.4

1 million tokens

qwen3-32b

International

Non-Thinking and Thinking modes

$0.16

$0.64

$0.64

1 million tokens

qwen3-30b-a3b

International

Non-Thinking and Thinking modes

$0.2

$0.8

$2.4

1 million tokens

qwen3-14b

International

Non-Thinking and Thinking modes

$0.35

$1.4

$4.2

1 million tokens

qwen3-8b

International

Non-Thinking and Thinking modes

$0.18

$0.7

$2.1

1 million tokens

China (Beijing)

Model ID

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen3-next-80b-a3b-thinking

Chinese mainland

Thinking mode only

$0.144

-

$1.434

qwen3-next-80b-a3b-instruct

Chinese mainland

Non-Thinking mode only

$0.144

$0.574

-

qwen3-235b-a22b-thinking-2507

Chinese mainland

Thinking mode only

$0.287

-

$2.868

qwen3-235b-a22b-instruct-2507

Chinese mainland

Non-Thinking mode only

$0.287

$1.147

-

qwen3-30b-a3b-thinking-2507

Chinese mainland

Thinking mode only

$0.108

-

$1.076

qwen3-30b-a3b-instruct-2507

Chinese mainland

Non-Thinking mode only

$0.108

$0.431

-

qwen3-235b-a22b

Chinese mainland

Non-Thinking and Thinking modes

$0.287

$1.147

$2.868

qwen3-32b

Chinese mainland

Non-Thinking and Thinking modes

$0.287

$1.147

$2.868

qwen3-30b-a3b

Chinese mainland

Non-Thinking and Thinking modes

$0.108

$0.431

$1.076

qwen3-14b

Chinese mainland

Non-Thinking and Thinking modes

$0.144

$0.574

$1.434

qwen3-8b

Chinese mainland

Non-Thinking and Thinking modes

$0.072

$0.287

$0.717

Germany (Frankfurt)

Model ID

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen3-next-80b-a3b-thinking

Global

Thinking mode only

$0.144

-

$1.434

qwen3-next-80b-a3b-instruct

Global

Non-Thinking mode only

$0.144

$0.574

-

qwen3-235b-a22b-thinking-2507

Global

Thinking mode only

$0.23

-

$2.3

qwen3-235b-a22b-instruct-2507

Global

Non-Thinking mode only

$0.23

$0.92

-

qwen3-30b-a3b-thinking-2507

Global

Thinking mode only

$0.108

-

$1.076

qwen3-30b-a3b-instruct-2507

Global

Non-Thinking mode only

$0.108

$0.431

-

qwen3-235b-a22b

Global

Non-Thinking and Thinking modes

$0.287

$1.147

$2.868

qwen3-32b

Global

Non-Thinking and Thinking modes

$0.16

$0.64

$0.64

qwen3-30b-a3b

Global

Non-Thinking and Thinking modes

$0.108

$0.431

$1.076

qwen3-14b

Global

Non-Thinking and Thinking modes

$0.144

$0.574

$1.434

qwen3-8b

Global

Non-Thinking and Thinking modes

$0.072

$0.287

$0.717

US (Virginia)

Model ID

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-Thinking mode

Thinking mode (chain of thought + answer)

qwen3-next-80b-a3b-thinking

Global

Thinking mode only

$0.144

-

$1.434

qwen3-next-80b-a3b-instruct

Global

Non-Thinking mode only

$0.144

$0.574

-

qwen3-235b-a22b-thinking-2507

Global

Thinking mode only

$0.23

-

$2.3

qwen3-235b-a22b-instruct-2507

Global

Non-Thinking mode only

$0.23

$0.92

-

qwen3-30b-a3b-thinking-2507

Global

Thinking mode only

$0.108

-

$1.076

qwen3-30b-a3b-instruct-2507

Global

Non-Thinking mode only

$0.108

$0.431

-

qwen3-235b-a22b

Global

Non-Thinking and Thinking modes

$0.287

$1.147

$2.868

qwen3-32b

Global

Non-Thinking and Thinking modes

$0.16

$0.64

$0.64

qwen3-30b-a3b

Global

Non-Thinking and Thinking modes

$0.108

$0.431

$1.076

qwen3-14b

Global

Non-Thinking and Thinking modes

$0.144

$0.574

$1.434

qwen3-8b

Global

Non-Thinking and Thinking modes

$0.072

$0.287

$0.717

Qwen-Omni

Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, seeBilling and rate limits.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Text

Audio

Image/video

Text

Text-only input

Text

Multimodal input

Text + audio

Audio only billed

qwen2.5-omni-7b

International

$0.10

$6.76

$0.28

$0.40

$0.84

$13.51

1 million tokens (regardless of modality)

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Input: text

Input: audio

Input: image/video

Output: text

Text-only input

Output: text

Multimodal input

Output: text + audio

Audio only billed

qwen2.5-omni-7b

Chinese mainland

$0.087

$5.448

$0.287

$0.345

$0.861

$10.895

Qwen3-Omni-Captioner

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-omni-30b-a3b-captioner

International

$3.81

$3.06

1 million tokens

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen3-omni-30b-a3b-captioner

Chinese mainland

$2.265

$1.821

Qwen-VL

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-vl-235b-a22b-thinking

International

Thinking mode only

$0.4

$4

1 million tokens

qwen3-vl-235b-a22b-instruct

International

Non-Thinking mode only

$0.4

$1.6

1 million tokens

qwen3-vl-32b-thinking

International

Thinking mode only

$0.16

$0.64

1 million tokens

qwen3-vl-32b-instruct

International

Non-Thinking mode only

$0.16

$0.64

1 million tokens

qwen3-vl-30b-a3b-thinking

International

Thinking mode only

$0.2

$2.4

1 million tokens

qwen3-vl-30b-a3b-instruct

International

Non-Thinking mode only

$0.2

$0.8

1 million tokens

qwen3-vl-8b-thinking

International

Thinking mode only

$0.18

$2.1

1 million tokens

qwen3-vl-8b-instruct

International

Non-Thinking mode only

$0.18

$0.7

1 million tokens

More models

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

China (Beijing)

Model ID

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

qwen3-vl-235b-a22b-thinking

Chinese mainland

Thinking mode only

$0.287

$2.867

qwen3-vl-235b-a22b-instruct

Chinese mainland

Non-Thinking mode only

$0.287

$1.147

qwen3-vl-32b-thinking

Chinese mainland

Thinking mode only

$0.287

$2.868

qwen3-vl-32b-instruct

Chinese mainland

Non-Thinking mode only

$0.287

$1.147

qwen3-vl-30b-a3b-thinking

Chinese mainland

Thinking mode only

$0.108

$1.076

qwen3-vl-30b-a3b-instruct

Chinese mainland

Non-Thinking mode only

$0.108

$0.431

qwen3-vl-8b-thinking

Chinese mainland

Thinking mode only

$0.072

$0.717

qwen3-vl-8b-instruct

Chinese mainland

Non-Thinking mode only

$0.072

$0.287

More models

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen2-vl-72b-instruct

Chinese mainland

$2.294

$6.881

qwen2-vl-7b-instruct

Chinese mainland

Limited-time free

qwen2-vl-2b-instruct

Chinese mainland

Germany (Frankfurt)

Model ID

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

qwen3-vl-235b-a22b-thinking

Global

Thinking mode only

$0.287

$2.867

qwen3-vl-235b-a22b-instruct

Global

Non-Thinking mode only

$0.287

$1.147

qwen3-vl-32b-thinking

Global

Thinking mode only

$0.16

$0.64

qwen3-vl-32b-instruct

Global

Non-Thinking mode only

$0.16

$0.64

qwen3-vl-30b-a3b-thinking

Global

Thinking mode only

$0.108

$1.076

qwen3-vl-30b-a3b-instruct

Global

Non-Thinking mode only

$0.108

$0.431

qwen3-vl-8b-thinking

Global

Thinking mode only

$0.072

$0.717

qwen3-vl-8b-instruct

Global

Non-Thinking mode only

$0.072

$0.287

US (Virginia)

Model ID

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

qwen3-vl-235b-a22b-thinking

Global

Thinking mode only

$0.287

$2.867

qwen3-vl-235b-a22b-instruct

Global

Non-Thinking mode only

$0.287

$1.147

qwen3-vl-32b-thinking

Global

Thinking mode only

$0.16

$0.64

qwen3-vl-32b-instruct

Global

Non-Thinking mode only

$0.16

$0.64

qwen3-vl-30b-a3b-thinking

Global

Thinking mode only

$0.108

$1.076

qwen3-vl-30b-a3b-instruct

Global

Non-Thinking mode only

$0.108

$0.431

qwen3-vl-8b-thinking

Global

Thinking mode only

$0.072

$0.717

qwen3-vl-8b-instruct

Global

Non-Thinking mode only

$0.072

$0.287

Qwen-Coder

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-coder-next

International

0<Token≤32K

$0.3

$1.5

1 million tokens

32K<Token≤128K

$0.5

$2.5

128K<Token≤256K

$0.8

$4

qwen3-coder-480b-a35b-instruct

International

0<Token≤32K

$1.5

$7.5

1 million tokens

32K<Token≤128K

$2.7

$13.5

128K<Token≤200K

$4.5

$22.5

qwen3-coder-30b-a3b-instruct

International

0<Token≤32K

$0.45

$2.25

1 million tokens

32K<Token≤128K

$0.75

$3.75

128K<Token≤200K

$1.2

$6

China (Beijing)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen3-coder-next

Chinese mainland

0<Token≤32K

$0.144

$0.574

32K<Token≤128K

$0.216

$0.861

128K<Token≤256K

$0.359

$1.434

qwen3-coder-480b-a35b-instruct

Chinese mainland

0<Token≤32K

$0.861

$3.441

32K<Token≤128K

$1.291

$5.161

128K<Token≤200K

$2.151

$8.602

qwen3-coder-30b-a3b-instruct

Chinese mainland

0<Token≤32K

$0.216

$0.861

32K<Token≤128K

$0.323

$1.291

128K<Token≤200K

$0.538

$2.151

Germany (Frankfurt)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen3-coder-30b-a3b-instruct

Global

0<Token≤32K

$0.216

$0.861

32K<Token≤128K

$0.323

$1.291

128K<Token≤200K

$0.538

$2.151

qwen3-coder-480b-a35b-instruct

Global

0<Token≤32K

$0.861

$3.441

32K<Token≤128K

$1.291

$5.161

128K<Token≤200K

$2.151

$8.602

qwen3-coder-next

EU

0<Token≤32K

$0.3

$1.5

32K<Token≤128K

$0.5

$2.5

128K<Token≤256K

$0.8

$4

US (Virginia)

Model ID

Deployment scope

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen3-coder-480b-a35b-instruct

Global

0<Token≤32K

$0.861

$3.441

32K<Token≤128K

$1.291

$5.161

128K<Token≤200K

$2.151

$8.602

qwen3-coder-30b-a3b-instruct

Global

0<Token≤32K

$0.216

$0.861

32K<Token≤128K

$0.323

$1.291

128K<Token≤200K

$0.538

$2.151

Text generation - third-party models

DeepSeek

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

deepseek-v4-pro

context caching discount

International

$2.400

$4.800

1 million tokens

deepseek-v4-flash

context caching discount

International

$0.200

$0.400

1 million tokens

deepseek-v3.2

context caching discount

International

$0.57

$1.71

1 million tokens

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

deepseek-v4-pro

context caching discount

Chinese mainland

$1.65

$3.301

No free quota

deepseek-v4-flash

context caching discount

Chinese mainland

$0.138

$0.275

No free quota

deepseek-v3.2

context caching discount

Chinese mainland

$0.287

$0.431

No free quota

deepseek-v3.2-exp

Chinese mainland

$0.287

$0.431

No free quota

deepseek-v3.1

Chinese mainland

$0.574

$1.721

No free quota

deepseek-r1

Chinese mainland

$0.574

$2.294

No free quota

deepseek-r1-0528

Chinese mainland

$0.574

$2.294

No free quota

deepseek-v3

Chinese mainland

$0.287

$1.147

No free quota

deepseek-r1-distill-qwen-1.5b

Chinese mainland

Limited-time free

deepseek-r1-distill-qwen-7b

Chinese mainland

$0.072

$0.144

No free quota

deepseek-r1-distill-qwen-14b

Chinese mainland

$0.144

$0.431

No free quota

deepseek-r1-distill-qwen-32b

Chinese mainland

$0.287

$0.861

No free quota

deepseek-r1-distill-llama-8b

Chinese mainland

Limited-time free

deepseek-r1-distill-llama-70b

Chinese mainland

Limited-time free

Germany (Frankfurt)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

deepseek-v4-pro

context caching discount

Global

$1.65

$3.3

deepseek-v4-flash

context caching discount

Global

$0.14

$0.28

US (Virginia)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

deepseek-v4-pro

context caching discount

Global

$1.65

$3.3

deepseek-v4-flash

context caching discount

Global

$0.14

$0.28

Japan (Tokyo)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

deepseek-v4-pro

Context cachecontext caching discount

Global

$1.65

$3.3

deepseek-v4-flash

Context cachecontext caching discount

Global

$0.14

$0.28

deepseek-v4-pro

Context cachecontext caching discount

Japan

$2.400

$4.800

deepseek-v4-flash

Context cachecontext caching discount

Japan

$0.200

$0.400

Kimi

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

kimi-k2.7-code

Chinese mainland

$0.894

$3.713

$3.7131

kimi-k2.6

Chinese mainland

$0.8939

$3.7131

No free quota

kimi-k2.5

Chinese mainland

$0.574

$3.011

No free quota

kimi-k2-thinking

Chinese mainland

$0.574

$2.294

No free quota

Moonshot-Kimi-K2-Instruct

Chinese mainland

$0.574

$2.294

No free quota

Germany (Frankfurt)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

kimi-k2.7-code

Global

$0.894

$3.713

kimi-k2.5

Global

$0.574

$3.011

US (Virginia)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

kimi-k2.7-code

Global

$0.894

$3.713

kimi-k2.5

Global

$0.574

$3.011

Japan (Tokyo)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

kimi-k2.5

Context cachecontext caching discount

Global

$0.574

$3.011

China(Hong Kong)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

kimi-k2.7-code

Global

$0.894

$3.713

MiniMax

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought and answer

MiniMax-M2.5

Chinese mainland

Thinking mode only

$0.304

$1.213

GLM

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought and answer

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

glm-5.1

International

Non-Thinking and Thinking modes

0<Token≤200K

$1.4

$4.4

1 million tokens

China (Beijing)

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought and answer

glm-5.2

Chinese mainland

Non-Thinking and Thinking modes

flat-rate pricing

$1.100

$3.851

glm-5.1

Chinese mainland

Non-Thinking and Thinking modes

0<Token≤32K

$0.825

$3.301

32K<Token≤200K

$1.100

$3.851

glm-5

Chinese mainland

Non-Thinking and Thinking modes

0<Token≤32K

$0.573

$2.58

32K<Token≤166K

$0.86

$3.154

glm-4.7

Chinese mainland

Non-Thinking and Thinking modes

0<Token≤32K

$0.431

$2.007

32K<Token≤166K

$0.574

$2.294

glm-4.6

Chinese mainland

Non-Thinking and Thinking modes

0<Token≤32K

$0.431

$2.007

32K<Token≤166K

$0.574

$2.294

Germany (Frankfurt)

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought and answer

glm-5.2

Global

Non-Thinking and Thinking modes

flat-rate pricing

$1.100

$3.851

glm-5.1

Global

Non-Thinking and Thinking modes

0<Token≤32K

$0.825

$3.301

32K<Token≤200K

$1.100

$3.851

US (Virginia)

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought and answer

glm-5.2

Global

Non-Thinking and Thinking modes

flat-rate pricing

$1.100

$3.851

glm-5.1

Global

Non-Thinking and Thinking modes

0<Token≤32K

$0.825

$3.301

32K<Token≤200K

$1.100

$3.851

Japan (Tokyo)

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought + answer

glm-5.1

Context cachecontext caching discount

Global

Non-Thinking and Thinking modes

0<Token≤32K

$0.825

$3.301

32K<Token≤200K

$1.100

$3.851

China(Hong Kong)

Tab 正文

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought and answer

glm-5.2

Global

Non-Thinking and Thinking modes

flat-rate pricing

$1.100

$3.851

Image generation

You are not charged for input. You are charged for output based on the number of successfully generated images.

Formula: Cost = Image unit price × Number of images generated.

Notes:

  • Cost does not depend on image resolution or aspect ratio.

  • Failed requests incur no cost and do not consume your free quota.

Billing example: Some images fail to generate

Assume the image unit price is $0.10 per image. If you call the API to generate four images but only three image URLs return successfully, the system charges only for the three successfully generated images.

  • Number billed: 3 images.

  • Cost calculation: 0.1 × 3 = $0.3.

Qwen Text-to-Image

Only output is billed. For pricing rules, seeImage generation.
Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-image-2.0-pro

International

$0.075/image

100 images

qwen-image-2.0-pro-2026-04-22

International

$0.075/image

100 images

qwen-image-2.0-pro-2026-03-03

International

$0.075/image

100 images

qwen-image-2.0

International

$0.035/image

100 images

qwen-image-2.0-2026-03-03

International

$0.035/image

100 images

qwen-image-max

Currently equivalent to qwen-image-max-2025-12-30

International

$0.075/image

100 images

qwen-image-max-2025-12-30

International

$0.075/image

100 images

qwen-image-plus

Currently equivalent to qwen-image

International

$0.03/image

100 images

qwen-image-plus-2026-01-09

International

$0.03/image

100 images

qwen-image

International

$0.035/image

100 images

China (Beijing)

Model ID

Deployment scope

Output price

qwen-image-2.0-pro

Chinese mainland

$0.071676/image

qwen-image-2.0-pro-2026-04-22

Chinese mainland

$0.071676/image

qwen-image-2.0-pro-2026-03-03

Chinese mainland

$0.071676/image

qwen-image-2.0

Chinese mainland

$0.028671/image

qwen-image-2.0-2026-03-03

Chinese mainland

$0.028671/image

qwen-image-max

Currently equivalent to qwen-image-max-2025-12-30

Chinese mainland

$0.071677/image

qwen-image-max-2025-12-30

Chinese mainland

$0.071677/image

qwen-image-plus

Currently equivalent to qwen-image

Chinese mainland

$0.028671/image

qwen-image-plus-2026-01-09

Chinese mainland

$0.028671/image

qwen-image

Chinese mainland

$0.035/image

Qwen Image Editing

Only output is billed. For pricing rules, seeImage generation.
Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-image-2.0-pro

International

$0.075/image

100 images

qwen-image-2.0-pro-2026-04-22

International

$0.075/image

100 images

qwen-image-2.0-pro-2026-03-03

International

$0.075/image

100 images

qwen-image-2.0

International

$0.035/image

100 images

qwen-image-2.0-2026-03-03

International

$0.035/image

100 images

qwen-image-edit-max

Currently equivalent to qwen-image-edit-max-2026-01-16

International

$0.075/image

100 images

qwen-image-edit-max-2026-01-16

International

$0.075/image

100 images

qwen-image-edit-plus

Currently equivalent to qwen-image-edit-plus-2025-10-30

International

$0.03/image

100 images

qwen-image-edit-plus-2025-12-15

International

$0.03/image

100 images

qwen-image-edit-plus-2025-10-30

International

$0.03/image

100 images

qwen-image-edit

International

$0.045/image

100 images

China (Beijing)

Model ID

Deployment scope

Output price

qwen-image-2.0-pro

Chinese mainland

$0.071676/image

qwen-image-2.0-pro-2026-04-22

Chinese mainland

$0.071676/image

qwen-image-2.0-pro-2026-03-03

Chinese mainland

$0.071676/image

qwen-image-2.0

Chinese mainland

$0.028671/image

qwen-image-2.0-2026-03-03

Chinese mainland

$0.028671/image

qwen-image-edit-max

Currently equivalent to qwen-image-edit-max-2026-01-16

Chinese mainland

$0.071677/image

qwen-image-edit-max-2026-01-16

Chinese mainland

$0.071677/image

qwen-image-edit-plus

Currently equivalent to qwen-image-edit-plus-2025-10-30

Chinese mainland

$0.028671/image

qwen-image-edit-plus-2025-12-15

Chinese mainland

$0.028671/image

qwen-image-edit-plus-2025-10-30

Chinese mainland

$0.028671/image

qwen-image-edit

Chinese mainland

$0.043/image

Qwen Image Translation

Only output is billed. For pricing rules, seeImage generation.

China (Beijing)

Model ID

Deployment scope

Output price

Free quota(Note)

qwen-mt-image

Chinese mainland

$0.000431/image

No free quota

Qwen-Text-to-Image-Z-Image

Only output is billed. For pricing rules, seeImage generation.
Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

z-image-turbo

International

Prompt rewriting disabled (prompt_extend=false): $0.015/image

Prompt rewriting enabled (prompt_extend=true): $0.03/image

100 images

China (Beijing)

Model ID

Deployment scope

Output price

z-image-turbo

Chinese mainland

Prompt rewriting disabled (prompt_extend=false): $0.01434/image

Prompt rewriting enabled (prompt_extend=true): $0.02868/image

Wanx Text-to-Image

Only output is billed. For pricing rules, seeImage generation.
Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

wan2.6-t2i

International

$0.03/image

50 images

wan2.5-t2i-preview

International

$0.03/image

50 images

wan2.2-t2i-plus

International

$0.05/image

100 images

wan2.2-t2i-flash

International

$0.025/image

100 images

wan2.1-t2i-plus

International

$0.05/image

200 images

wan2.1-t2i-turbo

International

$0.025/image

200 images

China (Beijing)

Model ID

Deployment scope

Output price

wan2.6-t2i

Chinese mainland

$0.028671/image

wan2.5-t2i-preview

Chinese mainland

$0.028671/image

wan2.2-t2i-plus

Chinese mainland

$0.020070/image

wan2.2-t2i-flash

Chinese mainland

$0.028671/image

wanx2.1-t2i-plus

Chinese mainland

$0.028671/image

wanx2.1-t2i-turbo

Chinese mainland

$0.020070/image

wanx2.0-t2i-turbo

Chinese mainland

$0.005735/image

Germany (Frankfurt)

Model ID

Deployment scope

Output price

wan2.6-t2i

Global

$0.028671/image

US (Virginia)

Model ID

Deployment scope

Output price

wan2.6-t2i

Global

$0.028671/image

Wanx Image Generation and Editing

Only output is billed. For pricing rules, seeImage generation.
Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

wan2.7-image-pro

International

$0.075/image

50 images

wan2.7-image

International

$0.03/image

50 images

wan2.6-image

International

$0.03/image

50 images

China (Beijing)

Model ID

Deployment scope

Output price

wan2.7-image-pro

Chinese mainland

$0.068761/image

wan2.7-image

Chinese mainland

$0.028671/image

wan2.6-image

Chinese mainland

$0.028671/image

US (Virginia)

Model ID

Deployment scope

Output price

wan2.6-image

Global

$0.028671/image

Wanx General Image Editing

Only output is billed. For pricing rules, seeImage generation.
Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

wan2.5-i2i-preview

International

$0.03/image

50 images

China (Beijing)

Model ID

Deployment scope

Output price

wan2.5-i2i-preview

Chinese mainland

$0.028671/image

wanx2.1-imageedit

Chinese mainland

$0.020070/image

AIVirtual Try-on - OutfitAnyone

  • aitryon-plus: Input is free while output is billed. For pricing rules, seeImage generation.

  • aitryon-parsing-v1: Input is billed while output is free. Billed by the number of input images. Failed requests are not billed.

China (Beijing)

Model ID

Deployment scope

Unit price

Free quota(Note)

aitryon-plus

Chinese mainland

$0.071677/image

No free quota

aitryon-parsing-v1

Chinese mainland

$0.000574/image

Video generation

You are not charged for input. You are charged for output based on the total duration of successfully generated videos (in seconds).

Formula: Cost = Video unit price × Video duration (seconds).

Notes:

  • Some models charge by output video resolution. Prices differ for resolutions such as 480P, 720P, and 1080P.

  • Some models charge by output video edition. Prices differ for editions such as Standard Edition and Professional Edition.

  • Some models charge by output video aspect ratio. Prices differ for aspect ratios such as 1:1 and 3:4.

  • Some models use a flat rate, regardless of resolution, edition, or aspect ratio.

  • Failed requests incur no cost and do not consume your free quota.

HappyHorse-Text-to-video

Only output is billed. For pricing rules, seeVideo generation.
Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output video resolution

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

happyhorse-1.1-t2v

International

720P

$0.14/second

10 seconds

1080P

$0.18/second

happyhorse-1.0-t2v

International

720P

$0.14/second

10 seconds

1080P

$0.24/second

China (Beijing)

Model ID

Deployment scope

Output video resolution

Output price

happyhorse-1.1-t2v

Chinese mainland

720P

$0.123769/second

1080P

$0.165026/second

happyhorse-1.0-t2v

Chinese mainland

720P

$0.123769/second

1080P

$0.220034/second

Germany (Frankfurt)

Model ID

Deployment scope

Output video resolution

Output price

happyhorse-1.1-t2v

Global

720P

$0.123769/second

1080P

$0.165026/second

happyhorse-1.0-t2v

Global

720P

$0.123769/second

1080P

$0.220034/second

US (Virginia)

Model ID

Deployment scope

Output video resolution

Output price

happyhorse-1.1-t2v

Global

720P

$0.123769/second

1080P

$0.165026/second

happyhorse-1.0-t2v

Global

720P

$0.123769/second

1080P

$0.220034/second

HappyHorse-Image-to-video - first frame

Only output is billed. For pricing rules, seeVideo generation.
Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output video resolution

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

happyhorse-1.1-i2v

International

720P

$0.14/second

10 seconds

1080P

$0.18/second

happyhorse-1.0-i2v

International

720P

$0.14/second

10 seconds

1080P

$0.24/second

China (Beijing)

Model ID

Deployment scope

Output video resolution

Output price

happyhorse-1.1-i2v

Chinese mainland

720P

$0.123769/second

1080P

$0.165026/second

happyhorse-1.0-i2v

Chinese mainland

720P

$0.123769/second

1080P

$0.220034/second

Germany (Frankfurt)

Model ID

Deployment scope

Output video resolution

Output price

happyhorse-1.1-i2v

Global

720P

$0.123769/second

1080P

$0.165026/second

happyhorse-1.0-i2v

Global

720P

$0.123769/second

1080P

$0.220034/second

US (Virginia)

Model ID

Deployment scope

Output video resolution

Output price

happyhorse-1.1-i2v

Global

720P

$0.123769/second

1080P

$0.165026/second

happyhorse-1.0-i2v

Global

720P

$0.123769/second

1080P

$0.220034/second

HappyHorse-Reference-to-video

Only output is billed. For pricing rules, seeVideo generation.
Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output video resolution

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

happyhorse-1.1-r2v

International

720P

$0.14/second

10 seconds

1080P

$0.18/second

happyhorse-1.0-r2v

International

720P

$0.14/second

10 seconds

1080P

$0.24/second

China (Beijing)

Model ID

Deployment scope

Output video resolution

Output price

happyhorse-1.1-r2v

Chinese mainland

720P

$0.123769/second

1080P

$0.165026/second

happyhorse-1.0-r2v

Chinese mainland

720P

$0.123769/second

1080P

$0.220034/second

Germany (Frankfurt)

Model ID

Deployment scope

Output video resolution

Output price

happyhorse-1.1-r2v

Global

720P

$0.123769/second

1080P

$0.165026/second

happyhorse-1.0-r2v

Global

720P

$0.123769/second

1080P

$0.220034/second

US (Virginia)

Model ID

Deployment scope

Output video resolution

Output price

happyhorse-1.1-r2v

Global

720P

$0.123769/second

1080P

$0.165026/second

happyhorse-1.0-r2v

Global

720P

$0.123769/second

1080P

$0.220034/second

HappyHorse-Video editing

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID

Deployment scope

Output video resolution

Input and output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

happyhorse-1.0-video-edit

International

720P

$0.14/second

10 seconds

1080P

$0.24/second

China (Beijing)

Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID

Deployment scope

Output video resolution

Input and output price

happyhorse-1.0-video-edit

Chinese mainland

720P

$0.123769/second

1080P

$0.220034/second

Germany (Frankfurt)

Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID

Deployment scope

Output video resolution

Input and output price

happyhorse-1.0-video-edit

Global

720P

$0.123769/second

1080P

$0.220034/second

US (Virginia)

Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID

Deployment scope

Output video resolution

Input and output price

happyhorse-1.0-video-edit

Global

720P

$0.123769/second

1080P

$0.220034/second

Wanx-Text-to-Video

Only output is billed. For pricing rules, seeVideo generation.
Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output video resolution

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

wan2.7-t2v-2026-04-25

International

720P

$0.10/second

50 seconds

1080P

$0.15/second

wan2.7-t2v

International

720P

$0.10/second

50 seconds

1080P

$0.15/second

wan2.6-t2v

International

720P

$0.10/second

50 seconds

1080P

$0.15/second

wan2.5-t2v-preview

International

480P

$0.05/second

50 seconds

720P

$0.10/second

1080P

$0.15/second

wan2.2-t2v-plus

International

480P

$0.02/second

50 seconds

1080P

$0.10/second

wan2.1-t2v-turbo

International

480P

$0.036/second

50 seconds

720P

$0.036/second

wan2.1-t2v-plus

International

720P

$0.10/second

50 seconds

China (Beijing)

Model ID

Deployment scope

Output video resolution

Output price

wan2.7-t2v-2026-04-25

Chinese mainland

720P

$0.086012/second

1080P

$0.143353/second

wan2.7-t2v

Chinese mainland

720P

$0.086012/second

1080P

$0.143353/second

wan2.6-t2v

Chinese mainland

720P

$0.086012/second

1080P

$0.143353/second

wan2.5-t2v-preview

Chinese mainland

480P

$0.043006/second

720P

$0.086012/second

1080P

$0.143353/second

wan2.2-t2v-plus

Chinese mainland

480P

$0.02007/second

1080P

$0.100347/second

wanx2.1-t2v-turbo

Chinese mainland

480P

$0.034405/second

720P

$0.034405/second

wanx2.1-t2v-plus

Chinese mainland

720P

$0.100347/second

Germany (Frankfurt)

Model ID

Deployment scope

Output video resolution

Output price

wan2.6-t2v

Global

720P

$0.086012/second

1080P

$0.143353/second

US (Virginia)

Model ID

Deployment scope

Output video resolution

Output price

wan2.6-t2v

Global

720P

$0.086012/second

1080P

$0.143353/second

wan2.6-t2v-us

US

720P

$0.1/second

1080P

$0.15/second

Wanx-Image-to-Video

Only output is billed. For pricing rules, seeVideo generation.
Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output video type

Output video resolution

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

wan2.7-i2v-2026-04-25

International

Audio video

720P

$0.10/second

50 seconds

1080P

$0.15/second

wan2.7-i2v

International

Audio video

720P

$0.10/second

50 seconds

1080P

$0.15/second

China (Beijing)

Model ID

Deployment scope

Output video type

Output video resolution

Output price

wan2.7-i2v-2026-04-25

Chinese mainland

Audio video

720P

$0.086012/second

1080P

$0.143353/second

wan2.7-i2v

Chinese mainland

Audio video

720P

$0.086012/second

1080P

$0.143353/second

Wanx-Image-to-Video-First-Frame

Only output is billed. For pricing rules, seeVideo generation.
Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output video type

Output video resolution

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

wan2.6-i2v-flash

International

Audio video

audio=true

720P

$0.05/second

50 seconds

1080P

$0.075/second

Silent video

audio=false

720P

$0.025/second

1080P

$0.0375/second

wan2.6-i2v

International

Audio video

720P

$0.10/second

50 seconds

1080P

$0.15/second

wan2.5-i2v-preview

International

Audio video

480P

$0.05/second

50 seconds

720P

$0.10/second

1080P

$0.15/second

wan2.2-i2v-flash

International

Silent video

480P

$0.015/second

50 seconds

720P

$0.036/second

wan2.2-i2v-plus

International

Silent video

480P

$0.02/second

50 seconds

1080P

$0.10/second

wan2.1-t2v-turbo

International

Silent video

480P

$0.036/second

50 seconds

720P

$0.036/second

wan2.1-t2v-plus

International

Silent video

720P

$0.10/second

50 seconds

China (Beijing)

Model ID

Deployment scope

Output video type

Output video resolution

Output price

wan2.6-i2v-flash

Chinese mainland

Audio video

audio=true

720P

$0.043006/second

1080P

$0.071676/second

Silent video

audio=false

720P

$0.021503/second

1080P

$0.035838/second

wan2.6-i2v

Chinese mainland

Audio video

720P

$0.086012/second

1080P

$0.143353/second

wan2.5-i2v-preview

Chinese mainland

Audio video

480P

$0.043006/second

720P

$0.086012/second

1080P

$0.143353/second

wan2.2-i2v-plus

Chinese mainland

Silent video

480P

$0.02007/second

1080P

$0.100347/second

wanx2.1-t2v-turbo

Chinese mainland

Silent video

480P

$0.034405/second

720P

$0.034405/second

wanx2.1-t2v-plus

Chinese mainland

Silent video

720P

$0.100347/second

Germany (Frankfurt)

Model ID

Deployment scope

Output video type

Output video resolution

Output price

wan2.6-i2v

Global

Audio video

720P

$0.086012/second

1080P

$0.143353/second

US (Virginia)

Model ID

Deployment scope

Output video type

Output video resolution

Output price

wan2.6-i2v

Global

Audio video

720P

$0.086012/second

1080P

$0.143353/second

wan2.6-i2v-us

US

Audio video

720P

$0.1/second

1080P

$0.15/second

Wanx-Image-to-Video-First-Last-Frame

Only output is billed. For pricing rules, seeVideo generation.
Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output video resolution

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

wan2.2-kf2v-flash

International

480P

$0.015/second

50 seconds

720P

$0.036/second

1080P

$0.07/second

wan2.1-kf2v-plus

International

720P

$0.10/second

50 seconds

China (Beijing)

Model ID

Deployment scope

Output video resolution

Output price

wan2.2-kf2v-flash

Chinese mainland

480P

$0.014335/second

720P

$0.028671/second

1080P

$0.068809/second

wanx2.1-kf2v-plus

Chinese mainland

720P

$0.100347/second

Wanx-Reference-to-Video

Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Billing formula: billable duration = input video duration (up to 5 seconds) + output video duration.

  • The billable duration of the input video does not exceed 5 seconds. For calculation rules, seeBilling and rate limiting.

  • The billable duration of the output video isduration (in seconds) of successfully generated videos.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output video type

Output video resolution

Input and output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

wan2.7-r2v

International

Audio video

720P

$0.10/second

50 seconds

1080P

$0.15/second

wan2.6-r2v-flash

International

Audio video

audio=true

720P

$0.05/second

50 seconds

1080P

$0.075/second

Silent video

audio=false

720P

$0.025/second

1080P

$0.0375/second

wan2.6-r2v

International

Audio video

720P

$0.10/second

50 seconds

1080P

$0.15/second

China (Beijing)

Model ID

Deployment scope

Output video type

Output video resolution

Input and output price

wan2.7-r2v

Chinese mainland

Audio video

720P

$0.086012/second

1080P

$0.143353/second

wan2.6-r2v-flash

Chinese mainland

Audio video

audio=true

720P

$0.043006/second

1080P

$0.071676/second

Silent video

audio=false

720P

$0.021503/second

1080P

$0.035838/second

wan2.6-r2v

Chinese mainland

Audio video

720P

$0.086012/second

1080P

$0.143353/second

Germany (Frankfurt)

Model ID

Deployment scope

Output video type

Output video resolution

Input and output price

wan2.6-r2v

Global

Audio video

720P

$0.086012/second

1080P

$0.143353/second

US (Virginia)

Model ID

Deployment scope

Output video type

Output video resolution

Input and output price

wan2.6-r2v

Global

Audio video

720P

$0.086012/second

1080P

$0.143353/second

Wanx-Video-Editing

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID

Deployment scope

Output video resolution

Input and output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

wan2.7-videoedit

International

720P

$0.10/second

50 seconds

1080P

$0.15/second

Pricing rule: input is free. Output video is billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID

Deployment scope

Output video resolution

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

wan2.1-vace-plus

International

720P

$0.10/second

50 seconds

China (Beijing)

Pricing rule: both input and output videos are billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID

Deployment scope

Output video resolution

Input and output price

wan2.7-videoedit

Chinese mainland

720P

$0.086012/second

1080P

$0.143353/second

Pricing rule: input is free. Output video is billed byvideo duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID

Deployment scope

Output video resolution

Output price

wanx2.1-vace-plus

Chinese mainland

720P

$0.100347/second

Wanx-Digital Human

  • wan2.2-s2v-detect: Input is billed while output is free. Input is billed by the number of images processed. Each input image is billed once as long as the request succeeds, regardless of the detection result.

  • wan2.2-s2v: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, seeVideo generation.

China (Beijing)

Model ID

Deployment scope

Unit price

Free quota(Note)

wan2.2-s2v-detect

Chinese mainland

Input image: $0.000574/image

No free quota

wan2.2-s2v

Chinese mainland

Output video:

  • 480P: $0.071677/second

  • 720P: $0.129018/second

No free quota

Wanx-Image-to-Motion

Only output is billed. For pricing rules, seeVideo generation.
Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output video mode

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

wan2.2-animate-move

International

Standard modewan-std

$0.12/second

50 seconds

Professional modewan-pro

$0.18/second

China (Beijing)

Model ID

Deployment scope

Output video mode

Output price

wan2.2-animate-move

Chinese mainland

Standard modewan-std

$0.06/second

Professional modewan-pro

$0.09/second

Wanx-Video-Face-Swap

Only output is billed. For pricing rules, seeVideo generation.
Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Output video mode

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

wan2.2-animate-mix

International

Standard modewan-std

$0.18/second

50 seconds

Professional modewan-pro

$0.26/second

China (Beijing)

Model ID

Deployment scope

Output video mode

Output price

wan2.2-animate-mix

Chinese mainland

Standard modewan-std

$0.09/second

Professional modewan-pro

$0.13/second

AnimateAnyone

  • animate-anyone-detect-gen2: Input is billed while output is free. Input is billed by the number of images processed. Each input image is billed once as long as the request succeeds, regardless of the detection result.

  • animate-anyone-template-gen2: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, seeVideo generation.

  • animate-anyone-gen2: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, seeVideo generation.

China (Beijing)

Model ID

Deployment scope

Unit price

Free quota(Note)

animate-anyone-detect-gen2

Chinese mainland

Input image: $0.000574/image

No free quota

animate-anyone-template-gen2

Chinese mainland

Output video: $0.011469/second

No free quota

animate-anyone-gen2

Chinese mainland

Output video: $0.011469/second

No free quota

EMO

  • emo-detect-v1: Input is billed while output is free. Input is billed by the number of images processed. Each input image is billed once as long as the request succeeds, regardless of the detection result.

  • emo-v1: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, seeVideo generation.

China (Beijing)

Model ID

Deployment scope

Unit price

Free quota(Note)

emo-detect-v1

Chinese mainland

Input image: $0.000574/image

No free quota

emo-v1

Chinese mainland

Output video:

  • 1:1landscape video: $0.011469/second

  • 3:4landscape video: $0.022937/second

LivePortrait

  • liveportrait-detect: Input is billed while output is free. Input is billed by the number of images processed. Each input image is billed once as long as the request succeeds, regardless of the detection result.

  • liveportrait: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, seeVideo generation.

China (Beijing)

Model ID

Deployment scope

Unit price

Free quota(Note)

liveportrait-detect

Chinese mainland

Input image: $0.000574/image

No free quota

liveportrait

Chinese mainland

Output video: $0.002868/second

Emoji Sticker

  • emoji-detect-v1: Input is billed while output is free. Input is billed by the number of images processed. Each input image is billed once as long as the request succeeds, regardless of the detection result.

  • emoji-v1: Input is free while output is billed. Output is billed by the duration (in seconds) of successfully generated videos. For pricing rules, seeVideo generation.

China (Beijing)

Model ID

Deployment scope

Unit price

Free quota(Note)

emoji-detect-v1

Chinese mainland

Input image: $0.000574/image

No free quota

emoji-v1

Chinese mainland

Output video: $0.011469/second

VideoRetalk

Only output is billed. For pricing rules, seeVideo generation.

China (Beijing)

Model ID

Deployment scope

Output price

Free quota(Note)

videoretalk

Chinese mainland

$0.011469/second

No free quota

Video Style Repaint

Only output is billed. For pricing rules, seeVideo generation.

China (Beijing)

Model ID

Deployment scope

Output video resolution

Output price

Free quota(Note)

video-style-transform

Chinese mainland

540P

$0.028671/second

No free quota

720P

$0.071677/second

Music generation

Pricing rule: billed by the duration (in seconds) of output audio. Input is free.

China (Beijing)

Model ID

Deployment scope

Output price (per second)

Free quota(Note)

fun-music-preview

Chinese mainland

$0.000695

No free quota

fun-music-v1

Chinese mainland

$0.000275

Speech synthesis (text-to-speech)

Qwen-TTS

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Qwen3-TTS-Instruct-Flash

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-tts-instruct-flash

International

$0.115

110,000 characters

qwen3-tts-instruct-flash-2026-01-26

International

$0.115

110,000 characters

Qwen3-TTS-VD

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-tts-vd-2026-01-26

International

$0.115

110,000 characters

Qwen3-TTS-VC

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-tts-vc-2026-01-22

International

$0.115

110,000 characters

Qwen3-TTS-Flash

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-tts-flash

Currently equivalent to qwen3-tts-flash-2025-11-27

International

$0.1

110,000 characters

qwen3-tts-flash-2025-11-27

International

$0.1

110,000 characters

qwen3-tts-flash-2025-09-18

International

$0.1

2025 (after November 13, 0:00 UTC+8): 10,000 characters

China (Beijing)

Qwen3-TTS-Instruct-Flash

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price (per 10,000 characters)

qwen3-tts-instruct-flash

Chinese mainland

$0.115

Free

qwen3-tts-instruct-flash-2026-01-26

Chinese mainland

$0.115

Free

Qwen3-TTS-VD

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price (per 10,000 characters)

qwen3-tts-vd-2026-01-26

Chinese mainland

$0.115

Free

Qwen3-TTS-VC

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price (per 10,000 characters)

qwen3-tts-vc-2026-01-22

Chinese mainland

$0.115

Free

Qwen3-TTS-Flash

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price (per 10,000 characters)

qwen3-tts-flash

Currently equivalent to qwen3-tts-flash-2025-11-27

Chinese mainland

$0.114682

Free

qwen3-tts-flash-2025-11-27

Chinese mainland

$0.114682

Free

qwen3-tts-flash-2025-09-18

Chinese mainland

$0.114682

Free

Qwen-TTS

Pricing rule: billed by input tokens and output tokens.

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen-tts-flash

Chinese mainland

$0.23

$1.434

qwen-tts-latest

Chinese mainland

$0.23

$1.434

qwen-tts-2025-05-22

Chinese mainland

$0.23

$1.434

qwen-tts-2025-04-10

Chinese mainland

$0.23

$1.434

Qwen-TTS-Realtime

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Qwen3-TTS-Instruct-Flash-Realtime

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-tts-instruct-flash-realtime

International

$0.143

110,000 characters

qwen3-tts-instruct-flash-realtime-2026-01-22

International

$0.143

110,000 characters

Qwen3-TTS-VD-Realtime

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-tts-vd-realtime-2026-01-15

International

$0.143353

110,000 characters

qwen3-tts-vd-realtime-2025-12-16

International

$0.143353

110,000 characters

Qwen3-TTS-VC-Realtime

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-tts-vc-realtime-2026-01-15

International

$0.13

110,000 characters

qwen3-tts-vc-realtime-2025-11-27

International

110,000 characters

Qwen3-TTS-Flash-Realtime

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-tts-flash-realtime

International

$0.13

2025 (after November 13, 0:00 UTC+8): 10,000 characters

qwen3-tts-flash-realtime-2025-11-27

International

$0.13

110,000 characters

qwen3-tts-flash-realtime-2025-09-18

International

$0.13

2025 (after November 13, 0:00 UTC+8): 10,000 characters

China (Beijing)

Qwen3-TTS-Instruct-Flash-Realtime

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price

qwen3-tts-instruct-flash-realtime

Chinese mainland

$0.143

Free

qwen3-tts-instruct-flash-realtime-2026-01-22

Chinese mainland

$0.143

Free

Qwen3-TTS-VD-Realtime

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price

qwen3-tts-vd-realtime-2026-01-15

Chinese mainland

$0.143353

Free

qwen3-tts-vd-realtime-2025-12-16

Chinese mainland

$0.143353

Free

Qwen3-TTS-VC-Realtime

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price

qwen3-tts-vc-realtime-2026-01-15

Chinese mainland

$0.143353

Free

qwen3-tts-vc-realtime-2025-11-27

Chinese mainland

Qwen3-TTS-Flash-Realtime

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price

qwen3-tts-flash-realtime

Chinese mainland

$0.143353

Free

qwen3-tts-flash-realtime-2025-11-27

Chinese mainland

$0.143353

Free

qwen3-tts-flash-realtime-2025-09-18

Chinese mainland

$0.143353

Free

Qwen-TTS-Realtime

Pricing rule: billed by input tokens and output tokens.

Model ID

Deployment scope

Input price (per 1 million tokens)

Input price (per 1 million tokens)

qwen-tts-realtime

Chinese mainland

$0.345

$1.721

qwen-tts-realtime-latest

Chinese mainland

$0.345

$1.721

qwen-tts-realtime-2025-07-15

Chinese mainland

$0.345

$1.721

Qwen-TTS Voice cloning

Pricing rule: billed by the number of new voice clones created.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Price (per voice clone)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-voice-enrollment

International

$0.01

1,000 voices/account

China (Beijing)

Model ID

Deployment scope

Price (per voice clone)

qwen-voice-enrollment

Chinese mainland

$0.01

Qwen-TTS Voice design

Pricing rule: billed by the number of new voice clones created.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Price (per voice clone)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-voice-design

International

$0.2

10 voices/account

China (Beijing)

Model ID

Deployment scope

Price (per voice clone)

qwen-voice-design

Chinese mainland

$0.2

CosyVoice

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

cosyvoice-v3-plus

International

$0.26

110,000 characters

cosyvoice-v3-flash

International

$0.13

1 million tokens

China (Beijing)

Pricing rule: billed by the number of input text characters. Output is free.

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

cosyvoice-v3.5-plus

Chinese mainland

$0.22

No free quota

cosyvoice-v3.5-flash

Chinese mainland

$0.116

No free quota

cosyvoice-v3-plus

Chinese mainland

$0.286706

No free quota

cosyvoice-v3-flash

Chinese mainland

$0.14335

No free quota

cosyvoice-v2

Chinese mainland

$0.286706

No free quota

Speech recognition (speech-to-text) and translation (speech-to-text in a specified language)

Qwen-LiveTranslate-Flash-Realtime

Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, seeBilling.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Input: audio

Input: image

Output: text

Output: audio

qwen3.5-livetranslate-flash-realtime

International

$7.5

$0.55

$20

$30

1 million tokens

qwen3.5-livetranslate-flash-realtime-2026-05-19

International

$7.5

$0.55

$20

$30

1 million tokens

qwen3-livetranslate-flash-realtime

International

$10

$1.3

$10

$38

1 million tokens

qwen3-livetranslate-flash-realtime-2025-09-22

International

$10

$1.3

$10

$38

1 million tokens

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Input: audio

Input: image

Output: text

Output: audio

qwen3.5-livetranslate-flash-realtime

Chinese mainland

$5.501

$0.454

$13.752

$22.003

qwen3.5-livetranslate-flash-realtime-2026-05-19

Chinese mainland

$5.501

$0.454

$13.752

$22.003

qwen3-livetranslate-flash-realtime

Chinese mainland

$9.175

$1.147

$9.175

$34.405

qwen3-livetranslate-flash-realtime-2025-09-22

Chinese mainland

$9.175

$1.147

$9.175

$34.405

Qwen-LiveTranslate-Flash

Pricing rule: billed by input tokens and output tokens. For the token calculation rules of different modalities, seeBilling.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Input: audio

Input: image

Output: text

Output: audio

qwen3-livetranslate-flash

International

$1.577

$0.631

$1.577

$6.308

1 million tokens

qwen3-livetranslate-flash-2025-12-01

International

$1.577

$0.631

$1.577

$6.308

1 million tokens

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Input: audio

Input: image

Output: text

Output: audio

qwen3-livetranslate-flash

Chinese mainland

$1.434

$0.573

$1.434

$5.734

qwen3-livetranslate-flash-2025-12-01

Chinese mainland

$1.434

$0.573

$1.434

$5.734

Qwen-ASR

Pricing rule: billed by the duration (in seconds) of input audio. Output is free.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-asr-flash-filetrans

International

$0.000035/second

36,000 seconds (10 hours)

qwen3-asr-flash-filetrans-2025-11-17

International

$0.000035/second

36,000 seconds (10 hours)

qwen3-asr-flash

Currently equivalent to qwen3-asr-flash-2025-09-08

International

$0.000035/second

36,000 seconds (10 hours)

qwen3-asr-flash-2026-02-10

International

$0.000035/second

36,000 seconds (10 hours)

qwen3-asr-flash-2025-09-08

International

$0.000035/second

36,000 seconds (10 hours)

China (Beijing)

Model ID

Deployment scope

Input price

qwen3-asr-flash-filetrans

Chinese mainland

$0.000032/second

qwen3-asr-flash-filetrans-2025-11-17

Chinese mainland

$0.000032/second

qwen3-asr-flash

Currently equivalent to qwen3-asr-flash-2025-09-08

Chinese mainland

$0.000032/second

qwen3-asr-flash-2026-02-10

Chinese mainland

$0.000032/second

qwen3-asr-flash-2025-09-08

Chinese mainland

$0.000032/second

US (Virginia)

Model ID

Deployment scope

Input price

qwen3-asr-flash-us

US

$0.000035/second

qwen3-asr-flash-2025-09-08-us

US

$0.000035/second

Qwen-ASR-Realtime

Pricing rule: billed by the duration (in seconds) of input audio. Output is free.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-asr-flash-realtime

International

$0.000090/second

36,000 seconds (10 hours)

qwen3-asr-flash-realtime-2026-02-10

International

$0.000090/second

36,000 seconds (10 hours)

qwen3-asr-flash-realtime-2025-10-27

International

$0.000090/second

36,000 seconds (10 hours)

China (Beijing)

Model ID

Deployment scope

Input price

qwen3-asr-flash-realtime

Chinese mainland

$0.000047/second

qwen3-asr-flash-realtime-2026-02-10

Chinese mainland

$0.000047/second

qwen3-asr-flash-realtime-2025-10-27

Chinese mainland

$0.000047/second

Fun-ASR

Audio file recognition

Pricing rule: billed by the duration (in seconds) of input audio. Output is free.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

fun-asr

International

$0.000035/second

36,000 seconds (10 hours)

fun-asr-2025-11-07

International

$0.000035/second

36,000 seconds (10 hours)

fun-asr-2025-08-25

International

$0.000035/second

36,000 seconds (10 hours)

fun-asr-mtl

International

$0.000035/second

36,000 seconds (10 hours)

fun-asr-mtl-2025-08-25

International

$0.000035/second

36,000 seconds (10 hours)

fun-asr-flash-2026-06-15

International

$0.000035/second

36,000 seconds (10 hours)

China (Beijing)

Model ID

Deployment scope

Input price

fun-asr

Chinese mainland

$0.000032/second

fun-asr-2025-11-07

Chinese mainland

$0.000032/second

fun-asr-2025-08-25

Chinese mainland

$0.000032/second

fun-asr-mtl

Chinese mainland

$0.000032/second

fun-asr-mtl-2025-08-25

Chinese mainland

$0.000032/second

fun-asr-flash-2026-06-15

Chinese mainland

$0.00003/second

Real-time speech recognition

Pricing rule: billed by the duration (in seconds) of input audio. Output is free.

Singapore

Model ID

Deployment scope

Input price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

fun-asr-realtime

International

$0.00009/second

36,000 seconds (10 hours)

fun-asr-realtime-2025-11-07

International

$0.00009/second

36,000 seconds (10 hours)

China (Beijing)

Model ID

Deployment scope

Input price

fun-asr-realtime

Chinese mainland

$0.000047/second

fun-asr-realtime-2026-02-28

Chinese mainland

$0.000047/second

fun-asr-realtime-2025-11-07

Chinese mainland

$0.000047/second

fun-asr-realtime-2025-09-15

Chinese mainland

$0.000047/second

fun-asr-mtl-realtime

Chinese mainland

$0.000047/second

fun-asr-mtl-realtime-2025-12-10

Chinese mainland

$0.000047/second

fun-asr-flash-8k-realtime

Chinese mainland

$0.000032/second

fun-asr-flash-8k-realtime-2026-01-28

Chinese mainland

$0.000032/second

Paraformer

Audio file recognition

Pricing rule: billed by the duration (in seconds) of input audio. Output is free.

China (Beijing)

Model ID

Deployment scope

Input price

paraformer-v2

Chinese mainland

$0.000012/second

paraformer-8k-v2

Chinese mainland

$0.000012/second

Real-time speech recognition

Pricing rule: billed by the duration (in seconds) of input audio. Output is free.

China (Beijing)

Model ID

Deployment scope

Input price

Free quota(Note)

paraformer-realtime-v2

Chinese mainland

$0.000035/second

No free quota

paraformer-realtime-8k-v2

Chinese mainland

$0.000035/second

Text embedding

Pricing rule: billed by input tokens. Output is free.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

text-embedding-v4

International

$0.07

1 million tokens

text-embedding-v3

International

$0.07

500,000 tokens

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

text-embedding-v4

Chinese mainland

$0.072

Hong Kong (China)

Model ID

Deployment scope

Input price (per 1 million tokens)

text-embedding-v4

Hong Kong (China)

$0.07

Multimodal embedding

Pricing rule: billed by input tokens. Output is free.

Singapore

Model ID

Deployment scope

Input price (per million input tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

tongyi-embedding-vision-plus

International

$0.09

1 million tokens

tongyi-embedding-vision-flash

International

Image/video:$0.03

Text: $0.09

1 million tokens

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-vl-embedding

Chinese mainland

Image/video:$0.258

Text: $0.1

1 million tokens

multimodal-embedding-v1

Chinese mainland

Free trial

No token quota limit

Text reranking

Pricing rule: billed by input tokens. Output is free.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-rerank

International

$0.1

1 million tokens

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

qwen3-vl-rerank

Chinese mainland

Text input: $0.1

Image input: $0.258

gte-rerank-v2

Chinese mainland

Text input: $0.115

Industry models

Intent understanding

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

tongyi-intent-detect-v3

Chinese mainland

$0.058

$0.144

No free quota

Role play

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the international service deployment scope. No free quota is available in other service deployment scopes.

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-plus-character

Session Cache discount

International

$0.5

$1.4

1 million tokens

qwen-flash-character

Session Cache discount

International

$0.05

$0.4

1 million tokens

qwen-plus-character-ja

International

$0.5

$1.4

1 million tokens

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-plus-character

Session Cache discount

Chinese mainland

$0.115

$0.287

1 million tokens

qwen-flash-character

Session Cache discount

Chinese mainland

$0.034

$0.203

Error codes

If a model call fails and returns an error message, seeError codesfor resolution.