Model calls (pay-as-you-go) - E-MapReduce - Alibaba Cloud Documentation Center

Pay-as-you-go is a billing method where you pay for services after you use them. Fees are calculated based on the token usage of built-in models in the AI Center of your workspace.

Note

The AI Center is currently in a free trial period. Commercial billing is scheduled to start on April 8, 2026. For more information, see EMR Serverless Spark AI Center Commercialization Announcement.

Pay-as-you-go details

Feature

Description

Billing rules

Billing is based on the token usage of built-in model services in a workspace within a one-hour billing cycle. The billing rules vary for different built-in models:

qwen-plus: Billed based on input tokens and output tokens.
text-embedding-v4: Billed based on input tokens. Output tokens are not billed.

The cost for model calls is calculated as follows: Input token usage × Unit price for input + Output token usage × Unit price for output

For example, if you call `ai_query()` 10,000 times in the China (Hangzhou) region, with each call having 260 input tokens and 50 output tokens, the cost is calculated as follows: 0.00096 × 260 × 10000 ÷ 1000 + 0.0024 × 50 × 10000 ÷ 1000 = 3.696 CNY.

Note

To estimate token usage, see the Model calls subsection in Billable items and billing methods.

Billing cycle

Fees are calculated on the hour, every hour (UTC+8). A new billing cycle begins after each calculation. At the end of each billing cycle, the system generates a bill and deducts the fees from your account. There may be a delay between the actual resource consumption and the data that appears on your bill. For more information, see Bill usage instructions.

Regional unit prices

qwen-plus

Region Name	Input token range per request	Input unit price (USD/1,000 tokens)	Output unit price (USD/1,000 tokens)
China (Beijing) China (Shanghai) China (Hangzhou) China (Shenzhen)	0 < Token ≤ 128K	0.000138	0.000344
	128K < Token ≤ 256K	0.000414	0.003442
	256K < Token ≤ 1M	0.000827	0.008257
China (Hong Kong) Singapore Germany (Frankfurt) US (Virginia) US (Silicon Valley) Japan (Tokyo) Indonesia (Jakarta) Mexico	0 < Token ≤ 256K	0.000480	0.001440
	256K < Token ≤ 1M	0.001440	0.004320

text-embedding-v4

Region Name	Input unit price (USD/1,000 tokens)
China (Beijing) China (Shanghai) China (Hangzhou) China (Shenzhen)	0.000086
China (Hong Kong) Singapore Germany (Frankfurt) US (Virginia) US (Silicon Valley) Japan (Tokyo) Indonesia (Jakarta) Mexico	0.000084