Pay-as-you-go is a billing method where you pay for services after you use them. Fees are calculated based on the token usage of built-in models in the AI Center of your workspace.
The AI Center is currently in a free trial period. Commercial billing is scheduled to start on April 8, 2026. For more information, see EMR Serverless Spark AI Center Commercialization Announcement.
Pay-as-you-go details
Feature | Description |
Billing rules | Billing is based on the token usage of built-in model services in a workspace within a one-hour billing cycle. The billing rules vary for different built-in models:
The cost for model calls is calculated as follows: Input token usage × Unit price for input + Output token usage × Unit price for output For example, if you call `ai_query()` 10,000 times in the China (Hangzhou) region, with each call having 260 input tokens and 50 output tokens, the cost is calculated as follows: Note To estimate token usage, see the Model calls subsection in Billable items and billing methods. |
Billing cycle | Fees are calculated on the hour, every hour (UTC+8). A new billing cycle begins after each calculation. At the end of each billing cycle, the system generates a bill and deducts the fees from your account. There may be a delay between the actual resource consumption and the data that appears on your bill. For more information, see Bill usage instructions. |
Regional unit prices
qwen-plus
Region Name | Input token range per request | Input unit price (USD/1,000 tokens) | Output unit price (USD/1,000 tokens) |
| 0 < Token ≤ 128K | 0.000138 | 0.000344 |
128K < Token ≤ 256K | 0.000414 | 0.003442 | |
256K < Token ≤ 1M | 0.000827 | 0.008257 | |
| 0 < Token ≤ 256K | 0.000480 | 0.001440 |
256K < Token ≤ 1M | 0.001440 | 0.004320 |
text-embedding-v4
Region Name | Input unit price (USD/1,000 tokens) |
| 0.000086 |
| 0.000084 |