Service Upgrade

[Model Studio] Qwen-VL-Plus and Qwen-VL-Max: Implicit cache support in the Singapore region

Affected Time

2025-09-12 00:00:00 (UTC+08) (Subject to actual release time)

Changes

Qwen-VL-Plus and Qwen-VL-Max will support implicit caching. Implicit cache is the default automatic mode with no configuration required, suitable for general scenarios. The system automatically detects common prefixes in request content and caches them; cached tokens are billed at 20% of the input price.

Impacts

After the update:

Cached price for Qwen-VL-Plus and Qwen-VL-Max in the Singapore region:

p

Thank you for your continued support in Model Studio.