[Model Studio] Qwen-VL-Plus and Qwen-VL-Max: Implicit cache support in the Singapore region
Sep 11, 2025
Alibaba Cloud Model StudioAffected Time
2025-09-12 00:00:00 (UTC+08) (Subject to actual release time)
Changes
Qwen-VL-Plus and Qwen-VL-Max will support implicit caching. Implicit cache is the default automatic mode with no configuration required, suitable for general scenarios. The system automatically detects common prefixes in request content and caches them; cached tokens are billed at 20% of the input price.
Impacts
After the update:
Cached price for Qwen-VL-Plus and Qwen-VL-Max in the Singapore region:

Thank you for your continued support in Model Studio.















