Alibaba Cloud Model Studio periodically deprecates older models to optimize resource use and ensure you have access to the latest and most effective versions. This topic describes the model deprecation policy.
Notifications
Timeline
Snapshot models: A deprecation notice is published 30 days before the official deprecation date. These model names contain a specific date, such as qwen-max-2025-01-25, and are common for Qwen series models.
Stable models: A deprecation notice is published 3 months before the official deprecation date. These are the core versions of a model series.
Notification methods
Notifications are sent through emails, internal messages, and official website announcements.
Emails and internal messages are sent only to users who have called a model scheduled for deprecation within the last three months.
Impacts of deprecation
From the date the deprecation notice is published, the requests per minute (RPM) and tokens per minute (TPM) for the model will be gradually reduced. For models with a requested quota increase, the quota is first restored to the default limit and then reduced. During this process, the model's APIs and related console features remain available.
From the official deprecation date:
Model inference: Support for model inference services is discontinued. Applications and services that call the deprecated model will no longer return results.
Console features and documentation: Related console features, such as Models page and Playground, and the official documentation are deprecated at the same time.
Recommended actions
Go to the Model Observation (Singapore) page, and check whether you are using deprecated models.
If you want to use models in the China (Beijing) region, go to the Model Observation page for the China (Beijing) region.
If you are using these models, test the performance of the alternative models and then switch to them.
List of deprecated models (Singapore region)
Retired on August 20, 2025
Type | Model | Retire date | Recommended replacement |
Text generation - Qwen | qwen2-72b-instruct | August 20, 2025, 00:00:00 (UTC+08:00) | Qwen large language models: QwQ, Qwen-Max, Qwen-Plus, Qwen-Turbo, Qwen3 |
qwen2-57b-a14b-instruct | |||
qwen2-7b-instruct | |||
qwen1.5-110b-chat | |||
qwen1.5-72b-chat | |||
qwen1.5-32b-chat | |||
qwen1.5-14b-chat | |||
qwen1.5-7b-chat |