Alibaba Cloud Model Studio is a one-stop model service platform. It provides the full Qwen series and mainstream third-party LLMs through official Qwen APIs and OpenAI-compatible APIs, with multimodal support across text, image, and audio/video. Call models on demand — no infrastructure to manage.
|
Generate content and summaries with a few lines of code. Model Studio is OpenAI-compatible. Update the API key, base URL, and model name to migrate existing OpenAI code.
|
Model service
Model Studio provides ready-to-use model services, including the proprietary Qwen series and third-party models such as DeepSeek, Kimi, and GLM. See Recommended models.
-
Qwen flagship models:
-
Qwen-Max: The highest-performing model in the Qwen series, suited for complex, multi-step tasks.
The latest qwen3.7-max delivers significant reasoning improvements over its predecessor. Recommended.
-
Qwen-Plus: Balances performance, speed, and cost — recommended for most scenarios.
-
Qwen-Flash: Low-cost and low-latency — suited for simple tasks that require fast responses.
-
-
Multimodal coverage: Includes text generation, visual understanding, image generation, video generation, speech recognition and synthesis, and embedding.
-
Domain-specific models: Models for long-text processing, translation, data mining, intent recognition, role-playing, and deep research.
Billing
Activating Model Studio is free. Costs apply only when you invoke models. See Billable items.
Free quota for new users
New users receive a free quota in the Singapore region to try model invocation.
-
Users who have not completed their profile cannot continue using the service after the free quota is depleted. They must complete their profile to switch to pay-as-you-go billing.
-
Users who have completed their profile are automatically switched to pay-as-you-go billing after the free quota is depleted. To avoid unexpected charges, enable the Free quota only feature — the service stops when the quota is depleted.
For more information, see Free quota for new users.
Payment methods
Model calls are billed per minute. For supported payment methods, see Payment methods.
View bills and usage
-
Billing details: Go to the Billing Details and Cost Analysis pages.
-
Call statistics: About one hour after making a model call, go to the Model Studio console, select your region from the top-right corner, go to the Model Monitoring page, set your query conditions, click Monitor in the Actions column for the target model, and view call volume, token consumption, success rate, and other statistics. See Model monitoring.
-
Coding Plan usage: If you are subscribed to Coding Plan, view quota consumption on the Coding Plan page. Coding Plan uses a fixed monthly fee with a monthly request quota for AI coding tools. See Coding Plan overview.
Getting started
-
Try models online:
-
Open the Model Studio console and select your region from the top-right corner.
-
Go to the Playground and select a model.
-
-
Make your first API call: Make the first call to a Qwen API