Alibaba Cloud Model Studio is a one-stop model service platform. It integrates the full series of Qwen and mainstream third-party LLMs, provides official Qwen APIs and OpenAI-compatible APIs, and supports multimodal capabilities such as text, image, and audio/video. It enables scenarios including code generation, translation, data mining, and intention recognition. Developers can call models on demand without managing infrastructure.
Generate content and summaries with a few lines of code. Model Studio is OpenAI-compatible. Update the API key, base URL, and model name to migrate existing OpenAI code. |
Model service
Model Studio provides model services. Call Qwen models and third-party models such as DeepSeek, Kimi without deployment or operational management. See Model list.
Qwen flagship models:
Qwen-Max: Top Qwen3 model for complex, multi-step tasks.
Qwen-Plus: Balances performance, speed, and cost — recommended for most scenarios.
Qwen3.5-Plus excels at language understanding, logical reasoning, code generation, agent tasks, and multimodal understanding.
Qwen-Flash: Cost-effective and low-latency — ideal for simple tasks with fast responses.
Qwen-Coder: Specialized for code generation and tool calling.
Multimodal coverage: Includes capabilities such as text generation, visual understanding, image generation, video generation, speech recognition and synthesis, and embedding.
Domain-specific models: For specific industries and tasks, provides domain models for long-text processing, translation, data mining, intent recognition, role-playing, and deep research.
Billing
Activating Model Studio is free. You incur costs only when you invoke models. See Billable items and the Model List.
Free quota for new users
New users receive a free quota in the Singapore region to experience model invocation. After depletion, billing switches to pay-as-you-go. To avoid unexpected charges, enable the Free quota only feature, and the service stops when the quota runs out. See Free quota for new users.
Payment methods
Model calls are billed per minute. For supported payment methods, see Payment methods.
View bills and usage
Billing details: Go to the Billing Details and Cost Analysis pages.
Call statistics: About one hour after making a model call, go to the Model Studio console, select your region from the top-right corner of the page, go to the Model Monitoring page, set your query conditions, click Monitor in the Actions column for the target model, and view call volume, token consumption, success rate, and other statistics. See Model monitoring.
Coding Plan usage: If you are subscribed to Coding Plan, view quota consumption on the Coding Plan page. Coding Plan uses a fixed monthly fee with a monthly request quota for AI coding tools. See Coding Plan overview.
Getting started
Try models online:
Access the Model Studio console, and select your region from the top-right corner of the page.
Go to the Playground and select a model.
Make your first API request: Make the first call to a Qwen API