OpenAI-Compatible Qwen & Multimodal Model Access - Model Studio - Alibaba Cloud - Alibaba Cloud Model Studio

Alibaba Cloud Model Studio is a one-stop model service platform. It provides the full Qwen series and mainstream third-party LLMs through official Qwen APIs and OpenAI-compatible APIs, with multimodal support across text, image, and audio/video. Call models on demand — no infrastructure to manage.

api Call APIs

Generate content and summaries with a few lines of code.

Model Studio is OpenAI-compatible. Update the API key, base URL, and model name to migrate existing OpenAI code.

import os
from openai import OpenAI

# Note: The base_url varies by region. The following example uses the base_url for the Singapore region.
# - Singapore: https://{WorkspaceId}.ap-southeast-1.maas.aliyuncs.com/compatible-mode/v1. Replace {WorkspaceId} with your workspace ID.
# - China (Beijing): https://{WorkspaceId}.cn-beijing.maas.aliyuncs.com/compatible-mode/v1
# - China (Hong Kong): https://{WorkspaceId}.cn-hongkong.maas.aliyuncs.com/compatible-mode/v1
# - Japan (Tokyo): https://{WorkspaceId}.ap-northeast-1.maas.aliyuncs.com/compatible-mode/v1
# - Germany (Frankfurt): https://{WorkspaceId}.eu-central-1.maas.aliyuncs.com/compatible-mode/v1
# - US (Virginia): https://dashscope-us.aliyuncs.com/compatible-mode/v1
client = OpenAI(
    api_key=os.getenv("DASHSCOPE_API_KEY"), 
    base_url="https://{WorkspaceId}.ap-southeast-1.maas.aliyuncs.com/compatible-mode/v1"  # Replace {WorkspaceId} with your workspace ID
)
completion = client.chat.completions.create(
    model="qwen3.7-plus",
    messages=[{"role": "user", "content": "Who are you?"}]
)
print(completion.choices[0].message.content)

Model service

Model Studio provides ready-to-use model services, including the proprietary Qwen series and third-party models such as DeepSeek, Kimi, and GLM. See Recommended models.

Qwen flagship models:
- Qwen-Max: The highest-performing model in the Qwen series, suited for complex, multi-step tasks.
  
  The latest qwen3.7-max delivers significant reasoning improvements over its predecessor. Recommended.
- Qwen-Plus: Balances performance, speed, and cost — recommended for most scenarios.
- Qwen-Flash: Low-cost and low-latency — suited for simple tasks that require fast responses.
Multimodal coverage: Includes text generation, visual understanding, image generation, video generation, speech recognition and synthesis, and embedding.
Domain-specific models: Models for long-text processing, translation, data mining, intent recognition, role-playing, and deep research.

Billing

Activating Model Studio is free. Costs apply only when you invoke models. See Billable items.

Free quota for new users

New users receive a free quota in the Singapore region to try model invocation.

Users who have not completed their profile cannot continue using the service after the free quota is depleted. They must complete their profile to switch to pay-as-you-go billing.
Users who have completed their profile are automatically switched to pay-as-you-go billing after the free quota is depleted. To avoid unexpected charges, enable the Free quota only feature — the service stops when the quota is depleted.

For more information, see Free quota for new users.

Payment methods

Model calls are billed per minute. For supported payment methods, see Payment methods.

View bills and usage

Billing details: Go to the Billing Details and Cost Analysis pages.
Call statistics: About one hour after making a model call, go to the Model Studio console, select your region from the top-right corner, go to the Model Monitoring page, set your query conditions, click Monitor in the Actions column for the target model, and view call volume, token consumption, success rate, and other statistics. See Model monitoring.
Coding Plan usage: If you are subscribed to Coding Plan, view quota consumption on the Coding Plan page. Coding Plan uses a fixed monthly fee with a monthly request quota for AI coding tools. See Coding Plan overview.

Getting started

Try models online:
- Open the Model Studio console and select your region from the top-right corner.
- Go to the Playground and select a model.
Make your first API call: Make the first call to a Qwen API

FAQ

Q: Is my data secure? Will Alibaba Cloud Model Studio use my data for training?

A: Alibaba Cloud protects data privacy and will never use your data for model training. All data transmitted when building applications or training models is encrypted. See Security certifications and privacy.

Q: How many regions is Model Studio available in? What are the differences between regions?

A: Model Studio is available in the following regions:

Singapore, US (Virginia), China (Beijing), China (Hong Kong), Japan (Tokyo), Germany (Frankfurt)

Select a nearby region to reduce network latency. Regions differ in endpoints and base URLs, API keys (not interchangeable across regions), supported models, platform features, and pricing. See the Model list.

Q: How can I avoid automatic charges?

A: Model Studio uses pay-as-you-go billing with no automatic deduction. To avoid charges:

Delete API keys: Go to the Model Studio console, select your region, go to the API-KEY page, and delete all API keys.
Stop all model calls in applications, agents, and workflows — including scheduled tasks and background processes.
Turn on Free quota only (new users only): On the model details page, enable this switch. Service stops when the quota is depleted, preventing paid usage. Applies only to models in the Singapore region (international deployment scope) within the validity period.
Set up cost monitoring and alerts: View Billing Details and Monitoring, and set up high-spending alerts to catch abnormal spending.
Coding Plan: This plan offers a fixed monthly fee and monthly request quota, replacing pay-as-you-go charges. Use the dedicated base URL and API key for the Coding Plan when making calls — otherwise, model calls are charged pay-as-you-go. See Coding Plan overview.

Q: How to use Qwen3 or DeepSeek?

Try it online: Open the Model Studio console, select your region, go to the Models page, and click a model. (DeepSeek is available only in the China (Beijing) region.)
Call models via API: See Make the first call to a Qwen API.
Use developer tools (such as Claude Code): See Connect to clients or developer tools.