DashScope API reference - Alibaba Cloud Model Studio - Alibaba Cloud Documentation Center

Provides the input and output parameters of the Qwen DashScope API, and code samples for typical scenarios.

Singapore

HTTP endpoint:

Qwen large language model: POST https://dashscope-intl.aliyuncs.com/api/v1/services/aigc/text-generation/generation
Qwen-VL model: POST https://dashscope-intl.aliyuncs.com/api/v1/services/aigc/multimodal-generation/generation

base_url for SDK:

Python code

dashscope.base_http_api_url = 'https://dashscope-intl.aliyuncs.com/api/v1'

Java code

Method 1:

import com.alibaba.dashscope.protocol.Protocol;
Generation gen = new Generation(Protocol.HTTP.getValue(), "https://dashscope-intl.aliyuncs.com/api/v1");

Method 2:

import com.alibaba.dashscope.utils.Constants;
Constants.baseHttpApiUrl="https://dashscope-intl.aliyuncs.com/api/v1";

US (Virginia)

HTTP endpoint:

Qwen large language model: POST https://dashscope-us.aliyuncs.com/api/v1/services/aigc/text-generation/generation
Qwen-VL model: POST https://dashscope-us.aliyuncs.com/api/v1/services/aigc/multimodal-generation/generation

SDK for base_url:

Python code

dashscope.base_http_api_url = 'https://dashscope-us.aliyuncs.com/api/v1'

Java code

Method 1:

import com.alibaba.dashscope.protocol.Protocol;
Generation gen = new Generation(Protocol.HTTP.getValue(), "https://dashscope-us.aliyuncs.com/api/v1");

Method 2:

import com.alibaba.dashscope.utils.Constants;
Constants.baseHttpApiUrl="https://dashscope-us.aliyuncs.com/api/v1";

China (Beijing)

HTTP endpoint:

Qwen large language model: POST https://dashscope.aliyuncs.com/api/v1/services/aigc/text-generation/generation
Qwen-VL model: POST https://dashscope.aliyuncs.com/api/v1/services/aigc/multimodal-generation/generation

No base_url is required for SDK calls.

You need to create an API key and set it as an environment variable. If you use the DashScope SDK, install the DashScope SDK.

Request body	Text input Python import os import dashscope dashscope.base_http_api_url = 'https://dashscope-intl.aliyuncs.com/api/v1' # The preceding base_url is for the Singapore region. messages = [ {'role': 'system', 'content': 'You are a helpful assistant.'}, {'role': 'user', 'content': 'Who are you?'} ] response = dashscope.Generation.call( # If you have not configured an environment variable, replace the following line with your Model Studio API key: api_key="sk-xxx" # The API keys for the Singapore/Virginia and Beijing regions are different. To get an API key, see https://www.alibabacloud.com/help/en/model-studio/get-api-key api_key=os.getenv('DASHSCOPE_API_KEY'), model="qwen-plus", # This example uses qwen-plus. You can change the model name as needed. For a list of models, see https://www.alibabacloud.com/help/en/model-studio/getting-started/models messages=messages, result_format='message' ) print(response) Java // Use DashScope SDK V2.12.0 or later. import java.util.Arrays; import java.lang.System; import com.alibaba.dashscope.aigc.generation.Generation; import com.alibaba.dashscope.aigc.generation.GenerationParam; import com.alibaba.dashscope.aigc.generation.GenerationResult; import com.alibaba.dashscope.common.Message; import com.alibaba.dashscope.common.Role; import com.alibaba.dashscope.exception.ApiException; import com.alibaba.dashscope.exception.InputRequiredException; import com.alibaba.dashscope.exception.NoApiKeyException; import com.alibaba.dashscope.utils.JsonUtils; import com.alibaba.dashscope.protocol.Protocol; public class Main { public static GenerationResult callWithMessage() throws ApiException, NoApiKeyException, InputRequiredException { Generation gen = new Generation(Protocol.HTTP.getValue(), "https://dashscope-intl.aliyuncs.com/api/v1"); // The preceding base_url is for the Singapore region. Message systemMsg = Message.builder() .role(Role.SYSTEM.getValue()) .content("You are a helpful assistant.") .build(); Message userMsg = Message.builder() .role(Role.USER.getValue()) .content("Who are you?") .build(); GenerationParam param = GenerationParam.builder() // If you have not configured an environment variable, replace the following line with your Model Studio API key: .apiKey("sk-xxx") // The API keys for the Singapore/Virginia and Beijing regions are different. To get an API key, see https://www.alibabacloud.com/help/en/model-studio/get-api-key .apiKey(System.getenv("DASHSCOPE_API_KEY")) // This example uses qwen-plus. You can change the model name as needed. For a list of models, see https://www.alibabacloud.com/help/en/model-studio/getting-started/models .model("qwen-plus") .messages(Arrays.asList(systemMsg, userMsg)) .resultFormat(GenerationParam.ResultFormat.MESSAGE) .build(); return gen.call(param); } public static void main(String[] args) { try { GenerationResult result = callWithMessage(); System.out.println(JsonUtils.toJson(result)); } catch (ApiException \| NoApiKeyException \| InputRequiredException e) { // Use a logging framework to record the exception. System.err.println("An error occurred while calling the generation service: " + e.getMessage()); } System.exit(0); } } PHP (HTTP) <?php $url = "https://dashscope-intl.aliyuncs.com/api/v1/services/aigc/text-generation/generation"; // The API keys for the Singapore/Virginia and Beijing regions are different. To get an API key, see https://www.alibabacloud.com/help/en/model-studio/get-api-key $apiKey = getenv('DASHSCOPE_API_KEY'); $data = [ // This example uses qwen-plus. You can change the model name as needed. For a list of models, see https://www.alibabacloud.com/help/en/model-studio/getting-started/models "model" => "qwen-plus", "input" => [ "messages" => [ [ "role" => "system", "content" => "You are a helpful assistant." ], [ "role" => "user", "content" => "Who are you?" ] ] ], "parameters" => [ "result_format" => "message" ] ]; $jsonData = json_encode($data); $ch = curl_init($url); curl_setopt($ch, CURLOPT_RETURNTRANSFER, true); curl_setopt($ch, CURLOPT_POST, true); curl_setopt($ch, CURLOPT_POSTFIELDS, $jsonData); curl_setopt($ch, CURLOPT_RETURNTRANSFER, true); curl_setopt($ch, CURLOPT_HTTPHEADER, [ "Authorization: Bearer $apiKey", "Content-Type: application/json" ]); $response = curl_exec($ch); $httpCode = curl_getinfo($ch, CURLINFO_HTTP_CODE); if ($httpCode == 200) { echo "Response: " . $response; } else { echo "Error: " . $httpCode . " - " . $response; } curl_close($ch); ?> Node.js (HTTP) DashScope does not provide an SDK for Node.js. To make calls using the OpenAI Node.js SDK, see the OpenAI section in this topic. import fetch from 'node-fetch'; // The API keys for the Singapore/Virginia and Beijing regions are different. To get an API key, see https://www.alibabacloud.com/help/en/model-studio/get-api-key const apiKey = process.env.DASHSCOPE_API_KEY; const data = { model: "qwen-plus", // This example uses qwen-plus. You can change the model name as needed. For a list of models, see https://www.alibabacloud.com/help/en/model-studio/getting-started/models input: { messages: [ { role: "system", content: "You are a helpful assistant." }, { role: "user", content: "Who are you?" } ] }, parameters: { result_format: "message" } }; fetch('https://dashscope-intl.aliyuncs.com/api/v1/services/aigc/text-generation/generation', { method: 'POST', headers: { 'Authorization': `Bearer ${apiKey}`, 'Content-Type': 'application/json' }, body: JSON.stringify(data) }) .then(response => response.json()) .then(data => { console.log(JSON.stringify(data)); }) .catch(error => { console.error('Error:', error); }); C# (HTTP) using System.Net.Http.Headers; using System.Text; class Program { private static readonly HttpClient httpClient = new HttpClient(); static async Task Main(string[] args) { // If you have not configured an environment variable, replace the following line with your Model Studio API key: string? apiKey = "sk-xxx"; // The API keys for the Singapore/Virginia and Beijing regions are different. To get an API key, see https://www.alibabacloud.com/help/en/model-studio/get-api-key string? apiKey = Environment.GetEnvironmentVariable("DASHSCOPE_API_KEY"); if (string.IsNullOrEmpty(apiKey)) { Console.WriteLine("The API key is not set. Make sure that the 'DASHSCOPE_API_KEY' environment variable is set."); return; } // Set the request URL and content. string url = "https://dashscope-intl.aliyuncs.com/api/v1/services/aigc/text-generation/generation"; // This example uses qwen-plus. You can change the model name as needed. For a list of models, see https://www.alibabacloud.com/help/en/model-studio/getting-started/models string jsonContent = @"{ ""model"": ""qwen-plus"", ""input"": { ""messages"": [ { ""role"": ""system"", ""content"": ""You are a helpful assistant."" }, { ""role"": ""user"", ""content"": ""Who are you?"" } ] }, ""parameters"": { ""result_format"": ""message"" } }"; // Send the request and get the response. string result = await SendPostRequestAsync(url, jsonContent, apiKey); // Print the result. Console.WriteLine(result); } private static async Task<string> SendPostRequestAsync(string url, string jsonContent, string apiKey) { using (var content = new StringContent(jsonContent, Encoding.UTF8, "application/json")) { // Set the request headers. httpClient.DefaultRequestHeaders.Authorization = new AuthenticationHeaderValue("Bearer", apiKey); httpClient.DefaultRequestHeaders.Accept.Add(new MediaTypeWithQualityHeaderValue("application/json")); // Send the request and get the response. HttpResponseMessage response = await httpClient.PostAsync(url, content); // Handle the response. if (response.IsSuccessStatusCode) { return await response.Content.ReadAsStringAsync(); } else { return $"Request failed: {response.StatusCode}"; } } } } Go (HTTP) DashScope does not provide an SDK for Go. To make calls using the OpenAI Go SDK, see the OpenAI-Go section in this topic. package main import ( "bytes" "encoding/json" "fmt" "io" "log" "net/http" "os" ) type Message struct { Role string `json:"role"` Content string `json:"content"` } type Input struct { Messages []Message `json:"messages"` } type Parameters struct { ResultFormat string `json:"result_format"` } type RequestBody struct { Model string `json:"model"` Input Input `json:"input"` Parameters Parameters `json:"parameters"` } func main() { // Create an HTTP client. client := &http.Client{} // Build the request body. requestBody := RequestBody{ // This example uses qwen-plus. You can change the model name as needed. For a list of models, see https://www.alibabacloud.com/help/en/model-studio/getting-started/models Model: "qwen-plus", Input: Input{ Messages: []Message{ { Role: "system", Content: "You are a helpful assistant.", }, { Role: "user", Content: "Who are you?", }, }, }, Parameters: Parameters{ ResultFormat: "message", }, } jsonData, err := json.Marshal(requestBody) if err != nil { log.Fatal(err) } // Create a POST request. req, err := http.NewRequest("POST", "https://dashscope-intl.aliyuncs.com/api/v1/services/aigc/text-generation/generation", bytes.NewBuffer(jsonData)) if err != nil { log.Fatal(err) } // Set the request headers. // If you have not configured an environment variable, replace the following line with your Model Studio API key: apiKey := "sk-xxx" // The API keys for the Singapore/Virginia and Beijing regions are different. To get an API key, see https://www.alibabacloud.com/help/en/model-studio/get-api-key apiKey := os.Getenv("DASHSCOPE_API_KEY") req.Header.Set("Authorization", "Bearer "+apiKey) req.Header.Set("Content-Type", "application/json") // Send the request. resp, err := client.Do(req) if err != nil { log.Fatal(err) } defer resp.Body.Close() // Read the response body. bodyText, err := io.ReadAll(resp.Body) if err != nil { log.Fatal(err) } // Print the response content. fmt.Printf("%s\n", bodyText) } curl The API keys for the Singapore, Virginia, and Beijing regions are different. For more information, see Obtain an API key `curl --location "https://dashscope-intl.aliyuncs.com/api/v1/services/aigc/text-generation/generation" \ --header "Authorization: Bearer $DASHSCOPE_API_KEY" \ --header "Content-Type: application/json" \ --data '{ "model": "qwen-plus", "input":{ "messages":[ { "role": "system", "content": "You are a helpful assistant." }, { "role": "user", "content": "Who are you?" } ] }, "parameters": { "result_format": "message" } }'` Streaming output For more information, see Streaming output. Text generation models Python import os import dashscope dashscope.base_http_api_url = 'https://dashscope-intl.aliyuncs.com/api/v1' messages = [ {'role':'system','content':'you are a helpful assistant'}, {'role': 'user','content': 'Who are you?'} ] responses = dashscope.Generation.call( # If you have not configured an environment variable, replace the following line with your Model Studio API key: api_key="sk-xxx" # The API keys for the Singapore/Virginia and Beijing regions are different. To get an API key, see https://www.alibabacloud.com/help/en/model-studio/get-api-key api_key=os.getenv('DASHSCOPE_API_KEY'), # This example uses qwen-plus. You can change the model name as needed. For a list of models, see https://www.alibabacloud.com/help/en/model-studio/getting-started/models model="qwen-plus", messages=messages, result_format='message', stream=True, incremental_output=True ) for response in responses: print(response) Java import java.util.Arrays; import org.slf4j.Logger; import org.slf4j.LoggerFactory; import com.alibaba.dashscope.aigc.generation.Generation; import com.alibaba.dashscope.aigc.generation.GenerationParam; import com.alibaba.dashscope.aigc.generation.GenerationResult; import com.alibaba.dashscope.common.Message; import com.alibaba.dashscope.common.Role; import com.alibaba.dashscope.exception.ApiException; import com.alibaba.dashscope.exception.InputRequiredException; import com.alibaba.dashscope.exception.NoApiKeyException; import com.alibaba.dashscope.utils.JsonUtils; import io.reactivex.Flowable; import java.lang.System; import com.alibaba.dashscope.protocol.Protocol; public class Main { private static final Logger logger = LoggerFactory.getLogger(Main.class); private static void handleGenerationResult(GenerationResult message) { System.out.println(JsonUtils.toJson(message)); } public static void streamCallWithMessage(Generation gen, Message userMsg) throws NoApiKeyException, ApiException, InputRequiredException { GenerationParam param = buildGenerationParam(userMsg); Flowable<GenerationResult> result = gen.streamCall(param); result.blockingForEach(message -> handleGenerationResult(message)); } private static GenerationParam buildGenerationParam(Message userMsg) { return GenerationParam.builder() // If you have not configured an environment variable, replace the following line with your Model Studio API key: .apiKey("sk-xxx") // The API keys for the Singapore/Virginia and Beijing regions are different. To get an API key, see https://www.alibabacloud.com/help/en/model-studio/get-api-key .apiKey(System.getenv("DASHSCOPE_API_KEY")) // This example uses qwen-plus. You can change the model name as needed. For a list of models, see https://www.alibabacloud.com/help/en/model-studio/getting-started/models .model("qwen-plus") .messages(Arrays.asList(userMsg)) .resultFormat(GenerationParam.ResultFormat.MESSAGE) .incrementalOutput(true) .build(); } public static void main(String[] args) { try { Generation gen = new Generation(Protocol.HTTP.getValue(), "https://dashscope-intl.aliyuncs.com/api/v1"); Message userMsg = Message.builder().role(Role.USER.getValue()).content("Who are you?").build(); streamCallWithMessage(gen, userMsg); } catch (ApiException \| NoApiKeyException \| InputRequiredException e) { logger.error("An exception occurred: {}", e.getMessage()); } System.exit(0); } } curl `curl --location "https://dashscope-intl.aliyuncs.com/api/v1/services/aigc/text-generation/generation" \ --header "Authorization: Bearer $DASHSCOPE_API_KEY" \ --header "Content-Type: application/json" \ --header "X-DashScope-SSE: enable" \ --data '{ "model": "qwen-plus", "input":{ "messages":[ { "role": "system", "content": "You are a helpful assistant." }, { "role": "user", "content": "Who are you?" } ] }, "parameters": { "result_format": "message", "incremental_output":true } }'` Multimodal models Python import os from dashscope import MultiModalConversation import dashscope dashscope.base_http_api_url = 'https://dashscope-intl.aliyuncs.com/api/v1' messages = [ { "role": "user", "content": [ {"image": "https://dashscope.oss-cn-beijing.aliyuncs.com/images/dog_and_girl.jpeg"}, {"text": "What is depicted in the image?"} ] } ] responses = MultiModalConversation.call( # The API keys for the Singapore/Virginia and Beijing regions are different. To get an API key, see https://www.alibabacloud.com/help/en/model-studio/get-api-key # If you have not configured an environment variable, replace the following line with your Model Studio API key: api_key="sk-xxx", api_key=os.getenv("DASHSCOPE_API_KEY"), model='qwen3-vl-plus', # You can replace the model with another multimodal model and modify the messages accordingly. messages=messages, stream=True, incremental_output=True) full_content = "" print("Streaming output:") for response in responses: if response["output"]["choices"][0]["message"].content: print(response.output.choices[0].message.content[0]['text']) full_content += response.output.choices[0].message.content[0]['text'] print(f"Full content: {full_content}") Java import java.util.Arrays; import java.util.Collections; import com.alibaba.dashscope.aigc.multimodalconversation.MultiModalConversation; import com.alibaba.dashscope.aigc.multimodalconversation.MultiModalConversationParam; import com.alibaba.dashscope.aigc.multimodalconversation.MultiModalConversationResult; import com.alibaba.dashscope.common.MultiModalMessage; import com.alibaba.dashscope.common.Role; import com.alibaba.dashscope.exception.ApiException; import com.alibaba.dashscope.exception.NoApiKeyException; import com.alibaba.dashscope.exception.UploadFileException; import io.reactivex.Flowable; import com.alibaba.dashscope.utils.Constants; public class Main { static { Constants.baseHttpApiUrl="https://dashscope-intl.aliyuncs.com/api/v1"; } public static void streamCall() throws ApiException, NoApiKeyException, UploadFileException { MultiModalConversation conv = new MultiModalConversation(); // must create mutable map. MultiModalMessage userMessage = MultiModalMessage.builder().role(Role.USER.getValue()) .content(Arrays.asList(Collections.singletonMap("image", "https://dashscope.oss-cn-beijing.aliyuncs.com/images/dog_and_girl.jpeg"), Collections.singletonMap("text", "What is depicted in the image?"))).build(); MultiModalConversationParam param = MultiModalConversationParam.builder() // The API keys for the Singapore/Virginia and Beijing regions are different. To get an API key, see https://www.alibabacloud.com/help/en/model-studio/get-api-key // If you have not configured an environment variable, replace the following line with your Model Studio API key: .apiKey("sk-xxx") .apiKey(System.getenv("DASHSCOPE_API_KEY")) .model("qwen3-vl-plus") // You can replace the model with another multimodal model and modify the messages accordingly. .messages(Arrays.asList(userMessage)) .incrementalOutput(true) .build(); Flowable<MultiModalConversationResult> result = conv.streamCall(param); result.blockingForEach(item -> { try { var content = item.getOutput().getChoices().get(0).getMessage().getContent(); // Check if the content exists and is not empty. if (content != null && !content.isEmpty()) { System.out.println(content.get(0).get("text")); } } catch (Exception e){ System.exit(0); } }); } public static void main(String[] args) { try { streamCall(); } catch (ApiException \| NoApiKeyException \| UploadFileException e) { System.out.println(e.getMessage()); } System.exit(0); } } curl `curl -X POST https://dashscope-intl.aliyuncs.com/api/v1/services/aigc/multimodal-generation/generation \ -H "Authorization: Bearer $DASHSCOPE_API_KEY" \ -H 'Content-Type: application/json' \ -H 'X-DashScope-SSE: enable' \ -d '{ "model": "qwen3-vl-plus", "input":{ "messages":[ { "role": "user", "content": [ {"image": "https://dashscope.oss-cn-beijing.aliyuncs.com/images/dog_and_girl.jpeg"}, {"text": "What is depicted in the image?"} ] } ] }, "parameters": { "incremental_output": true } }'` Image input For more information about how to use large models to analyze images, see Visual understanding. Python import os import dashscope dashscope.base_http_api_url = 'https://dashscope-intl.aliyuncs.com/api/v1' messages = [ { "role": "user", "content": [ {"image": "https://dashscope.oss-cn-beijing.aliyuncs.com/images/dog_and_girl.jpeg"}, {"image": "https://dashscope.oss-cn-beijing.aliyuncs.com/images/tiger.png"}, {"image": "https://dashscope.oss-cn-beijing.aliyuncs.com/images/rabbit.png"}, {"text": "What are these?"} ] } ] response = dashscope.MultiModalConversation.call( # The API keys for the Singapore/Virginia and Beijing regions are different. To get an API key, see https://www.alibabacloud.com/help/en/model-studio/get-api-key api_key=os.getenv('DASHSCOPE_API_KEY'), # This example uses qwen-vl-max. You can change the model name as needed. For a list of models, see https://www.alibabacloud.com/help/en/model-studio/getting-started/models model='qwen-vl-max', messages=messages ) print(response) Java // Copyright (c) Alibaba, Inc. and its affiliates. import java.util.Arrays; import java.util.Collections; import com.alibaba.dashscope.aigc.multimodalconversation.MultiModalConversation; import com.alibaba.dashscope.aigc.multimodalconversation.MultiModalConversationParam; import com.alibaba.dashscope.aigc.multimodalconversation.MultiModalConversationResult; import com.alibaba.dashscope.common.MultiModalMessage; import com.alibaba.dashscope.common.Role; import com.alibaba.dashscope.exception.ApiException; import com.alibaba.dashscope.exception.NoApiKeyException; import com.alibaba.dashscope.exception.UploadFileException; import com.alibaba.dashscope.utils.JsonUtils; import com.alibaba.dashscope.utils.Constants; public class Main { static { Constants.baseHttpApiUrl="https://dashscope-intl.aliyuncs.com/api/v1"; } public static void simpleMultiModalConversationCall() throws ApiException, NoApiKeyException, UploadFileException { MultiModalConversation conv = new MultiModalConversation(); MultiModalMessage userMessage = MultiModalMessage.builder().role(Role.USER.getValue()) .content(Arrays.asList( Collections.singletonMap("image", "https://dashscope.oss-cn-beijing.aliyuncs.com/images/dog_and_girl.jpeg"), Collections.singletonMap("image", "https://dashscope.oss-cn-beijing.aliyuncs.com/images/tiger.png"), Collections.singletonMap("image", "https://dashscope.oss-cn-beijing.aliyuncs.com/images/rabbit.png"), Collections.singletonMap("text", "What are these?"))).build(); MultiModalConversationParam param = MultiModalConversationParam.builder() // If you have not configured an environment variable, replace the following line with your Model Studio API key: .apiKey("sk-xxx") .apiKey(System.getenv("DASHSCOPE_API_KEY")) // This example uses qwen-vl-plus. You can change the model name as needed. For a list of models, see https://www.alibabacloud.com/help/en/model-studio/getting-started/models .model("qwen-vl-plus") .message(userMessage) .build(); MultiModalConversationResult result = conv.call(param); System.out.println(JsonUtils.toJson(result)); } public static void main(String[] args) { try { simpleMultiModalConversationCall(); } catch (ApiException \| NoApiKeyException \| UploadFileException e) { System.out.println(e.getMessage()); } System.exit(0); } } curl The API keys for the Singapore/Virginia and Beijing regions are different. For more information, see Obtain and configure an API key curl --location 'https://dashscope-intl.aliyuncs.com/api/v1/services/aigc/multimodal-generation/generation' \ --header "Authorization: Bearer $DASHSCOPE_API_KEY" \ --header 'Content-Type: application/json' \ --data '{ "model": "qwen-vl-plus", "input":{ "messages":[ { "role": "user", "content": [ {"image": "https://dashscope.oss-cn-beijing.aliyuncs.com/images/dog_and_girl.jpeg"}, {"image": "https://dashscope.oss-cn-beijing.aliyuncs.com/images/tiger.png"}, {"image": "https://dashscope.oss-cn-beijing.aliyuncs.com/images/rabbit.png"}, {"text": "What are these?"} ] } ] } }' Video input The following code is an example of how to pass video frames. For more information about usage, such as passing a video file, see Visual understanding. Python import os # Your DashScope SDK for Python must be V1.20.10 or later. import dashscope dashscope.base_http_api_url = 'https://dashscope-intl.aliyuncs.com/api/v1' messages = [{"role": "user", "content": [ # If the model is in the Qwen2.5-VL series and an image list is passed, you can set the fps parameter. This parameter indicates that the image list is extracted from the original video at an interval of 1/fps seconds. {"video":["https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20241108/xzsgiz/football1.jpg", "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20241108/tdescd/football2.jpg", "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20241108/zefdja/football3.jpg", "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20241108/aedbqh/football4.jpg"], "fps":2}, {"text": "Describe the process shown in this video"}]}] response = dashscope.MultiModalConversation.call( # If you have not configured an environment variable, replace the following line with your Model Studio API key: api_key="sk-xxx" # The API keys for the Singapore/Virginia and Beijing regions are different. To get an API key, see https://www.alibabacloud.com/help/en/model-studio/get-api-key api_key=os.getenv("DASHSCOPE_API_KEY"), model='qwen2.5-vl-72b-instruct', # This example uses qwen2.5-vl-72b-instruct. You can change the model name as needed. For a list of models, see https://www.alibabacloud.com/help/en/model-studio/models messages=messages ) print(response["output"]["choices"][0]["message"].content[0]["text"]) Java // Your DashScope SDK for Java must be V2.18.3 or later. import java.util.Arrays; import java.util.Collections; import java.util.Map; import com.alibaba.dashscope.aigc.multimodalconversation.MultiModalConversation; import com.alibaba.dashscope.aigc.multimodalconversation.MultiModalConversationParam; import com.alibaba.dashscope.aigc.multimodalconversation.MultiModalConversationResult; import com.alibaba.dashscope.common.MultiModalMessage; import com.alibaba.dashscope.common.Role; import com.alibaba.dashscope.exception.ApiException; import com.alibaba.dashscope.exception.NoApiKeyException; import com.alibaba.dashscope.exception.UploadFileException; import com.alibaba.dashscope.utils.Constants; public class Main { static { Constants.baseHttpApiUrl="https://dashscope-intl.aliyuncs.com/api/v1"; } private static final String MODEL_NAME = "qwen2.5-vl-72b-instruct"; // This example uses qwen2.5-vl-72b-instruct. You can change the model name as needed. For a list of models, see https://www.alibabacloud.com/help/en/model-studio/models public static void videoImageListSample() throws ApiException, NoApiKeyException, UploadFileException { MultiModalConversation conv = new MultiModalConversation(); MultiModalMessage systemMessage = MultiModalMessage.builder() .role(Role.SYSTEM.getValue()) .content(Arrays.asList(Collections.singletonMap("text", "You are a helpful assistant."))) .build(); // If the model is in the Qwen2.5-VL series and an image list is passed, you can set the fps parameter. This parameter indicates that the image list is extracted from the original video at an interval of 1/fps seconds. Map<String, Object> params = Map.of( "video", Arrays.asList("https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20241108/xzsgiz/football1.jpg", "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20241108/tdescd/football2.jpg", "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20241108/zefdja/football3.jpg", "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20241108/aedbqh/football4.jpg"), "fps",2); MultiModalMessage userMessage = MultiModalMessage.builder() .role(Role.USER.getValue()) .content(Arrays.asList( params, Collections.singletonMap("text", "Describe the process shown in this video"))) .build(); MultiModalConversationParam param = MultiModalConversationParam.builder() // If you have not configured an environment variable, replace the following line with your Model Studio API key: .apiKey("sk-xxx") // The API keys for the Singapore/Virginia and Beijing regions are different. To get an API key, see https://www.alibabacloud.com/help/en/model-studio/get-api-key .apiKey(System.getenv("DASHSCOPE_API_KEY")) .model(MODEL_NAME) .messages(Arrays.asList(systemMessage, userMessage)).build(); MultiModalConversationResult result = conv.call(param); System.out.print(result.getOutput().getChoices().get(0).getMessage().getContent().get(0).get("text")); } public static void main(String[] args) { try { videoImageListSample(); } catch (ApiException \| NoApiKeyException \| UploadFileException e) { System.out.println(e.getMessage()); } System.exit(0); } } curl The API keys for the Singapore, Virginia, and Beijing regions are different. For more information, see Obtain and configure an API key curl -X POST https://dashscope-intl.aliyuncs.com/api/v1/services/aigc/multimodal-generation/generation \ -H "Authorization: Bearer $DASHSCOPE_API_KEY" \ -H 'Content-Type: application/json' \ -d '{ "model": "qwen2.5-vl-72b-instruct", "input": { "messages": [ { "role": "user", "content": [ { "video": [ "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20241108/xzsgiz/football1.jpg", "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20241108/tdescd/football2.jpg", "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20241108/zefdja/football3.jpg", "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20241108/aedbqh/football4.jpg" ], "fps":2 }, { "text": "Describe the process shown in this video" } ] } ] } }' Tool calling For the complete code for the Function calling flow, see Text generation overview. Python import os import dashscope dashscope.base_http_api_url = 'https://dashscope-intl.aliyuncs.com/api/v1' tools = [ { "type": "function", "function": { "name": "get_current_time", "description": "This is useful when you want to know the current time.", "parameters": {} } }, { "type": "function", "function": { "name": "get_current_weather", "description": "This is useful when you want to query the weather of a specified city.", "parameters": { "type": "object", "properties": { "location": { "type": "string", "description": "A city or a district, such as Beijing, Hangzhou, or Yuhang District." } } }, "required": [ "location" ] } } ] messages = [{"role": "user", "content": "What is the weather in Hangzhou?"}] response = dashscope.Generation.call( # If you have not configured an environment variable, replace the following line with your Model Studio API key: api_key="sk-xxx" # The API keys for the Singapore/Virginia and Beijing regions are different. To get an API key, see https://www.alibabacloud.com/help/en/model-studio/get-api-key api_key=os.getenv('DASHSCOPE_API_KEY'), # This example uses qwen-plus. You can change the model name as needed. For a list of models, see https://www.alibabacloud.com/help/en/model-studio/getting-started/models model='qwen-plus', messages=messages, tools=tools, result_format='message' ) print(response) Java import java.util.ArrayList; import java.util.Arrays; import java.util.List; import com.alibaba.dashscope.aigc.conversation.ConversationParam.ResultFormat; import com.alibaba.dashscope.aigc.generation.Generation; import com.alibaba.dashscope.aigc.generation.GenerationParam; import com.alibaba.dashscope.aigc.generation.GenerationResult; import com.alibaba.dashscope.common.Message; import com.alibaba.dashscope.common.Role; import com.alibaba.dashscope.exception.ApiException; import com.alibaba.dashscope.exception.InputRequiredException; import com.alibaba.dashscope.exception.NoApiKeyException; import com.alibaba.dashscope.tools.FunctionDefinition; import com.alibaba.dashscope.tools.ToolFunction; import com.alibaba.dashscope.utils.JsonUtils; import com.fasterxml.jackson.databind.node.ObjectNode; import com.github.victools.jsonschema.generator.Option; import com.github.victools.jsonschema.generator.OptionPreset; import com.github.victools.jsonschema.generator.SchemaGenerator; import com.github.victools.jsonschema.generator.SchemaGeneratorConfig; import com.github.victools.jsonschema.generator.SchemaGeneratorConfigBuilder; import com.github.victools.jsonschema.generator.SchemaVersion; import java.time.LocalDateTime; import java.time.format.DateTimeFormatter; import com.alibaba.dashscope.protocol.Protocol; public class Main { public class GetWeatherTool { private String location; public GetWeatherTool(String location) { this.location = location; } public String call() { return location + " is sunny today"; } } public class GetTimeTool { public GetTimeTool() { } public String call() { LocalDateTime now = LocalDateTime.now(); DateTimeFormatter formatter = DateTimeFormatter.ofPattern("yyyy-MM-dd HH:mm:ss"); String currentTime = "Current time: " + now.format(formatter) + "."; return currentTime; } } public static void SelectTool() throws NoApiKeyException, ApiException, InputRequiredException { SchemaGeneratorConfigBuilder configBuilder = new SchemaGeneratorConfigBuilder(SchemaVersion.DRAFT_2020_12, OptionPreset.PLAIN_JSON); SchemaGeneratorConfig config = configBuilder.with(Option.EXTRA_OPEN_API_FORMAT_VALUES) .without(Option.FLATTENED_ENUMS_FROM_TOSTRING).build(); SchemaGenerator generator = new SchemaGenerator(config); ObjectNode jsonSchema_weather = generator.generateSchema(GetWeatherTool.class); ObjectNode jsonSchema_time = generator.generateSchema(GetTimeTool.class); FunctionDefinition fdWeather = FunctionDefinition.builder().name("get_current_weather").description("Get the weather of a specified region") .parameters(JsonUtils.parseString(jsonSchema_weather.toString()).getAsJsonObject()).build(); FunctionDefinition fdTime = FunctionDefinition.builder().name("get_current_time").description("Get the current time") .parameters(JsonUtils.parseString(jsonSchema_time.toString()).getAsJsonObject()).build(); Message systemMsg = Message.builder().role(Role.SYSTEM.getValue()) .content("You are a helpful assistant. When asked a question, use tools wherever possible.") .build(); Message userMsg = Message.builder().role(Role.USER.getValue()).content("Weather in Hangzhou").build(); List<Message> messages = new ArrayList<>(); messages.addAll(Arrays.asList(systemMsg, userMsg)); GenerationParam param = GenerationParam.builder() // The API keys for the Singapore/Virginia and Beijing regions are different. To get an API key, see https://www.alibabacloud.com/help/en/model-studio/get-api-key .apiKey(System.getenv("DASHSCOPE_API_KEY")) // This example uses qwen-plus. You can change the model name as needed. For a list of models, see https://www.alibabacloud.com/help/en/model-studio/getting-started/models .model("qwen-plus") .messages(messages) .resultFormat(ResultFormat.MESSAGE) .tools(Arrays.asList( ToolFunction.builder().function(fdWeather).build(), ToolFunction.builder().function(fdTime).build())) .build(); Generation gen = new Generation(Protocol.HTTP.getValue(), "https://dashscope-intl.aliyuncs.com/api/v1"); // The preceding base_url is for the Singapore region. GenerationResult result = gen.call(param); System.out.println(JsonUtils.toJson(result)); } public static void main(String[] args) { try { SelectTool(); } catch (ApiException \| NoApiKeyException \| InputRequiredException e) { System.out.println(String.format("Exception %s", e.getMessage())); } System.exit(0); } } curl The API keys for the Singapore/Virginia and Beijing regions are different. For more information, see Obtain and configure an API key The following URL is for the Singapore region. curl --location "https://dashscope-intl.aliyuncs.com/api/v1/services/aigc/text-generation/generation" \ --header "Authorization: Bearer $DASHSCOPE_API_KEY" \ --header "Content-Type: application/json" \ --data '{ "model": "qwen-plus", "input": { "messages": [{ "role": "user", "content": "What is the weather in Hangzhou?" }] }, "parameters": { "result_format": "message", "tools": [{ "type": "function", "function": { "name": "get_current_time", "description": "This is useful when you want to know the current time.", "parameters": {} } },{ "type": "function", "function": { "name": "get_current_weather", "description": "This is useful when you want to query the weather of a specified city.", "parameters": { "type": "object", "properties": { "location": { "type": "string", "description": "A city or a district, such as Beijing, Hangzhou, or Yuhang District." } } }, "required": ["location"] } }] } }' Asynchronous invocation # Your DashScope SDK for Python must be V1.19.0 or later. import asyncio import platform import os import dashscope from dashscope.aigc.generation import AioGeneration dashscope.base_http_api_url = 'https://dashscope-intl.aliyuncs.com/api/v1' # The preceding base_url is for the Singapore region. async def main(): response = await AioGeneration.call( # If you have not configured an environment variable, replace the following line with your Model Studio API key: api_key="sk-xxx" # The API keys for the Singapore/Virginia and Beijing regions are different. To get an API key, see https://www.alibabacloud.com/help/en/model-studio/get-api-key api_key=os.getenv('DASHSCOPE_API_KEY'), # This example uses qwen-plus. You can change the model name as needed. For a list of models, see https://www.alibabacloud.com/help/en/model-studio/getting-started/models model="qwen-plus", messages=[{"role": "user", "content": "Who are you"}], result_format="message", ) print(response) if platform.system() == "Windows": asyncio.set_event_loop_policy(asyncio.WindowsSelectorEventLoopPolicy()) asyncio.run(main()) Document understanding Python import os import dashscope # Currently, you can call the qwen-long-latest model only in the China (Beijing) region. dashscope.base_http_api_url = 'https://dashscope.aliyuncs.com/api/v1' messages = [ {'role': 'system', 'content': 'you are a helpful assisstant'}, # Replace '{FILE_ID}' with the file ID that you use in the actual conversation scenario. {'role':'system','content':f'fileid://{FILE_ID}'}, {'role': 'user', 'content': 'What is this article about?'}] response = dashscope.Generation.call( # If you have not configured an environment variable, replace the following line with your Model Studio API key: api_key="sk-xxx" api_key=os.getenv('DASHSCOPE_API_KEY'), model="qwen-long-latest", messages=messages, result_format='message' ) print(response) Java import os import dashscope # Currently, you can call the qwen-long-latest model only in the China (Beijing) region. dashscope.base_http_api_url = 'https://dashscope.aliyuncs.com/api/v1' messages = [ {'role': 'system', 'content': 'you are a helpful assisstant'}, # Replace '{FILE_ID}' with the file ID that you use in the actual conversation scenario. {'role':'system','content':f'fileid://{FILE_ID}'}, {'role': 'user', 'content': 'What is this article about?'}] response = dashscope.Generation.call( # If you have not configured an environment variable, replace the following line with your Model Studio API key: api_key="sk-xxx" api_key=os.getenv('DASHSCOPE_API_KEY'), model="qwen-long-latest", messages=messages, result_format='message' ) print(response) curl Currently, you can call the document understanding model only in the China (Beijing) region. Replace {FILE_ID} with the file ID that you use in the actual conversation scenario. `curl --location "https://dashscope.aliyuncs.com/api/v1/services/aigc/text-generation/generation" \ --header "Authorization: Bearer $DASHSCOPE_API_KEY" \ --header "Content-Type: application/json" \ --data '{ "model": "qwen-long-latest", "input":{ "messages":[ { "role": "system", "content": "You are a helpful assistant." }, { "role": "system", "content": "fileid://{FILE_ID}" }, { "role": "user", "content": "What is this article about?" } ] }, "parameters": { "result_format": "message" } }'`
model `string` (Required) The name of the model. Supported models include Qwen large language models (commercial and open source), Qwen-VL, Qwen-Coder, and Qwen-Math models. For specific model names and billing information, see Models.
messages `array` (Required) Conversation context for the model, in chronological order. When you make an HTTP call, place messages in the input object. Message types System Message `object` (Optional) A system message that sets the role, tone, task objectives, or constraints for the large language model. This message is usually placed at the beginning of the `messages` array. Do not set a System Message for QwQ models. Setting a System Message for QVQ models has no effect. Properties content `string` (Required) The message content. role `string` (Required) The role for a system message. The value is fixed as `system`. User Message `object` (Required) A user message that passes questions, instructions, or context to the model. Properties content `string or array` (Required) The message content. The value is a string for text-only input. The value is an array for multimodal input, such as images, or if explicit caching is enabled. Properties text `string` (Required) The input text. image `string` (Optional) The image file for image understanding. You can pass the image in one of the following three ways: A public URL of the image. The Base64 encoding of the image, in the format `data:image/<format>;base64,<data>`. The absolute path of a local file. Applicable models: Qwen-VL, QVQ Example value: `{"image":"https://xxxx.jpeg"}` video `array or string` (Optional) The video to pass to the Qwen-VL model or QVQ model. If you pass a list of images, the value is an `array`. If you pass a video file, the value is a `string`. To pass a local file, see Local files (Qwen-VL) or Local files (QVQ). Example values: List of images: `{"video":["https://xx1.jpg",...,"https://xxn.jpg"]}` Video file: `{"video":"https://xxx.mp4"}` fps `float` (Optional) The number of frames to extract per second. The value must be in the range of [0.1, 10]. The default value is 2.0. Two features are available: When you input a video file, this parameter controls the frame extraction frequency. One frame is extracted every $\frac{1}{f p s}$ seconds. Applicable to the Qwen-VL model and QVQ model. This parameter informs the model of the time interval between adjacent frames. This helps the model better understand the temporal dynamics of the video. This function applies to both video file and image list inputs. It is suitable for scenarios such as event time localization or segment content summarization. Supports the `Qwen2.5-VL` and `Qwen3-VL` models, and the QVQ model. Example values: List of images input: `{"video":["https://xx1.jpg",...,"https://xxn.jpg"], "fps":2}` Video file input: `{"video": "https://xx1.mp4", "fps":2}` A larger `fps` value is suitable for high-speed motion scenarios, such as sports events and action movies. A smaller `fps` value is suitable for long videos or content with static scenes. max_frames `integer` (Optional) The maximum number of frames to extract from a video. If the number of frames calculated based on `fps` exceeds `max_frames`, the system automatically and evenly samples frames to stay within `max_frames`. `qwen3-vl-plus` series, `qwen3-vl-flash` series, `qwen3-vl-235b-a22b-thinking`, and `qwen3-vl-235b-a22b-instruct`: The maximum and default value is 2000. `qwen-vl-max`, `qwen-vl-max-latest`, `qwen-vl-max-0813`, `qwen-vl-plus`, `qwen-vl-plus-latest`, `qwen-vl-plus-0815`, and: The maximum and default value is 512. The OpenAI-compatible API does not support specifying `max_frames`. Default values of specific models will be used. min_pixels `integer` (Optional) Sets the minimum pixel threshold for an input image or video frame. If the pixel count of an input image or video frame is less than `min_pixels`, the image or frame is scaled up until its total pixel count is greater than `min_pixels`. Valid range Image input: `Qwen3-VL`: The default and minimum value is `65536`. `qwen-vl-max`, `qwen-vl-max-latest`, `qwen-vl-max-0813`, `qwen-vl-plus`, `qwen-vl-plus-latest`, `qwen-vl-plus-0815`: The default and minimum value is `4096`. Other `qwen-vl-plus` models, other `qwen-vl-max` models, the `Qwen2.5-VL` open source series, and `QVQ` series models: The default and minimum value is `3136`. Video file or image list input: Qwen3-VL (including commercial and open source versions), `qwen-vl-max`, `qwen-vl-max-latest`, `qwen-vl-max-0813`, `qwen-vl-plus`, `qwen-vl-plus-latest`, `qwen-vl-plus-0815`: The default value is `65536`, and the minimum value is `4096`. Other `qwen-vl-plus` models, other `qwen-vl-max` models, the `Qwen2.5-VL` open source series, and `QVQ` series models: The default value is `50176`, and the minimum value is `3136`. Example values Image input: `{"type": "image_url","image_url": {"url":"https://xxxx.jpg"},"min_pixels": 65536}` Video file input: `{"video":"https://xxxx.mp4","min_pixels": 65536}` List of images input: `{"video":["https://xx1.jpg",...,"https://xxn.jpg"], "min_pixels": 65536}` max_pixels `integer` (Optional) Sets the maximum pixel threshold for an input image or video frame. If the pixel count of an input image or video is within the `[min_pixels, max_pixels]` range, the model processes the original image. If the input image's pixel count is greater than `max_pixels`, the image is scaled down until its total pixel count is less than `max_pixels`. Image input: Applicable models: Supported by QVQ and Qwen-VL models. Value range: max_pixels value range The value of `max_pixels` depends on whether the `vl_high_resolution_images` parameter is enabled. If `vl_high_resolution_images` is set to `False`: `Qwen3-VL`: The default value is `2621440`, and the maximum value is `16777216`. `qwen-vl-max`, `qwen-vl-max-latest`, `qwen-vl-max-0813`, `qwen-vl-plus`, `qwen-vl-plus-latest`, `qwen-vl-plus-0815`: The default value is `1310720`, and the maximum value is `16777216`. `Other qwen-vl-plus` models, other `qwen-vl-max` models, the `Qwen2.5-VL` open source series, and the `QVQ` model series: The default value is `1003520` , and the maximum value is `12845056` If `vl_high_resolution_images` is set to `True`: `Qwen3-VL`, `qwen-vl-max`, `qwen-vl-max-latest`, `qwen-vl-max-0813`, `qwen-vl-plus`, `qwen-vl-plus-latest`, `qwen-vl-plus-0815`: `max_pixels` is invalid. The maximum pixel count for the input image is fixed at `16777216`. `For other qwen-vl-plus` models, other `qwen-vl-max` models, the `Qwen2.5-VL` open-source series, and the `QVQ` series models: the `max_pixels` parameter has no effect, and the maximum number of pixels for an input image is fixed at `12845056`. Example value: `{"image":"https://xxxx.jpg", "max_pixels": 8388608}` Video file or image list input: Applicable models: Qwen-VL, QVQ Value range: max_pixels value range `qwen3-vl-plus` series, `qwen3-vl-flash` series, `qwen3-vl-235b-a22b-thinking`, and `qwen3-vl-235b-a22b-instruct`: The default value is `655360`, and the maximum value is `2048000`. Other `Qwen3-VL` open source models, `qwen-vl-max`, `qwen-vl-max-latest`, `qwen-vl-max-0813`, `qwen-vl-plus`, `qwen-vl-plus-latest`, `qwen-vl-plus-0815`: The default value is `655360`, and the maximum value is `786432`. Other `qwen-vl-plus` models, other `qwen-vl-max` models, the `Qwen2.5-VL` open source series, and `QVQ` series models: The default value is `501760`, and the maximum value is `602112`. Example values: Video file input: `{"video":"https://xxxx.mp4","max_pixels": 655360}` List of images input: `{"video":["https://xx1.jpg",...,"https://xxn.jpg"], "max_pixels": 655360}` total_pixels `integer` (Optional) Limits the total number of pixels for all frames extracted from a video (pixels per frame × total number of frames). If the total pixel count of the video exceeds this limit, the system scales down the video frames. However, it still ensures that the pixel value of a single frame remains within the `[min_pixels, max_pixels]` range. Applicable models: Qwen-VL, QVQ Value range: total_pixels value range `qwen3-vl-plus` series, `qwen3-vl-flash` series, `qwen3-vl-235b-a22b-thinking`, and `qwen3-vl-235b-a22b-instruct`: The default and minimum value is 134217728. This value corresponds to `131072` image tokens (1 image token per 32×32 pixels). Other `Qwen3-VL` open source models, `qwen-vl-max`, `qwen-vl-max-latest`, `qwen-vl-max-0813`, `qwen-vl-plus`, `qwen-vl-plus-latest`, `qwen-vl-plus-0815`: The default and minimum value is `67108864`. This value corresponds to `65536` image tokens (1 image token per `32×32` pixels). Other `qwen-vl-plus` models, other `qwen-vl-max` models, the `Qwen2.5-VL` open source series, and `QVQ` series models: The default and minimum value is `51380224`. This value corresponds to `65536` image tokens (1 image token per `28×28` pixels). Example values: Video file input: `{"video":"https://xxxx.mp4","total_pixels": 134217728}` List of images input: `{"video":["https://xx1.jpg",...,"https://xxn.jpg"], "total_pixels": 134217728}` For long videos with a high number of extracted frames, you can lower this value to reduce token consumption and processing time. However, this may cause a loss of image detail. cache_control `object` (Optional) Enables explicit caching. This parameter is supported only by models that support explicit cache. Properties type `string` (Required) The value is fixed as `ephemeral`. role `string` (Required) The role for a user message. The value is fixed as `user`. Assistant Message `object` (Optional) The model's reply to the user's message. Properties content `string` (Optional) The message content. This parameter is optional only when the `tool_calls` parameter is specified in the assistant message. role `string` (Required) The value is fixed as `assistant`. partial `boolean` (Optional) Specifies whether to enable partial mode. For more information, see Partial mode. Supported models Qwen-Max series qwen3-max, qwen3-max-2025-09-23, qwen3-max-preview (non-thinking mode), qwen-max, qwen-max-latest, and snapshot models from qwen-max-2025-01-25 or later Qwen-Plus series (non-thinking mode) qwen-plus, qwen-plus-latest, and snapshot models from qwen-plus-2025-01-25 or later Qwen-Flash series (non-thinking mode) qwen-flash, and snapshot models from qwen-flash-2025-07-28 or later Qwen-Coder series qwen3-coder-plus, qwen3-coder-flash, qwen3-coder-480b-a35b-instruct, qwen3-coder-30b-a3b-instruct Qwen-VL series qwen3-vl-plus series (non-thinking mode) qwen3-vl-plus, and snapshot models from qwen3-vl-plus-2025-09-23 or later qwen3-vl-flash series (non-thinking mode) qwen3-vl-flash, and snapshot models from qwen3-vl-flash-2025-10-15 or later qwen-vl-max series qwen-vl-max, qwen-vl-max-latest, and snapshot models from qwen-vl-max-2025-04-08 or later qwen-vl-plus series qwen-vl-plus, qwen-vl-plus-latest, and snapshot models from qwen-vl-plus-2025-01-25 or later Qwen-Turbo series (non-thinking mode) qwen-turbo, qwen-turbo-latest, and snapshot models from qwen-turbo-2024-11-01 or later Qwen open-source series Qwen3 open-source models (non-thinking mode), Qwen2.5 series text models, Qwen3-VL open-source models (non-thinking mode) tool_calls `array` (Optional) The tool and input parameter information that is returned after you initiate a function call. This parameter contains one or more objects and is obtained from the `tool_calls` field of the previous model response. Properties id `string` The ID of the tool response. type `string` The tool type. Currently, only `function` is supported. function `object` The tool and input parameter information. Properties name `string` The tool name. arguments `string` The input parameter information, in a JSON string format. index `integer` The index of the current tool information in the `tool_calls` array. Tool Message `object` (Optional) The output information of the tool. Properties content `string` (Required) The output content of the tool function. The value must be a string. role `string` (Required) The value is fixed as `tool`. tool_call_id `string` (Optional) The ID that is returned after you initiate a function call. You can obtain the ID from `response.output.choices[0].message.tool_calls[$index]["id"]`. This parameter marks the Tool Message that corresponds to the tool.
temperature `float` (Optional) The sampling temperature. This value controls the diversity of text that the model generates. Higher values increase diversity; lower values make output more deterministic. The value must be in the range of [0, 2). When making an HTTP call, place temperature in the parameters object. Do not modify the default temperature value for QVQ models.
top_p `float` (Optional) The probability threshold for nucleus sampling. It controls the diversity of the text that the model generates. A higher top_p value results in more diverse text. A lower top_p value produces more deterministic text. The value must be in the range of (0, 1.0]. Default top_p values Qwen3 (non-thinking mode), Qwen3-Instruct series, Qwen3-Coder series, qwen-max series, qwen-plus series (non-thinking mode), qwen-flash series (non-thinking mode), qwen-turbo series (non-thinking mode), Qwen open source series, qwen-vl-max-2025-08-13, and Qwen3-VL (non-thinking mode): 0.8 qwen-vl-plus series, qwen-vl-max, qwen-vl-max-latest, qwen-vl-max-2025-04-08, qwen2.5-vl-3b-instruct, qwen2.5-vl-7b-instruct, qwen2.5-vl-32b-instruct, and qwen2.5-vl-72b-instruct: 0.001 QVQ series, qwen-vl-plus-2025-07-10, qwen-vl-plus-2025-08-15: 0.5 qwen3-max-preview (thinking mode), Qwen3-Omni-Flash series: 1.0 Qwen3 (thinking mode), Qwen3-VL (thinking mode), Qwen3-Thinking, QwQ series, Qwen3-Omni-Captioner: 0.95 In the Java SDK, the parameter is topP. When you call the API over HTTP, add top_p to the parameters object. Do not modify the default top_p value for QVQ models.
top_k `integer` (Optional) The size of the candidate set for sampling during generation. For example, if you set the value to 50, only the 50 highest-scoring tokens in a single generation are used as the candidate set for random sampling. A larger value results in more random output, and a smaller value results in more deterministic output. A value of `null` or a value greater than 100 disables the `top_k` policy. In this case, only the `top_p` policy is effective. The value must be greater than or equal to 0. Default top_k values QVQ series, qwen-vl-plus-2025-07-10, and qwen-vl-plus-2025-08-15: 10 QwQ series: 40 Other qwen-vl-plus series, models before qwen-vl-max-2025-08-13, qwen2.5-omni-7b: 1 Qwen3-Omni-Flash series: 50 All other models: 20 In the Java SDK, use the topK. For HTTP calls, add top_k to the parameters object. Do not modify the default top_k value for QVQ models.
enable_thinking `boolean` (Optional) Specifies whether to enable thinking mode when you use a hybrid thinking model. This parameter applies to the Qwen3 and Qwen3-VL models. For more information, see Deep thinking. Valid values: `true` If this parameter is enabled, the thinking content is returned in the `reasoning_content` field. `false` Default values for different models: Supported models In the Java SDK, this parameter is named enableThinking. When making an HTTP call, place enable_thinking in the parameters object.
thinking_budget `integer` (Optional) The maximum number of tokens for the thinking process. This applies to Qwen3-VL, and the commercial and open source versions of Qwen3 models. For more information, see Limit thinking length. The default value is the model's maximum chain-of-thought length. For more information, see Models. In the Java SDK, this parameter is named thinkingBudget. When you make a call using HTTP, place thinking_budget in the parameters object. The value defaults to the model's maximum chain-of-thought length.
enable_code_interpreter `boolean` (Optional) Defaults to: `false` Specifies whether to enable the code interpreter feature. This parameter applies only to qwen3-max-2026-01-23, qwen3-max-preview in thinking mode. For more information, see Code interpreter. Valid values: `true` `false` This parameter is not supported by the Java SDK. When calling using HTTP, place enable_code_interpreter in the parameters object.
repetition_penalty `float` (Optional) The penalty for repeating consecutive sequences during model generation. A higher repetition_penalty value reduces repetition in the generated text. A value of 1.0 means no penalty. The value must be greater than 0. In the Java SDK, this parameter is called repetitionPenalty. When making an HTTP call, place repetition_penalty in the parameters object. If you use the qwen-vl-plus_2025-01-25 model for text extraction, set repetition_penalty to 1.0. Do not modify the default repetition_penalty value for QVQ models.
presence_penalty `float` (Optional) Controls how much the model avoids repeating content. Value range: [-2.0, 2.0]. Positive values reduce repetition, and negative values increase it. For scenarios that require diversity and creativity, such as creative writing or brainstorming, increase this value. For scenarios that require consistency and terminological accuracy, such as technical documents or formal text, decrease this value. Default presence_penalty values qwen3-max-preview (thinking mode), Qwen3 (non-thinking mode), Qwen3-Instruct series, qwen3-0.6b/1.7b/4b (thinking mode), QVQ series, qwen-max, qwen-max-latest, qwen-max-latest, qwen2.5-vl series, qwen-vl-max series, qwen-vl-plus, Qwen3-VL (non-thinking): 1.5. qwen-vl-plus-latest, qwen-vl-plus-2025-08-15: 1.2. qwen-vl-plus-2025-01-25: 1.0. qwen3-8b/14b/32b/30b-a3b/235b-a22b (thinking mode), qwen-plus/qwen-plus-latest/2025-04-28 (thinking mode), qwen-turbo/qwen-turbo/2025-04-28 (thinking mode): 0.5. All other models: 0.0. How it works If the parameter value is positive, the model penalizes tokens that already exist in the text. The penalty amount does not depend on how many times the token appears. This reduces the chance of these tokens reappearing. As a result, content repetition decreases and word diversity increases. Example Prompt: Translate this sentence into English: "Esta película es buena. La trama es buena, la actuación es buena, la música es buena, y en general, toda la película es simplemente buena. Es realmente buena, de hecho. La trama es tan buena, y la actuación es tan buena, y la música es tan buena." Parameter value 2.0: This movie is very good. The plot is great, the acting is great, the music is also very good, and overall, the whole movie is incredibly good. In fact, it is truly excellent. The plot is very exciting, the acting is outstanding, and the music is so beautiful. Parameter value 0.0: This movie is good. The plot is good, the acting is good, the music is also good, and overall, the whole movie is very good. In fact, it is really great. The plot is very good, the acting is also very outstanding, and the music is also excellent. Parameter value -2.0: This movie is very good. The plot is very good, the acting is very good, the music is also very good, and overall, the whole movie is very good. In fact, it is really great. The plot is very good, the acting is also very good, and the music is also very good. When using the qwen-vl-plus-2025-01-25 model for text extraction, set presence_penalty to 1.5. Do not modify the default presence_penalty value for QVQ models. The Java SDK does not support this parameter. When you make a call using HTTP, place presence_penalty in the parameters object.
vl_high_resolution_images `boolean` (Optional) Defaults to: `false` Increases the maximum pixel limit for input images when enabled. The limit is set to the pixel value that corresponds to 16384 tokens. For more information, see Process high-resolution images. `vl_high_resolution_images: true`: A fixed-resolution strategy is used, and the `max_pixels` setting is ignored. If an image exceeds this resolution, its total pixels are downscaled to this limit. Click to view the pixel limits for each model When `vl_high_resolution_images` is `True`, different models have different pixel limits: `Qwen3-VL series`, `qwen-vl-max`, `qwen-vl-max-latest`, `qwen-vl-max-0813`, `qwen-vl-plus`, `qwen-vl-plus-latest`, `qwen-vl-plus-0815`: `16777216` (each `Token` corresponds to `3232` pixels, i.e., `163843232`) `QVQ series` and other `Qwen2.5-VL series` models: `12845056` (each `Token` corresponds to `2828` pixels, i.e., `163842828`) If `vl_high_resolution_images` is `false`, the actual resolution is determined by both `max_pixels` and the default limit. If the image exceeds `max_pixels`, it is downscaled to `max_pixels`. The default pixel limits for models are the default value of `max_pixels`. In the Java SDK, the parameter is vlHighResolutionImages. The minimum required version is 2.20.8. When making an HTTP call, place vl_high_resolution_images
vl_enable_image_hw_output `boolean` (Optional) Defaults to: `false` Specifies whether to return the dimensions of the scaled image. The model scales the input image. If this parameter is set to true, the model returns the height and width of the scaled image. If streaming output is enabled, this information is returned in the last chunk. This parameter is supported by the Qwen-VL model. In the Java SDK, this parameter is named vlEnableImageHwOutput. The minimum Java SDK version is 2.20.8. When making HTTP calls, place vl_enable_image_hw_output in the parameters object.
max_input_tokens `integer` (Optional) The maximum allowed token length for the input. This parameter is currently supported only by the qwen-plus-0728 and qwen-plus-latest models. qwen-plus-latest default value: 129,024 The default value may be adjusted to 1,000,000 in the future. qwen-plus-2025-07-28 default value: 1,000,000 The Java SDK does not currently support this parameter. When you make a call using HTTP, place max_input_tokens in the parameters object.
max_tokens `integer` (Optional) Maximum tokens in the response. Generation stops when this limit is reached, and the returned `finish_reason` is `length`. The default and maximum values are the model's maximum output length. For more information, see Models. This parameter is useful for controlling the output length in scenarios such as generating summaries or keywords, or for reducing costs and shortening response times. When `max_tokens` is triggered, the finish_reason field of the response is `length`. `max_tokens` does not limit the length of the chain-of-thought. In the Java SDK, this parameter is maxTokens. For Qwen-VL models, it is maxLength in the Java SDK, but versions after 2.18.4 also support maxTokens. When you make a call using HTTP, place max_tokens in the parameters
seed `integer` (Optional) A random number seed. This parameter ensures that results are reproducible for the same input and parameters. If you pass the same `seed` value in a call and other parameters remain unchanged, the model returns the same result as much as possible. Value range: `[0,2³¹-1]`. When you make a call using HTTP, place seed in the parameters object.
stream `boolean` (Optional) Defaults to: `false` Specifies whether to return the response in streaming output mode. Parameter values: false: The model returns the complete result at once after the generation is complete. true: The output is generated and sent incrementally. This means that a chunk is returned as soon as a part of the content is generated. This parameter is supported only by the Python SDK. To implement streaming output using the Java SDK, call the `streamCall` interface. To implement streaming output using HTTP, specify `X-DashScope-SSE` as `enable` in the request header. The commercial edition of Qwen3 (thinking mode), the open source edition of Qwen3, QwQ, and QVQ support only streaming output.
incremental_output `boolean` (Optional) Default: `false`. The default for Qwen3-Max, Qwen3-VL, Qwen3 open source models, QwQ, and QVQ models is `true`. Enables incremental streaming output. Recommended: `true`. Parameter values: false: Each output is the entire sequence that is generated. The last output is the complete result. `I I like I like apple I like apple.` true (recommended): The output is incremental. This means that subsequent output does not include previously generated content. Read these fragments in real time to get the complete result. `I like apple .` In the Java SDK, this corresponds to incrementalOutput. When calling via HTTP, add incremental_output to the parameters object. QwQ models and Qwen3 models in thinking mode support only the `true` value. Because the default value for Qwen3 commercial models is `false`, manually set this parameter to `true` when using thinking mode. Qwen3 open source models do not support the `false` value.
response_format `object` (Optional) Defaults to: `{"type": "text"}`. The format of the returned content. Valid values: `{"type": "text"}`: Outputs a text response. `{"type": "json_object"}`: Outputs a standard JSON string. `{"type": "json_schema","json_schema": {...} }`: Outputs a JSON string in a specified format. For more information, see Structured output. For information about supported models, see Supported models. If you set this parameter to `{"type": "json_object"}`, you must explicitly instruct the model to output JSON in the prompt. For example, use a prompt such as "Please output in JSON format." Otherwise, an error is reported. In the Java SDK, this parameter is named responseFormat. When making an HTTP call, place response_format in the parameters object. Properties type `string` (Required) The format of the returned content. Valid values: `text`: Outputs a text response. `json_object`: Outputs a standard JSON string. `json_schema`: Outputs a JSON string in a specified format. json_schema `object` This field is required when type is set to json_schema. It defines the configuration for structured output. Properties name `string` (Required) The unique name of the schema. It can contain only letters (case-insensitive), digits, underscores (_), and hyphens (-). The name can be up to 64 characters long. description `string` (Optional) A description of the schema's purpose. This helps the model understand the semantic context of the output. schema `object` (Optional) An object that complies with the JSON Schema standard. It defines the data structure of the model's output. To learn how to build a JSON Schema, see JSON Schema strict `boolean` (Optional) Defaults to: `false`. Controls whether the model must strictly follow all constraints of the schema. true (Recommended) The model strictly follows all constraints, such as field types, required fields, and formats. This ensures 100% compliance of the output. false (Not recommended) The model only loosely follows the schema. This may generate non-compliant output and cause validation to fail.
result_format `string` (Optional) Defaults to: `text` (The default for Qwen3-Max, Qwen3-VL, QwQ models, Qwen3 open source models (except qwen3-next-80b-a3b-instruct) is message) The format of the returned data. Set this parameter to `message` to simplify multi-turn conversations. The platform will update the default value to `message` in a future release. In the Java SDK, this parameter is named resultFormat When making calls over HTTP, place result_format in the parameters object. For Qwen-VL, QVQ models, setting `text` has no effect. The Qwen3-Max, Qwen3-VL, and Qwen3 models in thinking mode support only the `message` value. Because the default value for Qwen3 commercial models is `text`, set this parameter to `message`. If you use the Java SDK to call a Qwen3 open source model and pass `text`, the response is still returned in the `message` format.
logprobs `boolean` (Optional) Defaults to: `false` Specifies whether to return the log probabilities of the output tokens. Valid values: `true` `false` The following models are supported: Snapshot models of the qwen-plus series (excluding the main model) Snapshot models of the qwen-turbo series (excluding the main model) Qwen3 open source models When you make a call using HTTP, place logprobs in the parameters object.
top_logprobs `integer` (Optional) Default: 0 Specifies the number of most likely candidate tokens to return at each generation step. Value range: [0, 5] This parameter takes effect only when `logprobs` is set to `true`. In the Java SDK, the parameter is topLogprobs. For HTTP calls, place top_logprobs in the parameters object.
n `integer` (Optional) Default: 1 The number of responses to generate. The value range is `1-4`. For scenarios that require you to generate multiple responses, such as creative writing or ad copy, you can set a larger n value. This parameter is supported only by Qwen3 models in non-thinking mode. The value is fixed to 1 when the tools parameter is passed. Setting a larger value for n does not increase input token consumption, but it increases output token consumption. When making HTTP calls, place n in the parameters object.
stop `string or array` (Optional) Specifies stop words. When a string or `token_id` that is specified in `stop` appears in the text generated by the model, generation stops immediately. Pass sensitive words to control the model's output. When stop is an array, do not use a `token_id` and a string as elements at the same time. For example, `["Hello",104307]` is not a valid value. When you make a call using HTTP, place stop in the parameters object.
tools `array` (Optional) An array of one or more tool objects for the model to call in Function calling. For more information, see Function Calling. When using the `tools` parameter, set the `result_format` parameter to `message`. Set the `tools` parameter when you initiate Function calling or submit tool execution results. Properties type `string` (Required) The tool type. Currently, only `function` is supported. function `object` (Required) Properties name `string` (Required) The name of the tool function. The name must contain only letters, numbers, underscores (_), and hyphens (-). The maximum length is 64 characters. description `string` (Required) A description of the tool function. This helps the model decide when and how to call the tool function. parameters `object` (Optional) Defaults to: `{}` A description of the tool's parameters. The value must be a valid JSON Schema, see JSON schema. If the parameters parameter is empty, the function has no input parameters. Specify `parameters` for more accurate tool calling. When making HTTP calls, place tools in the parameters object. This parameter is not currently supported by the qwen-vl series models.
tool_choice `string or object` (Optional) Defaults to: `auto` The tool selection policy. Set this parameter to force a tool call for a specific type of question, such as always using a specific tool or disabling all tools. `auto` The model automatically selects a tool. `none` To temporarily disable tool calling in a specific request, set the `tool_choice` parameter to `none`. `{"type": "function", "function": {"name": "the_function_to_call"}}` To force a call to a specific tool, set the `tool_choice` parameter to `{"type": "function", "function": {"name": "the_function_to_call"}}`, where `the_function_to_call` is the name of the specified tool function. Models in thinking mode do not support forcing a call to a specific tool. In the Java SDK, this parameter is named toolChoice. When making an HTTP call, place tool_choice in the parameters object.
parallel_tool_calls `boolean` (Optional) Defaults to: `false` Specifies whether to enable parallel tool calling. Valid values: `true` `false` For more information about parallel tool calling, see Parallel tool calling. For the Java SDK, you can use parallelToolCalls. For HTTP calls, you can add parallel_tool_calls to the parameters object.

Chat response object (same format for streaming and non-streaming output)	`{ "status_code": 200, "request_id": "902fee3b-f7f0-9a8c-96a1-6b4ea25af114", "code": "", "message": "", "output": { "text": null, "finish_reason": null, "choices": [ { "finish_reason": "stop", "message": { "role": "assistant", "content": "I am a large-scale language model developed by Alibaba Cloud. My name is Qwen." } } ] }, "usage": { "input_tokens": 22, "output_tokens": 17, "total_tokens": 39 } }`
status_code `string` The status code of the request. A value of 200 indicates that the request was successful. Otherwise, the request failed. The Java SDK does not return this parameter. If the call fails, an exception is thrown that contains the content of the status_code and message parameters.
request_id `string` The unique ID for this call. The Java SDK returns this parameter as requestId
code `string` The error code. This parameter is empty when the call is successful. Only the Python SDK returns this parameter.
output `object` The result of the call. Properties text `string` The reply generated by the model. This field contains the reply content if the result_format input parameter is set to text. finish_reason `string` This parameter is not empty if the result_format input parameter is set to text. Consider the following four scenarios: This value is null upon generation. stop: The model's output ended naturally or triggered a stop condition from the input parameters. length: The generation ended because the output exceeded the maximum length. tool_calls: A tool call was triggered. choices `array` A list of output choices from the model. This parameter is returned if `result_format` is set to `message`. Properties finish_reason `string` The four cases are as follows: The value is null during generation. stop: The model's output ended naturally or triggered a stop condition from the input parameters. length: The generation ended because the output exceeded the maximum length. tool_calls: A tool call was triggered. message `object` The message object that is output by the model. Properties role `string` The role of the output message. The value is fixed to `assistant`. content `string or array` The content of the output message. The value is an `array` if you use a Qwen-VL or Qwen-Audio series model. Otherwise, the value is a `string`. If Function calling is triggered, this value is empty. Properties text `string` The content of the output message when you use a Qwen-VL or Qwen-Audio series model. image_hw `array` When the `vl_enable_image_hw_output` parameter is enabled for a Qwen-VL series model, the following cases apply: Image input: Returns the height and width of the image in pixels. Video input: Returns an empty array. reasoning_content `string` The deep thinking content from the model. tool_calls `array` The model generates the `tool_calls` parameter if it needs to call a tool. Properties function `object` The name of the tool to call and its input parameters. Properties name `string` The name of the tool to call. arguments `string` The input parameters for the tool, formatted as a JSON string. Because model responses have a degree of randomness, the output JSON string might not be valid for your function. Validate the parameters before passing them to the function. index `integer` The index of the current tool_calls object in the `tool_calls` array. id `string` The ID of this tool response. type `string` The type of the tool. The value is fixed to `function`. logprobs `object` The probability information for the current `choices` object. Properties content `array` An array of tokens with log probability information. Properties token `string` The current token. bytes `array` A list of the raw UTF-8 bytes for the current token. This is helpful for accurately restoring the output, especially when handling emojis and Chinese characters. logprob `float` The log probability of the current token. A `null` value indicates an extremely low probability. top_logprobs `array` The most likely tokens at the current token position and their log probabilities. The number of elements is the same as the value of the `top_logprobs` input parameter. Properties token `string` The current token. bytes `array` A list of the raw UTF-8 bytes for the current token. This is helpful for accurately restoring the output, especially when handling emojis and Chinese characters. logprob `float` The log probability of the current token. A `null` value indicates an extremely low probability.
usage `map` Information about the tokens that are used in this chat request. Properties input_tokens `integer` The number of tokens in the user input. output_tokens `integer` The number of tokens in the model output. input_tokens_details `integer` Details about the number of tokens in the input when you use the Qwen-VL model or QVQ model. Properties text_tokens `integer` When you use the Qwen-VL model or QVQ model, this is the number of tokens into which the input text is converted. image_tokens `integer` The number of tokens in the input image. video_tokens `integer` The number of tokens in the input video file or image list. total_tokens `integer` This field is returned when the input is plain text. The value is the sum of input_tokens and output_tokens. image_tokens `integer` This field is returned when the input includes an `image`. The value is the number of tokens in the user's input image. video_tokens `integer` This field is returned when the input includes a `video`. The value is the number of tokens in the user's input video. audio_tokens `integer` This field is returned when the input includes `audio`. The value is the number of tokens in the user's input audio. output_tokens_details `integer` Details about the number of tokens in the output. Properties text_tokens `integer` The number of tokens in the output text. reasoning_tokens `integer` The number of tokens in the Qwen3 model's thinking process. prompt_tokens_details `object` A fine-grained breakdown of input tokens. Properties cached_tokens `integer` The number of tokens that hit the cache. For more information, see Context cache. cache_creation `object` Information about the creation of an explicit cache. Properties ephemeral_5m_input_tokens `integer` The number of tokens that are used to create an explicit cache with a 5-minute validity period. cache_creation_input_tokens `integer` The number of tokens that are used to create an explicit cache. cache_type `string` When you use an explicit cache, the value of this parameter is `ephemeral`. Otherwise, this parameter is not returned.

Error codes

If the model call fails and returns an error message, see Error messages for a solution.

Singapore

Python code

Java code

US (Virginia)

Python code

Java code

China (Beijing)

Request body

Text input

Python

Java

PHP (HTTP)

Node.js (HTTP)

C# (HTTP)

Go (HTTP)

curl

Streaming output

Text generation models

Python

Java

curl

Multimodal models

Python

Java

curl

Image input

Python

Java

curl

Video input

Python

Java

curl

Tool calling

Python

Java

curl

Asynchronous invocation

Document understanding

Python

Java

curl

Chat response object (same format for streaming and non-streaming output)

Error codes