All Products
Search
Document Center

Alibaba Cloud Model Studio:Qwen - Image Editing Qwen-Image-Edit

Last Updated:Dec 17, 2025

The Qwen-Image-Edit-Plus model supports multi-image input and output. You can use it to accurately modify text in images, add, delete, or move objects, change the actions of subjects, transfer image styles, and enhance details.

Getting started

This example shows how to use the qwen-image-edit-plus model to generate two edited images based on three input images and a prompt.

Input prompt: The girl in Image 1 wears the black dress from Image 2 and sits in the pose from Image 3.

Input image 1

Input image 2

Input image 3

Output images (multiple images)

image99

image98

image89

image100

imageout2

Before making a call, obtain an API key and set the API key as an environment variable.

To make calls using the SDK, install the DashScope SDK. The SDK is available for Python and Java.

The Qwen image editing models support one to three input images. The qwen-image-edit-plus series of models, including qwen-image-edit-plus and qwen-image-edit-plus-2025-10-30, can generate one to six images. The qwen-image-edit model can generate only one image. The URLs for the generated images are valid for 24 hours. You should download the images to a local device using their URLs promptly.

Python

import json
import os
import dashscope
from dashscope import MultiModalConversation

# The following is the URL for the Singapore region. If you use a model in the Beijing region, replace the URL with: https://dashscope.aliyuncs.com/api/v1
dashscope.base_http_api_url = 'https://dashscope-intl.aliyuncs.com/api/v1'

# The model supports one to three input images.
messages = [
    {
        "role": "user",
        "content": [
            {"image": "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20250925/thtclx/input1.png"},
            {"image": "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20250925/iclsnx/input2.png"},
            {"image": "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20250925/gborgw/input3.png"},
            {"text": "Make the girl from Image 1 wear the black dress from Image 2 and sit in the pose from Image 3."}
        ]
    }
]

# The API Keys for the Singapore and Beijing regions are different. Get an API Key: https://www.alibabacloud.com/help/en/model-studio/get-api-key
# If you have not configured the environment variable, replace the following line with your Model Studio API Key: api_key="sk-xxx"
api_key = os.getenv("DASHSCOPE_API_KEY")

# qwen-image-edit-plus supports outputting 1 to 6 images. This example shows how to output 2 images.
response = MultiModalConversation.call(
    api_key=api_key,
    model="qwen-image-edit-plus",
    messages=messages,
    stream=False,
    n=2,
    watermark=False,
    negative_prompt="low quality",
    prompt_extend=True,
    # The size parameter is supported only when the number of output images n is 1. Otherwise, an error is reported.
    # size="1024*2048",
)

if response.status_code == 200:
    # To view the full response, uncomment the following line.
    # print(json.dumps(response, ensure_ascii=False))
    for i, content in enumerate(response.output.choices[0].message.content):
        print(f"URL of output image {i+1}: {content['image']}")
else:
    print(f"HTTP status code: {response.status_code}")
    print(f"Error code: {response.code}")
    print(f"Error message: {response.message}")
    print("For more information, see the documentation: https://www.alibabacloud.com/help/en/model-studio/error-code")

Response example

{
    "status_code": 200,
    "request_id": "121d8c7c-360b-4d22-a976-6dbb8bxxxxxx",
    "code": "",
    "message": "",
    "output": {
        "text": null,
        "finish_reason": null,
        "choices": [
            {
                "finish_reason": "stop",
                "message": {
                    "role": "assistant",
                    "content": [
                        {
                            "image": "https://dashscope-result-sz.oss-cn-shenzhen.aliyuncs.com/xxx.png?Expires=xxx"
                        },
                        {
                            "image": "https://dashscope-result-sz.oss-cn-shenzhen.aliyuncs.com/xxx.png?Expires=xxx"
                        }
                    ]
                }
            }
        ]
    },
    "usage": {
        "input_tokens": 0,
        "output_tokens": 0,
        "height": 1248,
        "image_count": 2,
        "width": 832
    }
}

Java

import com.alibaba.dashscope.aigc.multimodalconversation.MultiModalConversation;
import com.alibaba.dashscope.aigc.multimodalconversation.MultiModalConversationParam;
import com.alibaba.dashscope.aigc.multimodalconversation.MultiModalConversationResult;
import com.alibaba.dashscope.common.MultiModalMessage;
import com.alibaba.dashscope.common.Role;
import com.alibaba.dashscope.exception.ApiException;
import com.alibaba.dashscope.exception.NoApiKeyException;
import com.alibaba.dashscope.exception.UploadFileException;
import com.alibaba.dashscope.utils.JsonUtils;
import com.alibaba.dashscope.utils.Constants;

import java.io.IOException;
import java.util.Arrays;
import java.util.Collections;
import java.util.HashMap;
import java.util.Map;
import java.util.List;

public class QwenImageEdit {

    static {
        // The following URL is for the Singapore region. If you use a model in the Beijing region, replace the URL with https://dashscope.aliyuncs.com/api/v1.
        Constants.baseHttpApiUrl = "https://dashscope-intl.aliyuncs.com/api/v1";
    }
    
    // The API keys for the Singapore and Beijing regions are different. To obtain an API key, visit https://www.alibabacloud.com/help/en/model-studio/get-api-key.
    // If you have not configured the environment variable, replace the following line with your Model Studio API key: apiKey="sk-xxx"
    static String apiKey = System.getenv("DASHSCOPE_API_KEY");

    public static void call() throws ApiException, NoApiKeyException, UploadFileException, IOException {

        MultiModalConversation conv = new MultiModalConversation();

        // The model supports one to three input images.
        MultiModalMessage userMessage = MultiModalMessage.builder().role(Role.USER.getValue())
                .content(Arrays.asList(
                        Collections.singletonMap("image", "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20250925/thtclx/input1.png"),
                        Collections.singletonMap("image", "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20250925/iclsnx/input2.png"),
                        Collections.singletonMap("image", "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20250925/gborgw/input3.png"),
                        Collections.singletonMap("text", "Make the girl from Image 1 wear the black dress from Image 2 and sit in the pose from Image 3.")
                )).build();
        // qwen-image-edit-plus supports outputting 1 to 6 images. This example shows how to output 2 images.
        Map<String, Object> parameters = new HashMap<>();
        parameters.put("watermark", false);
        parameters.put("negative_prompt", "low quality");
        parameters.put("n", 2);
        parameters.put("prompt_extend", true);
        // The size parameter is supported only when the number of output images n is 1. Otherwise, an error is reported.
        // parameters.put("size", "1024*2048");

        MultiModalConversationParam param = MultiModalConversationParam.builder()
                .apiKey(apiKey)
                .model("qwen-image-edit-plus")
                .messages(Collections.singletonList(userMessage))
                .parameters(parameters)
                .build();

        MultiModalConversationResult result = conv.call(param);
        // To view the complete response, uncomment the following line.
        // System.out.println(JsonUtils.toJson(result));
        List<Map<String, Object>> contentList = result.getOutput().getChoices().get(0).getMessage().getContent();
        int imageIndex = 1;
        for (Map<String, Object> content : contentList) {
            if (content.containsKey("image")) {
                System.out.println("URL of output image " + imageIndex + ": " + content.get("image"));
                imageIndex++;
            }
        }
    }

    public static void main(String[] args) {
        try {
            call();
        } catch (ApiException | NoApiKeyException | UploadFileException | IOException e) {
            System.out.println(e.getMessage());
        }
    }
}

Response example

{
    "requestId": "46281da9-9e02-941c-ac78-be88b8xxxxxx",
    "usage": {
        "image_count": 2,
        "width": 1216,
        "height": 864
    },
    "output": {
        "choices": [
            {
                "finish_reason": "stop",
                "message": {
                    "role": "assistant",
                    "content": [
                        {
                            "image": "https://dashscope-result-sz.oss-cn-shenzhen.aliyuncs.com/xxx.png?Expires=xxx"
                        },
                        {
                            "image": "https://dashscope-result-sz.oss-cn-shenzhen.aliyuncs.com/xxx.png?Expires=xxx"
                        }
                    ]
                }
            }
        ]
    }
}

curl

The following command uses the URL for the Singapore region. If you use a model in the China (Beijing) region, replace the URL with: https://dashscope.aliyuncs.com/api/v1/services/aigc/multimodal-generation/generation

curl --location 'https://dashscope-intl.aliyuncs.com/api/v1/services/aigc/multimodal-generation/generation' \
--header 'Content-Type: application/json' \
--header "Authorization: Bearer $DASHSCOPE_API_KEY" \
--data '{
    "model": "qwen-image-edit-plus",
    "input": {
        "messages": [
            {
                "role": "user",
                "content": [
                    {
                        "image": "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20250925/thtclx/input1.png"
                    },
                    {
                        "image": "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20250925/iclsnx/input2.png"
                    },
                    {
                        "image": "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20250925/gborgw/input3.png"
                    },
                    {
                        "text": "Make the girl from Image 1 wear the black dress from Image 2 and sit in the pose from Image 3."
                    }
                ]
            }
        ]
    },
    "parameters": {
        "n": 2,
        "negative_prompt": "low quality",
        "prompt_extend": true,
        "watermark": false
    }
}'

Response example

{
    "output": {
        "choices": [
            {
                "finish_reason": "stop",
                "message": {
                    "role": "assistant",
                    "content": [
                        {
                            "image": "https://dashscope-result-sz.oss-cn-shenzhen.aliyuncs.com/xxx.png?Expires=xxx"
                        },
                        {
                            "image": "https://dashscope-result-sz.oss-cn-shenzhen.aliyuncs.com/xxx.png?Expires=xxx"
                        }
                    ]
                }
            }
        ]
    },
    "usage": {
        "width": 1248,
        "image_count": 2,
        "height": 832
    },
    "request_id": "bf37ca26-0abe-98e4-8065-xxxxxx"
}

Download an image to a local device using its URL

# You need to install requests to download the image: pip install requests
import requests


def download_image(image_url, save_path='output.png'):
    try:
        response = requests.get(image_url, stream=True, timeout=300)  # Set timeout
        response.raise_for_status()  # Raise an exception if the HTTP status code is not 200.
        with open(save_path, 'wb') as f:
            for chunk in response.iter_content(chunk_size=8192):
                f.write(chunk)
        print(f"Image downloaded successfully to: {save_path}")

    except requests.exceptions.RequestException as e:
        print(f"Image download failed: {e}")


image_url = "https://dashscope-result-sz.oss-cn-shenzhen.aliyuncs.com/xxx.png?Expires=xxx"
download_image(image_url, save_path='output.png')
import java.io.FileOutputStream;
import java.io.InputStream;
import java.net.HttpURLConnection;
import java.net.URL;
 
public class ImageDownloader {
    public static void downloadImage(String imageUrl, String savePath) {
        try {
            URL url = new URL(imageUrl);
            HttpURLConnection connection = (HttpURLConnection) url.openConnection();
            connection.setConnectTimeout(5000);
            connection.setReadTimeout(300000);
            connection.setRequestMethod("GET");
            InputStream inputStream = connection.getInputStream();
            FileOutputStream outputStream = new FileOutputStream(savePath);
            byte[] buffer = new byte[8192];
            int bytesRead;
            while ((bytesRead = inputStream.read(buffer)) != -1) {
                outputStream.write(buffer, 0, bytesRead);
            }
            inputStream.close();
            outputStream.close();
 
            System.out.println("Image downloaded successfully to: " + savePath);
        } catch (Exception e) {
            System.err.println("Image download failed: " + e.getMessage());
        }
    }
 
    public static void main(String[] args) {
        String imageUrl = "http://dashscope-result-bj.oss-cn-beijing.aliyuncs.com/xxx?Expires=xxx";
        String savePath = "output.png";
        downloadImage(imageUrl, savePath);
    }
}

Model selection

  • qwen-image-edit-plus series (Recommended): This series includes qwen-image-edit-plus and qwen-image-edit-plus-2025-10-30. It supports single-image editing, multi-image fusion, intelligent prompt optimization, and custom resolutions. It can generate one to six images.

  • qwen-image-edit: Supports single-image editing and multi-image fusion. It can generate only one image.

For more information, see Qwen image editing.

Input instructions

Input images (messages)

The messages parameter is an array that must contain a single object. This object must include the role and content properties. The role must be set to user, and the content must include both an image (one to three images) and text (one editing instruction).

The input images must meet the following requirements:

  • Image formats: JPG, JPEG, PNG, BMP, TIFF, WEBP, and GIF.

    The output image is in PNG format. For animated GIFs, only the first frame is processed.
  • Image resolution: For the best results, the width and height of the image should be between 384 and 3072 pixels. A low resolution may result in a blurry output, while a high resolution increases processing time.

  • File size: The size of a single image file cannot exceed 10 MB.

"messages": [
    {
        "role": "user",
        "content": [
            { "image": "Public URL or Base64 data of Image 1" },
            { "image": "Public URL or Base64 data of Image 2" },
            { "image": "Public URL or Base64 data of Image 3" },
            { "text": "Your editing instruction, for example: 'The girl in Image 1 wears the black dress from Image 2 and sits in the pose from Image 3'" }
        ]
    }
]

Image input order

When you provide multiple input images, the image order is defined by their sequence in the array. The editing instruction must correspond to the order of the images in the content field, such as 'Image 1' and 'Image 2'. Otherwise, the results may be unexpected.

Input image 1

Input image 2

Output image

image95

image96

5

Replace the clothes of the girl in Image 1 with the clothes of the girl in Image 2.

4

Replace the clothes of the girl in Image 2 with the clothes of the girl in Image 1.

Image input methods

Public URL

  • Provide a publicly accessible image URL that supports the HTTP or HTTPS protocol.

  • Example value: https://xxxx/img.png.

Base64 encoding

Convert the image file to a Base64-encoded string and concatenate it in the format: data:{mime_type};base64,{base64_data}.

  • {mime_type}: The media type of the image, which must correspond to the file format.

  • {base64_data}: The Base64-encoded string of the file.

  • Example value: ... (The example is truncated for demonstration purposes.)

For complete code examples, see Python SDK and Java SDK.

More parameters

You can adjust the generation results using the following optional parameters:

  • n: The number of images to generate. The default value is 1. The qwen-image-edit-plus series of models supports generating one to six images. The qwen-image-edit model supports generating only one image.

  • negative_prompt: Describes content to exclude from the image, such as "blur" or "extra fingers". This parameter helps optimize the quality of the generated image.

  • watermark: Specifies whether to add a "Qwen-Image" watermark to the bottom-right corner of the image. The default value is false. The following image shows the watermark style:

    1

  • seed: The random number seed. The value can be an integer from [0, 2147483647]. If this parameter is not specified, the algorithm generates a random number to use as the seed. Using the same seed value helps ensure that the generated content is relatively consistent.

The following optional parameters are available only for the qwen-image-edit-plus series of models:

  • size: The resolution of the output image. The format is a width*height string, such as "1024*2048". The width and height can range from 512 to 2048 pixels. This parameter is available only when the number of output images (n) is 1. Otherwise, an error is returned. If this parameter is not set, the output image retains an aspect ratio similar to the original image (the last image in a multi-image input), with a resolution close to 1024*1024.

  • prompt_extend: Specifies whether to enable the prompt rewriting feature. This feature is enabled by default. When enabled, the service uses a large model to optimize the prompt. This can significantly improve the results for simple or less descriptive prompts.

For a complete list of parameters, see Qwen-Image-Edit API reference.

Showcase

Multi-image fusion

Input image 1

Input image 2

Input image 3

Output image

image83

image103

1

2

The girl in Image 1 wears the necklace from Image 2 and carries the bag from Image 3 on her left shoulder.

Subject consistency

Input image

Output image 1

Output image 2

Output image 3

image5

image4

Change to a certificate photo with a blue background. The person wears a white shirt, a black suit, and a striped tie.

image6

The person wears a white shirt, a gray suit, and a striped tie, with one hand on the tie, against a light-colored background.

image7

The person wears a black hoodie with "Qwen Image" in a thick brush font, leans against a guardrail with sunlight on their hair, and a bridge and the sea are in the background.

image12

image13

Place this air conditioner in the living room, next to the sofa.

image14

Add mist coming from the air conditioner's vent, extending to the sofa, and add green leaves.

image15

Add the white handwritten text "Natural Fresh Air, Enjoy Breathing" at the top.

Sketch creation

Input image

Output image

image42

image43

Generate an image that matches the detailed shape outlined in Image 1 and follows this description: A young woman smiles on a sunny day. She wears round brown sunglasses with a leopard print frame. Her hair is neatly tied up, she wears pearl earrings, a dark blue scarf with white star patterns, and a black leather jacket.

image44

Generate an image that matches the detailed shape outlined in Image 1 and follows this description: An elderly man smiles at the camera. His face is wrinkled, his hair is messy in the wind, and he wears round-framed reading glasses. He has a worn-out red scarf with star patterns around his neck and is wearing a cotton-padded jacket.

Creative product generation

Input image

Output image

图片 1

image23

Make this bear sit under the moon (represented by a light gray crescent outline on a white background), holding a guitar, with small stars and speech bubbles with phrases such as "Be Kind" floating around.

image22

Print this design on a T-shirt and a paper tote bag. A female model is displaying these items. She is also wearing a baseball cap with "Be kind" written on it.

image21

A hyper-realistic 1/7 scale character model, designed as a commercial finished product, is placed on an iMac computer with a white keyboard. The model stands on a clean, round, transparent acrylic base with no labels or text. Professional studio lighting highlights the sculpted details. The ZBrush modeling process for the same model is displayed on the iMac screen in the background. Next to the model is a packaging box with a transparent window on the front, showing only the clear plastic shell inside. The box is slightly taller than the model and reasonably sized to hold it.

image

This bear is wearing an astronaut suit and pointing into the distance.

image

This bear is wearing a gorgeous ball gown, with its arms spread in an elegant dance pose.

image

This bear is wearing sportswear, holding a basketball, with one leg bent.

Generate image from depth map

Input image

Output image

image36

image37

Generate an image that matches the depth map outlined in Image 1 and follows this description: A blue bicycle is parked in a side alley, with a few weeds growing from cracks in the stone in the background.

image38

Generate an image that matches the depth map outlined in Image 1 and follows this description: A worn-out red bicycle is parked on a muddy path, with a dense primeval forest in the background.

Generate image from keypoints

Input image

Output image

image40

image41

Generate an image that matches the human pose outlined in Image 1 and follows this description: A Chinese woman in a Hanfu is holding an oil-paper umbrella in the rain, with a Suzhou garden in the background.

image39

Generate an image that matches the human pose outlined in Image 1 and follows this description: A young man stands on a subway platform. He wears a baseball cap, a T-shirt, and jeans. A train is speeding by behind him.

Text editing

Input image

Output image

Input image

Output image

image

image

Replace 'HEALTH INSURANCE' on the Scrabble tiles with 'Tomorrow will be better'.

image

image

Change the phrase "Take a Breather" on the note to "Relax and Recharge".

Input image

Output image

image53

image45

Change "Qwen-Image" to a black ink-drip font.

image46

Change "Qwen-Image" to a black handwriting font.

image49

Change "Qwen-Image" to a black pixel font.

image54

Change "Qwen-Image" to red.

image57

Change "Qwen-Image" to a blue-purple gradient.

image59

Change "Qwen-Image" to candy colors.

image63

Change the material of "Qwen-Image" to metal.

image64

Change the material of "Qwen-Image" to clouds.

image67

Change the material of "Qwen-Image" to glass.

Add, delete, modify, and replace

Capability

Input image

Output image

Add element

image

image

Add a small wooden sign in front of the penguin that says "Welcome to Penguin Beach".

Delete element

image

image

Remove the hair from the plate.

Replace element

image

image

Change the peaches to apples.

Portrait modification

image

image

Make her close her eyes.

Pose modification

image8

image9

She raises her hands with palms facing the camera and fingers spread in a playful pose.

Viewpoint transformation

Input image

Output image

Input image

Output image

image

image

Get a front view.

image

image

Face left.

image

image

Get a rear view.

image

image

Face right.

Background replacement

Input image

Output image

image

image

Change the background to a beach.

image

Replace the original background with a realistic modern classroom scene. In the center of the background is a traditional dark green or black blackboard. The Chinese characters "Qwen" are neatly written on the blackboard in white chalk.

Old photo processing

Capability

Input image

Output image

Old photo restoration and colorization

image

image

Restore the old photo, remove scratches, reduce noise, enhance details, high resolution, realistic image, natural skin tone, clear facial features, no distortion.

image31

image32

Intelligently colorize the image based on its content to make it more vivid.

Billing and rate limiting

For information about the free quota and pricing of the model, see Model list and pricing.

For information about the rate limits of the model, see Qwen (Qwen-Image).

Billing description:

  • Billing is based on the number of successfully generated images. Failed model calls or processing errors do not incur fees or consume the free quota.

  • You can enable the Free Quota Only feature to avoid extra charges after your free quota is exhausted. For more information, see Free quota.

API reference

For information about the input and output parameters of the API, see Qwen - image editing.

Error codes

If a call fails, see Error messages for troubleshooting.

FAQ

Q: Does qwen-image-edit support multi-turn conversational editing?

A: No, it does not. The model only supports single-turn execution. Each call is an independent, stateless task. To perform continuous edits, you can use the generated image as a new input for another call.

Q: What languages do the qwen-image-edit and qwen-image-edit-plus series models support?

A: They officially support Simplified Chinese and English. You can try other languages, but their performance has not been fully verified and is not guaranteed.

Q: If I upload multiple reference images with different aspect ratios, which one determines the aspect ratio of the output image?

A: The output image will match the aspect ratio of the last uploaded reference image.

Q: How do I view model usage?

A: Model call data is available after a one-hour delay. You can go to the Model Observation (Singapore or Beijing) page to view metrics such as call usage, number of calls, and success rate. For more information, see How to view model call records.

For more questions, see Image generation FAQ.