Text-to-image

Text-to-image - Z-Image is a lightweight model that quickly generates images, supports Chinese and English text rendering, and adapts to various resolutions and aspect ratios.

Reference: Technical blog

Prerequisites

You must have have created an API key and set the API key as an environment variable.

Examples

Input prompt

Output image

Photo of a stylish young woman with short black hair standing confidently in front of a vibrant cartoon-style mural wall. She wears an all-black outfit: a puffed bomber jacket with a ruffled collar, cargo shorts, fishnet tights, and chunky black Doc Martens, with a gold chain dangling from her waist. The background features four colorful comic-style panels: one reads “GRAND STAGE” and includes sneakers and a Gatorade bottle; another displays green Nike sneakers and a slice of pizza; the third reads “HARAJUKU st” with floating shoes; and the fourth shows a blue mouse riding a skateboard with the text “Takeshita WELCOME.” Dominant bright colors include yellow, teal, orange, pink, and green. Speech bubbles, halftone patterns, and playful characters enhance the urban street-art aesthetic. Daylight evenly illuminates the scene, and the ground beneath her feet is white tiled pavement. Full-body portrait, centered composition, slightly tilted stance, direct eye contact with the camera. High detail, sharp focus, dynamic framing.

b16c8008-83c1-4c80-ae22-786a2299bec3-1-转换自-png

Model overview

Model

Description

Output image specifications

z-image-turbo

A lightweight model for fast image generation

Image resolution: Total pixels between 512*512 and 2048*2048. For recommended resolutions, see size parameter settings.

Image format: png

Number of images: Fixed at 1.

Note

Before making a call, check the list of models supported in each region.

Synchronous HTTP

Singapore region: POST https://dashscope-intl.aliyuncs.com/api/v1/services/aigc/multimodal-generation/generation

Beijing region: POST https://dashscope.aliyuncs.com/api/v1/services/aigc/multimodal-generation/generation

Request parameters	Text-to-image The following example shows how to return an image directly, which results in a faster response time. To enable the intelligent rewriting feature, set `prompt_extend=true`. When enabled, the system returns the optimized prompt and its reasoning process along with the image, but this increases the response time. curl --location 'https://dashscope-intl.aliyuncs.com/api/v1/services/aigc/multimodal-generation/generation' \ --header 'Content-Type: application/json' \ --header "Authorization: Bearer $DASHSCOPE_API_KEY" \ --data '{ "model": "z-image-turbo", "input": { "messages": [ { "role": "user", "content": [ { "text": "Photo of a stylish young woman with short black hair standing confidently in front of a vibrant cartoon-style mural wall. She wears an all-black outfit: a puffed bomber jacket with a ruffled collar, cargo shorts, fishnet tights, and chunky black Doc Martens, with a gold chain dangling from her waist. The background features four colorful comic-style panels: one reads “GRAND STAGE” and includes sneakers and a Gatorade bottle; another displays green Nike sneakers and a slice of pizza; the third reads “HARAJUKU st” with floating shoes; and the fourth shows a blue mouse riding a skateboard with the text “Takeshita WELCOME.” Dominant bright colors include yellow, teal, orange, pink, and green. Speech bubbles, halftone patterns, and playful characters enhance the urban street-art aesthetic. Daylight evenly illuminates the scene, and the ground beneath her feet is white tiled pavement. Full-body portrait, centered composition, slightly tilted stance, direct eye contact with the camera. High detail, sharp focus, dynamic framing." } ] } ] }, "parameters": { "prompt_extend": false, "size": "1024*1024" } }'
Request headers
Content-Type `string` (Required) The content type of the request. Must be `application/json`.
Authorization `string` (Required) The authentication credentials using a Model Studio API key. Example: `Bearer sk-xxxx`
Request body
model `string` (Required) The model name. Example: z-image-turbo.
input `object` (Required) The input for the model. Properties messages `array` (Required) An array of request content. Currently, only single-turn conversations are supported. This means the array can contain only one message object. Multi-turn conversations are not supported. Properties role `string` (Required) The role of the message. This parameter must be set to `user`. content `array` (Required) An array of message content. It must contain exactly one text object. Properties text `string` (Required) A positive prompt that describes the desired content, style, and composition of the generated image. Supports Chinese and English. The prompt cannot exceed 800 characters. Each Chinese character, letter, number, or symbol is counted as one character. Any excess characters are automatically truncated. Example: A sitting orange cat, with a happy expression, lively and cute, realistic and accurate. Note: Only one text object is supported. If you pass zero or more than one text object, the system reports an error.
parameters `object` (Optional) Image generation parameters. Properties size `string` (Optional) The resolution of the output image, in the format `widthheight`. The default value and constraints are as follows: Default value: `10241536`. Total pixel range: [512512, 20482048] Recommended resolution range: [512512, 20482048] Example: 10241536. Recommended resolutions for a total of 10241024 pixels:** 1:1: 10241024 2:3: 8321248 3:2: 1248832 3:4: 8641152 4:3: 1152864 7:9: 8961152 9:7: 1152896 9:16: 7201280 9:21: 5761344 16:9: 1280720 21:9: 1344576 Recommended resolutions for a total of 12801280 pixels:** 1:1: 12801280 2:3: 10241536 3:2: 15361024 3:4: 11041472 4:3: 14721104 7:9: 11201440 9:7: 14401120 9:16: 8641536 9:21: 7201680 16:9: 1536864 21:9: 1680720 Recommended resolutions for a total of 15361536 pixels:** 1:1: 15361536 2:3: 12481872 3:2: 18721248 3:4: 12961728 4:3: 17281296 7:9: 13441728 9:7: 17281344 9:16: 11522048 9:21: 8642016 16:9: 20481152 21:9: 2016864 prompt_extend* `bool` (Optional) Important The `prompt_extend` parameter directly affects billing. Setting this parameter to `true` costs more than setting it to `false`. For more information, see Model pricing. Specifies whether to enable intelligent rewriting for the text prompt. If this feature is enabled, a large language model optimizes the prompt and outputs its reasoning process. false: (default) Disables intelligent rewriting. The output includes the image and the original text prompt. true: Enables intelligent rewriting. The output includes the image, the optimized text prompt, and the reasoning process. seed `integer` (optional) Random number seed. Valid range: `[0,2147483647]`. Using the same `seed` value makes results more consistent. If omitted, the algorithm uses a random seed. Note: Because generation is probabilistic, even identical `seed` values do not guarantee identical results.

Response parameters	Successful task execution Task data (task status and image URLs) is retained for only 24 hours and then automatically purged. Save generated images promptly. { "output": { "choices": [ { "finish_reason": "stop", "message": { "content": [ { "image": "https://dashscope-result-bj.oss-cn-beijing.aliyuncs.com/xxx.png?Expires=xxx" }, { "text": "Photo of a stylish young woman with short black hair standing confidently in front of a vibrant cartoon-style mural wall. She wears an all-black outfit: a puffed bomber jacket with a ruffled collar, cargo shorts, fishnet tights, and chunky black Doc Martens, with a gold chain dangling from her waist. The background features four colorful comic-style panels: one reads “GRAND STAGE” and includes sneakers and a Gatorade bottle; another displays green Nike sneakers and a slice of pizza; the third reads “HARAJUKU st” with floating shoes; and the fourth shows a blue mouse riding a skateboard with the text “Takeshita WELCOME.” Dominant bright colors include yellow, teal, orange, pink, and green. Speech bubbles, halftone patterns, and playful characters enhance the urban street-art aesthetic. Daylight evenly illuminates the scene, and the ground beneath her feet is white tiled pavement. Full-body portrait, centered composition, slightly tilted stance, direct eye contact with the camera. High detail, sharp focus, dynamic framing." } ], "reasoning_content": "", "role": "assistant" } } ] }, "usage": { "height": 1024, "image_count": 1, "input_tokens": 0, "output_tokens": 0, "total_tokens": 0, "width": 1024 }, "request_id": "abf1645b-b630-433a-92f6-xxxxxx" } Abnormal task execution If a task fails, the system returns information about the failure. The `code` and `message` fields indicate the cause of the error. For more information, see Error messages. `{ "request_id": "a4d78a5f-655f-9639-8437-xxxxxx", "code": "InvalidParameter", "message": "num_images_per_prompt must be 1" }`
output `object` The output of the task. Properties choices `array` The output content generated by the model. This array contains only one element. Properties finish_reason `string` The reason the task stopped. A value of `stop` indicates normal completion. message `object` The message returned by the model. Properties role `string` The role of the message. The value is always `assistant`. content `array` Properties image `string` The URL of the generated image. The image format is PNG. The link is valid for 24 hours. Download and save the image promptly. text `string` If prompt_extend=false, this is the input prompt. If prompt_extend=true, this is the rewritten prompt. reasoning_content `string` The model's reasoning process. This field is returned only when prompt_extend is set to true.
usage `object` Usage statistics for the task. Only successful tasks are included in the statistics. Properties width `integer` The width of the generated image in pixels. height `integer` The height of the generated image in pixels. image_count `integer` The number of generated images. The value is always 1. input_tokens `integer` The number of input tokens. output_tokens `integer` The number of output tokens. output_tokens_details `object` Details about the output tokens. output_tokens_details.reasoning_tokens `integer` The number of tokens used for reasoning. total_tokens `integer` The total number of tokens.
request_id `string` Unique identifier for the request. Use for tracing and troubleshooting issues.
code `string` The error code. Returned only when the request fails. See error codes for details.
message `string` Detailed error message. Returned only when the request fails. See error codes for details.

Limitations

Data validity: The task_id and image url are retained for only 24 hours. After this period, they cannot be queried or downloaded.
Content moderation: The input prompt and output image are both subject to content moderation. Requests containing prohibited content result in an IPInfringementSuspect or DataInspectionFailed error. See error codes for details.

Network access configuration: Image URLs are stored in OSS. If your business system cannot access external OSS URLs due to security policies, add the following OSS domain names to your whitelist.

# List of OSS domain names
dashscope-result-bj.oss-cn-beijing.aliyuncs.com
dashscope-result-hz.oss-cn-hangzhou.aliyuncs.com
dashscope-result-sh.oss-cn-shanghai.aliyuncs.com
dashscope-result-wlcb.oss-cn-wulanchabu.aliyuncs.com
dashscope-result-zjk.oss-cn-zhangjiakou.aliyuncs.com
dashscope-result-sz.oss-cn-shenzhen.aliyuncs.com
dashscope-result-hy.oss-cn-heyuan.aliyuncs.com
dashscope-result-cd.oss-cn-chengdu.aliyuncs.com
dashscope-result-gz.oss-cn-guangzhou.aliyuncs.com
dashscope-result-wlcb-acdr-1.oss-cn-wulanchabu-acdr-1.aliyuncs.com

Billing and rate limiting

For free quota and unit price, see Model pricing.
For rate limits, see Text-to-image - Z-Image.
Billing details:
- Billing is based on the number of successfully generated images and whether intelligent rewriting is enabled. You are charged only when the query result API returns a task_status of SUCCEEDED and an image is successfully generated.
- Failed model calls or processing errors do not incur any fees and do not consume your free quota.

Error codes

If a model call fails, see Error messages to troubleshoot the issue based on the returned error message.

FAQ

Q: How can I view the model call volume?

A: You can go to the Model Observation (Singapore) or Model Observation (Beijing) page to view metrics such as the number of calls and the success rate. The data is available approximately one hour after the model call is complete. For more information, see How to view model call records?

Prerequisites

Examples

Model overview

Synchronous HTTP

Request parameters

Request headers

Request body

Response parameters

Successful task execution

Abnormal task execution

Limitations

Billing and rate limiting

Error codes

FAQ