×
Community Blog Qwen-Image-Layered: Layered Decomposition for Inherent Editablity

Qwen-Image-Layered: Layered Decomposition for Inherent Editablity

This article introduces Qwen-Image-Layered, an AI model that decomposes an image into multiple, independently editable layers.

1

We are excited to introduce Qwen-Image-Layered, a model capable of decomposing an image into multiple RGBA layers. This layered representation unlocks inherent editability: each layer can be independently manipulated without affecting other content. Meanwhile, such a layered representation naturally supports high-fidelity elementary operations-such as resizing, reposition, and recoloring. By physically isolating semantic or structural components into distinct layers, our approach enables high-fidelity and consistent editing.

Layered Decomposition in Application

Given an image, Qwen-Image-Layered can decompose it into several RGBA layers:

2

After decomposition, edits are applied exclusively to the target layer, physically isolating it from the rest of the content, and thereby fundamentally ensuring consistency across edits.

For example, we can recolor the first layer and keep all other content untouched:

3

We can also replace the second layer from a girl to a boy:

4

Here, we revise the text to “Qwen-Image”:

5

Furthermore, the layered structure naturally supports elemetary operations. For example, we can delete unwanted objects cleanly:

6

We can also resize an object without distortion:

7

After layer decomposition, we can move objects freely within the canvas:

8

Flexible and Iterative Decomposition

Qwen-Image-Layered is not limited to a fixed number of layers. The model supports variable-layer decomposition. For example, we can decompose an image into either 3 or 8 layers as needed:

9

Moreover, decomposition can be applied recursively: any layer can itself be further decomposed, enabling infinite decomposition.

10

Conclusion

Qwen-Image-Layered bridges the gap between raster imagery and structured, editable representations. By reimagining images as composable layers, we hope to enable intuitive, precise, and robust editing capabilities.

Citation

If you find our model useful in your research, please consider citing us 📝 :)

@misc{yin2025qwenimagelayered,
      title={Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition},
      author={Shengming Yin, Zekai Zhang, Zecheng Tang, Kaiyuan Gao, Xiao Xu, Kun Yan, Jiahao Li, Yilei Chen, Yuxiang Chen, Heung-Yeung Shum, Lionel M. Ni, Jingren Zhou, Junyang Lin, Chenfei Wu},
      year={2025},
      eprint={2512.15603},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2512.15603},
}


Original Source

0 0 0
Share on

Alibaba Cloud Community

1,302 posts | 458 followers

You may also like

Comments