Alibaba Cloud Model Studio offers Qwen and third-party models for text, image, audio, and video.
Text generation
Image & video
Generation
Generate images and videos from text or images, with support for editing, reference, and high-resolution output
More →Audio & speech
Speech recognition
Dedicated ASR and LLM-based approaches — choose based on accuracy and flexibility
More →Omni
Integrates understanding and generation capabilities across text, image, audio, and video modalities
More →Embeddings & reranking
Convert text or multimodal content into vectors, combined with reranking to improve retrieval accuracy
More →View all models
Go to Model Plaza to browse all Qwen, third-party, domain-specific, and legacy models.


