Today, DeepSeek released two models, V4-Pro and V4-Flash. Their architecture and technical advantages can be summarized as follows:
This brings improvements in model performance and cost-effectiveness, including:
DeepSeek-V4 supports the OpenAI ChatCompletions interface and the Anthropic interface. When calling the new model API, the Model parameter needs to be changed to deepseek-v4-pro or deepseek-v4-flash.
Alibaba Cloud AI Gateway provides management capabilities for Model API, Agent API, and MCP Server, and now supports management of the DeepSeek-V4 API first. Through Alibaba Cloud AI Gateway, you can call DeepSeek-V4 API services, including thinking, multi-turn dialogue, Tool Call, Anthropic /v1/messages compatible calls, and more. It also supports integration of DeepSeek-V4 on Claude Code, and additionally implements fallback capabilities between DeepSeek-V4 and other models such as Qwen.
Open the AI Gateway page, click to enter the console, and click the target instance ID. In the left navigation bar, click Model API, then click Create Model API.

After entering the Create Model API form, you can configure it as follows:

BasePath must be unique./. You can choose whether to enable remove when forwarding to backend services.After configuration, run a test case:


Building Cross-Cloud Observability: One Architecture, Unified Analytics
706 posts | 57 followers
FollowAlibaba Cloud Native Community - February 13, 2026
Alibaba Cloud Native Community - February 13, 2025
Alibaba Cloud Native Community - March 10, 2025
Alibaba Container Service - July 10, 2025
Alibaba Container Service - May 27, 2025
Alibaba Cloud Native Community - February 28, 2025
706 posts | 57 followers
Follow
Container Compute Service (ACS)
A cloud computing service that provides container compute resources that comply with the container specifications of Kubernetes
Learn More
Container Service for Kubernetes
Alibaba Cloud Container Service for Kubernetes is a fully managed cloud container management service that supports native Kubernetes and integrates with other Alibaba Cloud products.
Learn More
Tongyi Qianwen (Qwen)
Top-performance foundation models from Alibaba Cloud
Learn More
Alibaba Cloud for Generative AI
Accelerate innovation with generative AI to create new business success
Learn MoreMore Posts by Alibaba Cloud Native Community