New Features

Platform for AI (PAI) - EAS releases distributed inference

EAS offers a multi-machine distributed inference solution that overcomes hardware limitations and efficiently supports the deployment and operation of models with large parameter sizes.
Content

Target customers: Customers who use AI inference, model service or AIGC. New features/specifications: With the advent of ultra-large-scale MoE models such as Qwen-max and Deepseek, it is difficult for a single device to handle their huge parameter sizes. EAS offers a multi-machine distributed inference solution that overcomes hardware limitations and efficiently supports the deployment and operation of models with large parameter sizes. EAS supports multiple parallelism methods, such as pipeline parallelism, tensor parallelism, and data parallelism. It is also compatible with high-performance inference engine frameworks, such as BladeLLM, vLLM, and SGLang.

Help Document

https://www.alibabacloud.com/help/pai/user-guide/multi-machine-distributed-inference

7th Gen ECS Is Now Available

Increase instance computing power by up to 40% and Fully equipped with TPM chips.
Powered by Third-generation Intel® Xeon® Scalable processors (Ice Lake).

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.