New Features

Platform for AI (PAI) - EAS upgrades BladeLLM high-performance deployment service

PAI-EAS supports scenario-based deployment of BladeLLM to achieve faster response time and higher throughput for LLM inference.
Content

Target customers: Customers who use EAS to build LLM-driven applications and services, such as intelligent customer service, content generation, and translation. New features /specifications: BladeLLM is an inference engine developed by PAI. It provides efficient runtime, high-performance operator implementation, and extreme hybrid quantization. PAI-EAS fully integrates BladeLLM to launch the LLM high-performance inference service. It supports the deployment of preset models and custom models as well as advanced options such as model parallelism and speculative sampling. This provides customers with efficient LLM deployment solutions.

7th Gen ECS Is Now Available

Increase instance computing power by up to 40% and Fully equipped with TPM chips.
Powered by Third-generation Intel® Xeon® Scalable processors (Ice Lake).

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.