New Features

Platform for AI (PAI) - EAS supports LLM Intelligent Router to improve LLM inference efficiency

LLM Intelligent Router can significantly improve the resource usage of the inference system, reducing costs and increasing efficiency for customers.
Content

Intended customers: Customers who use EAS to build LLM-driven applications and services, such as intelligent customer service, content generation, and translation. LLM Intelligent Router can improve throughput and reduce latency, helping customers process user requests efficiently and stably. LLM Intelligent Router can improve throughput and reduce latency, helping customers process user requests efficiently and stably. New features: When customers deploy LLM services on EAS, they can enable the LLM Intelligent Router feature. LLM Intelligent Router can evenly allocate the computing power and video memory of backend inference instances and improve the resource usage of clusters.

Help Document

https://www.alibabacloud.com/help/pai/user-guide/use-llm-intelligent-router-to-improve-inference-efficiency

7th Gen ECS Is Now Available

Increase instance computing power by up to 40% and Fully equipped with TPM chips.
Powered by Third-generation Intel® Xeon® Scalable processors (Ice Lake).

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.