New Features

Platform for AI (PAI) - Ray on DLC support for dynamic scaling

Ray on DLC now supports autoscaling, allowing users to configure minimum and maximum instance counts. Combined with the quota preemption mechanism, this enables adaptive scaling policies to ensure an optimal balance between job throughput and overall resource utilization.
Content

Target Audience: Large to mid-sized internet companies, AI companies, and research or academic institutions. New Feature/Specification: Autoscaling is now supported for non-Head roles in Ray on DLC. Users can set a minimum (min) and maximum (max) number of instances. The min value ensures a fast job start, after which the system will attempt to scale out to the max instance count. This feature integrates with quota preemption. When a task is preempted by a higher-priority one (from a parent or high-priority peer quota), the scheduler will first deallocate the scaled-out role instances. This approach keeps more tasks running concurrently, optimizing for both system-wide job throughput and resource utilization.

7th Gen ECS Is Now Available

Increase instance computing power by up to 40% and Fully equipped with TPM chips.
Powered by Third-generation Intel® Xeon® Scalable processors (Ice Lake).

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.