Platform for AI (PAI) - EAS supports GPU sharing
Jul 02 2024
Platform for AI (PAI)Content
Customers who use generative AI, AI inference, and online model services. New features: When you deploy a model in EAS, you can split and use the computing power based on the ratio of GPU computing power and the memory size. This helps reduce resource costs and improve resource utilization. On the deployment page, you can schedule instances based on GPU memory and computing power. This allows multiple instances to share a single GPU.