ack-ai-installer is a collection of Device Plugins that enhances GPU scheduling capabilities in ACK Managed Cluster Pro Edition and ACK Edge Cluster Pro Edition. It works with ACK Scheduler — a unified scheduling system built on the Kubernetes Scheduling Framework extension — to support shared GPU scheduling and GPU topology-aware scheduling.
Component overview
ack-ai-installer includes the following sub-components. Each works with ACK Scheduler to extend GPU scheduling beyond the default exclusive GPU scheduling available in ACK Managed Cluster Pro Edition and ACK Edge Cluster Pro Edition.
gpushare-device-plugin
gpushare-device-plugin works with ACK Scheduler to enable shared GPU scheduling with sharing isolation. Multiple applications or processes share a single GPU card, improving resource utilization across the cluster.
cgpu-installer
cgpu-installer builds on shared GPU scheduling by integrating with cGPU, Alibaba Cloud's GPU container sharing technology. This adds:
-
GPU memory isolation: Different applications or processes are isolated from each other in GPU memory, preventing task interference.
-
GPU computing power isolation: Fine-grained allocation policies — average, preemption, and weight — control how computing power is distributed across containers.
For installation methods and scenarios, see Manage the shared GPU scheduling component and Allocate computing power using shared GPU scheduling.
gputopo-device-plugin
gputopo-device-plugin enables GPU topology-aware scheduling. It selects the GPU combination on a node that provides the optimal training speed.
For installation steps and scenarios, see GPU topology-aware scheduling.
Usage notes
-
You can install ack-ai-installer only in ACK Managed Cluster Pro Edition and ACK Edge Cluster Pro Edition from the Cloud-native AI Suite page in the console.
-
ack-ai-installer is pre-installed in ACK Lingjun managed clusters.
-
For ack-ai-installer versions earlier than 1.12.0, cluster versions 1.18.8 and later are supported.
-
For ack-ai-installer versions 1.12.0 and later, only cluster versions 1.20 and later are supported.
Change log
March 2026
|
Version |
Changes |
Last Modified |
Impact |
|
1.13.1 |
|
March 16, 2026 |
This upgrade does not affect existing services. |
October 2025
|
Version |
Changes |
Change Time |
Impact |
|
1.13.0 |
|
October 29, 2025 |
This upgrade does not affect existing services. |
August 2025
|
Version number |
Changes |
Modification Time |
Impact |
|
1.12.8 |
cGPU 1.5.20 update:
|
August 04, 2025 |
This upgrade does not affect existing services. |
July 2025
|
Version |
Changes |
Change Time |
Impact |
|
1.12.7 |
|
July 17, 2025 |
This upgrade does not affect existing services. |
|
1.12.6 |
cGPU 1.5.19 update:
|
July 16, 2025 |
This upgrade does not affect existing services. |
June 2025
|
Version |
Changes |
Change Time |
Impact |
|
1.12.5 |
|
June 23, 2025 |
This upgrade does not affect existing services. |
|
1.12.4 |
|
June 19, 2025 |
This upgrade does not affect existing services. |
May 2025
|
Version number |
Changes |
Change Time |
Impact |
|
1.12.3 |
|
May 14, 2025 |
This upgrade does not affect existing services. |
March 2025
|
Version |
Changes |
Change Time |
Impact |
|
1.12.2 |
|
March 17, 2025 |
This upgrade does not affect existing services. |
February 2025
|
Version |
Changes |
Update Time |
Impact |
|
1.12.1 |
|
February 18, 2025 |
This upgrade does not affect existing services. |
January 2025
|
Version |
Changes |
Modification Time |
Impact |
|
1.12.0 |
|
January 03, 2025 |
This upgrade does not affect existing services. |
November 2024
|
Version number |
Changes |
Change Time |
Impact |
|
1.11.1 |
Releases cGPU 1.5.13. Fixes a rare kernel crash issue that may be caused by residual container processes. |
November 19, 2024 |
This upgrade does not affect existing services. |
|
1.10.1 |
Releases cGPU 1.5.12. Fixes an issue where GPU memory isolation fails for some CUDA APIs on new driver versions such as 535. |
November 07, 2024 |
This upgrade does not affect existing services. |
September 2024
|
Version number |
Changes |
Modification Time |
Impact |
|
1.9.16 |
|
September 26, 2024 |
This upgrade does not affect existing services. |
|
1.9.15 |
Releases cGPU 1.5.11. Fixes decoding-related issues. |
September 19, 2024 |
This upgrade does not affect existing services. |
August 2024
|
Version |
Changes |
Change Time |
Impact |
|
1.9.14 |
|
August 21, 2024 |
This upgrade does not affect existing services. |
|
1.9.14 |
Releases cGPU 1.5.9. Adds policy 6 to proportionally divide computing power and GPU memory. |
August 13, 2024 |
This upgrade does not affect existing services. |
May 2024
|
Version |
Changes |
Modification Time |
Impact |
|
1.9.11 |
Releases cGPU 1.5.7. Supports L-series GPUs and GPU drivers of version 550 and later. |
May 14, 2024 |
This upgrade does not affect existing services. |
|
1.9.10 |
Releases cGPU 1.5.7. Fixes an issue where the |
May 09, 2024 |
This upgrade does not affect existing services. |
January 2024
|
Version |
Changes |
Change Time |
Impact |
|
1.8.8 |
Releases cGPU 1.5.6. A new cGPU License Server policy is released. |
January 04, 2024 |
This upgrade does not affect existing services. |
December 2023
|
Version |
Changes |
Modification Time |
Impact |
|
1.8.7 |
|
December 20, 2023 |
This upgrade does not affect existing services. |
November 2023
|
Version |
Changes |
Change Time |
Impact |
|
1.8.5 |
Releases cGPU 1.5.5. Fixes a Kernel Panic issue triggered by |
November 23, 2023 |
This upgrade does not affect existing services. |
August 2023
|
Version |
Changes |
Change Time |
Impact |
|
1.8.2 |
|
August 29, 2023 |
This upgrade does not affect existing services. |
July 2023
|
Version |
Changes |
Change Time |
Impact |
|
1.7.7 |
|
July 04, 2023 |
This upgrade does not affect existing services. |
April 2023
|
Version |
Changes |
Modification Time |
Impact |
|
1.7.6 |
|
April 26, 2023 |
This upgrade does not affect existing services. |
|
1.7.5 |
Releases cGPU 1.5.2. |
April 18, 2023 |
This upgrade does not affect existing services. |