All Products
Search
Document Center

Container Service for Kubernetes:Add a GPU-accelerated node

Last Updated:Apr 10, 2025

Container Service for Kubernetes (ACK) Edge clusters facilitate the management of on-premises GPU resources within edge node pools. This topic describes how to add a GPU-accelerated node to an edge node pool in an ACK Edge cluster.

Prerequisites

Limits

Procedure

Kubernetes 1.26 or later

When you add a GPU-accelerated node that is equipped with an NVIDIA GPU to an ACK Edge cluster that runs Kubernetes 1.26 or later, you do not need to configure the gpuVersion parameter. The system automatically checks the GPU model and installs the relevant components.

The steps to add a GPU-accelerated node are similar to the steps to add an edge node. Fore more information, see Add an edge node.

Note

ACK Edge clusters that run Kubernetes 1.26 or later support all series of production-grade GPUs provided NVIDIA, including Tesla, Hopper, Ada Lovelace, and L.

Kubernetes versions earlier than 1.26

When you add a GPU-accelerated node to an ACK Edge cluster that runs a Kubernetes version earlier than 1.26, the GPU model must meet the requirements in the following table. If you want to use a GPU model that does not the following requirements, submit a ticket.

OS architecture

GPU model

Kubernetes version

AMD64/x86_64

Nvidia_Tesla_T4

≥1.16.9-aliyunedge.1

AMD64/x86_64

Nvidia_Tesla_P4

≥1.16.9-aliyunedge.1

AMD64/x86_64

Nvidia_Tesla_P100

≥1.16.9-aliyunedge.1

AMD64/x86_64

Nvidia_Tesla_V100

≥1.18.8-aliyunedge.1

AMD64/x86_64

Nvidia_Tesla_A10

≥1.20.11-aliyunedge.1

AMD64/x86_64

Nvidia_L40

≥1.26.3-aliyun.1

  1. Log on to the ACK console. In the left-side navigation pane, click Clusters.

  2. On the Clusters page, find the cluster to manage and click its name. In the left-side navigation pane, choose Nodes > Node Pools.

  3. On the Node Pools page, find the target node pool and choose More > 图标 > Add Existing Node > Add Existing Node in the column.

  4. On the Add Existing ECS Instance page, select Manual as Method and then select an existing instance.

  5. Click Next Step to go to the Specify Instance Information step. You can set the parameters that are used to add the node. For more information about the parameters, see Parameter list.

    image

    Note
    • You must configure the gpuVersion parameter in the script to connect the node to the cloud. For more information about the supported GPU models, see Limits.

    • After you configure the parameters, the connection tool automatically installs nvidia-containerd-runtime. For more information, see nvidia-containerd-runtime.

  6. After you set the parameters, click Next Step. In the Complete step, click Copy to copy the script to the edge node that you want to add. Then, execute the script on the node.

    If the following result is returned, the node is added to the cluster.

    接入成功

References

  • If you have any problems when you add edge nodes, see Diagnose edge node problems.

  • For more information about how to remove an edge node, see Remove edge nodes.

  • ACK Edge clusters support edge node autonomy. Edge node autonomy ensures that applications on an edge node can still run as expected when the edge node is disconnected from the cloud. For more information, see Configure edge node autonomy.