This topic describes how to create an ACK dedicated cluster for heterogeneous computing in the Container Service for Kubernetes (ACK) console.
Background information
ACK performs the following operations when a cluster is created:
- Creates ECS instances, configures a public key to enable SSH logon from master nodes to other nodes, and then configures the ACK cluster through CloudInit.
- Creates a security group that allows access to the VPC over Internet Control Message Protocol (ICMP).
- If you do not specify an existing VPC, ACK creates a VPC and a vSwitch and creates SNAT entries for the vSwitch.
- Adds route entries to the VPC.
- Creates a NAT gateway and EIPs.
- Creates a Resource Access Management (RAM) user and an AccessKey pair. Grants the following permissions to the RAM user: permissions to query, create, and delete ECS instances, permissions to add and delete disks, and full permissions on SLB, CloudMonitor, VPC, Log Service, and Apsara File Storage NAS (NAS). The ACK cluster automatically creates SLB instances, disks, and VPC route entries based on your configuration.
- Creates an internal-facing SLB instance and opens port 6443.
- Creates an Internet-facing SLB instance and opens ports 6443, 8443, and 22. If you enable SSH logon over the Internet when you create the cluster, port 22 is opened. Otherwise, port 22 is not exposed.
Limits
- ACK clusters support only VPCs.
- By default, each account has specific quotas on cloud resources that can be created. You cannot create clusters if the quota is reached. Make sure that you have sufficient resource quotas before you create a cluster.
- For more information, see Resource quota limits. Important By default, you can add up to 200 route entries to a VPC. This means that you can deploy up to 200 nodes in an ACK cluster that uses Flannel. This limit does not apply to ACK clusters that use Terway. To deploy more nodes in a cluster, please go to Quota Center to apply for an increase on the quota of route entries in the VPC where the cluster is deployed.
- By default, you can create at most 100 security groups with each account.
- By default, you can create at most 60 pay-as-you-go SLB instances with each account.
- By default, you can create at most 20 EIPs with each account.
- For more information, see Resource quota limits.
- Limits on ECS instances:
The pay-as-you-go and subscription billing methods are supported.
Note After an ECS instance is created, you can change its billing method from pay-as-you-go to subscription in the ECS console. For more information, see Change the billing method of an ECS instance from pay-as-you-go to subscription.
Procedure
- Log on to the ACK console.
- In the left-side navigation pane of the ACK console, click Clusters.
- In the upper-right corner of the Clusters page, click Cluster Template.
- In the Select Cluster Template dialog box, find Dedicated Cluster for Heterogeneous Computing in the Other Clusters section and click Create.
- On the Dedicated Kubernetes tab, configure the cluster.
- Click Next:Master Configurations to configure master nodes.
Parameter Description Master Node Quantity Specify the number of master nodes. You can create three or five master nodes. Instance Type Select the instance type for master nodes. For more information, see Overview of instance families. Note If no instance type is available, you can change vSwitches on the Cluster Configurations wizard page.System Disk By default, system disks are mounted to master nodes. Standard SSDs, Enhanced SSDs (ESSDs), and ultra disks are supported. Note- You can select Enable Backup to back up disk data.
- If you select ESSD as the system disk type, you can set a custom performance level for the system disk.
You can select higher performance levels for ESSDs with larger storage capacities. For example, you can select performance level 2 for an ESSD with a storage capacity of more than 460 GiB. You can select performance level 3 for an ESSD with a storage capacity of more than 1,260 GiB. For more information, see Capacity and PLs.
- Click Next:Node Pool Configurations to configure worker nodes.
- Click Next:Component Configurations to configure components.
Parameter Description Ingress Specify whether to install an Ingress controller. By default, Install NGINX Ingress Controller is selected. For more information, see Advanced NGINX Ingress configurations. Service Discovery Specify whether to install NodeLocal DNSCache. By default, NodeLocal DNSCache is installed.
NodeLocal DNSCache runs a Domain Name System (DNS) caching agent to improve the performance and stability of DNS resolution. For more information about NodeLocal DNSCache, see Configure NodeLocal DNSCache.
Volume Plug-in Select a volume plug-in. FlexVolume and CSI are supported. ACK clusters can be automatically bound to Alibaba Cloud disks, Apsara File Storage NAS (NAS) file systems, and Object Storage Service (OSS) buckets that are mounted to pods in the clusters. For more information, see Storage management-FlexVolume and Storage management-CSI. Monitoring Agents Specify whether to install the CloudMonitor agent. You can select Install CloudMonitor Agent on ECS Instance. After the CloudMonitor agent is installed, you can view monitoring information about ECS instances in the CloudMonitor console.
Log Service Specify whether to enable Log Service. You can select an existing Log Service project or create one.
If you select Enable Log Service, the Log Service plug-in is automatically installed in the cluster. You can set up Log Service with a few steps when you deploy an application. For more information, see Collect log data from containers by using Log Service.
After you select Enable Log Service, you can specify whether to select Create Ingress Dashboard and Install node-problem-detector and Create Event Center.
Workflow Engine Specify whether to enable Alibaba Cloud Genomics Service (AGS). Note To use this feature, submit a ticket.- If you select this check box, the system automatically installs the AGS workflow plug-in when the system creates the cluster.
- If you clear this check box, you must manually install the AGS workflow plug-in. For more information, see Introduction to AGS CLI.
- Click Next:Confirm Order.
- Select Terms of Service and click Create Cluster. Note It requires approximately 10 minutes for the system to create an ACK managed cluster that contains multiple nodes.
What to do next
After the cluster is created, go to the Clusters page, find the created cluster, and then click the cluster name or click Details in the Actions column. In the left-side navigation pane of the details page, choose . On the page that appears, select a worker node and choose in the Actions column. Then, you can view the GPUs that are provided by the node.