Service management - Container Service for Kubernetes - Alibaba Cloud Documentation Center

This topic provides answers to some frequently asked questions about authorization, access, scheduling, and resource allocation when you use Container Service for Kubernetes (ACK).

How do I deploy the first application in a cluster?
How does Application A access Application B in a cluster?
How do I properly allocate node resources in a cluster to pods?
How do I schedule pods to a specified node?
How do I query the basic information about a Service?
How do I expose a containerized application to external access?
How do I enable auto scaling?
How do I expand storage resources?
How do I access an ACK cluster?
How do I query the public IP address of an ACK cluster?
How do I access an ACK cluster over the Internet or an internal network from an on-premises machine?
How do I grant RAM users different permissions on an ACK cluster?
How do I access workloads over the Internet?
How do I enable communication among workloads in an ACK cluster?
What considerations do I take note of when I expose Services through SLB instances?

How do I deploy the first application in a cluster?

Package the application code into a container image and push the image to a repository of Container Registry. Then, pull the image from the repository of Container Registry and deploy the image in an ACK cluster or ACK Serverless cluster.

Save the code to a Dockerfile and build a container image from the Dockerfile. For more information, see Build an image for a Java application by using a Dockerfile with multi-stage builds.
If the image is built on an on-premises machine, you can push the image to a repository of Container Registry. For more information, see Create an application by using a private image repository.
Pull the image from the repository of Container Registry and deploy the image in an ACK cluster. For more information, see Create a stateless application by using a Deployment.

How does Application A access Application B in a cluster?

Pods are created to run an application after you deploy the application in a cluster. Each pod has a separate IP address. Pods are nonpermanent resources. When you release a new application version or dynamically scale pods, the Elastic Compute Service (ECS) instance that hosts the pods reallocates computing resources. The IP addresses of the pods are also changed. For more information about how rolling updates are performed for applications, see Perform a rolling update.

Kubernetes uses Services to enable communication between pods. In Kubernetes, a Service is an abstraction that defines how an application is exposed. A Service is associated with pods that provide the same feature, serves as a unified ingress for external access, and balances traffic loads among the pods.

创建service The following table describes different Service types that you can use in different scenarios.

Scenario	Service type
Enable communication between applications in a cluster	ClusterIP
Enable communication between applications in different clusters that are deployed in the same virtual private cloud (VPC)	LoadBalancer (internal-facing SLB instance)
Expose applications for external access by using Ingresses	LoadBalancer (Internet-facing SLB instance)

For more information about Services, see Service.

You can create Services in the ACK console or by using the CLI.

Method 1: Create a Service on the Advanced wizard page when you deploy an application. For more information, see Configure advanced settings.
Method 2: If an existing application is not associated with a Service, you can create a Service in the ACK console. For more information, see Use Services to expose applications.
Method 3: Create a Service for a Deployment by using the CLI.
Run the following command to create a Service for the NGINX Deployment:
```
kubectl expose deployment/nginx --type="LoadBalancer" --port 80
```

How do I properly allocate node resources in a cluster to pods?

You must configure resource requests and resource limits for each container when you create a pod, including requests and limits for CPU and memory resources.

requests: specifies the minimum amount of resources that must be reserved for a pod to run as normal. If the node that hosts the pod has sufficient resources, the pod may consume an amount of resources that exceed the value of the request parameter.
limits: specifies the maximum amount of resources that can be allocated to a pod. This prevents a pod from consuming unlimited resources, which may cause other pods to crash. If a process in a pod attempts to consume memory resources that exceed the value of the limits parameter, the kernel of the operating system terminates the process and returns an Out of Memory (OOM) error. After you configure resource limits, ACK can use a proactive method or a passive method to control resource usage. If the passive method is used, ACK limits resource usage when ACK detects excess resource allocation. If the proactive method is used, ACK forces a container to use resources that do not exceed the resource limit of the container. ACK uses different methods to limit resource usage for containers based on the runtimes of the containers. If you use a Java application, we recommend that you set JAVA_OPTS -Xmx to a value that is 70% to 80% of the value specified by the limits parameter.

You can view the CPU and memory usage of all ECS instances in a cluster. For more information, see View nodes.

The CPU requests/limits/usage and Memory requests/limits/usage fields indicate that the utilization of requested resources is equal to or lower than the utilization of limited resources.

Note

We recommend that you configure resource requests and limits for containers. If you do not configure resource requests and limits for the containers on a node, the node information shows only the ratio of the sum of resource requests or limits of all containers on the node to the total node resources.

How do I schedule pods to a specified node?

Note

ACK Serverless clusters do not allow you to schedule pods to a specified node.

By default, pods are scheduled to different nodes in an ACK cluster to ensure load balancing. If you want to schedule a pod to a specified node, you can specify node affinities, taints, or tolerations for the pod.

Node affinity: You can use node affinities to schedule pods only to a specific node or prioritize a specific node when ACK schedules pods. You can configure node affinities by specifying the nodeSelector parameter in the YAML file of a pod. For more information, see node-affinity.
Tolerations: You can configure toleration rules to schedule pods to nodes with matching taints. Taints and tolerations can be used to prevent the system from scheduling pods to inappropriate nodes. You can add one or more taints to a node. Pods that do not tolerate the taints are not scheduled to the node with the same taints. For more information, see taint-and-toleration.

When you create a pod, you can configure Scheduling parameters on the Advanced wizard page. For more information, see Scheduling settings.

Log on to the ACK console and click the cluster that you want to manage. On the cluster details page, choose Nodes > Nodes and manage the labels and taints of nodes in the cluster. For more information, see Manage node labels and Manage taints.

You can configure node labels or Deployments to schedule pods to specific nodes. For more information, see Schedule pods to specific nodes.

How do I query the basic information about a Service?

For more information about how to query the basic information about a Service, such as the name, type, creation time, cluster IP address, and external endpoint, see Use Services to expose applications.

How do I expose a containerized application to external access?

You can configure an Internet-facing Server Load Balancer (SLB) instance or configure an internal-facing SLB instance and an elastic IP address (EIP) for a Service to expose an application to external access. For more information, see Use an automatically created SLB instance to expose an application or Use an existing SLB instance to expose an application.

If multiple Services are used in microservices scenarios, you can assign different ports to different Services and configure the Services to use the same EIP. This reduces the cost of EIPs and SLB instances.

Note

You must install the NGINX Ingress controller before you create a Service. You can install the NGINX Ingress controller by using one of the following methods:

If you want to use a new cluster, select Install NGINX Ingress Controller in the Ingress section on the Component Configurations wizard page when you create the cluster. For more information, see Create an ACK managed cluster.
If you use an existing cluster, you can install the NGINX Ingress controller on the Add-ons page. For more information, see Manage components.

How do I enable auto scaling?

ACK supports auto scaling. The auto scaling feature can scale elastic computing resources to meet business requirements.

Resource layer
Use ECS instances and elastic container instances together. You can deploy regular workloads on ECS instances and use elastic container instances to handle heavy traffic during peak hours. For more information about how to use elastic container instances, see Use Elastic Container Instance in ACK clusters. For more information about how to use ECS, see What is ECS?
- ACK uses virtual nodes to allow seamless integration between Kubernetes and elastic container instances. Virtual nodes provide Kubernetes clusters with high elasticity. This way, Kubernetes clusters are no longer limited by the computing capacity of the cluster nodes. For more information, see Deploy the virtual node controller and use it to create Elastic Container Instance-based pods.
- If only ECS instances are used as computing resources, ACK uses the auto scaling component to scale nodes. Regular instances, GPU-accelerated instances, and preemptible instances can be automatically added to or removed from an ACK cluster as required. This component is suitable for multiple types of instances, instances that are deployed across multiple zones, and various scenarios. For more information, see Auto scaling of nodes.
Container layer
ACK provides general scaling models, such as Horizontal Pod Autoscaler (HPA), Cron Horizontal Pod Autoscaler (CronHPA), Vertical Pod Autoscaler (VPA), and horizontal auto scaling components. ACK also provides scaling models that use specific technologies, such as event-driven scaling, Knative serverless scaling framework, fine-grained scaling of elastic workloads, and Spark and Presto CustomResourceDefinitions (CRDs). For more information, see Auto scaling overview.

You can configure ACK to perform auto scaling at the resource layer and ignore the container layer. If the resource utilization is low or the resources are sufficient, you can configure ACK to perform auto scaling at the container layer and ignore the resource layer.

How do I expand storage resources?

You can expand only disks. For more information, see Expand a disk volume without service interruptions.

How do I access an ACK cluster?

You can access an ACK cluster by using one of the following methods:

Method 1: Connect to an ACK cluster in the ACK console
Method 2: Connect to an ACK cluster by using kubectl
To connect to an ACK cluster from your on-premises machine, we recommend that you use the kubectl command-line tool. For more information, see Obtain the kubeconfig file of a cluster and use kubectl to connect to the cluster.
Method 3: Use kubectl on Cloud Shell to manage ACK clusters
If you use this method, you do not need to install kubectl. For more information, see Use kubectl on Cloud Shell to manage ACK clusters.
Method 4: Use SSH to access an ACK cluster
If you do not enable SSH logon when you create an ACK cluster, you cannot directly connect to the cluster by using SSH or kubectl. To use SSH to connect to an ACK cluster that has SSH logon disabled, you must manually associate EIPs with the ECS instances in the cluster, configure security group rules, and then open SSH port 22. For more information, Use SSH to connect to the master nodes of an ACK dedicated cluster.
Use other methods to access an ACK cluster
You can also use other methods to access an ACK cluster. For example, you can access a cluster through an API server or by using an SSH key pair.

How do I query the public IP address of an ACK cluster?

An API server exchanges data for an ACK cluster. The public IP address of an ACK cluster is the public IP address of the API server in the cluster. ACK allows you to use an EIP to expose the API server of a cluster. After you use an EIP to expose the API server of a cluster, the API server can be accessed over the Internet. To query the public IP address of an ACK cluster, perform the following steps:

Log on to the ACK console.
In the left-side navigation pane of the ACK console, click Clusters.
On the Clusters page, find the cluster that you want to manage. Then, click the name of the cluster or click Details in the Actions column.
On the cluster details page, click the Basic Information tab. In the Cluster Information section, you can view the public IP address of the cluster in the API Server Public Endpoint field.

You can associate an EIP with the API server of a cluster when you create the cluster or after the cluster is created. For more information, see Control public access to the API server of a cluster.

How do I access an ACK cluster over the Internet or an internal network from an on-premises machine?

You must connect to the ACK cluster through the API server.

Access over the Internet

You must first check whether Internet access is enabled for the cluster.
- For more information about how to enable Internet access when you create a cluster, see Associate an EIP with the API server when you create a cluster.
- For more information about how to enable Internet access for an existing cluster, see Associate an EIP with the API server after a cluster is created.
Connect to the cluster.
You can connect to the cluster by using one of the following methods:
- Method 1: If you want to connect to an ACK cluster from your on-premises machine, you must first install the kubectl command-line tool and configure cluster credentials. For more information, see Obtain the kubeconfig file of a cluster and use kubectl to connect to the cluster.
- Method 2: Connect to an ACK cluster by using Cloud Shell that is provided by Alibaba Cloud. For more information, see Use kubectl on Cloud Shell to manage ACK clusters.

Access over an internal network

We recommend that you do not expose the API server of an ACK cluster to external access in production environments. We recommend that you access the cluster over an internal network.

The machine on which the kubectl client is installed and the ACK cluster must be deployed in the same VPC. You must also configure cluster credentials to connect to the cluster. For more information, see Obtain the kubeconfig file of a cluster and use kubectl to connect to the cluster.

How do I grant RAM users different permissions on an ACK cluster?

The authorization mechanism of ACK consists of Resource Access Management (RAM) authorization and role-based access control (RBAC) authorization.

O&M engineers manage cloud resources as RAM users. If you want to scale a cluster, add nodes to a cluster, or access a cluster as a RAM user, you must grant the required permissions to the RAM user. For more information, see RAM authorization.
If a RAM user requires the permissions to manage resources in a specified cluster and namespace, you must go to the Authorizations page of the ACK console and grant the required permissions to the RAM user. For more information, see RBAC authorization.

You can use an Alibaba Cloud account to grant permissions to a RAM user and then associate the permissions with a cluster role based on RBAC. This allows you to configure different permission policies for different roles to manage resources in a specified cluster and namespace.

After the RAM user logs on to the ACK cluster, the RAM user can obtain the authorized kubeconfig file. For more information, see Step 2: Select a type of cluster credentials.

Note

RAM users can obtain only their own kubeconfig files. The kubeconfig file of each RAM user is different. If a RAM user performs operations on a cluster by using the kubeconfig file, the operations and the RAM user ID are recorded in the audit log.

How do I access workloads over the Internet?

ACK allows you to use the following methods to access cluster workloads over the Internet:

Use NodePort Services.
Use a SLB instance.
Nginx Ingress
Use a Domain Name System (DNS) server
Use a NAT gateway that has DNAT rules configured.

How do I enable communication among workloads in an ACK cluster?

You can use internal domain names or ClusterIP Services to enable the intercommunication of workloads in an ACK cluster.

For example, if you want to allow Workload A in a cluster to access Workload B in the same cluster, you can create a ClusterIP Service for Workload B. For more information, see Use Services to expose applications. After the ClusterIP Service is created, Workload A can access Workload B by using the following connection strings:

<ClusterIP Service name>.<Workload namespace>.svc.cluster.local:<Port number>
ClusterIP:<Port number>

What considerations do I take note of when I expose Services through SLB instances?

When you create a LoadBalancer Service, the cloud controller manager (CCM) automatically creates and configures an SLB instance for the Service. We recommend that you do not modify the SLB instance in the SLB console. Otherwise, access to the Service may be interrupted. For more information, see Considerations for configuring a LoadBalancer type Service.