All Products
Search
Document Center

Server Load Balancer:What is ALB?

Last Updated:Aug 22, 2023

Application Load Balancer (ALB) is an Alibaba Cloud service that runs at the application layer and is optimized to balance traffic over HTTP, HTTPS, and Quick UDP Internet Connections (QUIC). ALB is highly elastic and can scale on demand to process large volumes of Layer 7 traffic. ALB also provides an availability SLA of up to 99.995%. ALB supports complex routing. ALB is integrated with other cloud-native services and is designed to serve as an ingress gateway to manage inbound traffic on Alibaba Cloud.

ALB architecture

Why ALB?

As a cloud-native ingress gateway on Alibaba Cloud, Application Load Balancer (ALB) is intended for Layer 7 load balancing over HTTP, HTTPS, and QUIC and provides advanced routing features. For more information about ALB Ingresses, see ALB Ingress overview and ALB Ingress features.

Application layer elasticity

ALB is intended for Layer 7 load balancing and provides domain names and virtual IP addresses to handle large amounts of network traffic at multiple levels. ALB distributes network traffic across groups of backend servers and automatically scales in response to the volume of inbound traffic. This not only improves the availability of your applications, but also helps prevent service interruptions caused by single points of failure (SPOFs). ALB supports custom cross-zone deployment and elastic scaling across zones to prevent resource insufficiency in individual zones.

Advanced protocols

ALB supports HTTP, HTTPS, and QUIC and can process a heavy load of network traffic. You can use ALB to provide fast and secure connections to mobile Internet applications such as real-time audio and video applications, interactive streaming applications, and online game applications. ALB supports Google Remote Procedure Call (gRPC) to facilitate efficient communication among microservices through APIs.

Content-based routing

ALB can route network traffic to different backend servers based on the request content such as the path, HTTP header, query string, HTTP request method, cookie, and source IP address. ALB also supports advanced configurations such as redirects, rewrites, and custom HTTPS headers.

Security and reliability

ALB supports DDoS mitigation and can be integrated with Web Application Firewall (WAF) to provide more security features. In addition, ALB supports end-to-end HTTPS encryption, custom security policies, and efficient encryption protocols such as TLS 1.3. ALB protects your business-critical services and complies with the Zero Trust security framework.

Cloud-native

As more and more cloud-native services are developed, a growing number of customers in the Internet, finance, and enterprise sectors choose to deploy their applications in the cloud or migrate their existing business systems to the cloud. As a cloud-native ingress gateway for Alibaba Cloud, ALB is deeply integrated with Container Service for Kubernetes (ACK), Serverless App Engine (SAE), Function Compute, and Kubernetes.

Flexible billing

ALB uses elastic IP addresses (EIPs) and EIP bandwidth plans to provide Internet-facing services and supports flexible billing for Internet data transfer. ALB also supports the pay-by-LCU metering method, which allows you to pay only for the resources that you use.

Performance metrics

ALB supports the Dynamic IP and Static IP modes. The performance of an ALB instance varies based on the IP mode.

Note

The performance metrics of an ALB instance vary only based on the IP mode of the ALB instance, and are not affected by the edition of the ALB instance.

Table 1. Performance of an ALB instance deployed across two zones in dynamic IP mode

IP mode

Maximum number of queries per second (QPS)

Maximum number of connections per second (CPS)

Maximum number of concurrent connections

Maximum private bandwidth

Maximum Internet bandwidth

Dynamic IP

1,000,000

1,000,000

10,000,000

100 Gbps

4000 Mbps

Note

The sum of the maximum bandwidth of all pay-by-data-transfer EIPs in each Alibaba Cloud account in each region cannot exceed 5 Gbit/s. For more information, see the bandwidth limits in Overview.

If you want to use larger bandwidth, purchase an EIP bandwidth plan. For more information, see Create an EIP bandwidth plan.

Table 2. Performance of an ALB instance deployed across two zones in static IP mode

IP mode

Maximum QPS

Maximum CPS

Maximum number of concurrent connections

Maximum private bandwidth

Maximum Internet bandwidth

Static IP

100,000

100,000

1,000,000

10 Gbps

400 Mbps

Note

The sum of the maximum bandwidth of all pay-by-data-transfer EIPs in each Alibaba Cloud account in each region cannot exceed 5 Gbit/s. For more information, see the bandwidth limits in Overview.

If you want to use larger bandwidth, purchase an EIP bandwidth plan. For more information, see Create an EIP bandwidth plan.

Note
  • In static IP mode, you can use CNAME records or A records to provide services, and an ALB instance deployed across two zones supports up to 100,000 QPS. In dynamic IP mode, you can use CNAME records to provide services, and an ALB instance deployed across two zones supports up to 1,000,000 QPS.

  • ALB supports multi-zone deployment. If the current region has two or more zones, you must select at least two zones to ensure high availability. If you select multiple zones for an ALB instance, you are not charged additional fees.

ALB components

ALB components

Term

Description

Instance

ALB provides ultra-high Layer 7 processing capabilities and can increase the service capacity of your applications by distributing network traffic across different backend servers. Each ALB instance supports up to one million QPS.

Listener

A listener is the smallest configurable unit of ALB. Listeners check for requests over the protocols and ports that you configure. For example, you can configure a listener of ALB to process HTTP requests on port 80. You must add at least one listener to each ALB instance before the ALB instance can distribute network traffic. You can add at most 50 listeners to each ALB instance to distribute network traffic for different workloads.

Forwarding rule

ALB distributes requests to backend servers in one or more server groups based on forwarding rules. ALB provides advanced routing features. In addition to the basic routing features, you can specify conditions such as the HTTP header, cookie, and HTTP method in a forwarding rule to route network traffic to different backend servers.

Server group

Backend servers can be organized into logical groups (also known as server groups) to which requests are distributed. Each server group contains one or more backend servers that process requests distributed by ALB. Server groups of ALB are independent of ALB. You can associate a server group with different ALB instances. You can specify up to 1,000 backend servers in each server group. ALB supports multiple types of backend servers. For example, you can specify Elastic Compute Service (ECS) instances, elastic container instances, and elastic network interfaces (ENIs) as backend servers.

Health check

ALB checks the availability of backend servers by performing health checks. If a backend server is declared unhealthy, ALB does not forward requests to the backend server. ALB supports flexible health check configurations. For example, you can specify the protocol, port, and thresholds for health checks. ALB provides health check templates that can be applied to different server groups.

Types of ALB instances

Alibaba Cloud provides Internet-facing and internal-facing ALB instances. You can specify the type of ALB instance based on your business requirements. Whether EIP Internet Shared Bandwidth instances plans are used is based on the specified type of ALB instance.

Architecture

Term

Description

EIP bandwidth plan

Internet Shared Bandwidth instances allow you to share and transfer bandwidth resources within the same region. Internet Shared Bandwidth instances support multiple metering methods such as pay-by-bandwidth and pay-by-enhanced-95th-percentile. You can use Internet Shared Bandwidth instances to reduce Internet bandwidth costs. Internet-facing ALB instances use EIPs aInternet Shared Bandwidth instancesans that support the pay-by-bandwidth and pay-by-enhanced-95th-percentile metering methods to provide Internet-facing services.

Domain name

A domain name that can be accessed over the Internet or private networks and can be resolved to the virtual IP address of an ALB instance. You can also create a CNAME record to map a readable domain name to the domain name of ALB. For more information, see Configure a CNAME record.

EIP

EIPs are required only by Internet-facing ALB instances. You do not need to associate EIPs with internal-facing ALB instances. An EIP is an IP address that ALB uses to provide services over the Internet. You can associate multiple EIPs with an Internet-facing ALB instance. To ensure high availability, an Internet-facing ALB instance must use at least two EIPs in different zones to provide services.

Virtual IP address

Virtual IP addresses of ALB are used to receive and forward requests. Each virtual IP address is a private IP address that belongs to a virtual private cloud (VPC).

Activate ALB

Click the following button to go to the ALB buy page.

Create ALB instances

Deploy and maintain ALB instances

After you create an Alibaba Cloud account, you can deploy and manage ALB instances in the following ways:

  • ALB console: a web interface that you can use to manage your ALB service. You can create, use, or release ALB instances in the console. For more information, see Create an ALB instance.

  • Alibaba Cloud SDKs: SDKs for Java, Go, Python, and other programming languages.

  • OpenAPI Explorer: allows you to retrieve and call API operations, and dynamically generate SDK sample code.

  • Terraform: uses configuration files to call computing resources of Alibaba Cloud and other platforms that support Terraform. Terraform is an open source tool that implements version control.

References