Automatic elasticity and stable delivery

Solution background

High availability and Auto Scaling

In modern application architectures, high availability is critical for ensuring business continuity and serves as the foundation for horizontal scaling. Horizontal scaling prevents resource waste by precisely matching resource supply with demand. To reduce unnecessary costs, it automatically scales in resources during low-traffic periods. To maintain service quality and user experience, it rapidly scales out server instances during traffic peaks.

Solution advantages

Why choose our solution?

In a traditional approach, you build your own load balancer nodes and manually scale resources out and in to match business volume. This method incurs high hardware costs, introduces management complexity, and results in poor maintainability. It also demands a significant investment of resources, time, and personnel. In contrast, the Alibaba Cloud ALB and Auto Scaling solution automates horizontal application scaling.

Traditional approach

Alibaba Cloud ALB + Auto Scaling

Complex deployment

Require manual configuration of multi-level redundant systems and continuous investment from a dedicated operations team for ongoing maintenance.

Deployment and maintenance complexity

Simplified configuration

Provide highly integrated tools and services for application high availability, which simplifies the configuration process.

Higher fixed cost

Require purchasing and maintaining equipment. Sustain all hardware even during periods of low load, resulting in high fixed costs.

Cost efficiency

Reduces costs

The pay-as-you-go model lets you automatically scale resources based on actual demand, which avoids waste.

Slow fault recovery

Require manual intervention to inspect and replace faulty hardware or restart systems.

Fault recovery

Fast automatic failover

Offer fast, automatic failover capabilities, helping you achieve near-zero downtime fault recovery.

Poor scalability

Pre-plan and purchase extra resources to handle potential traffic growth, often leading to long-term resource idling.

Scalability

Excellent scalability

Automatically adjust resources in real time to handle traffic fluctuations.

Improve availability with ALB and Auto Scaling

High concurrency

A single ALB instance supports up to 10 million concurrent connections, 1 million connections per second (CPS), and 1 million Layer-7 queries per second (QPS).

High availability

ALB offers up to a 99.995% SLA. While Auto Scaling ensures application availability by using health checks to automatically replace unhealthy ECS instances.

Optimize costs with Auto Scaling

Auto Scaling reduces costs by automatically scaling resources to match demand, eliminating the need for manual adjustments.

Architecture and deployment

On-demand scaling: Automatic elasticity and stable delivery

2766666

The ALB + Auto Scaling solution is used to improve the availability of applications. When receiving a request from a user, ALB can route traffic to the target group according to the request content. The ALB server group is associated with the Auto Scaling group, so that when the business demand fluctuates, the Auto Scaling automatically adjusts the specified types of ECS instances to meet the business needs.

30 minutes

CNY 10 (Assume that you select the recommended resource specification and the up time does not exceed 60 minutes. The actual cost may vary due to the actual traffic used during your operation. Please refer to the actual quotation displayed in the console and the final bill.)

Elastic Compute Service SLB Auto Scaling

Immediate Deployment

Automatic elasticity and stable delivery - Technical Solutions - Alibaba Cloud

Intended customers

High availability and Auto Scaling

Why choose our solution?

Improve availability with ALB and Auto Scaling

On-demand scaling: Automatic elasticity and stable delivery

Recommended solutions