Automatic elasticity and stable delivery
This solution uses Application Load Balancer (ALB) and Auto Scaling to distribute network traffic and dynamically scale server resources. This approach improves application high availability and throughput, and it reduces costs by optimizing resource use.
Intended customers
Users who handle high-concurrency traffic
Users who need to respond quickly to market changes
Users who seek to optimize resources and control costs
Solution background
High availability and Auto Scaling
In modern application architectures, high availability is critical for ensuring business continuity and serves as the foundation for horizontal scaling. Horizontal scaling prevents resource waste by precisely matching resource supply with demand. To reduce unnecessary costs, it automatically scales in resources during low-traffic periods. To maintain service quality and user experience, it rapidly scales out server instances during traffic peaks.
Solution advantages
Why choose our solution?
In a traditional approach, you build your own load balancer nodes and manually scale resources out and in to match business volume. This method incurs high hardware costs, introduces management complexity, and results in poor maintainability. It also demands a significant investment of resources, time, and personnel. In contrast, the Alibaba Cloud ALB and Auto Scaling solution automates horizontal application scaling.
Traditional approach
Alibaba Cloud ALB + Auto Scaling
Complex deployment
Require manual configuration of multi-level redundant systems and continuous investment from a dedicated operations team for ongoing maintenance.
Deployment and maintenance complexity
Simplified configuration
Provide highly integrated tools and services for application high availability, which simplifies the configuration process.
Higher fixed cost
Require purchasing and maintaining equipment. Sustain all hardware even during periods of low load, resulting in high fixed costs.
Cost efficiency
Reduces costs
The pay-as-you-go model lets you automatically scale resources based on actual demand, which avoids waste.
Slow fault recovery
Require manual intervention to inspect and replace faulty hardware or restart systems.
Fault recovery
Fast automatic failover
Offer fast, automatic failover capabilities, helping you achieve near-zero downtime fault recovery.
Poor scalability
Pre-plan and purchase extra resources to handle potential traffic growth, often leading to long-term resource idling.
Scalability
Excellent scalability
Automatically adjust resources in real time to handle traffic fluctuations.
Improve availability with ALB and Auto Scaling
![]()
High concurrency
A single ALB instance supports up to 10 million concurrent connections, 1 million connections per second (CPS), and 1 million Layer-7 queries per second (QPS).
![]()
High availability
ALB offers up to a 99.995% SLA. While Auto Scaling ensures application availability by using health checks to automatically replace unhealthy ECS instances.
![]()
Optimize costs with Auto Scaling
Auto Scaling reduces costs by automatically scaling resources to match demand, eliminating the need for manual adjustments.
Architecture and deployment
On-demand scaling: Automatic elasticity and stable delivery
2766666
The ALB + Auto Scaling solution is used to improve the availability of applications. When receiving a request from a user, ALB can route traffic to the target group according to the request content. The ALB server group is associated with the Auto Scaling group, so that when the business demand fluctuates, the Auto Scaling automatically adjusts the specified types of ECS instances to meet the business needs.
30 minutes
CNY 10 (Assume that you select the recommended resource specification and the up time does not exceed 60 minutes. The actual cost may vary due to the actual traffic used during your operation. Please refer to the actual quotation displayed in the console and the final bill.)
Elastic Compute Service SLB Auto Scaling
Recommended solution