All Products
Search
Document Center

Microservices Engine:Traffic protection overview

Last Updated:Mar 11, 2026

When microservices scale, a single slow dependency or unexpected traffic spike can cascade across your entire call chain and bring down production. Traffic protection in Microservices Engine (MSE) takes traffic as the starting point and ensures service stability through real-time throttling, circuit breaking and degradation, and system overload controls across distributed applications. As a commercial product of the open source Sentinel framework, it is the core component behind large-scale promotion events such as Double 11.

How traffic protection works

In a distributed application, a client request passes through multiple layers -- gateway, firewall, and Network Load Balancer (NLB) -- before reaching the frontend network service and then the backend services. Along the way, downstream services and third-party applications may be called.

Traffic protection operates across this entire request flow. It provides professional and stable traffic protection measures, continuously analyzes traffic patterns at each layer, and enforces protection rules within seconds. Rule updates take effect immediately.

The protection model has two core properties:

  • Second-level detection and enforcement: Detects anomalies and applies rules within seconds of a traffic change.

  • Dynamic rule updates: Push new or modified protection rules in real time.

When to use each feature

Traffic protection includes four complementary features. Use the following table to identify the right feature for your scenario:

ScenarioProblemFeatureHow it helps
Flash sales or promotional eventsSudden traffic spikes overwhelm backend capacityIngress throttlingCaps inbound request rates at service entry points to match downstream capacity
Peak-load shifting for messagesBursty message traffic degrades consumer performanceIngress throttling + concurrency isolationLimits request rates and isolates slow consumers so they do not exhaust shared resources
Cluster throttlingAggregate traffic exceeds the total capacity of a service clusterIngress throttlingEnforces request rate limits distributed across cluster nodes
Hotspot parametersA single parameter value (for example, a frequently accessed item ID) generates disproportionate loadHotspot isolationSeparates hotspot traffic from normal traffic so that normal requests retain capacity
Cascading failuresA slow or failing dependency brings down the entire call chainCircuit breakingTrips a circuit breaker on the unstable dependency to limit the blast radius

Feature details

Traffic protection provides professional and diversified preventive measures:

Ingress throttling

Controls inbound request rates based on service capacity. Apply throttling at application entry points such as gateways, frontend applications, and service providers to prevent traffic from exceeding what downstream services can handle.

Ingress throttling is the first line of defense: it rejects excess requests before they enter the system, protecting all downstream services in the call chain.

Concurrency isolation

Limits the maximum number of concurrent threads for a given resource. If a slow call holds a thread for too long, other interfaces stay responsive because the thread pool is not exhausted.

Use concurrency isolation when your services share a common thread pool and a single slow dependency could monopolize all available threads.

Hotspot isolation

Separates hotspot traffic from normal traffic. This prevents invalid hotspots from preempting the capacity of normal traffic. When a specific parameter value -- such as a frequently accessed item ID -- generates disproportionate load, hotspot isolation caps the rate for that parameter so that normal requests retain their share of capacity.

Circuit breaking on unstable dependencies

Applies circuit breaking both between and within applications. When a downstream dependency becomes unstable -- responding slowly or returning errors above a threshold -- the circuit breaker trips to limit the blast radius and prevent cascading failures.

Monitoring

Traffic protection provides real-time traffic monitoring and traffic diagnosis and analysis at the client level. This helps O&M personnel take preventive measures to protect application stability.

Traffic protection provides three levels of observability:

LevelScope
Single-node monitoringReal-time metrics for individual service instances
Aggregate monitoringCluster-wide view that combines metrics across all nodes
Historical data queriesPast traffic data stored for retrospective analysis

References

  • Sentinel -- the open source framework that traffic protection builds on.