When microservices infrastructure encounters capacity limits, performance degradation, or operational state changes, Microservices Engine (MSE) generates events through Event Center. Use this reference to identify each event, understand its severity and impact, and take the appropriate remediation action.
Event Center collects events from three MSE components: Microservices Registry (Nacos, ZooKeeper), cloud-native gateways, and Microservices Governance.
Severity levels
MSE classifies events into three severity levels:
| Severity | Meaning | Action |
|---|---|---|
| Critical | The system actively blocks operations -- rejecting requests, refusing new connections, or preventing resource creation. | Take immediate action. |
| Warning | The system detects a threshold breach or abnormal condition but still allows operations. | Investigate promptly to prevent escalation. |
| Info | A lifecycle event such as a scale-out completion or graceful shutdown. | No action required unless unexpected. |
Two-tier escalation pattern
Several Nacos events follow a two-tier escalation pattern: a Warning fires first when the system enters monitor mode, then a Critical fires if the condition worsens and the system switches to enforcement mode.
The CloudMonitor event name suffix indicates the current state:
| Suffix | Mode | Behavior |
|---|---|---|
:Normal | Monitor (Warning) | Operations continue. The system logs the breach but does not block traffic. |
:Executing | Enforcement (Critical) | The system actively rejects or blocks operations that exceed the threshold. |
Nacos events
Nacos:TPSRateLimit -- Client TPS throttling
Fires when client transactions per second (TPS) against the Nacos engine exceed the configured threshold.
| Severity | CloudMonitor event name | Behavior |
|---|---|---|
| Warning | Nacos:TPSRateLimit:Normal | Monitor mode. Requests above the threshold are still allowed. |
| Critical | Nacos:TPSRateLimit:Executing | Enforcement mode. Requests above the threshold are rejected. |
Remediation:
Upgrade the instance specification or scale out nodes to increase capacity.
Check whether application code is driving the high TPS -- for example, calling a Nacos API in a tight loop or creating multiple Nacos client instances instead of reusing a singleton.
Nacos:ConfigCapLimit -- Configuration count limit
Fires when the number of configurations in a Nacos instance exceeds the default upper limit of 10,000.
| Severity | CloudMonitor event name | Behavior |
|---|---|---|
| Warning | Nacos:ConfigCapLimit:Normal | Monitor mode. New configurations can still be created and released. |
| Critical | Nacos:ConfigCapLimit:Executing | Enforcement mode. New configurations cannot be created or released. |
Remediation:
Delete unused configurations.
Check whether the application is programmatically publishing a large number of configurations -- for example, creating one configuration per request instead of aggregating values.
Nacos:ServiceCapLimit -- Service registration limit
Fires when the number of registered services reaches the instance storage upper limit.
| Severity | CloudMonitor event name | Behavior |
|---|---|---|
| Warning | Nacos:ServiceCapLimit:Normal | Monitor mode. New services and service providers can still register. |
| Critical | Nacos:ServiceCapLimit:Executing | Enforcement mode. New services and service providers cannot register. |
Remediation:
Upgrade the instance specification or scale out nodes.
Check for invalid or duplicate service registrations, or multiple Nacos client instances creating redundant registrations.
Nacos:ConnectionCapLimit -- Connection count limit
Fires when the number of connected clients reaches the instance upper limit.
| Severity | CloudMonitor event name | Behavior |
|---|---|---|
| Warning | Nacos:ConnectionCapLimit:Normal | Monitor mode. New client connections are still allowed. |
| Critical | Nacos:ConnectionCapLimit:Executing | Enforcement mode. New client connections are rejected. |
Remediation:
Upgrade the instance specification or scale out nodes.
Check for multiple Nacos client instances per application or connection leaks caused by clients that are not properly closed.
Nacos:ParametersIllegalDenied -- Invalid request parameters
Fires when a client sends invalid request parameters to the Nacos instance.
| Severity | CloudMonitor event name | Behavior |
|---|---|---|
| Warning | Nacos:ParametersIllegalDenied:Normal | Monitor mode. Normal results are still returned. |
| Critical | Nacos:ParametersIllegalDenied:Executing | Enforcement mode. The server rejects the request and returns an error code. |
Remediation:
Check whether request parameters of the client are valid.
Nacos:CPUAlert -- High CPU load
Fires when CPU load on a Nacos instance is too high and the instance is overloaded. Subsequent requests may respond slowly or fail.
| Severity | CloudMonitor event name |
|---|---|
| Warning | Nacos:CPUAlert:Executing |
Remediation:
Upgrade the instance specification or add nodes to distribute load.
Check whether the high CPU load is caused by improper use of the instance.
Nacos:MemoryAlert -- High memory usage
Fires when memory usage on a Nacos instance is too high. An out-of-memory (OOM) killer may terminate processes, causing instance nodes to restart.
| Severity | CloudMonitor event name |
|---|---|
| Warning | Nacos:MemoryAlert:Executing |
Remediation:
Upgrade the instance specification. High memory usage typically indicates that the instance is handling more services, configurations, or connections than its current specification supports.
Nacos:FullGc -- Frequent full garbage collections
Fires when frequent full garbage collections (GCs) reduce available capacity. Subsequent requests may respond slowly or fail.
| Severity | CloudMonitor event name |
|---|---|
| Warning | Nacos:FullGc:Executing |
Remediation:
Upgrade the instance specification or add nodes.
Check whether the capacity is insufficient due to improper use of the instance.
Nacos:TooManyConfigAlert -- Configuration count exceeded
Fires when the number of configurations exceeds 10,000. Additional configurations cannot be created, and new configuration releases may fail.
| Severity | CloudMonitor event name |
|---|---|
| Warning | Nacos:TooManyConfigAlert |
Remediation:
Delete unnecessary configurations.
Nacos:TooManyConfigLongPolling -- Excessive configuration listeners
Fires when too many listeners are configured for a Nacos instance, causing overload. Frequent full GCs may follow, leading to slow or failed responses.
| Severity | CloudMonitor event name |
|---|---|
| Warning | Nacos:TooManyConfigLongPolling |
Remediation:
Upgrade the instance specification or add nodes.
Review application code for unnecessary configuration listeners. Each listener maintains a long-polling connection, so reducing listener count directly lowers resource consumption.
Nacos:ProviderDropTooFast -- Sudden decrease in service providers
Fires when the number of registered service providers drops by more than 50% compared to 3 minutes ago. Upstream services may fail to discover downstream providers.
| Severity | CloudMonitor event name |
|---|---|
| Warning | Nacos:ProviderDropTooFast:Executing |
Remediation:
Check for widespread application failures or network issues that caused multiple providers to deregister simultaneously.
If the drop resulted from a planned large-scale deployment or release, this event can be safely ignored.
ZooKeeper events
ZooKeeper:ExceedEphemeralLimit -- Ephemeral node limit exceeded
Fires when a single session creates more than 2,000 ephemeral nodes. Additional ephemeral nodes may fail to be added.
| Severity | CloudMonitor event name |
|---|---|
| Critical | ZooKeeper:ExceedEphemeralLimit |
Remediation:
Review business code for excessive ephemeral node creation within a single session.
If you use Dubbo, check the version. Older Dubbo versions may create excessive ephemeral nodes. Upgrade to the latest version.
Add the
Jute.Maxbufferparameter to ensure successful recovery from failures.
ZooKeeper:FullGc -- Long full garbage collections
Fires when full GCs take too long, increasing processing latency and causing response timeouts. ZooKeeper instance nodes may be recovered from a failure and restarted.
| Severity | CloudMonitor event name |
|---|---|
| Warning | ZooKeeper:CMSGc:Executing |
Remediation:
Check the ZooKeeper version and upgrade if outdated. Newer versions include GC optimizations.
If this event persists, add nodes to the instance to increase capacity.
ZooKeeper:CPUAlert -- High CPU load
Fires when CPU load is too high, increasing processing latency and causing response timeouts. ZooKeeper instance nodes may be recovered from a failure and restarted.
| Severity | CloudMonitor event name |
|---|---|
| Warning | ZooKeeper:CPUAlert:Executing |
Remediation:
If this event persists, add nodes to the instance to increase capacity.
Cloud-native gateway events
Cloud-native gateway events track the status of elastic scaling operations. Failed operations generate Warning-level events; in-progress and completed operations generate Info-level events.
Gateway:ElasticScaleOut -- Elastic scale-out
| Severity | CloudMonitor event name | Description |
|---|---|---|
| Warning | Gateway:ElasticScaleOut:Failed | Scale-out failed. Check event details in the console. |
| Info | Gateway:ElasticScaleOut:Executing | Scale-out in progress. |
| Info | Gateway:ElasticScaleOut:Executed | Scale-out completed. |
Gateway:ElasticScaleIn -- Elastic scale-in
| Severity | CloudMonitor event name | Description |
|---|---|---|
| Warning | Gateway:ElasticScaleIn:Failed | Scale-in failed. Check event details in the console. |
| Info | Gateway:ElasticScaleIn:Executing | Scale-in in progress. |
| Info | Gateway:ElasticScaleIn:Executed | Scale-in completed. |
Gateway:ElasticOpen -- Enable elastic scale-out
| Severity | CloudMonitor event name | Description |
|---|---|---|
| Warning | Gateway:ElasticOpen:Failed | Failed to enable elastic scale-out. Check event details in the console. |
| Info | Gateway:ElasticOpen:Executed | Elastic scale-out enabled. |
Gateway:ElasticClose -- Disable elastic scale-out
| Severity | CloudMonitor event name | Description |
|---|---|---|
| Warning | Gateway:ElasticClose:Failed | Failed to disable elastic scale-out. Check event details in the console. |
| Info | Gateway:ElasticClose:Executed | Elastic scale-out disabled. |
Gateway:ElasticStrategyUpdate -- Elasticity policy update
| Severity | CloudMonitor event name | Description |
|---|---|---|
| Warning | Gateway:ElasticStrategyUpdate:Failed | Failed to update the elasticity policy. Check event details in the console. |
| Info | Gateway:ElasticStrategyUpdate:Executed | Elasticity policy updated. |
Microservices Governance events
Governance:FlowTrafficBlock -- Throttling rule triggered
Fires when an interface throttling rule configured in Microservices Governance is triggered.
| Severity | CloudMonitor event name |
|---|---|
| Warning | Governance:FlowTrafficBlock:Executing |
Remediation:
Burst traffic is hitting the interface. Monitor traffic trends and assess whether the throttling threshold needs adjustment.
Governance:CircleBreakerTrafficBlock -- Circuit breaking rule triggered
Fires when a circuit breaking rule configured in Microservices Governance is triggered.
| Severity | CloudMonitor event name |
|---|---|
| Warning | Governance:CircleBreakerTrafficBlock:Executing |
Remediation:
Check downstream interface dependencies for unstable performance, such as slow calls or service errors.
Governance:ParamsTrafficBlock -- Hotspot parameter protection triggered
Fires when a hotspot parameter protection rule configured in Microservices Governance is triggered.
| Severity | CloudMonitor event name |
|---|---|
| Warning | Governance:ParamsTrafficBlock:Executing |
Remediation:
If hotspot parameters are called frequently, prefetch the relevant data to reduce per-request load.
Lifecycle events
The following Info-level events track microservice lifecycle operations. No action is required unless the event is unexpected.
| Event code | CloudMonitor event name | When it fires |
|---|---|---|
Governance:GracefulShutdown | Governance:GracefulShutdown | A microservice application is gracefully shut down. This feature is automatically enabled with Microservices Governance. Also fires when an instance is manually shut down on the Node details page. |
Governance:RegistrationFinish | Governance:RegistrationFinish | A microservice application completes registration and startup after the graceful start feature is enabled. Also fires when an instance is manually started on the Node details page. |
Governance:WarmupStart | Governance:WarmupStart | A new instance begins traffic warmup after the graceful start feature is enabled. |
Governance:WarmupEnd | Governance:WarmupEnd | Traffic warmup completes for a new instance. |
Governance:ReadinessFinish | Governance:ReadinessFinish | The Kubernetes readiness probe completes for the application. |
Governance:ApplicationStop | Governance:ApplicationStop | The graceful shutdown process completes before the application stops. |