Descriptions, impacts, and handling suggestions of events in MSE Event Center - Microservices Engine

Event Center in the Microservices Engine (MSE) console collects event data that is generated by Microservices Registry, Microservices Governance, and cloud-native gateways. You can view event changes of all resources managed in MSE and troubleshoot issues based on abnormal events. This topic describes the events that may be generated in MSE and provides suggestions on how to handle each event.

Events of Nacos

Event code	Event name	Event severity	CloudMonitor event name	Event description and impact	Event handling suggestion
Nacos:TPSRateLimit	Nacos Client TPS Throttling	Warning	Nacos:TPSRateLimit:Normal	When the client accesses the Nacos engine, the transactions per second (TPS) is excessively high. To ensure stable operations of the Nacos engine, the system is about to perform throttling on requests. In this case, the Nacos instance is in the monitor mode. When the TPS threshold is reached, subsequent requests are still allowed.	Upgrade the instance configuration or scale out nodes, and check whether the high TPS is caused by incorrect usage. For example, a Nacos API operation is frequently called or a Nacos client is created multiple times.
Nacos:TPSRateLimit	Nacos Client TPS Throttling	Critical	Nacos:TPSRateLimit:Executing	When the client accesses the Nacos engine, the TPS is excessively high. To ensure stable operations of the Nacos engine, the system performs throttling on requests. When the TPS threshold is reached, subsequent requests are rejected.	Upgrade the instance configuration or scale out nodes, and check whether the high TPS is caused by incorrect usage. For example, a Nacos API operation is frequently called or a Nacos client is created multiple times.
Nacos:ConfigCapLimit	Nacos Configuration Limiting	Warning	Nacos:ConfigCapLimit:Normal	The number of configurations in a Nacos instance exceeds the default upper limit 10,000. The system is about to perform throttling on new configurations. In this case, the Nacos instance is in the monitor mode. New configurations can still be created and released.	Delete unused configurations at the earliest opportunity and check whether a large number of configurations are automatically released by the application.
Nacos:ConfigCapLimit	Nacos Configuration Limiting	Critical	Nacos:ConfigCapLimit:Executing	The number of default configurations in a Nacos instance exceeds 10,000, and the system no longer allows users to create configurations. As a result, new configurations cannot be released.	Delete unused configurations at the earliest opportunity and check whether a large number of configurations are automatically released by the application.
Nacos:ServiceCapLimit	Nacos Service Limiting	Warning	Nacos:ServiceCapLimit:Normal	An excessive number of services are registered, and the storage upper limit is reached. To ensure stable operations of the Nacos engine, the system is about to perform throttling on service registrations. In this case, the Nacos instance is in the monitor mode. New services or service providers can still be registered with the Nacos instance.	Upgrade the instance configuration or scale out nodes, and check whether invalid service publish occurs, repeated service publish occurs, or a Nacos client is created multiple times.
Nacos:ServiceCapLimit	Nacos Service Limiting	Critical	Nacos:ServiceCapLimit:Executing	An excessive number of services are registered, and the storage upper limit is reached. To ensure stable operations of the Nacos engine, the system no longer allows service registration. As a result, new services or service providers cannot be registered with the Nacos engine.	Upgrade the instance configuration or scale out nodes, and check whether invalid service publish occurs, repeated service publish occurs, or a Nacos client is created multiple times.
Nacos:ConnectionCapLimit	Nacos Connection Limiting	Warning	Nacos:ConnectionCapLimit:Normal	The number of clients that are connected to the Nacos engine reaches the upper limit of the instance. To ensure stable operations of the Nacos engine, the system is about to perform throttling on new connections. In this case, the Nacos instance is in the monitor mode. New Nacos clients can still be connected to the Nacos engine.	Upgrade the instance configuration or scale out nodes, and check whether incorrect usage exists. For example, the Nacos client is created multiple times, or connection leaks occur due to the Nacos client enabled.
Nacos:ConnectionCapLimit	Nacos Connection Limiting	Critical	Nacos:ConnectionCapLimit:Executing	The number of clients that are connected to the Nacos engine reaches the upper limit of the instance. To ensure stable operations of the Nacos engine, the system no longer allows the establishment of new connections. As a result, new Nacos clients cannot be connected to the Nacos engine.	Upgrade the instance configuration or scale out nodes, and check whether incorrect usage exists. For example, the Nacos client is created multiple times, or connection leaks occur due to the Nacos client enabled.
Nacos:ParametersIllegalDenied	Invalid Request Parameters for Nacos Instances	Warning	Nacos:ParametersIllegalDenied:Normal	The request parameters sent by the client to the Nacos instance are invalid. The server is about to reject invalid parameters. In this case, the Nacos instance is in the monitor mode. Normal results can still be returned.	Check whether request parameters of the client are valid.
Nacos:ParametersIllegalDenied	Invalid Request Parameters for Nacos Instances	Critical	Nacos:ParametersIllegalDenied:Executing	The request parameters sent by the client to the Nacos instance are invalid. The server rejects the invalid parameters and returns an error code.	Check whether request parameters of the client are valid.
Nacos:CPUAlert	Excessively High CPU Load in Nacos Instances	Warning	Nacos:CPUAlert:Executing	The CPU load of a Nacos instance is excessively high and the instance is overloaded. This may cause slow responses or even no responses to subsequent requests.	Upgrade the instance configuration or add instance nodes, and check whether the high CPU load is caused by improper use of the instance.
Nacos:MemoryAlert	Excessively High Memory Usage in Nacos Instances	Warning	Nacos:MemoryAlert:Executing	The memory usage of a Nacos instance is excessively high, which may cause an out of memory (OOM) killer to run. This may cause the nodes of the Nacos instance to restart.	Upgrade the instance configuration.
Nacos:FullGc	Full GCs of Nacos Instances	Warning	Nacos:FullGc:Executing	The capacity of a Nacos instance becomes insufficient due to frequent full garbage collections (GCs). This may cause slow responses or even no responses to subsequent requests.	Upgrade the instance configuration or add instance nodes, and check whether the capacity is insufficient due to improper use of the instance.
Nacos:TooManyConfigAlert	Excessive Configurations for Nacos Instances	Warning	Nacos:TooManyConfigAlert	The number of configurations that you attempt to create for a Nacos instance exceeds 10,000. The extra configurations cannot be created. New configurations may fail to be released.	Delete unnecessary configurations.
Nacos:TooManyConfigLongPolling	Excessive Long Pollings Configured for Nacos Instances	Warning	Nacos:TooManyConfigLongPolling	The number of listeners configured for a Nacos instance is excessively large, which results in an overload issue. Full GCs may be frequently triggered, which may cause slow responses or even no responses to subsequent requests.	Upgrade the instance configuration or add instance nodes, and check whether the capacity is insufficient due to improper use of the instance.
Nacos:ProviderDropTooFast	Sudden Decrease of Providers on Nacos Instances	Warning	Nacos:ProviderDropTooFast:Executing	The number of service providers that are registered with a Nacos instance at the current time is decreased by more than 50% compared with 3 minutes ago. When this event is generated, the upstream service may fail to identify the downstream service providers.	Check whether a large number of applications fail or network failures occur. If the event is generated due to the expected release of a large number of applications, you can ignore the event.

Events of Zookeeper

Event code	Event name	Event severity	CloudMonitor event name	Event description and impact	Event handling suggestion
ZooKeeper:ExceedEphemeralLimit	Ephemeral Nodes Created by a Single Session in ZooKeeper Exceed Limit (Limit = 2000)	Critical	ZooKeeper:ExceedEphemeralLimit	The number of ephemeral nodes that are created by a single session is excessively large. This may cause a failure to add ephemeral nodes to the session.	Check whether the number of ephemeral nodes that are created is excessively large based on business code. If you use Dubbo, check the Dubbo version. If the version is out of date, upgrade Dubbo to the latest version. Add the Jute.Maxbuffer parameter to ensure successful recovery from failures.
ZooKeeper:FullGc	Full GCs of ZooKeeper Instances	Warning	ZooKeeper:CMSGc:Executing	Full GCs take a long time. This may increase processing latency and cause response timeouts. As a result, nodes of the ZooKeeper instance are recovered from a failure and restarted.	Check the ZooKeeper version. If the version is out of date, upgrade ZooKeeper to the latest version. This event is generated due to insufficient capacity. If this event persists, we recommend that you add nodes to the instance.
ZooKeeper:CPUAlert	Excessively High CPU Load in ZooKeeper Instances	Warning	ZooKeeper:CPUAlert:Executing	The CPU load is excessively high. This may increase processing latency and cause response timeouts. As a result, nodes of the ZooKeeper instance are recovered from a failure and restarted.	This event is generated due to insufficient capacity. If this event persists, we recommend that you add nodes to the instance.

Events of cloud-native gateways

Event code	Event name	Event severity	CloudMonitor event name	Event description and impact	Event handling suggestion
Gateway:ElasticScaleOut	Elastic Scale-out of Cloud-native Gateways	Warning	Gateway:ElasticScaleOut:Failed	When an elastic scale-out operation is performed on a cloud-native gateway, the cloud-native gateway generates this event based on the scale-out status.	Handle the event based on the event details in the console.
		Info	Gateway:ElasticScaleOut:Executing		Take note of the event information if necessary.
		Info	Gateway:ElasticScaleOut:Executed		Take note of the event information if necessary.
Gateway:ElasticScaleIn	Elastic Scale-in of Cloud-native Gateways	Warning	Gateway:ElasticScaleIn:Failed	When an elastic scale-in operation is performed on a cloud-native gateway, the cloud-native gateway generates this event based on the scale-in status.	Handle the event based on the event details in the console.
		Info	Gateway:ElasticScaleIn:Executing		Take note of the event information if necessary.
		Info	Gateway:ElasticScaleIn:Executed		Take note of the event information if necessary.
Gateway:ElasticOpen	Enable Elastic Scale-out of Cloud-native Gateways	Warning	Gateway:ElasticOpen:Failed	When you attempt to enable elastic scale-out, cloud-native gateways generate this event based on the enabling result.	Handle the event based on the event details in the console.
Gateway:ElasticOpen	Enable Elastic Scale-out of Cloud-native Gateways	Info	Gateway:ElasticOpen:Executed		Take note of the event information if necessary.
Gateway:ElasticClose	Disable Elastic Scale-out of Cloud-native Gateways	Warning	Gateway:ElasticClose:Failed	When you attempt to disable elastic scale-out, cloud-native gateways generate this event based on the disabling result.	Handle the event based on the event details in the console.
Gateway:ElasticClose	Disable Elastic Scale-out of Cloud-native Gateways	Info	Gateway:ElasticClose:Executed		Take note of the event information if necessary.
Gateway:ElasticStrategyUpdate	Elasticity Policy Update of Cloud-native Gateways	Warning	Gateway:ElasticStrategyUpdate:Failed	When elastic scale-out is enabled, cloud-native gateways update the elastic scale-out policy and generate this event based on the update result.	Handle the event based on the event details in the console.
Gateway:ElasticStrategyUpdate	Elasticity Policy Update of Cloud-native Gateways	Info	Gateway:ElasticStrategyUpdate:Executed		Take note of the event information if necessary.

Microservices Governance

Event code	Event name	Event severity	CloudMonitor event name	Event description and impact	Event handling suggestion
Governance:FlowTrafficBlock	Throttling Rules Triggered for Applications	Warning	Governance:FlowTrafficBlock:Executing	This event is generated when the interface throttling rule configured for the application that is enabled with Microservices Governance is triggered.	Burst traffic occurs on the interface. We recommend that you continue to observe the traffic.
Governance:CircleBreakerTrafficBlock	Circuit Breaking Rules Triggered for Applications	Warning	Governance:CircleBreakerTrafficBlock:Executing	This event is generated when the circuit breaking rule configured for the application for which Microservices Governance is enabled is triggered.	If the performance of the downstream interface is unstable, such as slow calls and abnormal services, we recommend that you check dependencies of the downstream interface.
Governance:ParamsTrafficBlock	Hotspot Parameter Protection Rules Triggered for Applications	Warning	Governance:ParamsTrafficBlock:Executing	This event is generated when the hotspot parameter protection rule configured for the application that is enabled with Microservices Governance is triggered.	If hotspot parameters are called, we recommend that you prefetch the relevant data.
Governance:GracefulShutdown	Graceful Shutdown of Microservices	Info	Governance:GracefulShutdown	This event is generated when a microservice application is gracefully shut down. After Microservices Governance is enabled for the application, the graceful shutdown feature is automatically enabled.	Take note of the event information if necessary.
Governance:RegistrationFinish	Microservices Registration Finished	Info	Governance:RegistrationFinish	This event is generated when a microservice application is registered and started after the graceful start feature is enabled.	Take note of the event information if necessary.
Governance:WarmupStart	Microservices Prefetching Started	Info	Governance:WarmupStart	This event is generated when a new instance of the application enabled with Microservices Governance starts to prefetch traffic after the graceful startup feature is enabled.	Take note of the event information if necessary.
Governance:WarmupEnd	Microservices Prefetching Completed	Info	Governance:WarmupEnd	This event is generated when traffic of new instances of the application enabled with Microservices Governance is completely prefetched after the graceful startup feature is enabled.	Take note of the event information if necessary.
Governance:ReadinessFinish	Microservice Application Passed Kubernetes Readiness Check	Info	Governance:ReadinessFinish	This event is generated when a Kubernetes readiness probe is completed for the application enabled with Microservices Governance after the graceful startup feature is enabled.	Take note of the event information if necessary.
Governance:GracefulShutdown	Manually Disable Microservices	Info	Governance:GracefulShutdown	This event is generated when an application instance is manually shut down on the Node details page of Microservices Governance and the operation takes effect.	Take note of the event information if necessary.
Governance:RegistrationFinish	Manually Enable Microservices	Info	Governance:RegistrationFinish	This event is generated when an application instance is manually started on the Node details page of Microservices Governance and the operation takes effect.	Take note of the event information if necessary.
Governance:ApplicationStop	Microservice Application Shutdown	Info	Governance:ApplicationStop	This event is generated when the graceful shutdown process is completely performed on the application enabled with Microservices Governance after the graceful shutdown feature is enabled and before the application stops.	Take note of the event information if necessary.