Network failure is a common scenario in Elastic Compute Service (ECS). Network failure includes hardware link abnormalities, carrier network fluctuations, and system configuration issues, which can cause network connection failures and make ECS instances unavailable for long periods. Such scenarios test the monitoring and recovery capabilities of your business when one of its nodes becomes unavailable.
Implementation
Much like 100% packet loss, this drill uses the Cloud Assistant plug-in ACS-ECS-NetLoss and sets the packet loss ratio to rate=100. For more information about network packet loss, see Network packet loss drill.
Use these operations to ensure ECS instance accessibility:
Configure destination IP addresses.
Use Cloud Assistant for command injection and recovery.
Implement a timeout exit mechanism.
Restart instances in the ECS console.