Use htop and sar to identify CPU, memory, and I/O bottlenecks that cause high load on Linux instances. Optimize performance by using code refactoring and improvements, resource configuration adjustments, and alert monitoring. -

Symptoms

Slow response: Secure Shell Protocol (SSH) commands are delayed. Website or API access is slow or times out.
High metrics: CPU, memory, and disk I/O metrics consistently exceed 80%.
Service interruption: The system terminates critical processes due to an out-of-memory (OOM) error, and the instance automatically restarts.
Logon failure: SSH connections are refused.

Application issues: The application code has performance bottlenecks or memory leaks.
Traffic spikes: Concurrent access exceeds the processing capacity of the instance.
I/O bottleneck: Disk read and write operations are saturated, which causes high CPU iowait.

Log on to an ECS instance using a VNC connection.
1. Go to ECS console - Instances. In the top navigation bar, select the target region and resource group.
2. Go to the details page of the target instance. Click Connect and select VNC. Enter the username and password to log on to the ECS instance.
Install and run htop.
```
sudo yum install -y htop
htop
```
Analyze the output in the htop interface.
- To find processes with high CPU consumption, press the F6 key and sort by PERCENT_CPU in descending order.
- To find processes with high memory consumption, press the F6 key and sort by PERCENT_MEM in descending order.

After you use htop to identify a symptom, use sar to obtain quantitative data and confirm whether the bottleneck is CPU, memory, or I/O.

Install and enable sysstat.

sudo yum install -y sysstat
systemctl start sysstat && systemctl enable sysstat

For application processes with high CPU consumption:
- Code optimization: Use tools such as perf (C/C++) and jstack (Java) to identify and optimize hot spot code.
- Logic optimization: Check for and fix inefficient operations such as infinite loops and SQL queries that perform full table scans.
For insufficient memory or frequent swapping:
- Investigate leaks: Use tools such as valgrind (C/C++) and jmap (Java) to analyze memory leaks.
- Adjust configurations: Configure application memory parameters, such as the -Xms and -Xmx parameters for a Java Virtual Machine (JVM).
- Upgrade resources: Increase physical memory by changing the instance type. For more information, see Overview of instance type changes.
High disk I/O: For more information, see Troubleshoot high disk I/O load on Linux systems.

Configure monitoring and alerts: Set alert thresholds for key metrics such as CPU, memory, load, and disk to receive early warnings.
Plan for Auto Scaling: For workloads with fluctuations, such as web applications, configure Auto Scaling policies to automatically add or remove instances in response to traffic changes.