Auto Scaling, elastic observable scaling policy - Serverless App Engine

To optimze cloud resource utilization, change trend of instance quantity with scaling metrics provides a clear view of the automatic adjustment of computing resources based on predefined rules at specified period.

Prerequisites

An application is deployed. For more information, see the documents in Deploy applications category.
An Auto Scaling policy has been created and enabled. For more information, see Configure an Auto Scaling policy.

Procedure

Log on to the SAE console. In the left-side navigation pane, choose Applications > Applications. On the page that appears, select a region in the top navigation bar and a namespace from the Namespace drop-down list, and then click the desired application name.
On the Basic Information page of the application, click the Auto Scaling tab.
In the Auto Scaling tab, select the Change Trend of Instance Quantity With Scaling Metrics Instance section and set the time range to view the trend within the specified period.
- CPU Utilization
  The chart above provides a detailed view of CPU utilization over the selected period, including six key parameters:
  - CPU Utilization: The extent of CPU resource engagement, a critical indicator of system load.
  - Application Instances: The number of instances deployed for application operations.
  - Scale-out Interval: The maximum increase in application instances when CPU utilization surpasses the predefined threshold.
  - Scale-in Interval: The maximum reduction in application instances when CPU utilization drops to the predefined threshold.
  - Expected Value: The scale-out and scale-in thresholds established by the HPA.
  - Scaling Description: Descriptions of the factors influencing instance count changes.
  Note
  - These parameters are configured during HPA setup. If not configured, they will not be displayed on the trend chart.
  - The same applies if other metrics are configured with HPA.
- Memory Usage
  The chart above provides a detailed view of memory usage over the selected period, indicating the ratio of used memory to total available memory.
- Active TCP Connections
  The chart above provides a detailed view of the active TCP connections over the selected period, indicating the number of TCP connections processed by the server or application.

Service Requests
The chart above provides a detailed view of service requests over the selected period, indicating the number of requests received by the application or service.
Average Response Time
The chart above provides a detailed view of the average response time over the selected period, indicating the time taken on average for the service to process requests.