All Products
Search
Document Center

Serverless App Engine:Observable scaling policy

Last Updated:May 08, 2025

To optimze cloud resource utilization, change trend of instance quantity with scaling metrics provides a clear view of the automatic adjustment of computing resources based on predefined rules at specified period.

Prerequisites

Procedure

  1. Log on to the SAE console. In the left-side navigation pane, choose Applications > Applications. On the page that appears, select a region in the top navigation bar and a namespace from the Namespace drop-down list, and then click the desired application name.

  2. On the Basic Information page of the application, click the Auto Scaling tab.

  3. In the Auto Scaling tab, select the Change Trend of Instance Quantity With Scaling Metrics Instance section and set the time range to view the trend within the specified period.

    • CPU Utilization

      image

      The chart above provides a detailed view of CPU utilization over the selected period, including six key parameters:

      • CPU Utilization: The extent of CPU resource engagement, a critical indicator of system load.

      • Application Instances: The number of instances deployed for application operations.

      • Scale-out Interval: The maximum increase in application instances when CPU utilization surpasses the predefined threshold.

      • Scale-in Interval: The maximum reduction in application instances when CPU utilization drops to the predefined threshold.

      • Expected Value: The scale-out and scale-in thresholds established by the HPA.

      • Scaling Description: Descriptions of the factors influencing instance count changes.

      Note
      • These parameters are configured during HPA setup. If not configured, they will not be displayed on the trend chart.

      • The same applies if other metrics are configured with HPA.

    • Memory Usage

      image

      The chart above provides a detailed view of memory usage over the selected period, indicating the ratio of used memory to total available memory.

    • Active TCP Connections

      image

      The chart above provides a detailed view of the active TCP connections over the selected period, indicating the number of TCP connections processed by the server or application.

  • Service Requests

    image

    The chart above provides a detailed view of service requests over the selected period, indicating the number of requests received by the application or service.

  • Average Response Time

    image

    The chart above provides a detailed view of the average response time over the selected period, indicating the time taken on average for the service to process requests.