All Products
Search
Document Center

Platform For AI:Appendix: Capacity and QPS of dedicated gateways

Last Updated:Sep 18, 2024

This topic describes the capacity threshold and queries per second (QPS) performance benchmarks for dedicated gateways with various specifications. We recommend that you choose gateway specifications based on both capacity threshold and QPS performance parameters. In most cases, if a specification meets the requirement of one parameter, the specification can also meet that of the other parameter. In some scenarios however, a specification may only meet one of the parameters, you need to upgrade the specification so that both parameters align with your business requirements.

Capacity threshold

The following table describes the capacity thresholds for a single gateway node based on gateway specifications. If the values of the gateway capacity metrics are less than the warning thresholds, your service-level agreement (SLA) requirements can be completely met. For your key business, we recommend that you keep the values of the gateway capacity metrics to be less than the security thresholds. This ensures the stability of your system.

Note
  • Security thresholds: Security thresholds can ensure that a gateway maintains high throughput and low latency even when traffic is doubled.

  • Warning thresholds: When the usage reaches the warning thresholds, the latency of the gateway may increase, and traffic surges may result in stability risks.

  • If a single node is deployed for your gateway, your SLA objective may not be met. You must make sure that at least two nodes are deployed for your gateway. If two or more nodes are deployed for your gateway, the security thresholds and warning thresholds are calculated based on the specifications of each node.

Gateway specifications

2 cores and 4 GiB of memory

4 cores and 8 GiB of memory

8 cores and 16 GiB of memory

16 cores and 32 GiB of memory

Number of nodes

1

1

1

1

Number of client connections

Security threshold

12,000

24,000

48,000

96,000

Warning threshold

24,000

48,000

96,000

192,000

Number of new HTTPS connections per second

Security threshold

400

800

1,600

3,200

Warning threshold

800

1,600

3,200

6,400

Network bandwidth (Gbit/s)

Security threshold

1

2

4

8

Warning threshold

1

2

4

8

CPU utilization

Security threshold

30%

30%

30%

30%

Warning threshold

60%

60%

60%

60%

Memory usage

Security threshold

75%

75%

75%

75%

Warning threshold

75%

75%

75%

75%

QPS performance

The QPS throughput of a gateway is affected by various factors, such as the response size, HTTPS support, and GZIP compression. The following table lists the QPS pessimistic values in worst cases when the CPU utilization of a gateway is less than 30%.

Note
  • Establishing new HTTPS connections consumes much CPU resources. In scenarios with a high volume of instantaneous HTTPS concurrent connections, you can assess gateway capacity based on the Number of short-lived connections row of the following table.

  • The gzip compression feature is exclusive to users on a whitelist. If you want to use the feature, submit a ticket.

Gateway specifications

2 cores and 4 GiB of memory

4 cores and 8 GiB of memory

8 cores and 16 GiB of memory

16 cores and 32 GiB of memory

Number of nodes

3

5

3

5

3

5

3

5

Connection type

Response size (KBytes)

HTTPS support

GZIP compression

QPS values when the CPU utilization is less than 30%

Number of short-lived connections

1

No

No

5,200

8,700

10,500

17,500

21,000

35,000

42,000

70,000

Yes

No

1,600

2,700

3,200

5,500

6,500

11,000

13,000

22,000

Number of persistent connections

1

No

No

6,500

10,800

13,000

21,700

26,000

43,500

52,000

87,000

Yes

No

6,000

10,000

12,000

20,000

24,000

40,000

48,000

80,000

Yes

Yes

5,200

8,700

10,500

17,500

21,000

35,000

42,000

70,000

10

No

No

5,600

9,300

11,200

18,700

22,500

37,500

45,000

75,000

Yes

No

5,300

9,000

10,700

18,000

21,500

36,000

43,000

72,000

Yes

Yes

3,100

5,200

6,200

10,500

12,500

21,000

25,000

42,000