This topic describes the billable items and information about computing resources of E-MapReduce (EMR) Serverless Spark.
Billable items
The billable items of EMR Serverless Spark are computing resources. You are charged for the actual usage of computing resources based on the pay-as-you-go billing method. The unit is compute unit (CU). You can use a computing resource plan to offset the fees generated for your actual usage of computing resources. For more information, see Resource plan.
Computing resources
CU
A CU is the basic unit of computing capabilities in EMR Serverless Spark workspaces and is billed by minute. The unit price of a CU depends on the CPU architecture of an EMR Serverless Spark workspace and the high availability attribute of the zone. By default, the Intel x86 architecture and a single zone are used. The unit price of a CU also varies based on regions.
Measurement method
CUs reflect the CPU compute power of the underlying system of Serverless Spark. The number of CUs consumed by a computing task varies based on the actual amount of data processed by the task, the computing complexity, the distribution of data, and whether you enable the Fusion engine. If you enable the Fusion engine for acceleration, no additional resource fees are generated, but the time required to run a job is reduced by more than 30% in most cases, which offers a higher cost-performance ratio. You can estimate the number of CUs that you need to purchase based on your business scale and the amount of data that you want to process. By default, one CU is equal to 1 CPU core and 4 GiB of memory. If the ratio of CPU cores to memory is not 1:4, you can calculate the number of CUs based on promotional discounts for CUs. For more information, see Notice on promotional discounts for CUs.
The following table describes the processing capability of one CU.
Scenario | Processing capability (Java Runtime) | Processing capability (Fusion engine) |
Simple data processing, such as filtering and cleansing | One CU can process about 2,000,000 data records per second. | One CU can process about 5,000,000 data records per second. |
Complex data processing, such as aggregation, join, and string-related operations | One CU can process about 700,000 data records per second. | One CU can process about 2,000,000 data records per second. |