Lists new features, component releases, and platform updates for E-MapReduce (EMR) on ECS. For version details, see Overview.
For more information about the versions, see Overview.
2025
October
| Feature | Description | Release date | References |
|---|---|---|---|
| EMR V5.21 and EMR V3.55 | Optimizes Hive, Spark, Tez, and Ranger components and fixes issues from previous versions. | 2025-10-27 | EMR-5.21.x, EMR-3.55.x |
July
| Feature | Description | Release date | References |
|---|---|---|---|
| EMR V5.20 and EMR V3.54 | Fixes issues from previous versions. | 2025-07-10 | Release notes for EMR V5.X, Release notes for EMR V3.X |
April
| Feature | Description | Release date | References |
|---|---|---|---|
| EMR V5.19 and EMR V3.53 | Upgrades multiple service components and fixes issues from previous versions. | 2025-04-24 | EMR v5.19.x, EMR-3.53.x |
2024
December
| Feature | Description | Release date | References |
|---|---|---|---|
| EMR V5.18.1 and EMR V3.52.1 | Updates multiple service versions, fixes issues from previous versions, and discontinues Impala and Kafka. | 2024-12-18 | Release notes for EMR V5.18.X, Release notes for EMR V3.52.X |
| EMR V5.17.4 and EMR V3.51.4 | Updates multiple service versions and fixes issues from previous versions. | 2024-12-18 | Release notes for EMR V5.17.X, Release notes for EMR V3.51.X |
November
| Feature | Description | Release date | References |
|---|---|---|---|
| Release protection | Enable release protection when creating a cluster to prevent resource loss from accidental operations. | 2024-11-28 | Enable and disable release protection |
| Configuration upgrade for pay-as-you-go node groups | Upgrade configurations of pay-as-you-go node groups to handle high load caused by increased business demands. | 2024-11-28 | Upgrade node configurations |
| Elastic resource provisioning for pay-as-you-go node groups | Reserve resources in the Elastic Compute Service (ECS) console in advance, then associate a pay-as-you-go node group with a private pool to ensure stable elastic resource provisioning. | 2024-11-28 | Manage node groups |
| System disk encryption | Bind a customer master key (CMK) from Key Management Service (KMS) to the system disk when creating a cluster to enable encryption. | 2024-11-28 | Enable system disk encryption |
| Bootstrap action script optimization | Adds a new execution timing — after service installation and before service startup — for bootstrap action scripts. | 2024-11-28 | Use bootstrap actions to execute scripts |
October
| Feature | Description | Release date | References |
|---|---|---|---|
| Managed auto scaling | Continuously monitors YARN load in a cluster. Set the maximum and minimum number of task nodes, and the system automatically adjusts the count based on load to maximize resource utilization. | 2024-10-22 | Add managed auto scaling rules |
August
| Feature | Description | Release date | References |
|---|---|---|---|
| Monitoring and diagnostics | Built on a large model and incorporating the knowledge and experience of the Alibaba Cloud EMR team in open source big data, EMR observability, and technical expert diagnostics. Provides real-time health diagnostics to identify abnormal clusters and troubleshoot issues based on diagnostic results. Includes daily cluster reports with global optimization suggestions to reduce O&M costs. | 2024-08-20 | Initiate health diagnostics |
| Cluster cloning optimization | Clones modified service configurations, added node groups, and configured auto scaling rules from cluster creation or cluster use to a new cluster, making it faster to replicate an existing cluster's full setup. | 2024-08-20 | Clone a cluster |
| Up to four security groups per node group | Associate up to four security groups with a node group for flexible access control on ECS instances in a cluster. | 2024-08-20 | Manage node groups |
June
| Feature | Description | Release date | References |
|---|---|---|---|
| Auto-renewal during scale-out | Enable auto-renewal when scaling out an EMR cluster so that newly added nodes renew automatically, reducing asynchronous operations. Modify the renewal duration or disable auto-renewal on the Auto-renewal page. | 2024-06-19 | Scale out an EMR cluster |
| Pay-as-you-go to subscription switchover at the node group level | Switch the billing method of core, task, or gateway node groups in a subscription cluster from pay-as-you-go to subscription for more flexible resource billing. | 2024-06-19 | Switch from pay-as-you-go to subscription |
| Master-Extend node groups | Create Master-Extend node groups and deploy Spark, Hive, and Kyuubi components on them based on business requirements. Configurations sync automatically to nodes that need them, reducing load on the master node group. | 2024-06-19 | Manage node groups |
March
| Feature | Description | Release date | References |
|---|---|---|---|
| OSS-HDFS bucket management in the EMR console | Create Object Storage Service (OSS)-HDFS buckets when creating a cluster, and view storage overviews and object lists on the Services tab without switching to the OSS console. This simplifies bucket usage and prevents misoperations that could make Hadoop Distributed File System (HDFS) unavailable. | 2024-03-14 | Create a cluster |
| Gateway node groups | Add gateway nodes to offload task submission from the master node. Gateway nodes serve as dedicated task submission machines with automatic configuration sync for the task submission environment. | 2024-03-14 | Manage node groups |
| Health check item management | View and modify health check items for cluster nodes and services. EMR checks node and service health against preset items so you can detect and address exceptions early. | 2024-03-14 | Manage health check items |
| Expanded health check items for services and components | Adds more health check items for YARN, HDFS, Hive, Kafka, and ZooKeeper to improve check accuracy for service and component health. | 2024-03-14 | View the health status of services and components |
2023
October
| Feature | Description | Release date | References |
|---|---|---|---|
| Auto scaling rule recommendations | View cluster resource overviews on the Auto Scaling tab. EMR analyzes resource utilization and recommends auto scaling rules for clusters that meet specific conditions, helping improve resource elasticity. | 2023-10-24 | View the overview information about cluster resources |
| Alert rule management | Create and view alert rules for clusters in the EMR console, powered by CloudMonitor. Alerts trigger and CloudMonitor sends notifications when resource metrics meet the configured conditions, so you can identify and handle exceptions early. | 2023-10-24 | Manage alert rules |
| Node health status | View node health status on the Nodes tab to identify abnormal nodes and verify that nodes are running as expected. | 2023-10-24 | View the health status of nodes |
| Disk performance level (PL) configuration | Specify different performance levels for Enhanced SSDs (ESSDs) when creating a cluster or adding a node group to meet varying cluster performance requirements. | 2023-10-24 | Create a cluster |
August
| Feature | Description | Release date | References |
|---|---|---|---|
| Cluster templates | Save an EMR instance configuration as a template and create clusters from it with a few clicks. | 2023-08-29 | Create a cluster template, Create a cluster based on a cluster template |
| Cluster resource overview | View cluster resource utilization on the Auto Scaling tab. EMR analyzes utilization and provides auto scaling rules for qualifying clusters to improve resource elasticity. | 2023-08-29 | View the overview information about cluster resources |
| Node group and node-level configuration visibility | View node group-level and node-level configuration overrides on the Configure tab by selecting Node Group Configuration or Independent Node Configuration from the Default Cluster Configuration drop-down list. | 2023-08-29 | Manage configuration items |
July
| Feature | Description | Release date | References |
|---|---|---|---|
| Auto scaling management | Manage auto scaling rules, view elastic resource usage, and analyze cost allocation from a dedicated auto scaling module. Evaluate cost savings from auto scaling and optimize cluster resource utilization. | 2023-07-12 | Add auto scaling rules, View auto scaling activities, View auto scaling cost analysis in a visualized manner |
| Automatic supplementation optimization | Replaces abnormal nodes in a cluster automatically. Adds information prompts and event notifications so you can track how automatic supplementation is performed. Note From 18:00 (UTC+8) on July 10, 2023, Automatic Compensation is enabled by default for new pay-as-you-go task node groups. | 2023-07-12 | Manage automatic supplementation |
| Service configuration prompts | Adds To Be Delivered and Not Effective Yet prompts when configurations are modified, guiding you on next steps to make changes take effect. | 2023-07-12 | Manage configuration items |
| Stateless clusters | Remove the core node group to build a fully stateless cluster using EMR's default data lake architecture, which does not depend on HDFS. Reduces O&M costs for workloads that don't require core nodes. | 2023-07-12 | Create a cluster |
| YARN partition and queue association | Associate YARN partitions with queues and allocate capacity directly in the EMR console without manual configuration. | 2023-07-12 | Manage resource queues |
| Per-second billing for pay-as-you-go resources | Pay-as-you-go resources are billed per second, providing finer billing granularity to help reduce resource costs. | 2023-07-12 | Pay-as-you-go, Subscription |
June
| Feature | Description | Release date | References |
|---|---|---|---|
| Version update | Releases EMR V5.12.0 and EMR V3.46.0. | 2023-06-01 | Release notes for EMR V5.12.X, Release notes for EMR V3.46.X |
| Paimon | Adds Apache Paimon, a data lake platform for streaming and batch data processing with high-throughput writes and low-latency queries. | 2023-06-01 | Paimon overview, Integrate Paimon with Flink, Integrate Paimon with Spark, Integrate Paimon with Hive, Integrate Paimon with Trino |
| Presto | Adds Presto (PrestoDB), a flexible and scalable distributed SQL query engine. | 2023-06-07 | Overview, Use the CLI to connect to Presto, Use JDBC to access Presto, Configure connectors, Manage LDAP authentication |
April
| Feature | Description | Release date | References |
|---|---|---|---|
| Version update | Releases EMR V5.11.1 and EMR V3.45.1. | 2023-04-03 | Release notes for EMR V5.11.X, Release notes for EMR V3.45.X |
| Data lakehouse support for Hologres and MaxCompute | Access Hologres and MaxCompute tables using the Spark and Trino compute engines. | 2023-04-03 | New capability in data lakehouse scenarios: EMR supports Hologres and MaxCompute data sources |
| Spark access to Hologres | Read data from Hologres tables using Spark. | 2023-04-03 | Use Spark to access Hologres |
| Node configuration upgrade | Upgrade the ECS instance configurations of a node group. | 2023-04-03 | Upgrade node configurations |
| YARN partition management in the EMR console | Manage YARN partitions visually in the EMR console and map multiple node groups to partitions at once. | 2023-04-13 | Manage YARN partitions in the EMR console |
March
| Feature | Description | Release date | References |
|---|---|---|---|
| Flink Table Store | Adds Flink Table Store, a unified data lake storage for streaming and batch processing with high-throughput writes and low-latency queries. | 2023-03-03 | — |
| Service configuration export and import | Export service configurations in XML or JSON format for backup, migration, or restore. | 2023-03-02 | Export and import service configurations |
February
| Feature | Description | Release date | References |
|---|---|---|---|
| Version update | Releases EMR V5.11.0 and EMR V3.45.0. | 2023-02-28 | Release notes for EMR V5.11.X, Release notes for EMR V3.45.X |
2022
December
| Feature | Description | Release date | References |
|---|---|---|---|
| Version update | Releases EMR V5.10.0 and EMR V3.44.0. | 2022-12-01 | Release notes for EMR V5.10.X, Release notes for EMR V3.44.X |
| YARN node labels | Manage nodes running NodeManager in a cluster by partition using YARN node labels. | 2022-12-14 | Node labels |
November
| Feature | Description | Release date | References |
|---|---|---|---|
| Version update | Releases EMR V5.9.1 and EMR V3.43.1. | 2022-11-08 | Release notes for EMR V5.9.X, Release notes for EMR V3.43.X |
| Log management | Query logs generated by open source components directly in the EMR console. | 2022-11-29 | Manage logs |
October
| Feature | Description | Release date | References |
|---|---|---|---|
| Version update | Releases EMR V5.9.0 and EMR V3.43.0. | 2022-10-14 | Release notes for EMR V5.9.X, Release notes for EMR V3.43.X |
| HBase Shell | Connect to HBase deployed in an EMR cluster using HBase Shell. | 2022-10-21 | Use HBase Shell |
| DataServing cluster | Provides DataServing clusters based on Apache HBase. | 2022-10-28 | DataServing cluster |
September
| Feature | Description | Release date | References |
|---|---|---|---|
| Automatic supplementation | Automatically replaces abnormal ECS instances in a cluster when EMR detects they can no longer run engine services as expected. | 2022-09-07 | Manage automatic supplementation |
| Cluster cloning | Create a new cluster based on an existing cluster's configuration. | 2022-09-09 | Clone a cluster |
August
| Feature | Description | Release date | References |
|---|---|---|---|
| Version update | Releases EMR V5.8.0 and EMR V3.42.0. | 2022-08-05 | Release notes for EMR V5.8.X, Release notes for EMR V3.42.X |
| Deployment sets | Use Alibaba Cloud ECS deployment sets to manage the distribution of ECS instances, improving disaster recovery capability and availability. | 2022-08-05 | Add nodes to the deployment set |
| Gateway deployment using EMR-CLI | Deploy a gateway on an ECS instance using the EMR-CLI tool. | 2022-08-05 | Use EMR-CLI to deploy a gateway |
July
| Feature | Description | Release date | References |
|---|---|---|---|
| EMR Doctor | An intelligent O&M system developed by the Alibaba Cloud EMR team for open source big data clusters. | 2022-07-25 | Overview |
June
| Feature | Description | Release date | References |
|---|---|---|---|
| DataLake cluster | A big data computing cluster for flexible, reliable, and efficient data analysis. DataLake clusters are available only in the new EMR console. | 2022-06-01 | DataLake cluster |
| Spark cluster and Shuffle Service cluster association | Associate a Spark cluster created on the EMR on ACK page with a Shuffle Service cluster. Remote Shuffle Service (RSS) improves the stability and performance of Spark Shuffle. | 2022-06-09 | Associate a Spark cluster with a Shuffle Service cluster |
May
| Feature | Description | Release date | References |
|---|---|---|---|
| StarRocks memory management | Covers memory usage categories, memory configuration parameters for a backend (BE) in StarRocks, and how to view memory usage. | 2022-05-10 | Manage memory resources |