All Products
Search
Document Center

E-MapReduce:Release notes for EMR V5.8.X

Last Updated:Apr 27, 2023

This topic describes the release notes for E-MapReduce (EMR) V5.8.X, including the release date, updates, and release version information.

Release date

August 5, 2022 for EMR V5.8.0

Updates

Service

Description

Spark

LDAP authentication can be enabled with one click.

Hive

LDAP authentication can be enabled with one click.

Presto

  • Presto is updated to 389.

    Delta Lake and Hudi connectors that are provided by the Presto community can be used.

    • The time travel and Z-Order features are not supported by the Delta Lake connector.

    • The Hudi connector cannot be used to query Merge on Read (MOR) tables.

  • LDAP authentication can be enabled with one click.

Delta Lake

  • Delta Lake is interconnected with Data Lake Formation (DLF) to support automatic management of tables in data lakes.

  • The issue that the partition information cannot be automatically synchronized by using the CREATE TABLE AS statement is fixed.

  • Metric information can be returned by the OPTIMIZE and VACUUM commands.

Hudi

Hudi is updated to 0.11.1.

Hadoop Common

The Hadoop Common service is added. This way, the issue that the configurations of HDFS, YARN, and JindoSDK are overwritten is fixed.

YARN

The auto scaling feature is enhanced.

Ranger

  • Spark 2 and Spark 3 are supported.

  • LDAP authentication can be enabled for Ranger UserSync with one click.

Kafka

The Kafka service is added. The service version ranges from 2.12 to 2.4.1.

HBase

The HBase service is added. The service version is 2.3.4.

Phoenix

The Phoenix service is added. The service version is 5.1.2.

Doris

Doris is updated to 1.1.1.

StarRocks

  • StarRocks is updated to 2.3.0.

  • The primary key model supports the DELETE WHERE syntax. The primary key indexes are stored in the persistent storage to reduce memory usage. For more information, see StarRocks version 2.3.

ClickHouse

  • ClickHouse is updated to 22.3.8.39.

  • The following issue is fixed: An out-of-memory (OOM) error occurs when you read large files from Object Storage Service (OSS).

Release version information

DataLake clusters

Service

Version

HDFS

3.2.1

YARN

3.2.1

Hive

3.1.3

Spark 2

2.4.8

Spark 3

3.2.1

TEZ

0.10.1

Presto

389

Delta Lake

1.1.0

Hudi

0.11.1

Iceberg

0.13.1

JindoData

4.4.2

Kyuubi

1.5.2

Knox

1.5.0

Impala

3.4.0

OpenLDAP

2.4.44

Ranger

2.1.0

Sqoop

1.4.7

DLF-Auth

2.0.0

ZooKeeper

3.6.3

RSS

0.1.1

OLAP clusters

Service

Version

ZooKeeper

3.6.3

ClickHouse

22.3.8.39

Doris

1.1.1

StarRocks

2.3.0

Dataflow clusters

Service

Version

HDFS

3.2.1

YARN

3.2.1

Knox

1.5.0

OpenLDAP

2.4.44

ZooKeeper

3.6.3

Flink

1.13-vvr-4.0.13

Kafka

2.12-2.4.1

Kafka Manager

2.0.0.2

DataServing clusters

Service

Version

HDFS

3.2.1

JindoData

4.4.2

ZooKeeper

3.6.3

HBase

2.3.4

Phoenix

5.1.2