Alibaba Cloud Elastic MapReduce (EMR) is a big data processing solution that runs on the Alibaba Cloud platform. EMR is built on Alibaba Cloud ECS instances and is based on open-source Apache Hadoop and Apache Spark. EMR allows you to use the Hadoop and Spark ecosystem components, such as Apache Hive, Apache Kafka, Flink, Druid, and TensorFlow, to analyze and process data. You can use EMR to process data stored on different Alibaba Cloud data storage service, such as Object Storage Service (OSS), Log Service (SLS), and Relational Database Service (RDS).
Benefits
- Easy-to-use
You can quickly create clusters without the need to configure hardware and software. All maintenance operations are completed on its Web interface.
- Cost-effectiveness
You can create clusters and dynamically scale in and out the number of compute nodes based on current computing needs.
- Stability
EMR provides a deeply optimized cluster environment, automated background maintenance, and multiple online support channels.
- Security
EMR supports Kerberos authentication and data encryption. You can use RAM users to refine the management of service permissions.