×
Community Blog Working with E-MapReduce in Alibaba Cloud

Working with E-MapReduce in Alibaba Cloud

In this article, we'll introduce how to create an Alibaba Cloud EMR cluster step by step.

Introduction

E-MapReduce (EMR) is a cloud-native open-source big data platform that provides easy-to-integrate open-source big data computing and storage engines such as Hadoop, Hive, Spark, Flink, Presto, and ClickHouse. EMR allows you to adjust computing resources based on your business needs and deploy the resources on Alibaba Cloud Elastic Search Service (ECS), Alibaba Cloud Container Service for Kubernetes (ACK), and Apsara Stack. In this blog, we are going to see how to create an Alibaba Cloud EMR cluster.

Step-1: Selecting Alibaba Cloud EMR from the Alibaba Cloud console

1

Step-2: Selecting the Creating Cluster option present in the Alibaba Cloud EMR cluster console as shown below diagram

2

Step-3: Providing Basic Configuration for Alibaba Cloud EMR Cluster to be created as shown in the below diagram.

3
4

Step-4: Providing Hardware Configuration for both master node and core node of the Alibaba Cloud EMR cluster to be created.

5
6

Step-5: Assign public IP for Master Node in the created Alibaba Cloud EMR Cluster

7

Step-6: Provide the configuration for the Core Node of the created Alibaba Cloud EMR Cluster.

8

Step-7: Provide Other Basic Configurations such as Cluster Name and Password for the Alibaba Cloud EMR Cluster to be created.

9

Step-8: Finally, Confirm the configuration by clicking the confirm button present in Alibaba Cloud EMR Console.

10
11

Step-9: Once the Cluster Creation Process is completed, we can view the created cluster with status as Running as depicted in the below diagram.

12

Step-10: After the Cluster Creation process is completed we can view the master Node and core node as shown in the below diagram.

13

Step-11: We can get the Public IP address of the Master Node from the basic configuration panel of Alibaba Cloud EMR Console.

14

Step-12: Now we can connect to the Master Node from the windows client through SSH as shown below.

15

Step-13: We can run the Hadoop cluster by using the command Hadoop in the command prompt as shown below.

16

Step-14: Finally, we can run the spark job in the created EMR Cluster by using the command spark.

17

Conclusion

Alibaba Cloud E-MapReduce (EMR), a cloud-native open-source big data platform, provides easy-to-integrate open-source big data computing and storage engines such as Hadoop, Hive, Spark, Flink, Presto, and ClickHouse. The Alibaba Cloud EMR service can also be used to create an EMR cluster within minutes with just a few mouse clicks. In this blog post, we have provided an overview of the steps involved in creating an Alibaba Cloud EMR cluster.

0 1 0
Share on

GAVASKAR S

11 posts | 3 followers

You may also like

Comments

GAVASKAR S

11 posts | 3 followers

Related Products