All Products
Search
Document Center

E-MapReduce:Scale out an EMR cluster

Last Updated:Sep 15, 2023

You can scale out an E-MapReduce (EMR) cluster that has insufficient computing or storage resources. This topic describes how to scale out an EMR cluster.

Prerequisites

An EMR cluster is created. For more information, see Create a cluster.

Limits

  • You can add only core nodes and task nodes to an existing EMR cluster. By default, the configurations of an added node are the same as those of the existing nodes.

  • For Hadoop clusters, you cannot scale out a node group that is created on the Auto Scaling tab. For more information, see Configure auto scaling (only for Hadoop clusters).

Procedure

Note
  • If you want to scale out a core node group of an EMR cluster and the cluster uses Hadoop Distributed File System (HDFS) to store data, you must use HDFS Balancer to rebalance HDFS after you scale out the node group. For more information, see HDFS Balancer.

  • The cluster scale-out operation does not restart the application processes on existing nodes.

  1. Go to the Nodes tab.

    1. Log on to the EMR console. In the left-side navigation pane, click EMR on ECS.

    2. In the top navigation bar, select the region in which your cluster resides and select a resource group based on your business requirements.

    3. On the EMR on ECS page, find the cluster that you want to scale out and click Nodes in the Actions column.

  2. On the Nodes tab, find the desired node group and click Scale Out in the Actions column.

  3. In the Scale Out dialog box, configure the parameters based on your business requirements.

    Parameter

    Description

    Node Group Name

    The name of the node group.

    Node Type

    The type of the node group.

    Current Instance Type

    The information about instances in the node group.

    Billing Method

    The billing method of the cluster. The billing method of a new node is the same as that of the cluster and cannot be changed. If the billing method is subscription, you can configure the Subscription Duration parameter for the new node.

    vSwitch

    The information about the vSwitch that is deployed for the node group.

    Current Quantity

    The number of instances in the node group.

    Added Instances

    The number of instances that you want to add to the node group. Click the upward or downward arrow or enter a number in the Added Instances field.

    Cluster Expiration Time

    The expiration time of the subscription cluster.

    Terms of Service

    Read and agree to the terms of service.

  4. Click OK.

    For more information about how to log on to a new node, see Log on to a cluster.