When you use E-MapReduce (EMR), cluster instability may occur or clusters may become unavailable due to unexpected operations. Take note of the information in this topic to avoid these issues. This topic describes the limits of EMR.
High-risk operations (prohibited)
|Delete or modify the hosts file that is stored in the etc/ directory.||You cannot find the services that run on the nodes of your cluster, which causes service exceptions.||Add information to the hosts file.|
|Modify parameters in component configuration files in the ECS console.||After specific services are restarted, the settings of the parameters that are modified are overwritten.||Modify the parameters in the EMR console.|
|Attach disks to the nodes of your EMR cluster in the ECS console.||The disks are unavailable because EMR cannot recognize and initialize the disks.||Add data disks in the EMR console.|
|Detach disks from the nodes of your EMR cluster in the ECS console.||This may cause data loss because EMR is unaware of the disk detaching operation.||Perform the following operations on specific nodes:
|Remove core nodes in the ECS console.||This causes data loss, and execution failures of jobs on the removed nodes.||Disable the HDFS, YARN, HBase, Kudu, or Kafka component.|
|Remove master nodes in the ECS console.||
|Remove task nodes in the ECS console.||The jobs that you run on the removed nodes fail.||Stop the NodeManager of YARN.|
|Stop the MySQL service of the master node. (Type is set to Built-in MySQL when you create an EMR cluster.)||The MySQL service deployed on the emr-header-1 node is associated with Hive MetaStore, Oozie, and Ranger. If you stop the MySQL service, the associated components cannot access the specific database.||N/A|
|Change the password of the root user that is used to access the MySQL service deployed on the emr-header-1 node. (Type is set to Built-in MySQL when you create an EMR cluster.)||The associated component such as Hue or Ranger fails.||N/A|
|Modify the security group to which ECS instances belong when an EMR cluster is running.||
|Insufficient disk capacity||Increase the capacity of a single disk or add core nodes in the EMR console. EMR clusters do not support the addition of disks.|
|Excess disk capacity||Purchase a new cluster and release the original one. For more information, see Create a cluster. EMR clusters do not support scale-down of disk capacity.|
|Insufficient computing capabilities||Add task nodes in the EMR console. For more information, see Scale out a cluster.|
|Excess computing capabilities||
Solutions to clusters with different billing methods:
|Outdated component versions||Purchase a cluster of a later version. For more information, see Create a cluster. Existing clusters do not support the version update of a single component.|
|Conversion from a non-HA cluster to an HA cluster||We recommend that you purchase an HA cluster. Existing non-HA clusters cannot be converted into HA clusters.|
|Deployment of third-party software or services on EMR||We recommend that you perform bootstrap actions to install third-party software or
services when you create a cluster.
If you manually install third-party software or services after you create a cluster, when you add nodes, you must manually install the third-party software or services again on the added nodes.