When you create an E-MapReduce (EMR) cluster, the service components that are deployed on cluster nodes vary based on the cluster type. For example, the NameNode component of the HDFS service is deployed on the master node of a Hadoop cluster. This topic describes how to view the deployment information of service components on each node of an EMR cluster.

Prerequisites

An EMR cluster is created. For more information, see Create a cluster.

Procedure

  1. Go to the Cluster Overview page of your cluster.
    1. Log on to the Alibaba Cloud EMR console by using your Alibaba Cloud account.
    2. In the top navigation bar, select the region where your cluster resides and select a resource group based on your business requirements.
    3. Click the Cluster Management tab.
    4. Find your cluster and click Details in the Actions column.
  2. In the left-side navigation pane, click Cluster Service and select the service whose component deployment information you want to view
  3. On the page that appears, click the Component Deployment tab.
    The deployment information is displayed on the Component Deployment tab. The following sections provide the deployment information of each cluster type:

Hadoop cluster

The following tables provide the deployment information of service components on each node of a Hadoop cluster in EMR V3.29.0.
  • Required services
    Service Component on the master node Component on the core nodes
    HDFS
    • KMS
    • SecondaryNameNode
    • HttpFS
    • HDFS Client
    • NameNode
    • DataNode
    • HDFS Client
    YARN
    • ResourceManager
    • App Timeline Server
    • JobHistory
    • WebAppProxyServer
    • Yarn Client
    • Yarn Client
    • NodeManager
    Hive
    • Hive MetaStore
    • HiveServer2
    • Hive Client
    Hive Client
    Spark
    • Spark Client
    • SparkHistory
    • ThriftServer
    Spark Client
    Knox Knox None
    Tez
    • Tomcat
    • Tez Client
    Tez Client
    Ganglia
    • Gmond
    • Httpd
    • Gmetad
    • Ganglia Client
    • Gmond
    • Ganglia Client
    Sqoop Sqoop Client Sqoop Client
    Bigboot
    • Bigboot Client
    • Bigboot Monitor
    • Bigboot Client
    • Bigboot Monitor
    OpenLDAP OpenLDAP None
    Hue Hue None
    SmartData
    • Jindo Namespace Service
    • Jindo Storage Service
    • Jindo Client
    • Jindo Storage Service
    • Jindo Client
  • Optional services
    Service Component on the master node Component on the core nodes
    LIVY Livy None
    Superset Superset None
    Flink
    • FlinkHistoryServer
    • Flink Client
    Flink Client
    RANGER
    • RangerPlugin
    • RangerAdmin
    • RangerUserSync
    • Solr
    RangerPlugin
    Storm
    • Storm Client
    • UI
    • Nimbus
    • Logviewer
    • Storm Client
    • Supervisor
    • Logviewer
    Phoenix Phoenix Client Phoenix Client
    Kudu
    • Kudu Master
    • Kudu Client
    • Kudu Tserver
    • Kudu Master
    • Kudu Client
    HBase
    • HMaster
    • HBase Client
    • ThriftServer
    • HBase Client
    • HRegionServer
    ZooKeeper
    • ZooKeeper follower
    • ZooKeeper Client
    • ZooKeeper follower
    • ZooKeeper leader
    • ZooKeeper Client
    Oozie Oozie None
    Presto
    • Presto Client
    • PrestoMaster
    • Presto Client
    • PrestoWorker
    Impala
    • Impala Runtime and Shell
    • Impala Catalog Server
    • Impala StateStore Server
    • Impala Runtime and Shell
    • Impala Daemon Server
    Pig Pig Client Pig Client
    Zeppelin Zeppelin None
    FLUME
    • Flume Agent
    • Flume Client
    • Flume Agent
    • Flume Client

Druid cluster

The following tables provide the deployment information of service components on each node of a Druid cluster in EMR V3.29.0.
  • Required services
    Service Component on the master node Component on the core nodes
    Druid
    • Druid Client
    • Coordinator
    • Overlord
    • Broker
    • Router
    • MiddleManager
    • Historical
    • Druid Client
    HDFS
    • KMS
    • SecondaryNameNode
    • HttpFS
    • HDFS Client
    • NameNode
    • DataNode
    • HDFS Client
    Ganglia
    • Gmond
    • Httpd
    • Gmetad
    • Ganglia Client
    • Gmond
    • Ganglia Client
    ZooKeeper
    • ZooKeeper follower
    • ZooKeeper Client
    • ZooKeeper leader
    • ZooKeeper follower
    • ZooKeeper Client
    OpenLDAP OpenLDAP None
    Bigboot
    • Bigboot Client
    • Bigboot Monitor
    • Bigboot Client
    • Bigboot Monitor
    SmartData
    • Jindo Namespace Service
    • Jindo Storage Service
    • Jindo Client
    • Jindo Storage Service
    • Jindo Client
  • Optional services
    Service Component on the master node Component on the core nodes
    YARN
    • ResourceManager
    • App Timeline Server
    • JobHistory
    • WebAppProxyServer
    • Yarn Client
    • Yarn Client
    • NodeManager
    Superset Superset None

Dataflow-Kafka cluster

The following tables provide the deployment information of service components on each node of a Dataflow-Kafka cluster in EMR V3.29.0.
  • Required services
    Service Component on the master node Component on the core nodes
    Kafka-Manager Kafka Manager None
    Kafka
    • Kafka Client
    • KafkaMetadataMonitor
    • Kafka Rest Proxy
    • Kafka Broker broker
    • Kafka Schema Registry
    • Kafka Broker broker
    • Kafka Client
    Ganglia
    • Gmond
    • Httpd
    • Gmetad
    • Ganglia Client
    • Gmond
    • Ganglia Client
    ZooKeeper
    • ZooKeeper follower
    • ZooKeeper Client
    • ZooKeeper leader
    • ZooKeeper follower
    • ZooKeeper Client
    OpenLDAP OpenLDAP None
  • Optional services
    Service Component on the master node Component on the core nodes
    RANGER
    • RangerPlugin
    • RangerUserSync
    • RangerAdmin
    • Solr
    RangerPlugin
    Knox Knox None

Flink cluster

The following tables provide the deployment information of service components on each node of a Flink cluster in EMR V3.30.0.
  • Required services
    Service Component on the master node Component on the core nodes
    HDFS
    • KMS
    • SecondaryNameNode
    • HttpFS
    • HDFS Client
    • NameNode
    • DataNode
    • HDFS Client
    YARN
    • ResourceManager
    • App Timeline Server
    • JobHistory
    • WebAppProxyServer
    • Yarn Client
    • Yarn Client
    • NodeManager
    Ganglia
    • Gmond
    • Httpd
    • Gmetad
    • Ganglia Client
    • Gmond
    • Ganglia Client
    ZooKeeper
    • ZooKeeper
    • ZooKeeper Client
    • ZooKeeper
    • ZooKeeper Client
    Knox Knox None
    Flink-Vvp Flink-Vvp None
    OpenLDAP OpenLDAP None
  • Optional services
    Service Component on the master node Component on the core nodes
    PAI-Alink Alink None

Data Science cluster

The following tables provide the deployment information of service components on each node of a Data Science cluster in EMR V3.29.1.
  • Required services
    Service Component on the master node Component on the core nodes
    HDFS
    • HDFS Client
    • KMS
    • HttpFS
    • NameNode
    • SecondaryNameNode
    • HDFS Client
    • DataNode
    YARN
    • WebAppProxyServer
    • JobHistory
    • App Timeline Server
    • ResourceManager
    • Yarn Client
    • NodeManager
    • Yarn Client
    ZooKeeper
    • ZooKeeper Client
    • ZooKeeper follower
    • ZooKeeper Client
    • ZooKeeper leader
    • ZooKeeper follower
    Knox Knox None
    Tensorflow on YARN
    • TensorFlow-On-YARN-Gateway
    • TensorFlow-On-YARN-History-Server
    • TensorFlow-On-YARN
    • TensorFlow-On-YARN-Client
    • TensorFlow-On-YARN-Gateway
    SmartData
    • Jindo Namespace Service
    • Jindo Storage Service
    • Jindo Client
    • Jindo Storage Service
    • Jindo Client
    Bigoot
    • Bigboot Monitor
    • Bigboot Client
    • Bigboot Monitor
    • Bigboot Client
    PAI-EASYREC Easyrec Easyrec
    PAI-EAS PAIEAS PAIEAS
    PAI-Faiss Faiss Faiss
    PAI-Redis Redis Redis
    PAI-Alink Alink None
    Flink-Vvp Flink-Vvp None
    OpenLDAP OpenLDAP None
    Jindo SDK Jindo SDK Jindo SDK
  • Optional services
    Service Component on the master node Component on the core nodes
    Zeppelin Zeppelin None
    PAI-REC Rec None
    AUTOML AUTOML AUTOML
    TensorFlow TensorFlow TensorFlow

ClickHouse cluster

The following table provides the deployment information of service components on each node of a ClickHouse cluster in EMR V3.35.0.
Service Component on the master node Component on the core nodes
Ganglia
  • Gmond
  • Httpd
  • Gmetad
  • Ganglia Client
  • Gmond
  • Ganglia Client
ZooKeeper
  • ZooKeeper Client
  • ZooKeeper follower
  • ZooKeeper Client
  • ZooKeeper leader
  • ZooKeeper follower
ClickHouse
  • ClickHouse Server
  • ClickHouse Client
  • ClickHouse Server
  • ClickHouse Client

Data Development cluster

The following table provides the deployment information of service components on each node of a Data Development cluster in EMR V3.33.102.
Service Component on the master node Component on the core nodes
Ganglia
  • Gmond
  • Httpd
  • Gmetad
  • Ganglia Client
  • Gmond
  • Ganglia Client
Zeppelin ZooKeeper Master ZooKeeper Worker
JupyterHub JupyterHub None
RabbitMQ RabbitMQ
MySQL MySQL
Data Development Center Data Development Center
Airflow
  • Airflow Client
  • Airflow Scheduler
  • Airflow WebServer
  • Airflow Client
  • Airflow Worker