E-MapReduce versions 2.0 and higher support presto. Presto can be used in E-MapReduce by checking presto software when selecting the mirror image.
After cluster creation, log on to the master node. Presto software will be installed in the directory
/usr/lib/presto-current, and PrestoServer process can be seen by command
Presto service process can be divided into coordinator and worker. Coordinator is started on the master (the HA cluster is the master node of hostname starting with emr-header-1), and worker process is started on the core node. The service process configuration is under the directory
/usr/lib/presto-current/etc, and coordinator uses coordinator-config.properties while worker uses worker-config.preperties, and other configuration files are used for public. The web port is set as 9090.
By default, presto service is set with support from Hive. Connect the metastore of hive on the cluster to read the table information of Hive and perform querying. The cluster is pre-installed with presto cli and can directly execute “show tables” to check the Hive list. Note that a delay of several seconds will occur when synchronizing the Hive table.
presto -server localhost:9090 -catalog hive -schema default -user hadoop -execute 'show tables'