Tez is a computing framework that is built on Apache Hadoop and supports distributed directed acyclic graphs (DAGs). Tez can use a complex DAG to describe and process big data tasks.

Background information

Tez is used in Apache Hive. Tez serves as a runtime engine of Hive and can optimize the query engine of Hive SQL. Hive on Tez features higher query performance and stability than Hive on MapReduce.

The following figure shows how Hive submits tasks based on MapReduce and Tez. Hive-MapReduce-Tez

For more information about Tez, see Apache TEZ.

Enable the Tez engine

Hive allows you to use Tez to execute SQL statements. Before you execute an SQL statement, you must perform the following steps to enable the Tez engine:

  1. Go to the Services tab.
    1. Log on to the EMR console. In the left-side navigation pane, click EMR on ECS.
    2. In the top navigation bar, select the region where your cluster resides and select a resource group based on your business requirements.
    3. On the EMR on ECS page, click Services in the Actions column of the cluster that you want to manage.
  2. On the Services tab, click Configure in the Hive section.
  3. Modify and save the hive.execution.engine parameter.
    1. In the search box, enter hive.execution.engine.
    2. Set the hive.execution.engine parameter to tez and click Save.
    3. In the dialog box that appears, enter a reason in the Execution Reason field and click Save.
  4. Restart Hive.
    1. Click the Status tab.
    2. On the Services tab, choose More > Restart in the upper-right corner.
    3. In the dialog box that appears, enter a reason in the Execution Reason field and click OK.
    4. In the Confirm message, click OK.

Access the web UI of Tez

On the Access Links and Ports tab, click the access URL that corresponds to Tez UI to access the web UI of Tez.

For more information, see Access the web UIs of open source components.