All Products
Search
Document Center

Migration Hub:Fields collected from Hive

Last Updated:Mar 03, 2025

This topic describes the key fields that are collected by the hive-scanner tool of Cloud Migration Hub (CMH) from your Hive system.

Overview

The hive-scanner tool accesses the metastore database of your Hive system to collect resource information. The metastore database must be a MySQL database.

Basic information

The hive-scanner tool scans objects in your Hive system and collects the following key fields:

  • hiveVersion: the Hive version.

  • dbName: the name of the Hive database.

  • dbSize: the size of the Hive database.

  • tableNum: the number of tables.

  • partitionTableNum: the number of partitioned tables.

  • externalTableNum: the number of external numbers.

  • transactionalTableNum: the number of transactional tables.

  • viewTableNum: the number of views.

  • functionNum: the number of functions.

Top objects

The hive-scanner tool collects metadata statistics on the databases in your Hive system to obtain the following key fields about top objects:

  • top10TableBySize: the top 10 tables by size.

  • top10TableByPartNum: the top 10 tables by the number of partitions.

  • top10PartBySize: the top 10 partitions by size.