This topic describes the key fields that are collected by the hive-scanner tool of Cloud Migration Hub (CMH) from your Hive system.
Overview
The hive-scanner tool accesses the metastore database of your Hive system to collect resource information. The metastore database must be a MySQL database.
Basic information
The hive-scanner tool scans objects in your Hive system and collects the following key fields:
hiveVersion: the Hive version.
dbName: the name of the Hive database.
dbSize: the size of the Hive database.
tableNum: the number of tables.
partitionTableNum: the number of partitioned tables.
externalTableNum: the number of external numbers.
transactionalTableNum: the number of transactional tables.
viewTableNum: the number of views.
functionNum: the number of functions.
Top objects
The hive-scanner tool collects metadata statistics on the databases in your Hive system to obtain the following key fields about top objects:
top10TableBySize: the top 10 tables by size.
top10TableByPartNum: the top 10 tables by the number of partitions.
top10PartBySize: the top 10 partitions by size.