This topic describes how to convert a text file to the RCFile format. The nation.tbl file is used as an example.
Create an Object Storage Service (OSS) schema.
CREATE SCHEMA dla_oss_db with DBPROPERTIES( catalog='oss', location 'oss://bucket-name/TPC-H/' )
Create the nation_txt table in the Data Lake Analytics (DLA) console and set the
LOCATIONparameter to the path of nation.tbl in an OSS bucket.
CREATE EXTERNAL TABLE nation_txt( n_nationkey int, n_name string, n_id string, n_regionkey int, n_comment string ) ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' STORED AS TEXTFILE LOCATION 'oss://bucket-name/TPC-H/nation/nation.tbl'
Create a table named nation_RCFile in the DLA console and set the
LOCATIONparameter to the path where you want to store the RCFile file in the OSS bucket.
Note: You must set the
LOCATIONparameter to a path that exists in the OSS bucket and the path must end with a forward slash (
CREATE EXTERNAL TABLE nation_RCFile( n_nationkey int, n_name string, n_id string, n_regionkey int, n_comment string ) ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' STORED AS RCFile LOCATION 'oss://bucket-name/TPC-H/nation_RCFile/'
STORED AS RCFile: indicates that the file is stored in the RCFile format.
INSERT...SELECTstatement to insert data of the nation_txt table to the nation_RCFile table.
INSERT INTO nation_RCFile SELECT * FROM nation_txt;
View the data of the nation_RCFile table.
INSERT...SELECTstatement is executed, you can view the generated RCFile table in the OSS bucket.