Apace Drill provides support to the Parquet columnar storage format. In this section, we will create a table in a columnar storage format and run the select query on it.
Create a Table in Parquet Columnar Storage Format
Now we will create the table in the Parquet File format using the below command. We will use the existing "emp_detail.json" file to create the table.
For Parquet columnar storage format, we need to change the storage format of Apache Drill using the below command.
Command:
apache drill> alter sessionset `store.format`= 'parquet';
Output:
data:image/s3,"s3://crabby-images/e053e/e053e62cf69e4eb5abd3693f51139cd293f0641c" alt="drill file storage format parquet cloudduggu"
Now let's create the table using the below command.
Command:
apache drill> CREATE TABLE dfs.tmp.`/home/cloudduggu/drill/sample-data/` as SELECT * FROM dfs.`/home/cloudduggu/drill/sample-data/emp_detail.json`;
Output:
data:image/s3,"s3://crabby-images/89dbd/89dbdc21fecf18e6da34aaeb4893f85e89e8d240" alt="drill create parquet table cloudduggu"
The table has been created, lets run the below command to see the content of the table.
Command:
apache drill> SELECT * FROM dfs.tmp.`/home/cloudduggu/drill/sample-data/`;
Output:
data:image/s3,"s3://crabby-images/fcd8d/fcd8da8388212c60839d8487cf2f621cac4bdb74" alt="drill query parquet file cloudduggu"