Skip to content

Latest commit

 

History

History

Advanced Test Suites

Datasets for some of the Advanced Test Suites are not shipped with the repository. You can get them as follows:

TPC-H SF 100

For the TPC-H SF100 Parquet tests, download dataset from Amazon S3

Extract this compressed file and copy over files to MapR-FS / HDFS into /drill/testdata/tpch100/parquet

TPC-DS SF 100

For the TPC-DS SF100 tests, download dataset from Amazon S3

Extract this compressed file and copy over files to MapR-FS / HDFS into /drill/testdata/tpcds_sf100/parquet

Mondrian

For the Mondrian tests, download dataset from Amazon S3

Extract this compressed file and copy over files to MapR-FS / HDFS into /drill/testdata/mondrian

Metadata Caching

Download the required data set from https://s3.amazonaws.com/apache-drill/files/tpch100_dir_partitioned_50000files-lineitem.tgz Extract this compresses file and copy over files to "/drill/testdata/tpch100_dir_partitioned_50000files/lineitem"

Data-shapes widestring

For the data-shapes widestring 100000rows parquet tests, download dataset from Amazon S3

Extract this compressed file and copy over files to MapR-FS / HDFS into /drill/testdata/data-shapes/wide-columns/5000/100000rows/parquet