AWS
S3
Glue
Presto
Spark
SparkSQL
Alluxio
Parquet
TPC-H
A comparative analysis of Distibuted SQL Engines SparkSQL
and Presto
-
Dataset: TPC-H
-
Automate the cluster provisioning
-
Automate experiments and stats collection
- Config driven framework
-
Run federated queries