An implementation of the algorithms presented in the paper "Cardinality Estimation Done Right: Index-Based Join Sampling"
Step:
-
Downlad dataset from http://homepages.cwi.nl/~boncz/job/imdb.tgz, extract all csv file to
data/csv
. more infomation please refer to join-order-benchmark -
Install dependencies
# Update job repository git submodule init git submodule update # install python dependencies pip install -r requirements.txt
-
Generate
csv_schema.txt
andall-queries-filtered.sql
python3 extract.py
-
Generate pkl data for every relation
python3 DataLoader.py
-
Run main and test
python3 main.py python3 test.py