Data for benchm-ml, gbm-perf etc. (samples from the airline dataset) TODO: Add 10M large file (using Git LFS)