-
Notifications
You must be signed in to change notification settings - Fork 646
/
build-docker-image.sh
executable file
·39 lines (33 loc) · 2.02 KB
/
build-docker-image.sh
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
#!/bin/bash -e
# Licensed to Modin Development Team under one or more contributor license agreements.
# See the NOTICE file distributed with this work for additional information regarding
# copyright ownership. The Modin Development Team licenses this file to you under the
# Apache License, Version 2.0 (the "License"); you may not use this file except in
# compliance with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software distributed under
# the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF
# ANY KIND, either express or implied. See the License for the specific language
# governing permissions and limitations under the License.
cd "`dirname \"$0\"`"
docker build -t modin-omnisci .
echo -e '\nNYC TAXI BENCHMARK
User is responsible for preparing the dataset.
It Can be generated by following the instructions on the link:
https://github.com/toddwschneider/nyc-taxi-data#instructions
To run the benchmark execute:
\tdocker run --rm -v /path/to/dataset:/dataset modin-omnisci python nyc-taxi-omnisci.py <name of file starting with /dataset>
CENSUS BENCHMARK
User is responsible for preparing the dataset.
It can be downloaded from the following link:
https://rapidsai-data.s3.us-east-2.amazonaws.com/datasets/ipums_education2income_1970-2010.csv.gz
To run the benchmark execute:
\tdocker run --rm -v /path/to/dataset:/dataset modin-omnisci python census-omnisci.py <name of file starting with /dataset>
PLASTICC BENCHMARK
User is responsible for preparing the datasets.
The datasets must include four files: training set, test set,
training set metadata and test set metadata.
To run the benchmark execute:
\tdocker run --rm -v /path/to/dataset:/dataset modin-omnisci python plasticc-omnisci.py <training set file name starting with /dataset> <test set file name starting with /dataset> <training set metadata file name starting with /dataset> <test set metadata file name starting with /dataset>\n'