GitHub

Each call of run_test_automl.py will run the next algorithm in the experiments list that does not yet have results. Each script calls benchmark.evaluate on only one algorithm one time and then exits. Script can also be executed multiple times in parallel.

The two .sh files includes the settings for the Slurm scheduler as well as flags passed into run_test_automl.py .

DIGEN_Generate_test_set_and_Measure_Noise.ipynb generates the new test set and measures the amount of noise in each dataset.

retest_on_larger_testset.py and retest_on_larger_testset_autosklearn.py load the saved models and retests them on the new test set.

Generate_Plots_retested.ipynb makes the plots in the paper using the new test set.

Generate_Plots.ipynb makes the plots using the original test set.

Environments

Using the latest master branch on the TPOT repository https://github.com/epistasislab/tpot

Using a modified DIGEN fork (found in this repository), to add in a termination signal.

Environments were set up as follows:

conda create --name tpot_digen_env_final -c h2oai -c plotly -c conda-forge xgboost dask dask-ml scikit-mdr skrebate dill jupyter seaborn optuna pandas scipy seaborn matplotlib plotly h2o=3.38.0.1
conda activate tpot_digen_env_final
pip install -e tpot
pip install -e digen
pip install -U kaleido

conda create --name autosklearn_digen_env_final python
pip install auto-sklearn=0.14.5
pip install -e digen
conda install -c conda-forge dill jupyter plotly pynisher=0.6.4
conda activate autosklearn_digen_env_final

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
digen		digen
DIGEN_Generate_test_set_and_Measure_Noise.ipynb		DIGEN_Generate_test_set_and_Measure_Noise.ipynb
Generate_Plots.ipynb		Generate_Plots.ipynb
Generate_Plots_retested.ipynb		Generate_Plots_retested.ipynb
all_new_test_sets.pkl		all_new_test_sets.pkl
all_new_test_sets_no_noise.pkl		all_new_test_sets_no_noise.pkl
digen_noise.csv		digen_noise.csv
readme.md		readme.md
retest_on_larger_testset.py		retest_on_larger_testset.py
retest_on_larger_testset_autosklearn.py		retest_on_larger_testset_autosklearn.py
retest_results.csv		retest_results.csv
run_tests_automl.py		run_tests_automl.py
slurm_retests_automl.sh		slurm_retests_automl.sh
slurm_retests_automl_autosklearn.sh		slurm_retests_automl_autosklearn.sh
slurm_run_tests_automl.sh		slurm_run_tests_automl.sh
slurm_run_tests_automl_autosklearn.sh		slurm_run_tests_automl_autosklearn.sh
tpot_retest_results.csv		tpot_retest_results.csv

perib/automl_digen_benchmark

Folders and files

Latest commit

History

Repository files navigation

Environments

About

Resources

Stars

Watchers

Forks

Languages