Skip to content

Extended DOME (data, optimization, model, and evaluation)

License

Notifications You must be signed in to change notification settings

azevedolab/EDOME

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

EDOME

Extended DOME (data, optimization, model, and evaluation)

EDOME computes metrics specified by Walsh et al. 2021 to predict the performance of regression models of biological systems. Our goal is to assess the Euclidean distance (L2 norm) of machine-learning models from an ideal model with the following coordinates: mean absolute error (MAE) = 0.0, coefficient of determination (R2) = 1.0, and Root-mean squared error (RMSE) = 0.0. We also have metrics combining the DOME strategy with the estimation of Spearman's (r) and Pearson's (r) correlations named EDOME (Extended DOME).
The metrics EDOMEr2 and EDOMEr add r2 (squared Pearson correlation) and r (Spearman correlation) to the Euclidean distance equation (DOME), respectively. The last metric is EDOME. We take the following metrics: r2, r, RMSE, MAE, and R2. The goal is to evaluate the distance of a machine-learning model from an ideal model with the following coordinates: r2 = 1.0, r = 1.0, RMSE = 0.0, MAE = 0.0, and R2 = 1.0.

Reference Walsh I, Fishman D, Garcia-Gasulla D, Titma T, Pollastri G; ELIXIR Machine Learning Focus Group; Harrow J, Psomopoulos FE, Tosatto SCE. DOME: recommendations for supervised machine learning validation in biology. Nat Methods. 2021; 18(10): 1122-1127. PubMed

Installing EDOME (Linux)

You should type all commands shown here in a Linux terminal. The easiest way to open a Linux terminal is to use the Ctrl+Alt+T key combination.

Step 1. Download Anaconda Installer for Linux or newer.

Go to the directory where you have the installer file and type the following commands:

    chmod u+x Anaconda3-2021.11-Linux-x86_64.sh
    ./Anaconda3-2021.11-Linux-x86_64.sh

Follow the instructions of the installer. You may use a newer installer, but be sure to have the right installer in the above command lines.

Step 2. To run EDOME properly, you need Scikit-Learn 1.4.0. To be sure you have version 1.4.0, open a terminal and type the following commands:

    python3 -m pip uninstall scikit-learn
    python3 -m pip install scikit-learn==1.4.0

Step 3. Download edome.zip. Copy the EDOME zipped directory (edome.zip) to wherever you want it and unzip the zipped directory.

Type the following command:

    unzip edome.zip

We have a CSV file (bind_Ki_test_set.csv) and three input files (edome.in, edome_plot1.in, and edome_plot2.in) besides the Python scripts in the zipped folder.

cd to edome directory then, type:

python3 edome.py edome.in

EDOME will generate the following plots (bind_Ki_test_set_log(RMSE)_log(MAE).png and bind_Ki_test_set_r2_rho.png) and a csv file (bind_Ki_test_set_stats.csv).

About

Extended DOME (data, optimization, model, and evaluation)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published