Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can not reproduce the result on DrugOOD. #1

Open
panmianzhi opened this issue Jan 27, 2024 · 1 comment
Open

Can not reproduce the result on DrugOOD. #1

panmianzhi opened this issue Jan 27, 2024 · 1 comment

Comments

@panmianzhi
Copy link

panmianzhi commented Jan 27, 2024

I ran your code on DrugOOD with 10 different seeds (0 ~ 9), but the results are significantly lower than those reported in your paper. For example, the ROC-AUC on ec50_assay and ec50_size are:

  • ec50_assay: 76.997 73.57 71.06 72.65 75.16 73.07 71.84 71.92 73.67 76.86
  • ec50_size : 61.71 61.93 60.12 61.95 62.50 61.92 57.81 62.97 61.85 64.20

these results are all below those reported in your paper.
I have used the same hyper-parameters provided in the Appendix. Does that mean your proposed method is sensitive to the random seed?
Thanks!

@toooooodo
Copy link
Collaborator

Hi! Thank you for your interest in our work.

On the DrugOOD dataset, we searched for all hyper-parameters using the grid search method. Here is the demo script:

#!/bin/bash
hyper_list1=(choice1, choice2, ...)
hyper_list2=(choice1, choice2, ...)

# grid search on all hyper-parameters
for p1 in ${hyper_list1[*]}
do
for p2 in ${hyper_list2[*]}
do
python run.py --random_seed 0 --h1 p1 --h2 p2
python run.py --random_seed 1 --h1 p1 --h2 p2
python run.py --random_seed 2 --h1 p1 --h2 p2
done
done

The choices of hyper-parameters are detailed in the appendix.

We repeated the experiments with different random seeds for each group of hyper-parameters, and the results reported in the paper are for the best group. Also, we observed that due to the small size of the drugood dataset, it is prone to larger variance. But because of running many groups results out, the best one will be the one where the results are all a bit better.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants