-
Notifications
You must be signed in to change notification settings - Fork 200
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to use MOSES train/test/testSF dataset in Torchdrug #19
Comments
MOSES
train/test/testSF dataset in Torchdrug
Hi! There is a predefined split for MOSES implemented in TorchDrug. I am not sure if this is what you want. You can get it by
Sorry I am not an expert in molecule generation. Maybe @shichence knows more about the dataset and evaluation setting on MOSES? |
Thank you for your quick response! I have three questions as following:
|
|
Thank you for the quick response! I have the following update:
Since the split in
|
Pretrain:
Finetune:
The same procedure can be applied to resume training.
|
TorchDrug
implementsMOSES
dataset, but doesn't distinguish betweentrain
/test
/testSF
which MOSES has. To train GCPN on Moses, I think the correct order is to pretrain the model bytrain
dataset at first, then train it ontest
/testSF
dataset and finally generate the molecules. But how to do this inTorchDrug
? There's only one dataset namedMOSES
.I have this question because when I generate molecules by MOSES, the statistics doesn't look correct if compared to other models on MOSEC, especially the
Scaf/Test
property in the table, which tries to find out if there are same scaffolds in test dataset and generated molecules. It's 0 for GCPN model after training onTorchDrug
, following the tutorial. I think the problem is thatTorchDrug
only uses thetrain
dataset but nottest
dataset. How can I explicitly use it? Thanks in advance!The text was updated successfully, but these errors were encountered: