-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About the number of interactions and the substructure dimension #2
Comments
Hi. |
If we want to reproduce the results in your paper or develop our own method, should the dataset keep 37264 in this repo or aug to 74528? But the latter may cause a different split and test set ... |
I think you should keep 37264, because it may be easier for the model to predict (drugB,drugA) if (drugA,drugB) already exists in the training set. This will lead to false high accuracy. |
OK, understand. Thanks for your answer~ |
Hi, thanks for your kind reply. I have two questions:
The number of interactions in your paper is 74528 while the data in the code is 37264. I guess the "74528" actually contains the "drugA-drugB" and the "drugB-drugA"?Can we understand that the "drugA-drugB" and the "drugB-drugA" are actually the same event with same label?
The substructure dimension when I run the code is 583 instead of the 881 in your paper. Is something wrong?
The text was updated successfully, but these errors were encountered: