Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

example with a chemical dataset #1

Closed
thegodone opened this issue Nov 4, 2019 · 5 comments
Closed

example with a chemical dataset #1

thegodone opened this issue Nov 4, 2019 · 5 comments

Comments

@thegodone
Copy link

thegodone commented Nov 4, 2019

In order to reproduce the work on Chemicals:

Can you share a example of data preparation ?

  • with dummy 3D coordinate or rdkit 3D coordinate
  • for one molecule

how look like your data preparation dataloader.

BR
Guillaume

@mokosaur
Copy link
Collaborator

@thegodone ,
thank you for your interest. We are going to upload a chemical example soon, probably by the end of this month.

@thegodone thegodone reopened this Dec 9, 2019
@thegodone
Copy link
Author

@mokosaur any update there ?

@mokosaur
Copy link
Collaborator

mokosaur commented Mar 3, 2020

In the last update, I uploaded loaders for the chemical datasets. See the project README for a usage example. Thank you for your patience as it took a while to gather the code.

In the meantime, our efforts were focused on finishing our other work on another molecular prediction model which uses the Transformer architecture to extract information from molecular graphs. You can read more here: https://arxiv.org/abs/2002.08264.

@thegodone
Copy link
Author

thegodone commented Mar 20, 2020

In your last preprint paper, Do you provide RMSE values at scaled range or at orignal data range ?

@mokosaur
Copy link
Collaborator

The RMSE is calculated for the standardized values. The datasets that we uploaded contain values that are already standardized. I think you can recover the RMSE of non-standard data by simply multiplying the RMSE by the standard deviation of the original data. However, these numbers still vary across different random data splits, and thus, unfortunately, they cannot be compared to other results in the literature.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants