Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Request for the pretrained model and instructions on getting own proteins' embeddings. #11

Closed
yunxiangz opened this issue Nov 22, 2022 · 2 comments

Comments

@yunxiangz
Copy link

Thanks for the wonderful work!

I am trying the use the learned embeddings for a downstream protein classification problem on my own datasets. Since training the model requires a good HPC, I am wondering:

  1. whether you could kindly upload your pretrained model.
  2. could you explain how to generate the training and testing datasets (the pkl.gz file) from our own PDB files.
  3. based on the generated pkl.gz file in Q1, how to apply the trained model to get the final embedding vectors (512 dimensions) for our own PDB files.
@Oxer11
Copy link
Collaborator

Oxer11 commented Nov 22, 2022

Hi! Thanks for raising these questions!

1.We have a plan for releasing model checkpoint, but it still takes some time for preparation. If it is urgent for you, you can send an email to me (zuobai.zhang@mila.quebec) and I'll personally share a (good but not ready) checkpoint with you.
2&3.To define a customized dataset, you can follow the tutorials in TorchDrug and TorchProtein. The Customize Models & Tasks is about how to define a customized module in TorchDrug and Tutorial 3-Structure-based Protein Property Prediction is about how to define a dataset and run the GearNet model on it.

For your case, you can first define your customized dataset following the code of datasets.EnzymeCommission, which will show you how to generate .pkl.gz file with the API in TorchProtein. Then, you can load the task and model as the script in repo and then get the final embedding by calling task.model() (refer to tasks.MultipleBinaryClassification).

@yunxiangz
Copy link
Author

Thank you Zuobai!
I have sent you a request email~

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants