Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to extract the list of interacting Amino Acid Residues between the protein and the ligand through PDBQT #80

Closed
eschoo opened this issue Aug 12, 2022 · 3 comments
Labels
how-to-use How to correctly use this tool

Comments

@eschoo
Copy link

eschoo commented Aug 12, 2022

I have a Collection (in .tar) that is composed of hundreds of PDBQT files of the resulting docking poses between a protein and a ligand through an HPC Cluster. How would I go about extracting a list of all of the interacting amino acid residues for each of the docked PDBQT files (around 1 million docked files)?

@cbouy
Copy link
Member

cbouy commented Aug 12, 2022

Hi @eschoo,

Have you looked at this part of the documentation?
The only catch is that you'll require either a MOL2 file or SMILES to map the bond orders and formal charges to your PDBQT poses for each ligand.

Best, Cédric

@cbouy cbouy added the how-to-use How to correctly use this tool label Aug 12, 2022
@eschoo
Copy link
Author

eschoo commented Aug 14, 2022

Hi @cbouy ,

Thank you for the help! My issue is how I would go about this for a large collection of PDBQT files with docked compounds. Would ProLIF have a way of looking into each .tar collection, which is composed of thousands of PDBQT files, and obtain the amino acid residues for each of the PDBQT files and document them? Thank you!

@cbouy
Copy link
Member

cbouy commented Oct 1, 2022

Sorry for the very late response, I'm quite busy these days!

ProLIF doesn't have a way to directly read files from a tar collection, but the Python standard library has a tarfile module that might have helped. Unfortunately, MDAnalysis (the underlying library used by the prolif.pdbqt_supplier class) does not support multi-model PDBQT files, you have to split them so that they only contain a single docking pose per file. If you've used Vina for docking, there's a vina_split command that you could use to split poses into single files, but it would defeat the purpose of reading directly from a tar file :/

@cbouy cbouy closed this as completed Apr 15, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
how-to-use How to correctly use this tool
Projects
None yet
Development

No branches or pull requests

2 participants