Empty dataframe output when generating interaction fingerprints #87

shinoxide · 2022-10-23T20:58:53Z

Thank you for your efforts. Prolif is quite easy to install.
Please the issue I have is that I keep getting empty dataframe output when I generate the fingerprints and read into pandas dataframe. My protein and ligands are okay, I confirmed interactions elsewhere. Please kindly help me out. See image please, thank you

cbouy · 2022-10-23T22:50:02Z

Hi @shinoxide ,

Does your protein PDB file have explicit hydrogens on all atoms? It's a requirement for ProLIF to work correctly. If not, you can use webservers like PypKa or PropKa (also available as command line tools) for this matter.

Another possible explanation would be that your protein PDB file has CONECT records for some of the bonds but not all, in which case you might want to load the PDB file with:

prot = mda.Universe("myprot2.pdb", guess_bonds=True)

Tell me if this works.

Best,
Cédric

shinoxide · 2022-10-24T12:18:12Z

Hi Cédric,

This is great! It worked! The "guess_bonds=True" worked for me. See picture. Thank you very much 😊 I'm grateful.

Adeshina

shinoxide · 2022-10-24T17:46:31Z

Hi @shinoxide ,

Does your protein PDB file have explicit hydrogens on all atoms? It's a requirement for ProLIF to work correctly. If not, you can use webservers like PypKa or PropKa (also available as command line tools) for this matter.

Another possible explanation would be that your protein PDB file has CONECT records for some of the bonds but not all, in which case you might want to load the PDB file with:
prot = mda.Universe("myprot2.pdb", guess_bonds=True)
Tell me if this works.

Best, Cédric

Hi Cédric

Thanks again for your response. I have 2 questions please.

Can I very much rely on the accuracy of guess_bonds?
Do you know tools I can use to capture CONECT records for all bonds?
Thank you.

cbouy · 2022-10-24T21:44:29Z

Can I very much rely on the accuracy of guess_bonds?

MDAnalysis uses the same algorithm as VMD and many other tools for guessing bonds based on 3D coordinates and elements. so although it's not going to be completely failure-proof, I'd say it's pretty safe.
Also when you run plf.Molecule.from_mda(prot) it actually converts the full protein into an RDKit molecule, and RDKit will complain if it sees atoms with incorrect valences, so if there's a problem during bond guessing it will very likely result in an error on the RDKit side and you will know immediatly.

Do you know tools I can use to capture CONECT records for all bonds?

I'm not sure I understand the question, do you mean extracting all the lines that start with CONECT in your PDB file?
If so, on a Linux/Mac shell, the command grep 'CONECT ' myprot2.pdb should do the trick.

shinoxide · 2022-10-24T22:06:15Z

Can I very much rely on the accuracy of guess_bonds?

MDAnalysis uses the same algorithm as VMD and many other tools for guessing bonds based on 3D coordinates and elements. so although it's not going to be completely failure-proof, I'd say it's pretty safe. Also when you run plf.Molecule.from_mda(prot) it actually converts the full protein into an RDKit molecule, and RDKit will complain if it sees atoms with incorrect valences, so if there's a problem during bond guessing it will very likely result in an error on the RDKit side and you will know immediatly.

Do you know tools I can use to capture CONECT records for all bonds?

I'm not sure I understand the question, do you mean extracting all the lines that start with CONECT in your PDB file? If so, on a Linux/Mac shell, the command grep 'CONECT ' myprot2.pdb should do the trick.

Thank you very much for your response.
For my second question... You said earlier that it is possible that my protein PDB file doesn't have CONECT records for all bonds and indeed I have only two lines starting with 'CONECT' in the file. So I thought there might be ways to generate complete CONECT records.

cbouy · 2022-10-25T00:50:54Z

MDAnalysis and RDKit can both read and write PDB files and have that functionality, VMD and probably PyMol or Avogadro also have that.

shinoxide · 2022-10-25T08:12:36Z

Thank you very much for your support.

shinoxide closed this as completed Oct 24, 2022

shinoxide reopened this Oct 24, 2022

shinoxide closed this as completed Oct 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Empty dataframe output when generating interaction fingerprints #87

Empty dataframe output when generating interaction fingerprints #87

shinoxide commented Oct 23, 2022

cbouy commented Oct 23, 2022

shinoxide commented Oct 24, 2022

shinoxide commented Oct 24, 2022 •

edited

Loading

cbouy commented Oct 24, 2022

shinoxide commented Oct 24, 2022

cbouy commented Oct 25, 2022

shinoxide commented Oct 25, 2022

Empty dataframe output when generating interaction fingerprints #87

Empty dataframe output when generating interaction fingerprints #87

Comments

shinoxide commented Oct 23, 2022

cbouy commented Oct 23, 2022

shinoxide commented Oct 24, 2022

shinoxide commented Oct 24, 2022 • edited Loading

cbouy commented Oct 24, 2022

shinoxide commented Oct 24, 2022

cbouy commented Oct 25, 2022

shinoxide commented Oct 25, 2022

shinoxide commented Oct 24, 2022 •

edited

Loading