Skip to content

Retrieve chemical classes (ClassyFire/NPClassifier), as well as BGC classifications for MIBiG BGCs and match them.

License

Notifications You must be signed in to change notification settings

louwenjjr/mibig_classifications

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

mibig_classifications

Series of jupyter notebooks and scripts to retrieve chemical classes (ClassyFire/NPClassifier), as well as BGC classifications for MIBiG BGCs and their products. These two types of classifications are then connected to eachother by counting their interactions, ideally matching certain chemical and BGC classes. Based on the relative counts of each matched pair of class-terms it is assessed how well the pair of class-terms match to eachother. These matched classes can then be used to rule out unreliable BGC-compound matches, as is done in the NPClassScore method implemented in the NPLinker framework. Read about it here!

Class_matching_tables_fig.png

Environment

To run the code in this repo, set up an environment like this using conda:

conda create -n myenv python=3.7.2 rdkit
conda install -c plotly plotly=4.14.3
conda install -c plotly plotly-orca==1.3.1 psutil
conda activate myenv
pip install jupyter

To run the analysis from this repo, you will need to download a version of MIBiG (json format), like here.

Citation

Louwen, J. J. R., Medema, M. H., & van der Hooft, J. J. J. (2022). Enhanced correlation-based linking of biosynthetic gene clusters to their metabolic products through chemical class matching.

Acknowledgements

We would like to thank Oscar Hoekstra for his initial efforts (https://github.com/OscarHoekstra/ClassifyNPDB)

About

Retrieve chemical classes (ClassyFire/NPClassifier), as well as BGC classifications for MIBiG BGCs and match them.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published