Skip to content

michal-brylinski/efindsite

Repository files navigation

eFindSite

eFindSite is a tool that predicts binding pockets, residues and ligands from a given protein structure by threading methods.

This README file is written by Snigdha Thumma.

If you find this tool useful, please cite these following papers:

  1. Brylinski M, Feinstein WP. (2013) eFindSite: Improved prediction of ligand binding sites in protein models using meta-threading, machine learning and auxiliary ligands. J Comput Aided Mol Des 27 (6): 551-67.

  2. Feinstein WP, Brylinski M. (2014) eFindSite: Enhanced fingerprint-based virtual screening against predicted ligand binding sites in protein models. Mol Inf 33 (2): 135-50.

Prerequisites:

  • GCC compiler version 4.8.5+
  • Perl version 5.16.3+

Example:

  1. First you have to download eFindSite from here https://github.com/michal-brylinski/efindsite and unzip:

    • [local]$ unzip efindsite-master.zip
    • [local]$ cd efindsite-master/
    • [efindsite-master]$ make
    • [efindsite-master]$ ls bin/
    • [bin]$ ./efindsite
  2. Then install the eFindSite template libraries from here https://osf.io/mp343/( download the latest available library ) and unpack the tarball:

    • [local]$ mkdir libraries
    • [local]$ tar xzf efindsite-lib-2018-04.tar.gz -C /usr/local/libraries/
    • [local]$ tar xzf efindsite-mod-2016-06.tar.gz -C /usr/local/libraries/
  3. Before running eFindSite, you need to set several environmental variables.

    • eFindSite
      • [home]$ export EF_LIB=/usr/local/libraries/efindsite-lib-2018-04
      • [home]$ export EF_MOD=/usr/local/libraries/efindsite-mod-2016-06
  4. After the environmental variables are set, it’s time to run the eFindSite and efindsite_screen

    • [library]$ /usr/local/efindsite-master/bin/efindsite -s 13gsA.pdb -t 13gsA-efindsite.lst -e 13gsA.profile -o 13gsA-efindsite_test
    • [library]$ /usr/local/efindsite-master/bin/efindsite_screen -p 13gsA-efindsite.pockets.dat -s escreen-keggcomp-mar2012.gz -o 13gsA-escreen-keggcomptest
    • [library]$ /usr/local/efindsite-master/bin/efindsite_screen -p 13gsA-efindsite.pockets.dat -s escreen-zinc12_nr-mar2012.gz -o 13gsA-escreen-zinc12_nrtest

Mandatory arguments for eFindSite

Parameter Optional Name Description
-s N input_file Is the target protein structure in PDB format; you can use either experimental structure or protein model
-t N template-fun Is a text file that contains information on protein templated identified for your target e.g. by eThread
-e N seq.profile Is a sequence profile of the target; it can be generated by PROFILPRO
-o N output_name Is used to save five output files with different extensions that store dinging site prediction results

Mandatory arguments for efindsite_scr

Parameter Optional Name Description
-p N pocket.dat pocket.dat is a file containing the predicted binding pockets by eFindSite
-s N cmp_lib cmp_lib is a compound library used for virtual screening
-o N output_file output_file will contain ranked library compounds assigned with a Tanimoto score and Z-score

  • If eFindSite runs successfully, it should yield 5 output files with user defined input name. ** For example: the user defined input file, 13gsA-efindsite_test, has 5 outputs as seen below*

    • 13gsA-efindsite_test.pockets.dat (detailed info on predicted pockets)
    • 13gsA-efindsite_test.pockets.pdb (predicted pockets in PDB format)
    • 13gsA-efindsite_test.alignments.dat (structure alignments of templates into the target in FASTA format)
    • 13gsA-efindsite_test.templates.pdb (template structures aligned onto the target in PDB format)
    • 13gsA-efindsite_test.ligands.sdf (extracted binding ligands in SDF format)
  • Open the output files and check to make sure the files contain data.

  • If the output files are empty, go back and check the script.

About

Ligand binding site prediction and virtual screening

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages