dlab-public

Code to run the antibody virtual screening pipeline described in the paper "DLAB - Deep learning methods for structure-based virtual screening of antibodies" (bioRxiv)[1]

Install

Install libmolgrid [2] by building from source according to the instructions on the authors github repository. Use the source code provided in this repository, as a small change was made to accomodate the centering method employed for DLAB.
Get ZDock and copy the source code files into the folder external/zdock-3.0.2-src.
Install the python requirements for running data_prep_pipeline.py (which can be run entirely on CPU and is therefore kept seperate from dlab_re_vs_pipeline.py, which requires a GPU):
```
 pip install -r requirements_cpu.txt  
```
Install the python requirements for running the dlab_re_vs_pipeline:
```
 pip install -r requirements_gpu.txt
```
Install openbabel, including the python bindings.

Test your install

The folder tests/ contains scripts you can use to check if your folder structure is set up correctly. Run them with pytest:

pytest tests/

Data preparation

Prepare your input antibody and antigen structures by running the mark_sur script as per the ZDock README. If you want to limit docking to the (predicted) interaction site, block atoms outside the interaction site by changing column 55-56 after running mark_sur to 19 as per the ZDock README. There is more detail on this in the paper.
Generate a .csv file containing all antibody and antigen pairings you want to investigate. An example is shown in example_input_files/pairings.csv.
Use the data_prep_pipeline.py script to run ZDock and generate types files. This uses a python implementation of the atomtyper functionality in libmolgrid [2]:
```
python data_prep_pipeline.py -c data_prep_config.yaml
```
data_prep_config.yaml configures the pipeline (see example_input_files/data_prep_config.yaml).

Running DLAB

For this script, you will need to be on a machine with GPU and CUDA, which is why this is seperate from the data preperation script (docking and preperation can be run on cpu compute servers before running GPU computations). Run DLAB (both rescoring and virtual screening in one go) using this command:

python dlab_re_vs_pipeline.py -c dlab_config.yaml

Where the yaml file defines the locations and types of models used as well as the input data and output file. An example can be found in example_input_files/dlab_config.yaml.

Analysing the output

The pipeline generates a csv file with the columns

name
dlab-re-max
dlab-vs
zdock_score

To follow the approach used in the paper ("DLAB-VS+ZDock"), a final score can be calculated by minmax scaling both DLAB-VS and ZDock score for each target antigen and averaging the two scores. For the approach denoted "DLAB-Re-max thresholding" in the paper, discard all antibody-antigen pairings not falling in the top 20% of DLAB-Re-max scores.

References

[1] DLAB - Deep learning methods for structure-based virtual screening of antibodies. C Schneider, A Buchanan, B Taddese, CM Deane, bioRxiv, 2021

[2] libmolgrid: Graphics Processing Unit Accelerated Molecular Gridding for Deep Learning Applications. J Sunseri, DR Koes. Journal of Chemical Information and Modeling, 2020

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
example_input_files		example_input_files
external/psa_execs		external/psa_execs
libmolgrid		libmolgrid
models		models
pretrained_models		pretrained_models
tests		tests
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data_prep_pipeline.py		data_prep_pipeline.py
dlab_re_vs_pipeline.py		dlab_re_vs_pipeline.py
environment_cpu.yml		environment_cpu.yml
environment_gpu.yml		environment_gpu.yml
requirements_cpu.txt		requirements_cpu.txt
requirements_gpu.txt		requirements_gpu.txt

License

oxpig/dlab-public

Folders and files

Latest commit

History

Repository files navigation

dlab-public

Install

Test your install

Data preparation

Running DLAB

Analysing the output

References

About

Resources

License

Stars

Watchers

Forks

Languages