Skip to content

Data and code to reproduce the results of Dens, C. et al. The pitfalls of negative data bias for the T-cell epitope specificity challenge. Nat Mach Intell (2023). https://doi.org/10.1038/s42256-023-00727-0 The article is available for free at https://rdcu.be/dnO0c

License

Notifications You must be signed in to change notification settings

PigeonMark/PanPep-Shuffled-Negatives

 
 

Repository files navigation

PanPep tested on Shuffled Negatives

This repository contains the data and code to test PanPep (Gao et al., 2023) on data with negatives generated by shuffling. If you want to use or test PanPep on your own data, we recommend using the original repository (https://github.com/bm2-lab/PanPep).

This repository contains additions on the original PanPep software. These are also licensed under the GPL-3.0 license. An overview of all changes can be found in commit 7ecbc99.

Installation

First install PanPep. We suggest installing PyTorch by following the instruction on their official website (https://pytorch.org/get-started/locally/). Our tests were run with PyTorch 2.0.0.

On top off the requirements in the PanPep README, we also had to install these additional python packages to run PanPep:

  • joblib==1.1.1
  • matplotlib==3.5.1
  • scikit-learn==1.2.2

Finally, for plotting our ROC-AUC curves, installing this package is required:

  • seaborn==0.12.2

Usage

Although the output files of all our tests are included, they can be reproduced by running these scripts from the project root directory.

For the first test (cross-validation with shuffled negatives) run

bash predict_cross-validation_shuffled-negatives.sh

and for the second test (zero-shot with shuffled negatives) run

bash predict_zeroshot_shuffled-negatives.sh

To print all performance results and to create the ROC curves run

python shuffled-negatives_roc-auc.py

To print the data overlap statistics run

python check_data_overlap.py

About

Data and code to reproduce the results of Dens, C. et al. The pitfalls of negative data bias for the T-cell epitope specificity challenge. Nat Mach Intell (2023). https://doi.org/10.1038/s42256-023-00727-0 The article is available for free at https://rdcu.be/dnO0c

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 99.2%
  • Shell 0.8%