Question regarding pretraining #6

paulrbuckley · 2022-04-12T14:36:10Z

Hi,

Thanks for your help with the python version issue.

I had a question about a workflow for training models from 'scratch'. From what I gather, the 'flexible_training.py' script permits training of a TITAN model. Does this script pretrain the model on BindingDB? I assume that from there, one would finetune this model on TCR sequence data and epitope data of choice - e.g., using the semi_frozen_finetuning.py script?

Best,

Paul

jannisborn · 2022-04-16T10:23:56Z

Hi @paulrbuckley,

Indeed, the flexible_training.py trains a TITAN model from scratch. Now it depends on how you use the flexible training script. You can either pass the TCR-eptope binding data to it, in that case, you omit the pretraining. Alternatively, you can use the model to pretrain a TITAN model on BindingDB and afterwards you can use the semi_frozen_finetuning.py to finetune your model on TCR-epitope binding data. As per results in our paper, this should give the best results. Just keep in mind that we did not release the preprocessed BindingDB data for this paper. If you want to re-do this step, I recommend looking at our related paper in the Journal of Chemical Information & Modeling and its codebase: https://github.com/PaccMann/paccmann_kinase_binding_residues
If you follow the Box link there, you can access processed BindingDB data that should be quite straightforward to be fed to the flexible training script.

Depends on your needs, but it might be easier to start from the pretrained model that we provide rather than re-doing the pretraining on BindingDB. Also keep in mind to keep a low LR for finetuning otherwise you might induce catastrophic interference and overfit on your TCR data. Hope this helps!

jannisborn closed this as completed Apr 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question regarding pretraining #6

Question regarding pretraining #6

paulrbuckley commented Apr 12, 2022

jannisborn commented Apr 16, 2022

Question regarding pretraining #6

Question regarding pretraining #6

Comments

paulrbuckley commented Apr 12, 2022

jannisborn commented Apr 16, 2022