Skip to content

kaichen127/PtmPredictionModel

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🧬 Post-Translational Modification (PTM) Site Prediction Model

This project uses Facebook’s ESM2 protein language model as a backbone to predict potential post-translational modifications (PTMs) in protein sequences.

🚀 Features

  • ⚙️ Built on top of Facebook’s ESM2 transformer model.
  • 🧩 Supports multi-PTM classification through an optional embedding table and Transformer decoder head.
  • 🛠 Adjustable configurations via configs/config.yaml including:
    • ESM model size
    • Multi-PTM embedding
    • Custom decoder layers
    • Training hyperparameters
  • 📁 Includes a sample CSV dataset from the UniProt Consortium for quick testing and exploration.

📂 Dataset

A small example CSV dataset is provided from the UniProt database, including:

  • Protein sequences
  • Site labels

📝 Usage

Modify configs.yaml, dataset.py, and model.py as needed and run train.py to begin training. Included in training are multiple toggleable flags that allow for loading checkpoints, visualization, and evaluating on the test set. To toggle the model's multi-PTM capabilities, change the value in configs.yaml -> model -> use_decoder_block.

📚 Reference

About

PTM Site Classifier using ESM2 backbone

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages