DeepSub

Introduction

DeepSub is a tool designed to predict the number of subunits in a protein sequence for homo-oligomers.

Installation

$ git clone  https://github.com/tibbdc/DeepSub.git

$ cd DeepSub

$ conda create -n deepsub python=3.9

$ conda activate deepsub

$ pip install -r requirements.txt

Notebooks

01_GetData.ipynb
- Obtaining and processing data sets .
02_SeqIdentity.ipynb
- Sequence Identity Comparison Result.
03_DeepSub.ipynb
- DeepSub model and cross-validation results.
04_Queen.ipynb
- Queen model for model comparison.
05_OpenSet.ipynb
- OpenSet Dataset Evaluation.

Scripts

featurizer.py
- Sequence features are extracted before model training.
trainer.py
- Single training function.

Notice

We have successfully trained the model, which is now stored at DeepSub/model/deepsub.h5. You can simply execute the test.ipynb notebook to start making predictions. Should you wish to retrain the model with your custom dataset, please refer to the instructions in the "Usage" section and adjust accordingly.

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
DATA		DATA
model		model
script		script
tool		tool
.gitignore		.gitignore
01_GetData.ipynb		01_GetData.ipynb
02_SeqIdentity.ipynb		02_SeqIdentity.ipynb
03_DeepSub.ipynb		03_DeepSub.ipynb
04_Queen.ipynb		04_Queen.ipynb
05_OpenSet.ipynb		05_OpenSet.ipynb
README.md		README.md
featurizer.py		featurizer.py
get_seq.py		get_seq.py
requirements.txt		requirements.txt
test.py		test.py
trainer.py		trainer.py

tibbdc/DeepSub

Folders and files

Latest commit

History

Repository files navigation

DeepSub

Introduction

Installation

Notebooks

Scripts

Notice

About

Resources

Stars

Watchers

Forks

Languages