KO-Identification

Requirements

See requirements.txt

Getting Started

# get ProtT5 embedding (input format: fasta, output format: h5)
python prott5_seq2embedding.py seq_file.fa embedding_file.h5

# use the trained classifier to make predictions (output format: csv)
python prediction.py model/mlp_pipe.pt embedding_file.h5 cls_result.csv

# cluster (result file format: csv)
python cluster.py embedding_file.h5 reference.h5 cluster_result.csv

# train and test the classifier (The paths to the model and dataset can be modified in the python file)
python cls/mlp_train.py
python cls/mlp_test.py

Models

If you want to apply the model directly, you should use mlp_pipe.pt.

└── Model
    ├── att_cls.pt          # attention model
    ├── lstm_cls.pt         # LSTM model
    ├── mlp_cls.pt          # MLP model (used for classifier testing)
    └── mlp_pipe.pt         # MLP model (used for testing the entire pipeline)

Data availability

Publicly available datasets were analyzed in our paper. These datasets were collected from the KEGG database, the PDB database, and the AFDB database.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cls

cls

model

model

README.md

README.md

binary_classifier.py

binary_classifier.py

cluster.py

cluster.py

prediction.py

prediction.py

prott5_seq2embedding.py

prott5_seq2embedding.py

requirements.txt

requirements.txt

Repository files navigation

KO-Identification

Requirements

Getting Started

Models

Data availability

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
cls		cls
model		model
README.md		README.md
binary_classifier.py		binary_classifier.py
cluster.py		cluster.py
prediction.py		prediction.py
prott5_seq2embedding.py		prott5_seq2embedding.py
requirements.txt		requirements.txt

wuhaoyu3/KO-Identification

Folders and files

Latest commit

History

Repository files navigation

KO-Identification

Requirements

Getting Started

Models

Data availability

About

Resources

Stars

Watchers

Forks

Languages