Skip to content
Ronesh Sharma edited this page Aug 24, 2020 · 6 revisions

Welcome to the Predict-DNA-binding-proteins overview page

OVERVIEW

DNA-binding proteins perform important roles in cellular processes and are involved in many biological activities. These proteins include crucial protein-DNA binding domains and can interact with single-stranded or double-stranded DNA, and accordingly classified as single-stranded DNA-binding proteins (SSBs) or double-stranded DNA-binding proteins (DSBs). Computational prediction of SSBs and DSBs helps in annotating protein functions and understanding of protein-binding domains.

DATASETS

Train data

  • Train set The sequences are provided in fasta format. First 872 protein sequences are DSBs and the remaining (183 protein sequences) are SSBs.

Independent data

  • Independent set The sequences are provided in fasta format. First 125 protein sequences are DSBs and the remaining (41 protein sequences) are SSBs.

New test data

  • New test set The sequences are provided in fasta format. First 64 protein sequences are SSBs and the remaining (53 protein sequences) are DSBs.

Source code

CITATION

Sharma, R., Kumar, S., Thirumananseri, K. and Sharma, A., Single-stranded and double-stranded DNA-binding protein prediction using HMM profiles, 2020.

Clone this wiki locally