Skip to content

weasteam/miRPara

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

miRPara

MiRPara: a SVM-based software tool for prediction of most probable microRNA coding regions in genome scale sequences

BACKGROUND: MicroRNAs are a family of ~22nt small RNAs that can regulate gene expression at the post-transcriptional level. Identification of these molecules and their targets can aid understanding of regulatory processes. Recently, high throughput sequencing (HTS) has become a common identification method but there are two major limitations associated with the technique. Firstly, the method has low efficiency, with typically less than 1 in 10,000 sequences representing miRNA reads and secondly the method preferentially targets highly expressed miRNAs. If sequences are available, computational methods can provide a screening step to investigate the value of an HTS study and aid interpretation of results. However, current methods can only predict miRNAs for short fragments and have usually been trained against small datasets which don't always reflect the diversity of these molecules.

** RESULTS:** We have developed a software tool, miRPara, that predicts most probable mature miRNA coding regions from genome scale sequences in a species specific manner. We classified sequences from miRBase into animal, plant and overall categories and used a support vector machine (SVM) to train three models based on an initial set of 77 parameters related to the physical properties of the pre-miRNA and its miRNAs. By applying parameter filtering we found a subset of ~25 parameters produced higher prediction ability compared to the full set. Our software achieves an accuracy of up to 80% against experimentally verified mature miRNAs, making it one of the most accurate methods available.

CONCLUSIONS: miRPara is an effective tool for locating miRNAs coding regions in genome sequences and can be used as a screening step prior to HTS experiments. It is available at http://www.whiov.ac.cn/bioinformatics/mirpara.

BMC Link: http://www.biomedcentral.com/1471-2105/12/107/

Please Cite: MiRPara: a SVM-based software tool for prediction of most probable microRNA coding regions in genome scale sequences. Wu Y., Wei B., Liu H., Li T., Rayner S. BMC Bioinformatics. 2011 Apr 19; 12(1):107

Documentation

  • miRPara
  • [miRPara Model Trainer](/docs/miRPara Model Trainer.md)
  • [Parameters Description.md](/docs/Parameters Description.md)

About

MiRPara: a SVM-based software tool for prediction of most probable microRNA coding regions in genome scale sequences

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages