Skip to content

Tools for improving and annotating de novo RNA-seq assemblies

License

Notifications You must be signed in to change notification settings

MaxiEstravis/lenga

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

58 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

lenga

This repository contains scripts aimed at improving and annotating de novo RNA-seq assemblies

  • lenga.py (with its companion folder scripts) improves de novo RNA-seq assemblies by eliminating redundant contigs and extending those with more than a certain overlap but considered separate contigs by the assembler.

  • annotation.py (with its companion script merge_psl_by_query.php) allows the researcher to annotate an RNA-seq assembly against any sub-set of UniProt peptidic sequences. The script translates each contig into its 6 frames and retains the longest ORF per frame for annotation.

Third party software requirements:

Command line examples:

python lenga.py name_of_experiment /path/to/FASTA/file /path/to/BLAT/binary /path/to/Exonerate/suite

python annotation.py /path/to/FASTA/file /path/to/UniProt/reference/file /path/to/BLAT/binary /path/to/TranSeq/binary/from/EMBOSS /path/to/merge_psl_by_query.php

About

Tools for improving and annotating de novo RNA-seq assemblies

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published