Skip to content
/ tweed Public

Database to store and analyze the results of oases de novo assembly pipeline

License

Notifications You must be signed in to change notification settings

eco32i/tweed

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

tweed

A database to store and analyze the results of oases de novo assembly pipeline. Inspired by oases2csv. The database is built by parsing the following files from the standard oases output:

  • stats.txt
  • contig-ordering.txt
  • transcripts.fa

The resulting transcripts are annotated with coverage information calculated as geometric mean of the nodes comprising the tranascript and grouped by loci.

The best transcript in the locus is selected as the transcript in top 20% by length having the maximum coverage.

If transcripts.fa is blasted against a genome of interest, the resulting .xml output file can be imported to annotate transcripts that produced blast hits.

Dependencies

TODO

  • views: by assembly, by locus, best transcript in locus
  • blast annotations for transcripts
  • tunable length cutoff for the best in locus transcript selection

About

Database to store and analyze the results of oases de novo assembly pipeline

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published