Skip to content

Latest commit

 

History

History
70 lines (52 loc) · 3.38 KB

index.rst

File metadata and controls

70 lines (52 loc) · 3.38 KB

TAQLoRE - Transcript Annotation and Quantification using Long Reads

TAQLoRE is a Snakemake-based pipeline to improve existing annotations and to quantify transcripts coming from long read amplicon-based cDNA sequencing technologies (Oxford Nanopore Technologies, PacBio). It was tested on Linux (CentOS 6) but it should work on Mac as well. Briefly, it uses LAST to align all reads to the transcriptome, then it discovers new exons by looking at insertions in alignments, it creates meta-gene with all known and novel exons, aligns all reads to it and generates a TMM-normalised read counts, together with expression heatmaps and PCA plots. It also identifies new splice sites by looking at perfectly aligned reads to the genome, and correcting all splice sites to the closest most abundant canonical ones. For more information, refer to :doc:`description`.

Installation

Citations of dependencies

Our pipeline is based on following software:

Papers using the pipeline

The following papers/pre-prints that use our pipeline has been published:

Authors

Developers:

  • Wilfried Haerty (Earlham Institute)
  • Tomasz Wrzesinski (Earlham Institute)

Contributors:

  • Elizabeth Tunbridge (University of Oxford)
  • Michael Clark (University of Melbourne)
  • Nicola Hall (University of Oxford)
  • Syed Hussain (University of Oxford)
  • Hami Lee (University of Oxford)

Things to add

  • Splice-site-based pipeline (part4 and part5).
  • Usage of splice-site-based approach (part4 and part5).
  • Description of output files.
  • Description of scripts.
  • Description of example dataset.
.. toctree::
   :maxdepth: 2
   :caption: Documentation index

   description
   installation
   usage_exon_based
   usage_splice_site_based
   output_files_exon_based
   output_files_splice_site_based