Skip to content
/ dirseq Public

Work out whether RNAseq reads in general agree with the direction of the gene predicted

License

Notifications You must be signed in to change notification settings

wwood/dirseq

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

dirseq

Build Status

DirSeq work out whether RNAseq reads from metatranscriptomes are generally in the same direction as the ORF predicted, and provide gene-wise coverages using DNAseq mappings.

Note: this software is under active development!

Installation

Install some prerequisites via conda, and then dirseq itself:

conda create -c bioconda -n dirseq -y ruby samtools bedtools'>'2.24
conda activate dirseq
gem install dirseq

The following dependencies are installed above, but for completeness of documentation, dirseq requires these dependencies, on top of the Ruby ones:

  • samtools (tested with 0.1.19 and 1.0+)
  • bedtools (tested with 2.24.0) - old versions won't work.
  • Ruby (tested with 2.1.1)

Usage

Example usage:

Download the example data:

git clone https://github.com/wwood/dirseq
cd dirseq

Then run dirseq:

dirseq --bam spec/data/eg.bam --gff spec/data/eg.gff --measure-type count

Full usage help:

$ dirseq -h

    Usage: dirseq <arguments>

    Reports the coverage of a mapping in against each gene given in a GFF file

        --bam FILE                   path to mapping file [required]
        --gff FILE                   path to GFF3 file [required]

Optional parameters:

        --forward-read-only          consider only forward reads (i.e. read1) and ignore reverse reads. [default false]
        --ignore-directions          ignore directionality, give overall coverage [default: false i.e. differentiate between directions]
        --measure-type TYPE          what to count for each gene [options: count, coverage][default: coverage]
        --accepted-feature-types TYPE
                                     Print only features of these type(s) [default CDS]
        --comment-fields             Print elements from the comments in the GFF file [default ID]
        --sam-filter-flags           Apply these samtools filters [default: -F0x100 -F0x800]
        
Verbosity:

    -q, --quiet                      Run quietly, set logging to ERROR level [default INFO]
        --logger filename            Log to file [default stderr]
        --trace options              Set log level [default INFO]. e.g. '--trace debug' to set logging level to DEBUG

Running on EnrichM output, the output columns are changed relative to PROKKA-generated GFF files:

dirseq --bam spec/data/eg.bam --gff spec/data/eg.gff --measure-type count --comment-fields seq_id,annotations

Project home page

Information on the source tree, documentation, examples, issues and how to contribute, see

http://github.com/wwood/dirseq

The BioRuby community is on IRC server: irc.freenode.org, channel: #bioruby.

Cite

If you use this software, please cite

Woodcroft, B.J., Singleton, C.M., Boyd, J.A. et al. Genome-centric view of carbon processing in thawing permafrost. Nature 560, 49–54 (2018). https://doi.org/10.1038/s41586-018-0338-1

Copyright

Copyright (c) 2014-2021 Ben J. Woodcroft. See LICENSE.txt for further details.

About

Work out whether RNAseq reads in general agree with the direction of the gene predicted

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages