Skip to content

An R package which stores the code used to process Long Life Family Study RNA sequencing from fastq to analysis releases

License

Notifications You must be signed in to change notification settings

cmatKhan/llfsRnaseq

Repository files navigation

R-CMD-check

Introduction

This repository documents how the Long Life Family Study RNA sequencing data is processed.

There is no data stored in this repository. If you feel like you need the data, and don't already have access, then you should talk to your lab and ask how to get access. Please see the Getting Started section of the documentation for an explanation of how this data is tracked in relation to the versioned software releases.

Wait -- this is different

I used to distribute this data as a tarred, zipped R project. There was a subdirectory called data which had both the DeseqDataSet gene and transcript objects (see Why Use DESeqDataSet Objects) and csv files.

That data directory is now what will be distributed through box and on the dsg cluster. This repository will store the versioned code which corresponds to a given data release, and the documentation.

Installation and Usage

You can use this in a couple different ways, but the starting point for all of those ways is the served documentation. I'd start with Getting Started, myself.

Why Use DESeqDataSetObjects

Even if you aren't going to do differential expression analysis using DESeq2, it is useful to use the DESeqDataSet object. DESeqDataSet objects inherit from the basebioconductor object SummarizedExperiment. And, SummarizedExperiment is a core component of Bioconductor's Scalable Genomics Toolset.

The LLFS dds object has the GRCh38 annotations stored in the rowRanges. The metadata rows also are forced to correspond to the count columns. So, you could choose a single gene in a single sample like this:

dds['ENSG....', dds$library_id==2]

Some final notes

if the R CMD Check CI is passing, that means that this package successfully builds on the latest Windows, latest Mac, and the two most current Ubuntu OS's. If you are using one of these operating systems, then this will install on your system. If you are using an older, or different, OS, then there are no promises. Go ahead and open a bug report if this is the case, and I will create a Docker and/or singularity container for you.

About

An R package which stores the code used to process Long Life Family Study RNA sequencing from fastq to analysis releases

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published