Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to distinguish Stand/Unstrand? #258

Closed
Lei-Tian opened this issue Apr 21, 2017 · 7 comments
Closed

How to distinguish Stand/Unstrand? #258

Lei-Tian opened this issue Apr 21, 2017 · 7 comments

Comments

@Lei-Tian
Copy link

I am using STAR to align my RNA-Seq data. How to know my RNA-seq is Stand/Unstrand? Thanks!

@DarioS
Copy link

DarioS commented Apr 24, 2017

The fastest and easiest way is to view the alignments in the BAM files using IGV. Ensure that you have first read colouring activated by right-clicking on each BAM file track then sensure that Colour Alignments By -> First-of-Pair Strand is selected. Below is a screenshot of two samples that are stranded and two that are not belonging to a publicly available cancer RNA sequencing dataset. Note how each gene has approximately half blue and half red reads for the unstranded samples. The authors misleadingly did not explain their usage of different RNA-seq kits in the journal article's Methods section which causes reanalysis problems if not identified and specially handled.

mapkinhibitorstrands

@alexdobin
Copy link
Owner

Hi Lei-Tian,

great suggestion from Dario - you can use STAR to generate the wiggle files.
Another (less visual) possibility is to run STAR with the --quantMode GeneCounts option, and count total read counts on genes in the 3 and 4 columns (these column represent different library strandedness). For stranded data one of the columns should be much larger than the other. You can use this to estimate strandedness of the data quantitatively.

Cheers
Alex

@Lei-Tian
Copy link
Author

Lei-Tian commented Apr 26, 2017 via email

@DarioS
Copy link

DarioS commented May 5, 2017

If your issue has been resolved, then please close it.

@Lei-Tian
Copy link
Author

Lei-Tian commented May 5, 2017 via email

@Lei-Tian Lei-Tian closed this as completed May 5, 2017
@Lei-Tian
Copy link
Author

Lei-Tian commented May 5, 2017 via email

@xinl22
Copy link

xinl22 commented Jun 25, 2018

@alexdobin
Hi Alex, if I have stranded reads and run STAR with the --quantMode GeneCount, for a specific gene, which value should I use to represent gene expression level in following analysis? The 3rd or 4th column?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants