filterRNAstrand processe the bad strand #376

lelouar · 2016-06-20T15:09:18Z

Hi,

I tried to use bamCoverage for single end RNA-seq data and test the --filterRNAstrand option but the results are inverted as what I expected. I plot this picture to show you the behavior :

What is wrong ? Is it a mistake of me ?

bamCoverage --version : 2.2.4

dpryan79 · 2016-06-20T20:45:26Z

That looks correct. Read #1 is antisense in 99.99% of currently produced libraries. So, if you exclude alignments with bit 16 set, then you're getting only reads arising from the - strand, which is why the track is identical to that produced by --filterRNAstrand reverse. Yes, this looks backward, but it's correct (and also one of the single most confusing things in bioinformatics).

lelouar · 2016-06-21T07:27:47Z

Sorry but flag 16 is read mapped on reverse strand (-) so if you exclude 16 you are in the forward strand (+). I read an other time the doc of samtools to be sure that I am not confusing and also the readthedoc of DeepTools for bamCoverage and indeed you write that --samFlagExclude 16 = forward strand and --samFlagInclude 16 = reverse one.
Thx

dpryan79 · 2016-06-21T07:36:08Z

A flag of 16 only means that a read is reverse complemented. In RNAseq, that happens for read #1 when it arises from a + strand fragment. This is due to how library prep. works. This is also why you have to set -s reverse with current RNAseq datasets if you use htseq-count.

Further, the results visibly agree with the genes. Your reverse (-) strand gene visibly corresponds to the reverse track, and the forward strand gene to the forward track.

lelouar · 2016-06-21T08:43:43Z

It totally depend on the RNA protocol of library preparation. There is currently 2 different protocol. The picture from my first post is a reverse strand protocol meaning that read mapped in the forward strand came from genes positioned in the minus strand (and the filterRNAstrand give you, in this case, the "good result" but not what you expected when you take in account the protocol). I add a new picture made with the other protocol of library preparation (read and transcript came from the same strand) :

In this case the result is not what we expected.

For the htseq-count software you have to take in account of the protocol of library preparation to be sure that you count the "right" reads and use the good parameters.

So, if you want to keep read on the + strand when you use --filterRNAstrand forward you do a mistake, if you want to filter/remove such reads it's only a misunderstanding of me.

dpryan79 · 2016-06-21T09:09:30Z

As the overwhelming majority of produced libraries are dUTP-based, that's what the --filterRNAstrand option assumes. This will not be changed and people will need to simply understand their data.

vivekbhr · 2016-06-21T09:11:50Z

That's right the filterRNAstrand method is based on standard Illumina protocol (R2 in strand orientation), while other protocols also exist. we should probably mention this in the documentation.

lelouar · 2016-06-21T12:20:55Z

ok, sorry. I agree with vivekbhr that you should at least mention this in the documentation.

dpryan79 closed this as completed Jun 20, 2016

vivekbhr mentioned this issue Jun 22, 2016

Clarify libraryType assumption for strand-filtering #377

Merged

vertesy mentioned this issue Nov 22, 2019

Strand specific mapping location of reads via stranded bigwig files vertesy/pseudoBulk#5

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

filterRNAstrand processe the bad strand #376

filterRNAstrand processe the bad strand #376

lelouar commented Jun 20, 2016 •

edited

Loading

dpryan79 commented Jun 20, 2016

lelouar commented Jun 21, 2016

dpryan79 commented Jun 21, 2016

lelouar commented Jun 21, 2016

dpryan79 commented Jun 21, 2016

vivekbhr commented Jun 21, 2016

lelouar commented Jun 21, 2016

filterRNAstrand processe the bad strand #376

filterRNAstrand processe the bad strand #376

Comments

lelouar commented Jun 20, 2016 • edited Loading

dpryan79 commented Jun 20, 2016

lelouar commented Jun 21, 2016

dpryan79 commented Jun 21, 2016

lelouar commented Jun 21, 2016

dpryan79 commented Jun 21, 2016

vivekbhr commented Jun 21, 2016

lelouar commented Jun 21, 2016

lelouar commented Jun 20, 2016 •

edited

Loading