-
Notifications
You must be signed in to change notification settings - Fork 0
Could you add a ribosomal RNA content information #263
Comments
How do you calculate ribosomal RNA content? |
According to Maggie they are already calculated. She asked to put them into
multiqc. Thanks.
…On Fri, Aug 4, 2017, 11:35 AM Vishal Koparde ***@***.***> wrote:
How do you calculate ribosomal RNA content?
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#263 (comment)>, or mute
the thread
<https://github.com/notifications/unsubscribe-auth/AGkVDJiLACN2NRIx56n5SVhyY6kUB1t6ks5sUzorgaJpZM4OrneJ>
.
|
Hmm. I dont know. I will have to find out. |
Vishal:
For my project, I didn’t actually use a rRNA intervals list. What I did was to filter out all rRNAs that were the source of a 90% GC-peak on the GC distribution (I filtered for reads mapping to a single locus rDNA that gives rise to various rRNAs).
For our pipeline, we are assessing (not filtering for) rRNA contamination. Essentially, for any gtf file that has the standard ‘gene_type’ column, the rRNA intervals are created by grepping for ‘rRNA’ on the gene_type.
Here is someone’s script that we can use to create the rRNA-intervals list (BED format) for all the GTF files used in our pipeline:
https://gist.github.com/adomingues/57e3c2f9f4aca8dfde7b
Does that help?
Thanks,
Parthav
From: Vishal Koparde <notifications@github.com>
Reply-To: CCBR/Pipeliner <reply@reply.github.com>
Date: Friday, August 4, 2017 at 11:50 AM
To: CCBR/Pipeliner <Pipeliner@noreply.github.com>
Cc: Parthav Jailwala <parthav.jailwala@nih.gov>, Mention <mention@noreply.github.com>
Subject: Re: [CCBR/Pipeliner] Could you add a ribosomal RNA content information (#263)
Hmm. I dont know. I will have to find out.
@pajailwala<https://github.com/pajailwala> used in a recent project. Does he know where?
--Vishal
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub<#263 (comment)>, or mute the thread<https://github.com/notifications/unsubscribe-auth/ABc1_o8Aqj3Tcvh8rpvwXSs6Bw9sCM9-ks5sUz2wgaJpZM4OrneJ>.
|
Hi Parthav, I need to double check with Maggie, but I thought she was mentioning Picard RNASeqMetrics. Don't we have the RNAseqMetrics calculated in the pipeline? Bong-Hyun |
Yes. Bong-Hyun, it does seem to report it. |
My understanding is that rRNA metrics will only be reported by CollectRnaSeqMetrics, if the rRNA intervals file is passed to RIBOSOMAL_INTERVALS parameter.
In our RNASeq QC pipeline, I think we do assess rRNA metrics, but they are reported as part of the excel file (SF format). Still, we need to ensure that the ribosomal intervals file is accurate (based on the respective GTF that is used) and also see if we can have multiQC parse these metrics.
java -jar picard.jar CollectRnaSeqMetrics \
I=input.bam \
O=output.RNA_Metrics \
REF_FLAT=ref_flat.txt \
STRAND=SECOND_READ_TRANSCRIPTION_STRAND \
RIBOSOMAL_INTERVALS=ribosomal.interval_list
From: Vishal Koparde <notifications@github.com>
Reply-To: CCBR/Pipeliner <reply@reply.github.com>
Date: Friday, August 4, 2017 at 12:46 PM
To: CCBR/Pipeliner <Pipeliner@noreply.github.com>
Cc: Parthav Jailwala <parthav.jailwala@nih.gov>, Mention <mention@noreply.github.com>
Subject: Re: [CCBR/Pipeliner] Could you add a ribosomal RNA content information (#263)
Yes. Bong-Hyun, it does seem to report it.
https://broadinstitute.github.io/picard/picard-metric-definitions.html#RnaSeqMetrics
--Vishal
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub<#263 (comment)>, or mute the thread<https://github.com/notifications/unsubscribe-auth/ABc1_vBCftKJKylEhEtLxa0M3x3iUeiaks5sU0r8gaJpZM4OrneJ>.
|
Yes Parthav, Thats right. We do have RIBOSOMAL_BASES showing number of RNA bases covered in the bam file. and it is dependent on the RRNALIST provided, which generally are :
As per picard, RIBOSOMAL_BASES --> Number of bases in primary alignments that align to ribosomal sequence. This to me only indicates if the rDNA bases that are covered by sequencing, but does not give any idea about the depth at these bases. I think one is going to have some coverage over these bases, no matter what. It is how often they are covered that is important to us. Isnt it? --Vishal |
Added rRNA index in FqScreen. This close #263 |
So that we can see in the Multiqc table.
The text was updated successfully, but these errors were encountered: