Hybrid-selection metrics from alfred qc #18

evanbiederstedt · 2019-06-28T00:03:19Z

Hi there

I was curious about how Alfred worked with hybrid-selection metrics. The motivation was we were discussing the differences between Alfred and Picard/GATK4's "CollectHSMetrics" for WES samples:

https://software.broadinstitute.org/gatk/documentation/tooldocs/4.0.5.1/picard_analysis_directed_CollectHsMetrics.php

java -jar picard.jar CollectHsMetrics \
      I=input_reads.bam \
      O=output_hs_metrics.txt \
      R=reference.fasta \
      BAIT_INTERVALS=bait.interval_list \
      TARGET_INTERVALS=target.interval_list

This requires both target and bait intervals.

It's not clear to what extent Alfred provides HS metrics, or whether the metrics output by e.g. CollectHSMetrics would differ that much from Alfred.

Somewhat related to this issue: #17

Any insight on your part is appreciated. Thank you for the help

The text was updated successfully, but these errors were encountered:

tobiasrausch · 2020-07-23T11:48:08Z

I am not using picard but I would assume the bait intervals are the probes and the targets something like exons? Alfred only needs a single bed file of target regions.

okbys · 2021-12-13T10:36:35Z

I think there is a difference between Alfred and Picard when the mean depth is greater than 65535(0xFFFF).
The limit of "#AlignedBasesInBed" seems to be "#TotalBedBp * 65535".
Is there limit for "#AlignedBasesInBed"?

tobiasrausch · 2021-12-14T08:23:26Z

The basepair level coverage uses a 16bit integer by default. You can change this code fragment to uint32_t and recompile to allow coverages up to 2^32.

okbys · 2021-12-15T06:54:58Z

@tobiasrausch
Thank you for your solution.
However it's more complicated than I expected and it's not working...

At first, I changed this code from uint16_t to uint32_t, but alfred unexpectedly stopped without any output files.
Next, I changed this code from uint32_t to uint64_t, so alfred is never finished. (infinite loop?)
I checked the output file during the generation process, and the data in the ME section was as expected.

tobiasrausch · 2021-12-15T07:23:05Z

Let me check, I probably forgot somewhere in the code to refer back to this type definition.

tobiasrausch · 2021-12-15T16:06:37Z

You can specify now at compile time if you want to use 32bit integers for ultra-deep coverage. Just pull the last code and then use:

make all COVTYPE=uint32_t

Can you please try again if that works for you? Thanks.

okbys · 2021-12-16T04:14:46Z

It worked excellently!
The result is also as expected.
Thank you for your quick response!

tobiasrausch closed this as completed May 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hybrid-selection metrics from alfred qc #18

Hybrid-selection metrics from alfred qc #18

evanbiederstedt commented Jun 28, 2019 •

edited

Loading

tobiasrausch commented Jul 23, 2020

okbys commented Dec 13, 2021

tobiasrausch commented Dec 14, 2021

okbys commented Dec 15, 2021

tobiasrausch commented Dec 15, 2021

tobiasrausch commented Dec 15, 2021

okbys commented Dec 16, 2021

Hybrid-selection metrics from alfred qc #18

Hybrid-selection metrics from alfred qc #18

Comments

evanbiederstedt commented Jun 28, 2019 • edited Loading

tobiasrausch commented Jul 23, 2020

okbys commented Dec 13, 2021

tobiasrausch commented Dec 14, 2021

okbys commented Dec 15, 2021

tobiasrausch commented Dec 15, 2021

tobiasrausch commented Dec 15, 2021

okbys commented Dec 16, 2021

evanbiederstedt commented Jun 28, 2019 •

edited

Loading