-
Notifications
You must be signed in to change notification settings - Fork 354
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LCR regions still in final VCF #3609
Comments
Hi @adammaikai Thanks for reporting. A fair point! I think the filtration does happen, but what we see is a boundary effect: there should not be many of these leftover variants. I've checked it on a smaller WES chr22 example. Please note that I reported both $ bedtools intersect -wa -wb -a NA12878-gatk-haplotype.vcf.gz -b /n/app/bcbio/1.2.9/genomes/Hsapiens/hg38/coverage/problem_regions/repeats/LCR.bed.gz | head | awk -F '\t' '{print $1,$2,$4,$5,$(NF-2),$(NF-1),$NF}'
chr22 20953424 AAAC A chr22 20953424 20953452
chr22 21614427 CA C chr22 21614427 21614460 So these two variants are right on the left boundary of the interval, and the exclusion logic did not capture them. In your case: So I think these leftover variants are reported due to discrepancies between You may inspect the nolcr bed files in `work/gatk-haplotype/chrXX/nolcr.bed. chr22 20952753 20953424 (note that the interval > chr2:20953424 is gone). So this bed file is applied to filter the result vcf and Sergey |
Thank you for the clarification, that was very helpful. |
My germline WGS run completed successfully, however, I double checked the LCR regions using a bedtools intersectBed of the final vcf and the provided LCR.bed.gz, and variants were still present.
I even see the bed file being used in the pipeline from the commands log, for example:
Did I misunderstand that step in the pipeline?
Version info
bcbio_nextgen.py --version
): 1.2.8lsb_release -ds
): Ubuntu 16.04.7 LTSTo Reproduce
Exact bcbio command you have used:
Your yaml configuration file:
Log files (could be found in work/log)
Please attach (10MB max):
bcbio-nextgen-commands.log
, andbcbio-nextgen-debug.log
.bcbio-nextgen-commands.log
bcbio-nextgen.log
The text was updated successfully, but these errors were encountered: