Open
Description
I could be very wrong. The syntax and threshold values in the example might be outdated in Filtering variants on https://samtools.github.io/bcftools/howtos/variant-calling.html:
bcftools filter -sLowQual -g3 -G10 \
-e'%QUAL<10 || (RPB<0.1 && %QUAL<15) || (AC<2 && %QUAL<15) || %MAX(DV)<=3 || %MAX(DV)/%MAX(DP)<=0.3' \
calls.vcf.gz
I am using the following on bcftools_filterVersion=1.18+htslib-1.18 with a public GRCh38 dataset from PacBio:
bcftools filter -sLowQual -g3 -G10 \
-e 'QUAL<100 || (RPBZ<0.1 && QUAL<150) || (AC<2 && QUAL<150) || VDB<1.0e-04' \
"${input_vcf}" -Oz -o "${output_dir}/filtered.vcf.gz" --write-index
So might be the other examples in "Filtering variants".
P.S. I am going to try the filter on the G1K dataset next to understand if the filter can be generic enough.