Skip to content

Updating filter syntax and threshold ? #2101

Open
@davidyuyuan

Description

@davidyuyuan

I could be very wrong. The syntax and threshold values in the example might be outdated in Filtering variants on https://samtools.github.io/bcftools/howtos/variant-calling.html:

bcftools filter -sLowQual -g3 -G10 \
    -e'%QUAL<10 || (RPB<0.1 && %QUAL<15) || (AC<2 && %QUAL<15) || %MAX(DV)<=3 || %MAX(DV)/%MAX(DP)<=0.3' \
    calls.vcf.gz

I am using the following on bcftools_filterVersion=1.18+htslib-1.18 with a public GRCh38 dataset from PacBio:

bcftools filter -sLowQual -g3 -G10 \
    -e 'QUAL<100 || (RPBZ<0.1 && QUAL<150) || (AC<2 && QUAL<150) || VDB<1.0e-04' \
    "${input_vcf}" -Oz -o "${output_dir}/filtered.vcf.gz" --write-index

So might be the other examples in "Filtering variants".

P.S. I am going to try the filter on the G1K dataset next to understand if the filter can be generic enough.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions