Skip to content

High number of Indeterminate consistency in merged VCF #311

@Roj4ck

Description

@Roj4ck

I recently took four cram files (one family) and put them through the deep variant pipeline in accordance with the documentation. I then followed the instructions for "Best practices for multi-sample variant calling with DeepVariant (WES trio demonstration)" and tried two sets of of "trios" and one comparison containing all four. Each of them however came up with a substanial amount of mendelian constraints which I'll list below:

Checking: /home/username/deepvariant-run/output/RBAs.cohort.vcf.gz
Family: [2114337 + 2114302] -> [2115432]
12 non-pass records were skipped
Concordance 2115432: F:127216/127862 (99.49%) M:121292/121882 (99.52%) F+M:79650/80917 (98.43%)
Sample 2115432 has less than 99.0 concordance with both parents. Check for incorrect pedigree or sample mislabelling.
4249/338863 (1.25%) records did not conform to expected call ploidy
228759/338863 (67.51%) records were variant in at least 1 family member and checked for Mendelian constraints
147067/228759 (64.29%) records had indeterminate consistency status due to incomplete calls
1838/228759 (0.80%) records contained a violation of Mendelian constraints

Checking: /home/username/deepvariant-run/output/RBNs.cohort.vcf.gz
Family: [2114337 + 2114302] -> [2009617]
18 non-pass records were skipped
Concordance 2009617: F:124581/125097 (99.59%) M:120027/120545 (99.57%) F+M:79289/80523 (98.47%)
Sample 2009617 has less than 99.0 concordance with both parents. Check for incorrect pedigree or sample mislabelling.
3705/295693 (1.25%) records did not conform to expected call ploidy
184648/295693 (62.45%) records were variant in at least 1 family member and checked for Mendelian constraints
103541/184648 (56.07%) records had indeterminate consistency status due to incomplete calls
1603/184648 (0.87%) records contained a violation of Mendelian constraints

Checking: /home/username/deepvariant-run/output/RBNAs.cohort.vcf.gz
Family: [2114337 + 2114302] -> [2009617, 2115432]
24 non-pass records were skipped
Concordance 2009617: F:125675/126263 (99.53%) M:121023/121623 (99.51%) F+M:79766/81139 (98.31%)
Sample 2009617 has less than 99.0 concordance with both parents. Check for incorrect pedigree or sample mislabelling.
Concordance 2115432: F:128175/128895 (99.44%) M:122173/122833 (99.46%) F+M:80026/81440 (98.26%)
Sample 2115432 has less than 99.0 concordance with both parents. Check for incorrect pedigree or sample mislabelling.
4510/361421 (1.25%) records did not conform to expected call ploidy
286909/361421 (79.38%) records were variant in at least 1 family member and checked for Mendelian constraints
197595/286909 (68.87%) records had indeterminate consistency status due to incomplete calls
3231/286909 (1.13%) records contained a violation of Mendelian constraints

I'm not sure if the issue is within the deepvariant scan or perhaps the merge within RTG tools. If you have seen this problem before i'm open to any suggestions. Additionally if this issue is more so on the RTG tools side if things please feel free to close this out and I'll open the issue there. Thank you for your time and have a great day.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions