-
Notifications
You must be signed in to change notification settings - Fork 698
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add fixes to improve vcf_caller in calling mode
PiperOrigin-RevId: 266165858
- Loading branch information
Showing
11 changed files
with
94 additions
and
16 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Binary file not shown.
Binary file added
BIN
+2.9 KB
deepvariant/testdata/golden.vcf_caller_postprocess_single_site_input.tfrecord.gz
Binary file not shown.
36 changes: 36 additions & 0 deletions
36
deepvariant/testdata/golden.vcf_caller_postprocess_single_site_output.vcf
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,36 @@ | ||
##fileformat=VCFv4.2 | ||
##FILTER=<ID=PASS,Description="All filters passed"> | ||
##FILTER=<ID=RefCall,Description="Genotyping model thinks this site is reference."> | ||
##FILTER=<ID=LowQual,Description="Confidence in this variant being real is below calling threshold."> | ||
##INFO=<ID=END,Number=1,Type=Integer,Description="End position (for use with symbolic alleles)"> | ||
##FORMAT=<ID=GT,Number=1,Type=String,Description="Genotype"> | ||
##FORMAT=<ID=GQ,Number=1,Type=Integer,Description="Conditional genotype quality"> | ||
##FORMAT=<ID=DP,Number=1,Type=Integer,Description="Read depth"> | ||
##FORMAT=<ID=MIN_DP,Number=1,Type=Integer,Description="Minimum DP observed within the GVCF block."> | ||
##FORMAT=<ID=AD,Number=R,Type=Integer,Description="Read depth for each allele"> | ||
##FORMAT=<ID=VAF,Number=A,Type=Float,Description="Variant allele fractions."> | ||
##FORMAT=<ID=PL,Number=G,Type=Integer,Description="Phred-scaled genotype likelihoods rounded to the closest integer"> | ||
##contig=<ID=chr20,length=63025520> | ||
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT HG002 | ||
chr20 59777010 pbsv.INS.45658 C CCTCACTCAGGACCCTCCATGGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGTGTCACCTCCATCCTCACCAGGATCCCCTGTAAGTGTCACCTCCATCCTCACCAGGACCCTCCATGAGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGTGTCACCTCCATCCTCA 1 RefCall . GT:GQ:DP:AD:PL ./.:7:249:192,57:0,11,7 | ||
chr20 59777010 pbsv.INS.45657 C CCTCACTCAGGACCCTCCATGAGTGCCACCTCCATCCTCACCAGGATCCCCTGTAAGTGTCACCTCCATCCTCACCAGGACCCTCCATGAGTGTCACCTCCATCCTCAGGACCCTCCATGAGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGTGTCACCTCCATCCTCACCAGGACCCTCCATGAGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGTGTCACCGCCATCCTCA 1 RefCall . GT:GQ:DP:AD:PL ./.:7:240:194,46:0,11,7 | ||
chr20 59777010 pbsv.INS.45656 C CCTCACTCAGGACCCTCCATGGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGGTGTCACCTCCATCCTCA 1 RefCall . GT:GQ:DP:AD:PL ./.:7:240:207,33:0,11,7 | ||
chr20 59777553 pbsv.INS.45659 G GTGTCACCTCCATCCTCACTCAGGACCCTCCATGGGTGTCACCTCCATCCTCACTCAGGACCCTCCATGAGTGTCACCTCCATCCTCAGGACCCTCCATGAGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGTGTCACCTCCATCCTCACTCAGGACACTCCATGGTGTCACCTCCATCCTTACTCAGGACCCTCCATGGTGTCACCGCCATCCTCACTCAGGACCCTCCATGAGTGCCACCTCCATCCTCACCAGGATCCCCTGTAAGTGTCACCTCCATCCTCACTCAGGACCCTCCATGAGTGTCACCTCCATCCTCAGGACCCTCCATGGTGTCACCTCCATCCTCAGGACCCTCCATGAGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGGTGTCACCTCCATCCTCACTCAGGACCCTCCATGAGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGTGTCACCTCCATCCTCACTCAGGACCCTCCATGGTGTCACCTCCATCCTCACTCAGGACACTCCATGGTGTCACCTCCATCCTTACTCAGGACCCTCCATGGTGTCACCGCCATCCTCACTCAGGACCCTCCATGAGTGCCACCTCCATCCTCACCAGGATCCCCTGTAA 0.7 RefCall . GT:GQ:DP:AD:PL ./.:8:258:232,26:0,12,8 | ||
chr20 59804673 pbsv.DEL.45636 GATATATATATATATATATATATATATATATATATATATATATATAT G 4.5 PASS . GT:GQ:DP:AD:PL 1/1:2:184:99,85:0,1,0 | ||
chr20 59848584 pbsv.INS.45660 T TTGGGGAGTACCGTGTGCAGTTTGGGGGAGTATTGGGGGAGTACCATGTGCAGTTTGGGGGAGGACCATGTGCAGTTTAGGGGAGTACCGTGTGCAGTTTGGGGGAGTATTGGGGAAGTACCATGTGCAGTTTGGGGGAATACCATGTGCAACTTGGGGGAGTACCATGCACAGCTT 0.7 RefCall . GT:GQ:DP:AD:PL ./.:9:202:121,81:0,13,9 | ||
chr20 59858360 pbsv.INS.45661 G GGTGTGTGATGTACGTGCATTTGCACGCGTGTGCTGTGGC 0.6 RefCall . GT:GQ:DP:AD:PL ./.:9:242:136,106:0,13,9 | ||
chr20 59858389 pbsv.INS.45662 C CGTGTGTGATGTGTGTGCGTTTGCACGCGCGTGCTGTGGCGTGTGTGATGTGTGTGCGTTTGCACGCGCGTGCTGTGGCGTGTGTGATGTGTGTGCGTTTGCACGCGCGTGCTGTGGCGTGTGTGATGTGTGTGCGTTTGCACGCGCGTGCTGTGGCGTGTGTGAT 0.9 RefCall . GT:GQ:DP:AD:PL ./.:7:239:155,84:0,11,7 | ||
chr20 59865298 pbsv.DEL.45637 GTTTATCCCGATATCCAATGAGACTTGCTGGA G 1 RefCall . GT:GQ:DP:AD:PL ./.:7:209:169,40:0,11,7 | ||
chr20 59884412 pbsv.DEL.45638 CGAGTTTCCAGAACCATGCAGGTGTGCATAGAGGTGTTCAGCTGTCTTCCTT C 0.6 RefCall . GT:GQ:DP:AD:PL ./.:9:222:120,102:0,13,9 | ||
chr20 59904402 pbsv.INS.45691 T TCCCTCGTGGGTCTGGGACGCGGGTCGAGGTGGCTTTAAGCCCAGTGTGCTGCGGTGGATTGCAGCGTGCGTGGAGGGCACCCAGGTCCTCATGGTCCCCTCATGGGTCTGGGACGCGGGTCGAGGTGGCTTTAAGCCCAGTGTGCTGCGGTGGATTGCAGCATGCGTGGAGGGCACCCAGGTCCTCATGGTCCCCTCATGGGTCTGGGACGTGGGTCGAGGTGGCTTTAAGCCCAGTGTGCTGCAGTGGATTGCAGCATGCGTGGAGGGCACCCAGGTCCTCATGGTCCCCTCGTGGGTCTGGGACGCGGGTCGAGGTGGCTTTAAGCCCAGTGTGCTGCGGTGGATTGCAGCGTGCGTGGAGGGCACCCAGGTCCTCATGGTCCCCTCGTGGGTCTGGGACGCGGGTCGAGGTGGCTTTAAGCCTAGTGTGCTGCGGTGGATTGCAGCATGCGTGGAGGGCACCCAGGTCCTCATGGTCCCCTCATGGGTCTGGGACGTGGGTCGAGGTGGCTTTAAGCCCAGTGTGCTGCGGTGGATTGCAGCATGCGTGGAGGGCACCCAGGTCCTCATGGTTGCCTCGTGGGTCTGGGACGCGGGTCGAGGTGGCTTTAAGCCCACTGTGCTGCAGTGGATTGCAGCATGCGTGGAGGGCACCCAGGTCCTCATGGTTCCCCCGTGGGTCTGGGACGCGGGTCGAGGTGGCTTTAAGCCCACTGTGCTGCGGTGGATTGCAGCATGCGTGGAGGGCACCCAGGTCCTCATGGTCCCCCCGTGGGTCTGGGACGCGGGTCGAGGTGGCTTTAAGCCCAGTGTGCTGCGGTGGATTGCAGCATGCGTGGAGGGCACCCAGGTCCTCATGGTCCCCCCGTGGGTCTGGGACGCGGGTCGAGGTGGCTTTAAGCCCAGTGTGCTGCAGTGGATTGCAGCATGCGTGGAGGGCACCCAGGTCCTCATGGTTCCCTCGTGGGTCTGGGACGCGGGTCGAGGTGGCTTTAAGCCCAGTGTGCTGCGGTGGATTGCAGCGTGCGTGGAGGGCACCCAGATCCTCATGGTTCCCCCGTGGGTCTGGGACGCGGGTCGAGGTGGCTTTAAGCCCAGTGTGCTGCAGTGGATTGCAGCATGCGTGGAGGGCACCCAGATCCTCATGTTTCCCCCACGGGTCTGGGACGCGGGTCGAGGTGGCTTTAAGCCCAGTGTGCTGCGGTGGATTGCAGCGTGCGTGGAGGGCACCCAGGTCCTCATGGTTCCCTCGTGGGTCTGGGACGTGGGTCGAGGTGGCTTTAAGCCCAGTGTGCTGCAGTGGATTGCAGCGTGCGTGGAGGGCACCCAGGTCCTCATGGTC 0.7 RefCall . GT:GQ:DP:AD:PL ./.:8:241:216,25:0,12,8 | ||
chr20 59904405 pbsv.INS.45692 C CCCGTGGGTCTGGGACGCGGGTCGAGGTGGCTTTAAGCCCAGTGTGCTGCGGTGGATTGCAGCATGCGTGGAGGGCACCCAGATCCTCATGGTTCCCCCGTGGGTCTGGGACGCGGGTCGAGGTGGCTTTAAGCCCACTGTGCTGCGGTGGATTGCAGCATGCGTGGAGGGCACCCAGGTCCTCATGGTTCCCCCACGGGTCTGGGACGCGGGTCGAGGTGGCTTTAAGCCCAGTGTGCTGCGGTGGATTGCAGCGTGCGTGGAGGGCACCCAGGTCCTCATGGTCCCCCCGTGGGTCTGGGACGCGGGTCGAGGTGGCTTTAAGCCCAGTGTGCTGCGGTGGATTGCAGCATGCGTGGAGGGCACCCAGATCCTCATGGTCCCCCCGTGGGTCTGGGACGCGGGTCGAGGTGGCTTTAAGCCCAGTGTGCTGCGGTGGATTGCAGCATGCGTGGAGGGCACCCAGGTCCTCATGGTCCCC 0.9 RefCall . GT:GQ:DP:AD:PL ./.:7:241:196,45:0,11,7 | ||
chr20 59906638 pbsv.INS.45693 C CCCCAGGCCCACCTCCTGCCCCGGCCACCTGCCTCAGATCTACCTCCTGCCCCGGCCACCTGCCCCAGACCCACCTCCTGCCCCGGCCACCTGCCCCAGACCCACCTCCTGACCCGACCACCTCCCCCAGGCTCACCTCCTGCCCCGGCCACCTGCCCTAGAGCCACCTCCTGCCCCGGCCACCTGCCTCAGGCCCACCTCCTGCCCCGGCCACCTGCCCCAGATCTACCTCCTGACCCGACCACCTCCCCCAGGCTCACCTCCTGCCCTGGCCACCTGCCCCAGACCCACCTCCTGCCCCGGCCACCTGCCCCAGAGCCACCTCCTGACCCGACCACCTCCCCCAGGCTCACCTCCTGCCCCGACCACCTGCCCCAGGCCCACCTCCTGCCCCAGACCCACCTCCTGCCTCAGCCACCTGCCCCAGACTCACCTCCTGCCCCGGCCACCTGCCCCAGAGCCACCTCCTGCCCCGGCCACCTACCCCAGACCCACCTCCTGCCCCGGCCACCTGCCCCAGGCCCACCTCCTGCCCCGGCCACCTGCCCCAGGCCCACCTCCTGCCCCAGACCCACCTCCTGCCTCAGCCACCTGCCCCAGACTCACCTCCTGCCCCGGCCACCTGCCCCAGAGCCACCTCCTGCCCCGGCCACCTACCCCAGACCCACCTCCTGCCCCGGCCACCTGCCCCAGGCCCACCTCCTGCCCCGGCCACCTGCCCCAGGCCCACCTCCTGCCCCGGCCACCTGCCCCAGGCCCACCTCCTGCCCCGGCCACCTGCCCCAGGCCCACCTCCTGCCCCGGCCACCTGCCCCAGGCCCACCTCCTGCCCCCGCCAACTGCCCCAGGTCTACCTCCTGCCCCGGCCACCTGCCCCAGGCCCACCTCCTGCCCCGGTCACCTGTCCCAGCCCACCTCCTGCCCCGGCCACCTGCCCCAGGCCCACCTCCTGCCCCGGCCACCTGT 1.1 RefCall . GT:GQ:DP:AD:PL ./.:7:239:214,25:0,11,7 | ||
chr20 59912354 pbsv.INS.45694 A ATGATGATAATGATGGTGGTATGATGATGTGGATGGTAATGGTGATGATAACAGTGGATAATGATGGCAATGCAAGTGATGATATTGATGATGATGGTGATGACAGTGCTGATAATGATCATAATGGTGATGATGATGATGGTGATTATTGTAATAGTGATGACAGTGATGATGATAATGATGGTGATAAGAGTGGATAATGATAATAATGATGATGGTGATTTTAATAGTGATGATGATGTTAAGATGATTATGGTGATGATGATAGTGATGATGATGGTGATGATAACAGTGGATAC 0.6 RefCall . GT:GQ:DP:AD:PL ./.:9:222:124,98:0,12,9 | ||
chr20 59928659 pbsv.INS.45695 A ACTGCCTTCCTCCCCCTCCTCCCCATCTTTCTCCCCTTCCCCCTCCCCCCTCCTCTGTCTTCCCCCTACTCCTCCCCTTCTCCCTCCCCCTTCTCCCCCTCTTCCTCCCCTTCCCCCCTCCTCTTCCCCCCTCCTCCTCCTCCCCTTCCCCCTCCCATTACCCCTTCTCCTCCTCCTCTTCCTCCCCCTCCTCCTCTTCTTCCTCCTCTTCCCTCTCCTCCTCCTCCCCCTCCTCCCCCTCCCCCTGTTCCCCTTTTCCTCTGGAAGGCGATGAGAATACAGTATGATGATTCACCCTTCCCCTCCCCCTCCCCCTCCCTCCTCCTCCCCCTACTCTTCCTCCCCCTCCTCCTCCCTCTCCCCTCCCTCTCCTTCCCCTCCCTCCTCCTCCTCCCCCTCCCCTCCTCCTCCTCTTCCCCCTCCTCCTCTTCTTCCTCCCTTCCCCCTCCTCCTCCTCCCTCTCCTCCTCCTCCCCCTGTTCCCCTTTTCCTCTGGAAGGTGATGAGAGTACAGTATGATGATTCACCCTCCCCCCCCCTCCTCCCTCCCCCTCCCCTCCTCCTCCTCTTCCCCCTCCACCTCTT 0.6 RefCall . GT:GQ:DP:AD:PL ./.:9:96:61,35:0,13,9 | ||
chr20 59951039 pbsv.INS.45696 T TTGATGGTAGTGTGATGGTCTTGGTGCTGGTGGTGATGATAGTGGGGTTTATTGATGGTAGTGTGATGGTCTTGGTGGTGCTGATAATGGTGTGGTTTGTTGATGATAGTGTGATGGTCTTGGTGGTGGTGGCGTGGTTTGTTGATGGTAGTGTGATGGTCTTGGTGCTGGTGGTATGGTTTGTTGATGGTAGTGTGATGGTCTTGGTGCTGGTGGTGGTGGTGATGATGTAGTTTGTTGATGGTAGCGTGATGTTCTTGGTGCTGGTGGTGGTGATGATGTGGTTTGTTGATGGTAGTGTGATGGTCTTGGTGGTGGTGGTGGTGGTGATAGTGTGGTTTGTTGATGGTAGTGTGATGGTCTTGATGCTGGTGATGGTATTGGTGATGGTGTGGTTTGTTGATGGTAGTGTGATGGTCTTCGTGGTGGTGGTGTGGTTTGTCGATGGTAGTGTGGTGGTCTTGGTGCTGGTGGTTGTCGTGTGGTTTGTTGATGGTAGTGTGATGGTCTTGGTGCTGGTGGTGGTGATGTGGTTTGTTGATGATAGTGTGATGGTCTTGGTGGTGGTAGTGATGGTGTGGTTTGC 0.5 RefCall . GT:GQ:DP:AD:PL ./.:10:243:159,84:0,14,11 | ||
chr20 59958678 pbsv.INS.45697 G GATACATATCTATATATGATATATATGAATATATGAATATATGATACGTATGAATATATATGATATATATGGATATATGATACGTATGAATATATATCATATATGTATATATGATATGATATATATATGAATATATGATATATATGAACATATATATGGATATTATGAATATATATGATATATATGAATATATATGGATATATATGAATATATATGATATATGAATATATATGGATATATATGAATATATATGATTGATATATATCATATATGTATGCATATATGTGATATATATCATATATGA 0.3 RefCall . GT:GQ:DP:AD:PL ./.:12:212:129,83:0,16,13 | ||
chr20 59958834 pbsv.DEL.45664 TATATATGAAGATATATATGC T 0.2 RefCall . GT:GQ:DP:AD:PL ./.:14:208:118,90:0,18,16 | ||
chr20 59965316 pbsv.INS.45698 C CGGTCATAGGGTGCCCATAGTGCCGTGTCTGGGAAACCCCGATTGAGATCGTGCAGTCATAGGGTGCCCATAGCACCATGTCTAGGAAACCCCGATTGGGTTCGTGCAGTCGTAGGGTGCCCATAGTGCCGTGTCTGGGAAACCCCAATTGAGATCGTAT 1 RefCall . GT:GQ:DP:AD:PL ./.:7:201:116,85:0,11,7 | ||
chr20 59965633 pbsv.INS.45699 G GCGGTCATAGGGTGCCCATAGTCCCGTGTCTGGGAAGCCCCAATTGAGATCGTA 0.7 RefCall . GT:GQ:DP:AD:PL ./.:8:210:150,60:0,12,8 | ||
chr20 59965721 pbsv.INS.45700 A AACCCCAATTGAGATCGTACGGTCATAGGGTGCCCATAGTGCCGTGTCTGGGAAACCCCGATTGAGATCGTGCGGTCATAGGGTGCCCATAGCACCGTGTCTGGGAG 1.1 RefCall . GT:GQ:DP:AD:PL ./.:7:206:174,32:0,11,6 | ||
chr20 59974166 pbsv.DEL.45665 AAAAACCCACTTTTTTTTTTTTTTTTTTTTTTTTGAGACGGAGTCTCACTCTGTCGCCCAGGCCGGACTGCGGACTGCAGTGGCGCAATCTCGGCTCACTGCAAGCTCCGCTTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCGCCCGCCACCATGCCCGGCTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCTTGTTAGCCAGGATGGTCTCGATCTCCTGACCTCATGATCCACCCGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCCGGCC A 1.8 RefCall . GT:GQ:DP:AD:PL ./.:5:262:209,53:0,8,4 |
Binary file not shown.
Binary file modified
BIN
+48 Bytes
(100%)
deepvariant/testdata/ucsc.hg19.chr20.unittest.fasta.gz.gzi
Binary file not shown.
Binary file not shown.
Binary file not shown.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters