Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
storage: Handle overlap with symbolic NO_VARIATION #877
- Loading branch information
Showing
4 changed files
with
106 additions
and
9 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
15 changes: 15 additions & 0 deletions
15
...ncga-storage-hadoop/opencga-storage-hadoop-core/src/test/resources/gaps2/file1.genome.vcf
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,15 @@ | ||
##fileformat=VCFv4.2 | ||
##ALT=<ID=NON_REF,Description="Represents any possible alternative allele at this location"> | ||
##FORMAT=<ID=GT,Number=1,Type=String,Description="Genotype"> | ||
##FORMAT=<ID=DP,Number=1,Type=Integer,Description="Filtered basecall depth used for site genotyping"> | ||
##FORMAT=<ID=AD,Number=R,Type=Integer,Description=""> | ||
##INFO=<ID=END,Number=1,Type=Integer,Description="End position of the region described in this record"> | ||
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT s1 | ||
1 1 . N <NON_REF> . . END=10003 GT:DP .:. | ||
1 10004 . C <NON_REF> . . END=10010 GT:DP:AD 0/0:3:2,1 | ||
1 10011 . ATTT A 2 . . GT:DP:AD 0/1:41:20,21 | ||
1 10015 . A <NON_REF> . . END=10020 GT:DP:AD 0/0:7:6,1 | ||
1 10020 . A T 2 . . GT:DP:AD 0/1:42:20,22 | ||
1 10021 . A <NON_REF> . . END=10030 GT:DP:AD 0/0:7:6,1 | ||
1 10031 . T TAAA 1 . . GT:DP:AD 0/1:43:20,23 | ||
1 10032 . A <NON_REF> . . END=10043 GT:DP:AD 0/0:5:4,1 |
18 changes: 18 additions & 0 deletions
18
...ncga-storage-hadoop/opencga-storage-hadoop-core/src/test/resources/gaps2/file2.genome.vcf
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,18 @@ | ||
##fileformat=VCFv4.2 | ||
##ALT=<ID=NON_REF,Description="Represents any possible alternative allele at this location"> | ||
##FORMAT=<ID=GT,Number=1,Type=String,Description="Genotype"> | ||
##FORMAT=<ID=DP,Number=1,Type=Integer,Description="Filtered basecall depth used for site genotyping"> | ||
##FORMAT=<ID=AD,Number=R,Type=Integer,Description=""> | ||
##INFO=<ID=END,Number=1,Type=Integer,Description="End position of the region described in this record"> | ||
##GAPS=1:10015-10030 with 1:10020:A:T | ||
##MULTI_OVERLAP=1:10013:T:C and 1:10014:A:T with 1:10011:ATTT:A | ||
##INSERTION_GAP=1:10031:T:TAAA does not overlap with any from here | ||
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT s2 | ||
1 1 . N <NON_REF> . . END=10003 GT:DP .:. | ||
1 10004 . C <NON_REF> . . END=10012 GT:DP:AD 0/0:3:2,1 | ||
1 10013 . T C 2 . . GT:DP:AD 0/1:30:10,20 | ||
1 10014 . T A 2 . . GT:DP:AD 0/1:31:11,21 | ||
1 10031 . T G 1 . . GT:DP:AD 0/1:32:12,22 | ||
1 10032 . A <NON_REF> . . END=10034 GT:DP:AD 0/0:6:5,1 | ||
1 10035 . A G 1 . . GT:DP:AD 0/1:33:13,23 | ||
1 10036 . A <NON_REF> . . END=10043 GT:DP:AD 0/0:5:4,1 |