Expected output? #7

wdecoster · 2020-09-17T08:16:00Z

Hi,

I expected to get a VCF with per variants-file a new column with genotypes ('wide'-format?), but what I get is a VCF with just a single "sample" (identifier taken from the first VCF), and a long list of variants which seem to iterate through all variants I had in my files. It starts with chr1-2-3-4-5 etc for sample1, then restarts at chr1 for the second sample,... etc. The only way for me to connect variants in the merged file with the original sample is by using the SUPP_VEC?

Or did I do something wrong?

Thanks,
Wouter

mkirsche · 2020-09-18T15:10:49Z

Hi Wouter,

Yes, what you described is the expected output from Jasmine by default. The intent there is to avoid extremely large VCFs in the case where there are many samples. As you pointed out, the SUPP_VEC (and IDLIST) allows you to trace back to the original VCF entries. If you prefer the output in the more traditional one-sample-per-column format, you can use the --output_genotypes flag which outputs the additional columns.

I hope that helps!
Melanie

wdecoster · 2020-09-18T18:16:05Z

Aha, I'll give that a try. Thanks!

wdecoster closed this as completed Sep 18, 2020

brentp mentioned this issue Jul 1, 2021

documentation help #19

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expected output? #7

Expected output? #7

wdecoster commented Sep 17, 2020

mkirsche commented Sep 18, 2020

wdecoster commented Sep 18, 2020

Expected output? #7

Expected output? #7

Comments

wdecoster commented Sep 17, 2020

mkirsche commented Sep 18, 2020

wdecoster commented Sep 18, 2020