Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Joint genotyping tool #45

Closed
llllaaaa opened this issue Jan 31, 2018 · 5 comments
Closed

Joint genotyping tool #45

llllaaaa opened this issue Jan 31, 2018 · 5 comments
Assignees

Comments

@llllaaaa
Copy link

Is there a recommended tool for joint genotyping the gvcfs of a trio?

Thanks

@llllaaaa
Copy link
Author

llllaaaa commented Feb 7, 2018

@rpoplin Do you have any suggestions? Thanks

@cmclean
Copy link
Collaborator

cmclean commented Feb 7, 2018

At present we do not have a specific recommendation for joint genotyping DeepVariant gVCFs. Given the accurate genotype likelihood calibration of single-sample DeepVariant calls it may be better to simply merge calls without computing genotype posteriors based on population allelic frequencies and then altering the genotypes. We are actively investigating the performance of different methods to be able to provide a set of best practices.

That said, the gVCF outputs of DeepVariant are syntactically and semantically equivalent to those produced by other tools like GATK, so can be used by any existing joint genotyping tools.

@jaqueytw
Copy link

jaqueytw commented Apr 9, 2019

Hi there!

I tried to combine the deepvariant's gvcf files using GATK and it returned a error because of the "NON_REF".
Do you have any other recommendation? Or did you tested your gvcf in GATK?

This is the error that I got using GATK:
A USER ERROR has occurred: The list of input alleles must contain <NON_REF> as an allele but that is not the case at position 10325413; please use the Haplotype Caller with gVCF output to generate appropriate records

The DeepVariant "<NON_REF>" allele is "<*>"

@AndrewCarroll
Copy link
Collaborator

Hi @jaqueytw

To combine DeepVariant gVCFs together, we currently recommend using GLnexus with the --config deepvariant flag. This should combine DeepVariant gVCFs in a valid manner without filtering or re-genotyping.

We have used GLnexus in this way in some of our early cohort investigations (for example in this blog on a mosquito pedigree).

We are currently in the process of more fully evaluating the optimal ways to use GLnexus to merge DeepVariant calls. As this functionality matures, we will update our recommendations.

Finally, to clarify our recommendations regarding GATK-based methods to combine gVCFs. Although these may run, in these sense that the gVCF format used by DeepVariant can be successfully used, we do not currently recommend their use to combine DeepVariant gVCFs.

@AndrewCarroll
Copy link
Collaborator

Hi @llllaaaa

I wanted to update this issue with recent developments for cohort merging. With the DeepVariant v0.9 release, we recommend Best practices for multi-sample variant calling with DeepVariant

We encourage you or other users interested in multi-sample calling to follow these recommendations to combine multiple DeepVariant gVCFs

Thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants