-
Notifications
You must be signed in to change notification settings - Fork 702
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
haploid contigs and PAR region options for DeepTrio #816
Comments
Hi @prasundutta87 , given that `--postprocess_variants_extra_args="--haploid_contigs=yourValue,--par_regions_bed=yourValue"" If you're using Docker, you likely need to make sure the par_regions_bed path are something that the run in Docker can access. |
And actually, if it's on the child sample or parent samples, you should use |
Hi @pichuan , thanks a lot for the quick reply. I am running the "run_deeptrio" script from DeepTrio docker (v1.6.1). As I have trios, I am running it all together in one command as described in https://github.com/google/deepvariant/blob/r1.6.1/docs/deeptrio-pacbio-case-study.md. I will then be merging the single sample gvcfs using GLnexus as described there. Since, I am using trios, will just using `--postprocess_variants_extra_args="--haploid_contigs=yourValue,--par_regions_bed=yourValue"" help or I need to use the second suggestion? I have attached the example command I am running for your reference:
|
Hi @prasundutta87 , Right, given that you're running on trio, you'll for sure want to specify different things for the parent1 and parent2: only the dad would be using And, for the child: you should only use those flags if the child is male. |
Sounds good, thanks a lot @pichuan . Just an additional query. If I want to joint call SNVs/indels in a set of multiple trios, which pipeline is recommended? Deepvariant or DeepTrio. My understanding is that if we joint call multiple samples together, the cohort allele frequency could be used to filter rare variants. If we use DeepTrio, AF calculation will be restricted to only the trio or family. What is the best practice here in terms of Deepvariant? |
Both DeepVariant and DeepTrio can produce gVCFs, so you can joint call in a similar manner. In both cases you should use the DeepVariant_unfiltered preset for GLnexus, because there is family structure present which the other filtering presets wouldn't know about. Because you can joint genotype multiple trio gVCFs from either DeepVariant or DeepTrio in the same way, I would use DeepTrio to produce the gVCFs, take all of the gVCFs and run them together through glnexus, and then you can still use allele frequency information. To be clear, the unfiltered preset looks like this:
|
Hi @AndrewCarroll ,..sorry for the late reply, but aren't the inherent models different for DeepTrio and DeepVariant? |
Yes, they are different models. You would presumably want to use either one or the other (all DeepVariant or all DeepTrio). You can merge either with glnexus though. |
Sounds good, thanks a lot, @AndrewCarroll |
Hi @pichuan, I tried to use here's my example code where parent1 is the father in the trio
but I got the following error:
checking --helpfull, it seems only --postprocess_variants_extra_args is available. Thank you. |
Hi @zihhuafang , Note that From there, you can modify to make sure the postprocess_variants has the correct flag for the corresponding samples. Sorry for the inconvenience for now. The |
Hi @pichuan, Thanks a lot for the explanation. |
If I generated a VCF file for a trio (with a father using deeptrio) or a solo male (using deepvariant) without chrX,chrY option. Can I fix the VCF after the run is finished? |
Hi,
How do I use --haploid_contigs="chrX,chrY" and --par_regions_bed parameters in DeepTrio? This option is not present in DeepTrio for sex chromosomes.
Regards,
Prasun
The text was updated successfully, but these errors were encountered: