-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Assertion `GRvar_genotypes[vr][tidx].prob >= 0.0f' failed, when phasing rare alleles #20
Comments
Hi,
Can you also send the command line you run, please? For both phase_common
and phase_rare?
Best,
Olivier Delaneau
Le mar. 4 avr. 2023 à 15:11, Tanguy Lallemand ***@***.***> a
écrit :
… Hello, @odelaneau <https://github.com/odelaneau> ,
Thanks for the great work with shapeit5!!
I'm looking into implementing it in a nextflow pipeline (from modules made
by @LouisLeNezet <https://github.com/LouisLeNezet>, thanks also to him!).
I have a problem when phasing rare variants with shapeit5 either 5.0 or
5.1.
Indeed the HMM computations are not done. Do you have any idea why this
might be?
``
[SHAPEIT5] Phase_rare (phase rare variants onto a haplotype scaffold)
- Authors : Simone RUBINACCI & Olivier DELANEAU, University of Lausanne
- Contact : ***@***.*** & ***@***.***
- Version : 5.1.0 / commit = e4f5e58
<e4f5e58>
/ release = 2023-03-29
- Run date : 04/04/2023 - 13:02:34
Files:
- Unphased data : [split.chr18.vcf.gz]
- Scaffold data : [chr18.ref.common.phased.vcf.gz]
- Output data : [chr18_11000001_12000000.vcf.gz]
Parameters:
- Seed : 15052011
- Threads : 8 threads
- PBWT : [depth = 2,2 / modulo = 0.1 / mac = 2 / mdr = 0.3]
- HMM : [Ne = 15000 / Constant recombination rate of 1cM per Mb]
Parsing specified genomic regions
- Scaffold region [chr18:10500001-12500000]
- Input region [chr18:11000001-12000000]
Reading genotype data
- BCF scanning ...
[W::hts_idx_load3] The index file is older than the data file:
split.chr18.vcf.gz.tbi
[W::hts_idx_load3] The index file is older than the data file:
chr18.ref.common.phased.vcf.gz.tbi
- BCF scanning (35.15s)
- Variant sites: [#scaffold=37997 | #rare=13362 | #all=51359]
- 21696 rare variants in buffer regions excluded
- 13362 variants found in both files [priority to scaffold]
- 5265 multi-allelic variants excluded
- HAP allocation [#scaffold=13362 / #samples=1842] (0.00s)
- Genotype set allocation [#rare=37997 / #samples=1842] (0.00s)
- BCF parsing ...
- BCF parsing [100%]
- Plain VCF/BCF parsing (36.04s)
- Scaffold : [0/0=23191904 0/1=821979 1/1=598921]
- Rare : [0/0=65436271 0/1=1925356 1/1=1326764 ./.=1302083]
Initializing sparse genotypes
- 0 missing genotypes imputed at monomorphic sites
- Genotype set transpose V2I (0.05s)
Setting up genetic map
- Region length [1499962 bp / 1.5 cM (assuming 1cM per Mb)]
- HMM parameters [Ne=15000 / Error=0.0001]
- V2H transpose (0.03s)
PBWT pass
-
PBWT initialization [#eval=10283 / #select=6] (0.00s)
-
IBD2 constraints [100%]
-
IBD2 constraints [#inds=0 / #pairs=0] (0.42s)
-
PBWT forward selection [100%]
-
PBWT forward selection (2.65s)
-
PBWT backward selection [100%]
-
PBWT backward selection (2.87s)
- #states=229.97+/-88.68
- #collisions = 17789 / #pushes = 70642 / rate = 79.88%
HMM computations
- Processing [0%]
phase_rare_static:
src/containers/genotype_set/genotype_set_phasing.cpp:63: void
genotype_set::phaseLiAndStephens(unsigned int, unsigned int,
aligned_vector32&, aligned_vector32&, std::vector&, float): Assertion
`GRvar_genotypes[vr][tidx].prob >= 0.0f' failed.
``
—
Reply to this email directly, view it on GitHub
<#20>, or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AD4XTIO4X66UFKGXP6MVTVLW7QMY5ANCNFSM6AAAAAAWSWIMTE>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Hi, Yes, sure here they are. phase_common_static Between this two line i have ligated all file produced with phase common, producing chr18.ref.common.phased.vcf.gz. phase_rare_static I am running those files in a docker container based on your released one. Regards, T |
Sorry, I come back to this quite late. Is this problem still happening even with last version. Two comments anyway:
[W::hts_idx_load3] The index file is older than the data file: split.chr18.vcf.gz.tbi |
I am having the same problem with
After
I can share via email this test case if you want to reproduce the issue. |
Thanks for SHAPEIT. It is a great tool. I'm having the same issue described here. Same version as above:
Commands used:
|
Any news on this error? I'm also getting this error, but for some chunks only. What do you think might be the source of the error? Could it be some weird variants/format in the chunk that is being processed? |
Hi all! I was getting the same error when trying to phase rare variants. Actually, I was getting two flavors of the same error:
I managed to fix it by polishing the vcf. In particular, I updated the INFO tags (with bcftools +fill-tags). It would be nice if the upcoming versions of shapeit5 were a bit more explicit about this kind of fails or at least the requirements for the vcf were outlined in the documentation :). |
I was having the same issue for some of the chunks when using phase rare. I fixed it by increasing the chunk size. Best |
@jalghor mean you generated the chunks for rare variants with more than |
Hi all! Are there any news on this error? I have tried increasing the chunk size, but still got the error. However, independent of chunk size, I am getting the error only for some chunks, so I think in my case the chunk size is not the issue (generated chunks with glimpse2 as suggested). I also tried bcftools +fill-tags and VCF format is fine. I would appreciate any update. Thanks! |
Hello, @odelaneau ,
Thanks for the great work with shapeit5!!
I'm looking into implementing it in a nextflow pipeline (from modules made by @LouisLeNezet, thanks also to him!).
I have a problem when phasing rare variants with shapeit5 either 5.0 or 5.1.
Indeed the HMM computations are not done. Do you have any idea why this might be?
``
[SHAPEIT5] Phase_rare (phase rare variants onto a haplotype scaffold)
Files:
Parameters:
Parsing specified genomic regions
Reading genotype data
[W::hts_idx_load3] The index file is older than the data file: split.chr18.vcf.gz.tbi
[W::hts_idx_load3] The index file is older than the data file: chr18.ref.common.phased.vcf.gz.tbi
Initializing sparse genotypes
Setting up genetic map
PBWT pass
PBWT initialization [#eval=10283 / #select=6] (0.00s)
IBD2 constraints [100%]
IBD2 constraints [#inds=0 / #pairs=0] (0.42s)
PBWT forward selection [100%]
PBWT forward selection (2.65s)
PBWT backward selection [100%]
PBWT backward selection (2.87s)
HMM computations
phase_rare_static: src/containers/genotype_set/genotype_set_phasing.cpp:63: void genotype_set::phaseLiAndStephens(unsigned int, unsigned int, aligned_vector32&, aligned_vector32&, std::vector&, float): Assertion `GRvar_genotypes[vr][tidx].prob >= 0.0f' failed.
``
The text was updated successfully, but these errors were encountered: