Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Substr outside of string in TranscriptVariationAllele.pm line 906 #1013

Closed
ckandoth opened this issue Jun 23, 2021 · 5 comments
Closed

Substr outside of string in TranscriptVariationAllele.pm line 906 #1013

ckandoth opened this issue Jun 23, 2021 · 5 comments
Assignees

Comments

@ckandoth
Copy link

ckandoth commented Jun 23, 2021

Same as issue #1006 but now with details on how to reproduce.

System

  • VEP version: 102.0
  • VEP Cache version: 102
  • Perl version: 5.26.2
  • OS: Ubuntu
  • tabix is installed

Full VEP command line

~/mambaforge/bin/vep --species homo_sapiens --assembly GRCh38 --no_progress --no_stats --buffer_size 5000 --sift b --ccds --uniprot --hgvs --symbol --numbers --domains --gene_phenotype --canonical --protein --biotype --uniprot --tsl --variant_class --shift_hgvs 1 --check_existing --total_length --allele_number --no_escape --xref_refseq --failed 1 --vcf --flag_pick_allele --pick_order canonical,tsl,biotype,rank,ccds,length --dir /home/ckandoth/.vep --fasta /ngs/ref/GCA_000001405.15_GRCh38_full_plus_hs38d1_analysis_set.fna --format vcf --input_file test.vcf --output_file ./test.vep.vcf --force_overwrite --offline --pubmed --polyphen b --af --af_1kg --af_esp --af_gnomad --regulatory

Full error message

Possible precedence issue with control flow operator at /home/ckandoth/mambaforge/lib/site_perl/5.26.2/Bio/DB/IndexedBase.pm line 805.
substr outside of string at /home/ckandoth/mambaforge/share/ensembl-vep-102.0-0/Bio/EnsEMBL/Variation/TranscriptVariationAllele.pm line 906, <__ANONIO__> line 3.
Use of uninitialized value in uc at /home/ckandoth/mambaforge/share/ensembl-vep-102.0-0/Bio/EnsEMBL/Variation/TranscriptVariationAllele.pm line 906, <__ANONIO__> line 3.
substr outside of string at /home/ckandoth/mambaforge/share/ensembl-vep-102.0-0/Bio/EnsEMBL/Variation/TranscriptVariationAllele.pm line 906, <__ANONIO__> line 3.

Input VCF

##fileformat=VCFv4.2
##reference=GRCh38
#CHROM	POS	ID	REF	ALT	QUAL	FILTER	INFO	FORMAT	TUMOR
chr10	80081665	rs113172526	C	CAA,CA	2482.52	PASS	.	GT:AD:DP	0/1:17,0,25:42

Based on the allele depths above, the GT field should really be 0/2. I don't know if that is causing the VEP error. The error does not occur if the two ALT alleles are split into separate VCF lines:

##fileformat=VCFv4.2
##reference=GRCh38
#CHROM	POS	ID	REF	ALT	QUAL	FILTER	INFO	FORMAT	TUMOR
chr10	80081665	rs113172526	C	CAA	2482.52	PASS	.	GT:AD:DP	0/0:17,0:42
chr10	80081665	rs113172526	C	CA	2482.52	PASS	.	GT:AD:DP	0/1:17,25:42
@aparton
Copy link
Contributor

aparton commented Jun 24, 2021

Hi @ckandoth,

Thank you for this report. I'm able to reproduce the issue and we'll continue to investigate.

The issue seems to be tied to the --no_stats flag. If you remove this option, then this should allow you to continue your analyses in the meantime.

Kind Regards,
Andrew

@Anahit19
Copy link

Dear @ckandoth,

Thank you very much for your kind help!

@Stikus
Copy link

Stikus commented Jul 5, 2021

Hello, we're experiencing the same problem with VEP 104.3.

substr outside of string at /soft/ensembl-vep-104.3/Bio/EnsEMBL/Variation/TranscriptVariationAllele.pm line 906, <$fh> line 2757660.
Died in forked process 10571

Our run command:
/soft/ensembl-vep-104.3/vep --fork 96 --input_file /inputs/sample.PASSonly.vcf.gz --format vcf --cache --dir_cache /ref/.vep --dir_plugins /soft/ensembl-vep-104.3/Plugins --assembly GRCh38 --offline --output_file /trash/sample.PASSonly.annot-vep.vcf --vcf --no_stats --force_overwrite --symbol --check_existing --terms SO --tsl --hgvs --hgvsg --transcript_version --failed 1 --af --max_af --af_1kg --af_esp --af_gnomad --pubmed --sift b --polyphen b --variant_class --gene_phenotype --regulatory --numbers --domains --protein --canonical --ccds --uniprot --biotype --shift_hgvs 1 --xref_refseq --flag_pick_allele --no_escape --minimal --allele_number --total_length --plugin CADD,/ref/CADD/1.6/GRCh38/whole_genome_SNVs.tsv.gz --custom /ref/ClinVar/GRCh38/clinvar_20210626.vcf.gz,ClinVar,vcf,exact,0,CLNDN,CLNDNINCL,CLNDISDB,CLNDISDBINCL,CLNHGVS,CLNREVSTAT,CLNSIG,CLNSIGCONF,CLNSIGINCL,CLNVC,CLNVCSO,CLNVI

And here is another issue about this problem: #818

@aparton
Copy link
Contributor

aparton commented Jul 5, 2021

Hi @Stikus,

Thank you for bringing the duplicated issue to our attention. Unfortunately we do not have a fix in place for this issue yet. As a workaround, running your VEP command without the --no_stats option enabled should prevent this issue from happening in the short term.

Please let me know if there's anything else we can do to help.

Kind Regards,
Andrew

@aparton
Copy link
Contributor

aparton commented Jul 5, 2021

Closing this issue now as it's a duplicate of #818.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants