Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GTC converter - WARNING - Reference is missing entry for chromosomes #38

Open
cbiOPela opened this issue Mar 28, 2019 · 8 comments
Open
Labels

Comments

@cbiOPela
Copy link

Hi Dr. Kelley!
I get this error when using your tools in a Linux Virtual Machine hosted in windows. When downloading genome fasta file i found some warnings related to "grep" command at the end of the process. I continue as usual with the installation and when running the script i get this error.
Could you please help me to fix that? I re-download the reference, the manifest and the git repo and i still having the same problem.

Thank you in advance.

Pelayo

$ ./gtc_to_vcf.py --gtc-paths /home/alfonso/Escritorio/GTCs/ --manifest-file /home/alfonso/Escritorio/GSA-24v2-0_A1.csv --genome-fasta-file /home/alfonso/Escritorio/GrCh37/hg19.fa --output-vcf-path /home/alfonso/Escritorio/GTCs

@jjzieve
Copy link
Contributor

jjzieve commented Mar 28, 2019

@cbiOPela Can you post the specific errors you're getting?

@AlfonsoICM
Copy link

last 2 lines pasted:

GTC converter - WARNING - Failed to process entry for record rs12868621: string index out of range.
GTC converter - ERROR - Reference is missing entry for chromosome 12

@AlfonsoICM
Copy link

I tried also with test files and i have the same error.

./gtc_to_vcf.py --manifest-file /home/alfonso/GTCtoVCF_GrCh37/tests/data/small_manifest.bpm --genome-fasta-file /home/alfonso/GTCtoVCF_GrCh37/tests/data/test_fasta.fa --skip-indels --output-vcf-path ./

@jjzieve
Copy link
Contributor

jjzieve commented Mar 30, 2019

@AlfonsoICM and @cbiOPela I'm having trouble reproducing this issue. It would seem the genome fasta file you're using doesn't have chromosome 12 (based on the error you provided)? Can you include the logs from download_reference.sh?

@jjzieve
Copy link
Contributor

jjzieve commented Mar 30, 2019

@AlfonsoICM and @cbiOPela I was able to reproduce some issues with download_reference.sh on my mac. I opened this PR #39 that fixed the issues I experienced. It may be useful to you. If that doesn't help, I'll need the specifics of your Linux VM (distro and version) to see if I can reproduce the problem.

@cbiOPela
Copy link
Author

cbiOPela commented Apr 1, 2019

Hi jjzieve!
Thank you for your quick response! For my part, I checked the parameters and files. I used another reference genome (always hg19) and changed the chip manifest again. The last error we get is this:

GTC converter - ERROR -

There are no specifications about the error at the prompt. It loads the GTC file, reads the reference and everything seems correct until we get this error. I've been able to use this tool many times on linux and I haven't had any problems. Is it possible that the error is defined by using a VM on a Windows 10?
This is the version of Oracle VM i have used:

  • VirtualBox 6.0.4 platform packages

  • lBox 6.0.4 Oracle VM VirtualBox Extension Pack

@jjzieve jjzieve mentioned this issue Apr 16, 2019
@jjzieve
Copy link
Contributor

jjzieve commented Apr 16, 2019

Hi @cbiOPela, I was not able to reproduce your specific issue. If you're able to use docker, this PR I opened (#40) may be of use to you.

@jjzieve
Copy link
Contributor

jjzieve commented Apr 30, 2019

@cbiOPela The log file should have additional information not printed to stdout. Can you attach that file by re-running the tool with --log-file?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants