-
Notifications
You must be signed in to change notification settings - Fork 67
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Trouble generating images from a VCF file: Doesn't work on my data or sample data #101
Comments
Hi, Matthew, sorry about this problem. There is a bug in that version of
the samplot_vcf code and we're in the middle of a version transition. Could
you try installing via conda instead? This will give you a different
version of samplot in which that problem has been fixed. It will also
slightly alter the way samplot is invoked.
To install with conda, just run "conda install samplot"
To run the samplot vcf command: "samplot vcf" instead of "python
samplot_vcf.py".
Please let me know how that works for you! Happy to assist further.
…On Tue, Apr 14, 2020, 20:27 Matthew J. Oldach ***@***.***> wrote:
I'm able to run the samplot.py on the example data and on my own data but
am having issues when I try the samplot_vcf.py script.
As far as I know there is no Repeat Masker track for *C. elegans* (at
least not on UCSC) so the --important_regions regions.bed has been
omitted.
python ~/bin/samplot/src/samplot_vcf.py \
--vcf Pindel.vcf\
-d test/\
-O png\
-b 470.sorted.dedupped.bam > samplot_commands.sh
[W::bcf_hdr_check_sanity] PL should be declared as Number=G
I see an empty samplot_commands.sh file and when I go to open the
index.html I see zero records.
Because this did not work I want to validate on the example data; however,
I get similar results using all of the .bam files provided with samplot:
cd ~/bin/samplot/test/data/
python ~/bin/samplot/src/samplot_vcf.py --vcf test.vcf -d vcf-test/ -O png -b NA12878_restricted.bam > samplot_commands.sh
Your help would be greatly appreciated
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#101>, or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABSSNRYAEA6XFJIVQUSLBY3RMULPXANCNFSM4MIGVLLQ>
.
|
I've installed viz conda and run into the same issue (both on my data and test data from
|
My bad, I forgot to mention an additional change. The default behavior is now to autorun instead of printing out the commands. Did it create the images? You can also specify that you want to run manually by using the Again, my apologies |
Hi @jbelyeu So when you say autorun instead of printing out the commands I assume you meant to omit the
The output directory
|
Yes, and I'm not sure what the issue is now. Can you cd into the samplot directory and run the tests ( Also check that the version is 1.0.12 ( |
Hi @jbelyeu okay so
This also works. Pindel
Here I don't see anything in NGSep
LumpyTrying
Okay let's follow the prompt:
All of the other call sets produce either no-output like |
Ok, well, that narrows it down to data-specific problems. The other vcfs have different isssues. Lumpy.vcf probably just needs to be sorted before indexing (check this out: https://www.biostars.org/p/299659/). I'm not familiar with ngsep personally, but it appears not to include the SVTYPE field, which samplot vcf requires. |
Is sharing a subset of your pindel data an option? If so I'd be happy to try to reproduce the issue myself |
There are SVs larger than 20 bp in these files.
Earlier I ran into similar problems for the
After sorting and indexing the
However, following the biostars advice to sort/index the
|
What error did that last command generate? |
This is a tricky one that passes with no error message |
As I understand it, the first issue with the lumpy data was a failure to read the vcf, which threw an error. Looks like the sort/index steps solved that and we're back to the original problem that also occurred with the pindel vcf. That's what I call progress! Another thing you might check is that there's an exact match between the maddog sample id and the column name in the vcf |
Science typically progresses funeral by funeral so were on a better track!
Looks like it's the same name in the column as was used in the What's your e-mail? I have permission to share the Pindel.vcf. I can share a dropbox link with you. |
I'll also need part of the alignment file. You can use samtools to extract a region ( |
Ok, the issue is that in this Pindel vcf, all of the the genotype for this sample, for all variants, is homref. Samplot vcf only plots when it finds a sample recorded as het/homalt. Is this also the case for the Lumpy vcf? You may want to look into https://github.com/hall-lab/svtyper |
I'm able to run the
samplot.py
on the example data and on my own data but am having issues when I try thesamplot_vcf.py
script.As far as I know there is no Repeat Masker track for C. elegans (at least not on UCSC) so the
--important_regions regions.bed
has been omitted.I see an empty
samplot_commands.sh
file and when I go to open theindex.html
I see zero records.Because this did not work I want to validate on the example data; however, I get similar results using all of the
.bam
files provided withsamplot
:Your help would be greatly appreciated
The text was updated successfully, but these errors were encountered: