Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.Sign up
vcf output file can't be indexed with IGV - malformed header #29
Thanks for looking into this. I haven't used IGV before, so I'm not sure exactly the requirements it has for the VCF file. The pyrad VCF for denovo data is a little different from a standard VCF since there isn't a real reference, but rather just a pseudo-reference that we make up from the most common base at each site. I made the format option in pyrad because it was requested but I have not tested it rigorously, so I'm not surprised the format might be incompatible with some software. I'm interested in fixing whatever the problem is though.
I've already made some changes to the VCF format in our new software ipyrad, which I encourage you guys to check out (http://ipyrad.readthedocs.io). We now store read depth information in the VCF, so it's a lot more data rich. Looks like we've removed the blank line after the headers too, which might fix the problem. I'll try to check out IGV when I get a chance.
Here's the first few lines of the ipyrad VCF output:
@atcg - Thanks for pointing out that extra line after the #CHROME line! I must've accidentally added that there (a recent PyRad run did NOT have that extra line). When I remove that empty line AND the extra four spaces after the #INFO column name, there are no issues with IGV.
@dereneaton - Thanks for your work on this and for the heads up on ipyrad; will check it out!