You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When running vg construct (https://github.com/vgteam/vg), the vcflib component checking appears to identify the vcf file as having an incorrect field order. A similar bug has previously been filed and addressed within htslib:
In the example case a single new vg file, describing the variants in a graph format should be produced.
The 1k genomes repository is an excellent test set, that was designed to aid and accelerate tools development and is widely used. The current changes to the order testing will therefore be an issue that impacts anyone looking to use vcf tools based on the current vcflib releases
It has been suggested that the issue may be related to the vcflib code here:
With thanks to local team members for their help in tracking this issue to the above code base and many thanks to the vcflib team for making these tools available
Pete
The text was updated successfully, but these errors were encountered:
Thanks @PeteClapham. This refers to some 12 year old code ;). I haven't read everything, but I think the gist is that vcflib should not check for the field position, though it may check whether the field exists or not. Note, this refers to the VCF header, not to the column layout of the file itself. I think that is fair. Do you want to try and write a patch?
commit 6e3ede4
Author: Pjotr Prins <pjotr.public01@thebird.nl>
Date: Sat Jan 14 23:29:50 2023 -0600
Allow for different field order - see vcflib#365
because it introduced a bug discovered by the Debian packagers. Thank atille and
garguatua_kerr!
Only bug reports!
When running vg construct (https://github.com/vgteam/vg), the vcflib component checking appears to identify the vcf file as having an incorrect field order. A similar bug has previously been filed and addressed within htslib:
VCF Header: must Number be before Type? · Issue #642
samtools/hts-specs#642
The over strict checking rejects files where the field order may vary in the header. An example of the impact using vg, is described below:
The vcf's used are from the 1k genomes project and are of known verified quality. Downloads are available from here:
http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/data_collections/1000G_2504_high_coverage/working/20220422_3202_phased_SNV_INDEL_SV/
In the example case a single new vg file, describing the variants in a graph format should be produced.
The 1k genomes repository is an excellent test set, that was designed to aid and accelerate tools development and is widely used. The current changes to the order testing will therefore be an issue that impacts anyone looking to use vcf tools based on the current vcflib releases
It has been suggested that the issue may be related to the vcflib code here:
vcflib/src/Variant.cpp
Line 1839 in 6dbe2f6
With thanks to local team members for their help in tracking this issue to the above code base and many thanks to the vcflib team for making these tools available
Pete
The text was updated successfully, but these errors were encountered: