Skip to content

Exomiser VCF output includes whitespace in INFO field which are forbidden in VCF<4.3 #486

@ielis

Description

@ielis

Hi, I think there may be a bug in VCF file that is produced by Exomiser.

Specifically, the EXOMISER_ACMG_DISEASE_NAME sub-field may include a value such as "Presynaptic congenital myasthenic syndromes". However, the disease name will frequently contain whitespace characters which are not allowed in VCF<4.3.

The section 1.4.1 (8) of the VCF4.2 specs forbids presence of whitespace characters.

INFO - additional information: (String, no whitespace, semicolons, or equals-signs permitted; commas are
permitted only as delimiters for lists of values) INFO fields are encoded as a semicolon-separated series of short
keys with optional values in the format: =[,data]. ...

However, the restriction was apparently lifted in VCF 4.3:

INFO — additional information: Semicolon-separated series of additional information fields, or the MISS-
ING value ‘.’ if none are present.
...
Space characters are allowed in values.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions