Skip to content

Single-byte character set #95

@donmendelson

Description

@donmendelson

In SBE the default encoding for single char and fixed-length character array was specified as US-ASCII. The motivation was to be compatible with FIX tagvalue encoding. When FIX started over 20 years ago, it was simply specified as ASCII character set, originally a 7-bit encoding. However, some FIX engines now support 8-bit "extended ASCII". Exactly what that means must be defined carefully because there are several related but slightly incompatible encoding standards. Some have interpreted this as ISO/IEC 8859-1. There is also a later 8859 part 15. Some discussion involves UTF-8, but that strikes me as wrong for this usage because it is a multi-byte character set, even though the first page is like ASCII.

Within the FIX standard, this will be specified more precisely for future usage. Meanwhile, I would like to solicit feedback on how this should be handled in SBE.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions