First-class string type in serialization specification #13

pgriess · 2010-07-23T15:48:14Z

Packing all strings as raw byte arrays makes it very difficult to figure out how to unpack them correctly. In particular, it is impossible to know what encoding was used when encoding the string as a sequence of bytes. To address this, it would be nice to have a first-class MSGPACK_OBJECT_STRING type with a mandatory encoding (say, UTF-8).

aaronblohowiak · 2011-05-25T07:07:51Z

any news on this?

andrewschaaf · 2011-08-31T15:13:57Z

Packing all strings as raw byte arrays makes it very easy to figure out how to unpack them correctly: as Buffers/byte[]s/....

Libraries could have

unpackRaw (doin' it rite)
unpackUnicode (attempting to autodetect encodings, starting with UTF-8)

instead of unpack

tracker1 · 2012-07-26T17:58:07Z

@andrewschaaf The issue is more along the lines of dealing with cross-system messages. For example one system may have a native in-memory representation of strings as UTF-16, another may user UTF-8 ... since UTF-8 is usually the most effecient, it would make sense to have a string type that is always UTF-8 encoded without a BOM.

tracker1 · 2012-09-14T16:53:03Z

For that matter, you could just put the UTF-8 encoded Byte Order Marker (BOM) at the beginning of your raw data, when reading out, you'll "know" that it's a UTF-8 string.

cabo · 2013-02-20T13:30:10Z

The discussion of this issue is just exploding in #121

(And I'll plug http://tools.ietf.org/html/draft-bormann-apparea-bpack here, too.)

cabo · 2013-02-24T19:46:37Z

Well, it seems we are continuing the technical discussion in #128 today.

kuenishi · 2013-08-17T08:01:31Z

See the new spec

kuenishi closed this as completed Aug 17, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

First-class string type in serialization specification #13

First-class string type in serialization specification #13

pgriess commented Jul 23, 2010

aaronblohowiak commented May 25, 2011

andrewschaaf commented Aug 31, 2011

tracker1 commented Jul 26, 2012

tracker1 commented Sep 14, 2012

cabo commented Feb 20, 2013

cabo commented Feb 24, 2013

kuenishi commented Aug 17, 2013

First-class string type in serialization specification #13

First-class string type in serialization specification #13

Comments

pgriess commented Jul 23, 2010

aaronblohowiak commented May 25, 2011

andrewschaaf commented Aug 31, 2011

tracker1 commented Jul 26, 2012

tracker1 commented Sep 14, 2012

cabo commented Feb 20, 2013

cabo commented Feb 24, 2013

kuenishi commented Aug 17, 2013