CBOR Research #342

vinniefalco · 2020-09-15T14:40:57Z

We need preliminary research into CBOR. Is the mapping to and from a json::value clean? Are there any obstacles to implementing a parser and serializer for CBOR? This is research-only, no implementation.

The text was updated successfully, but these errors were encountered:

vinniefalco · 2020-09-15T14:44:07Z

https://cbor.io/spec.html

There are extensible "tags", which might not map: https://www.iana.org/assignments/cbor-tags/cbor-tags.xhtml

rainerdeyke · 2020-09-16T09:22:43Z

A CBOR implementation is not expected to know all semantic tags (and generally cannot, since new semantic tags can be added at any time). It is perfectly valid to ignore all semantic tags completely, and this is explicitly stated in the CBOR spec. Section 2.4, paragraph 3: "Understanding the semantic tags is optional for a decoder; it can just jump over the initial bytes of the tag and interpret the tagged data item itself." This seems like the best approach for Boost.JSON, since none of the types used by JSON require semantic tags.

vinniefalco · 2020-09-16T14:08:26Z

The one interesting decision that needs to be made here is how to handle CBOR byte strings (major type 3), as they aren't representable in JSON or in the current boost::json::value.

vinniefalco · 2020-09-20T20:45:03Z

Well Peter did this

ecorm · 2024-04-22T21:23:36Z

A CBOR implementation is not expected to know all semantic tags (and generally cannot, since new semantic tags can be added at any time). It is perfectly valid to ignore all semantic tags completely, and this is explicitly stated in the CBOR spec. Section 2.4, paragraph 3: "Understanding the semantic tags is optional for a decoder; it can just jump over the initial bytes of the tag and interpret the tagged data item itself." This seems like the best approach for Boost.JSON, since none of the types used by JSON require semantic tags.

Sorry for the late comment. There are a number of CBOR tags where it's impossible to reconstitute the value without knowing the tag. For example, tags 2 (unsigned bignum) and 3 (negative bignum). I've pointed out this flaw years ago on the CBOR mailing list, yet new tags keep popping up where part of the information (beyond semantics) is encoded in the tag. They are too obsessed over saving every possible byte, and don't seem care about being able to transcode to JSON without losing unrecoverable information.

vinniefalco assigned sdkrystian Sep 15, 2020

vinniefalco closed this as completed Sep 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CBOR Research #342

CBOR Research #342

vinniefalco commented Sep 15, 2020

vinniefalco commented Sep 15, 2020

rainerdeyke commented Sep 16, 2020

vinniefalco commented Sep 16, 2020

vinniefalco commented Sep 20, 2020

ecorm commented Apr 22, 2024

CBOR Research #342

CBOR Research #342

Comments

vinniefalco commented Sep 15, 2020

vinniefalco commented Sep 15, 2020

rainerdeyke commented Sep 16, 2020

vinniefalco commented Sep 16, 2020

vinniefalco commented Sep 20, 2020

ecorm commented Apr 22, 2024