dag-cbor,dag-json: relax strictness rules for decoders #214

rvagg · 2022-05-10T11:15:58Z

Fixes: #196

Current text reflects an idealistic perspective, not a pragmatic one that
accounts for historical data and handling data from encoders that don't quite
follow the rules. Nor do the current statements requiring strictness reflect
the reality of our current decoders, which are lose and don't even yet have
full opt-in strictness modes; even though we note the lack of strictness in
the implementation notes.

willscott

I'm happy about having this as a must on encode and should on decode.

rvagg · 2022-05-10T11:56:56Z

I'm happy about having this as a must on encode and should on decode.

so I was just writing a response to this and getting references to existing code when I noticed that I actually do some decode strictness in JS for dag-cbor:

> dagCBOR.decode(Buffer.from('190064', 'hex'))
Uncaught:
Error: CBOR decode error: integer encoded in more bytes than necessary (strict decode)

When I wrote the new codec stack for dag-cbor I started building in a decode strictness option but only did the easiest bits, with the hopes of getting to the rest later (I still hope to); and I enabled that strictness all the way up to the decode() people use. It's only doing checks on int sizes, but that also applies to lengths and tags. And this has been the dominant JS decoder for nearly 2 years now without anyone reporting problems with that, which is nice.

So the non-strict pieces for the JS codec is:

Map key ordering: map entries may be accepted in any order
~~Integer encoding lengths need not be as short as possible~~
~~Length descriptors of major types 2 through 5 need not be as short as possible~~
~~The expression of tag 42 need not be as short as possible (0xd82a)~~
Floating point values may be represented as single and half precision

Floating point values are rarely used and we recommend against them; but map key ordering is something we can't afford to break, cbor-gen does it wrong, the Filecoin genesis block has it wrong, 🤷.

So I've pushed another commit that adjusts things somewhat. The decode strictness section now starts with:

Due to the existence and active use of historical data, and the existence and active use of non-conforming encoders, DAG-CBOR decoders may relax strictness requirements by default. A strictness opt-in may be offered for systems where round-trip determinism is a desirable feature and backward compatibility with old, non-strict data is unnecessary.

And I've also added cbor-gen to the implementations list while I'm in there, which lead me to add a recommendation for what to use for both JS and Go.. I'm not comfortable recommending cbor-gen while whyrusleeping/cbor-gen#56 remains unmerged.

vmx · 2022-05-10T12:32:06Z

specs/codecs/dag-cbor/spec.md

@@ -68,12 +68,12 @@ Therefore the DAG-CBOR codec must:

 ### Decode strictness

-DAG-CBOR decoders should not enforce all of the above strictness requirements by default, but may provide an opt-in for systems where round-trip determinism is a desireable feature and backward compatibility with old, non-strict data is unnecessary.
+Due to the existence and active use of historical data, and the existence and active use of non-conforming encoders, DAG-CBOR decoders may relax strictness requirements by default. A strictness opt-in may be offered for systems where round-trip determinism is a desirable feature and backward compatibility with old, non-strict data is unnecessary.


I like this wording even more! I was close commenting that I would prefer a "may" over a "should", but then didn't want to block this PR.

RangerMauve

LGTM. The wording and explanation is clear and concise.

Fixes: #196 Current text reflects an idealistic perspective, not a pragmatic one that accounts for historical data and handling data from encoders that don't quite follow the rules. Nor do the current statements requiring strictness reflect the reality of our current decoders, which are lose and don't even yet have full opt-in strictness modes; even though we note the lack of strictness in the implementation notes.

rvagg requested review from vmx, aschmahmann and a team May 10, 2022 11:15

willscott reviewed May 10, 2022

View reviewed changes

vmx approved these changes May 10, 2022

View reviewed changes

rvagg force-pushed the rvagg/strictness-nope branch from 8a6ce41 to c8c08e1 Compare May 10, 2022 11:54

vmx reviewed May 10, 2022

View reviewed changes

RangerMauve approved these changes May 10, 2022

View reviewed changes

rvagg added 2 commits May 13, 2022 18:04

fixup! dag-cbor,dag-json: relax strictness rules for decoders

34d125e

rvagg force-pushed the rvagg/strictness-nope branch from c8c08e1 to 34d125e Compare May 13, 2022 08:05

rvagg merged commit 756520d into master May 13, 2022

rvagg deleted the rvagg/strictness-nope branch May 13, 2022 08:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dag-cbor,dag-json: relax strictness rules for decoders #214

dag-cbor,dag-json: relax strictness rules for decoders #214

rvagg commented May 10, 2022

willscott left a comment

rvagg commented May 10, 2022

vmx May 10, 2022

RangerMauve left a comment

dag-cbor,dag-json: relax strictness rules for decoders #214

dag-cbor,dag-json: relax strictness rules for decoders #214

Conversation

rvagg commented May 10, 2022

willscott left a comment

Choose a reason for hiding this comment

rvagg commented May 10, 2022

vmx May 10, 2022

Choose a reason for hiding this comment

RangerMauve left a comment

Choose a reason for hiding this comment