Spec: Refactor description of deserialization #128

nomeata · 2020-11-02T15:53:11Z

This is a significant rewirte of the section in the spec describing
deserialization. Some points worth noting:

This replaces the elaboration relation with one that takes a value
and a type as input, and returns a value. This makes it clearer that
there is no “input type” of deeper relevance.
One great benefit: No more long prose about how the rules assume
these types to be principal, even if they aren’t.

This clarifies questions like this
one
Also, the existing rules about “elaborating function values” were
kinda bogus, as function values are just accepted as they are. This
is much easier now, also good.
Unfortuantely for this application, our textual representation has
overloading. Instead of defining yet another abstract value algebra,
I did a bit of hand-waving, and defined an “overloading-free
fragment” of the textual format for this section.
A fair number of rules becomes simpler. Promising!
I was able to phrase some interesting properties more formally. Also
promising! Didn’t actually prove them, though, although we should.
Still unclear to me what we mean with “subtyping is complete”, see
the section there.
Decoding is still an inductive relation, so not fully algorithmic.
This may be an issue with a stright-forward formalization; not all
theorem provers like negative occurrences of inductive relations in
their rules.
Syntax of the relations up for discussion.

This is a significant rewirte of the section in the spec describing deserialization. Some points worth noting: * This replaces the elaboration relation with one that takes a value and a type as input, and returns a value. This makes it clearer that there is no “input _type_” of deeper relevance. * One great benefit: No more long prose about how the rules assume these types to be principal, even if they aren’t. This clarifies questions like [this one](#126 (comment)) * Also, the existing rules about “elaborating function values” were kinda bogus, as function values are just accepted as they are. This is much easier now, also good. * Unfortuantely for this application, our textual representation has overloading. Instead of defining yet another abstract value algebra, I did a bit of hand-waving, and defined an “overloading-free fragment” of the textual format for this section. * A fair number of rules becomes simpler. Promising! * I was able to phrase some interesting properties more formally. Also promising! Didn’t actually prove them, though, althogh we should. * Still unclear to me what we mean with “subtyping is complete”, see the section there. * Decoding is still an inductive relation, so not fully algorithmic. This may be an issue with a stright-forward formalization; not all theorem provers like negative occurrences of inductive relations in their rules. * Syntax of the relations up for discussion.

nomeata · 2020-11-02T15:54:19Z

@rossberg @chenyan-dfinity , what do you think of this?

As an implementor, I think I’d find this presentation more helpful. And if I were to prove soundess etc. formally, then too.

nomeata · 2020-11-02T15:57:43Z

If the diff is too confusing to read, then head to https://github.com/dfinity/candid/blob/joachim/rewrite/spec/Candid.md#decoding

rossberg

Good stuff, thanks!

spec/Candid.md

rossberg · 2020-11-03T10:40:00Z

spec/Candid.md

+  ```
+  (∀ v. v :? T ~> _ ⇒ v :? T' ~> _) ⇒ T <: T'
+  ```
+  This does not hold as state, because of counter examples involving the empty type. For example we do not have `opt empty <: null`, or `Empty <: t` where `type Empty = rec { Empty }`


I would expect the following formulation:

(∀ v, v'. v : T /\ v :? T' ~> v') ⇒ T <: T'

Is there anything wrong with that? And should it not be a requirement?

That formulation doesn't make sense to me, the antecent is never true.

Should /\ be →? Or should v' be behind some ∃?

Ah, oops, wrong quantifier:

(∃ v, v'. v : T /\ v ~> v' : T') ⇒ T <: T'

Now the antecent is far too weak, it says “T is a subtype of T' if some values of type T make sense at type T'). This would, for example, relate all list types (use v = [] and v' = []).

How about I remove this section (so this can land if all else is fine), and we discuss this in a separate issue?

Eh, you're right, I should turn my brain on. Okay, last try, if that's wrong as well, feel free to remove it. :)

(∃ v. v : T /\ ∀ v. v : T ⇒ ∃ v'. v ~> v' : T') ⇒ T <: T'

This looks a bit like monster-barring the uninhabited case, but I think it is a natural definition, saying: if serialised values exist for type T, and all such values can be deserialised at type T', then T ought to be considered a subtype of T'.

Uninhabited types tend to make everything harder...

Nope, sorry still not good.

In the antecent instead of ∀ v. v : T ⇒ I am sure you ∀ v. v ~> _ : T ⇒ (i.e. not just the “canonical” values, but really “all values that can be deserialized at T). But I think that’s what you mean anyways.

But empty types still break it: This definition would still imply vec Empty <: vec t for all t.

And even then: Remember that coercion never fails for reference values (because we don't do a subtyping check when decoding and coercion), so any definition for completeness that is based on purely the _ ~> _ : _ relation will probably relate all function types, which is obviously not sound (in the non-local; IDL-Soundess.md sense).

Let’s take this into a separate issue.

Moved to #132

spec/Candid.md

Co-authored-by: Andreas Rossberg <rossberg@dfinity.org>

nomeata

Mostly answers (and questions), will perform the changes after lunch.

spec/Candid.md

nomeata · 2020-11-03T11:25:26Z

spec/Candid.md

+  ```
+  (∀ v. v :? T ~> _ ⇒ v :? T' ~> _) ⇒ T <: T'
+  ```
+  This does not hold as state, because of counter examples involving the empty type. For example we do not have `opt empty <: null`, or `Empty <: t` where `type Empty = rec { Empty }`


That formulation doesn't make sense to me, the antecent is never true.

Should /\ be →? Or should v' be behind some ∃?

rossberg · 2020-11-03T11:38:56Z

Btw, nit: various inference bars are too short.

spec/Candid.md

Co-authored-by: Andreas Rossberg <rossberg@dfinity.org>

nomeata · 2020-11-03T14:52:16Z

Ok, all comments handled one way or another

nomeata · 2020-11-03T15:02:21Z

I’ll give @chenyan-dfinity a chance to comment before merging

(see #131, spec update in #128)

chenyan-dfinity

LGTM. It's indeed much cleaner!

spec/Candid.md

chenyan-dfinity · 2020-11-03T18:41:17Z

spec/Candid.md


+ * Number literals (`<primval>`) must be immediately enclosed with an `<annval>` that clarifies the precise number type.
+ * The canonical value of type `reserved` is expressed as `(null : reserved)`.
+ * No other use of `<annval>`.


We also need annval for null : opt _

No, I am pretty sure we don't! That’s because

On the left of ~>, it doesn't matter which null it is (any null behaves the same)

On the right of ~>, the type is given, so again there is only no null value.

So it is fine to have a single null value in the V set.

This is not true for overloaded numbers; (5 : nat) ~> and (5 : int) ~> behave differently.

In other words, null has a principal type, numbers don't. (Which makes me wonder, why do we need a special case for reserved?)

Because the algebra of binary values has a dedicated reserved value (DIDL\x00\x01\x70), but our textual representation doesn’t. The (null : reserved) could just as well be ("reserved" : reserved) or (record {} : reserved); it’s arbitrary and, in a way, always lie.

For me, the conceptual “abstract value data model” has a dedicated reserved value, and I think it would be good to have it in the textual representation as well, to avoid the above wart in the formalism, but also to not force debug output to come up with such arbitrary values when decoding a message with a reserved value:

~/dfinity/haskell-candid $ cabal -v0 run hcandid -- --decode --rust "DIDL\x00\x01\x70" ((null : reserved))

More evidence that it might be useful to have it: All implementations that have a “value” type seemed to feel the need to add it to their data type:

candid/rust/candid/src/parser/value.rs

Line 37 in 03773cc

Reserved,

https://github.com/nomeata/haskell-candid/blob/36202a9aef4387925c0d0de23a0e3645ed7d2b3e/src/Codec/Candid/Types.hs#L142

spec/Candid.md

Co-authored-by: Yan Chen <48968912+chenyan-dfinity@users.noreply.github.com>

this adds some tests to account for the new behaviour of #110 and #128 and #134 and #137. Co-authored-by: chenyan-dfinity <yan.chen@dfinity.org>

nomeata mentioned this pull request Nov 2, 2020

Some more tests related to subtyping #126

Merged

nomeata added 2 commits November 2, 2020 17:11

tidbits

4c9bc55

Tweaks to Completeness of subtyping

63fc356

nomeata requested a review from rossberg November 3, 2020 07:25

Link to IDL-Soundness.md

6305239

osa1 mentioned this pull request Nov 3, 2020

Implement CandidType for std::time::SystemTime #130

Merged

rossberg reviewed Nov 3, 2020

View reviewed changes

nomeata changed the title ~~Spec: Refactor description of deserializatoin~~ Spec: Refactor description of deserialization Nov 3, 2020

nomeata commented Nov 3, 2020

View reviewed changes

spec/Candid.md Outdated Show resolved Hide resolved

Apply suggestions from code review

4aa6b2f

Co-authored-by: Andreas Rossberg <rossberg@dfinity.org>

nomeata commented Nov 3, 2020

View reviewed changes

rossberg mentioned this pull request Nov 3, 2020

Subtyping: Missing record fields must decode at type reserved for transitivity #131

Closed

nomeata added 3 commits November 3, 2020 12:53

Allow removal of reserved, fixes #131

03d8950

Different relation, <numtype>, fix record field and reserved…

e49e2df

whitespace

f52fe2a

rossberg reviewed Nov 3, 2020

View reviewed changes

nomeata commented Nov 3, 2020

View reviewed changes

spec/Candid.md Outdated Show resolved Hide resolved

nomeata and others added 2 commits November 3, 2020 15:44

Apply suggestions from code review

4d2a742

Co-authored-by: Andreas Rossberg <rossberg@dfinity.org>

Remove section on completeness

e8d564d

rossberg approved these changes Nov 3, 2020

View reviewed changes

nomeata mentioned this pull request Nov 3, 2020

What is “completeness of subtyping” #132

Open

nomeata requested a review from chenyan-dfinity November 3, 2020 15:02

nomeata added a commit that referenced this pull request Nov 3, 2020

Reserved fields may be missing

429ea4a

(see #131, spec update in #128)

chenyan-dfinity approved these changes Nov 3, 2020

View reviewed changes

Apply suggestions from code review

e349bdb

Co-authored-by: Yan Chen <48968912+chenyan-dfinity@users.noreply.github.com>

nomeata added automerge-squash and removed automerge-squash labels Nov 4, 2020

Merge branch 'master' into joachim/rewrite

088402a

nomeata merged commit 03773cc into master Nov 4, 2020

nomeata deleted the joachim/rewrite branch November 4, 2020 09:19

nomeata added a commit that referenced this pull request Nov 21, 2020

Tests for new subtyping rules (#126)

f6f794f

this adds some tests to account for the new behaviour of #110 and #128 and #134 and #137. Co-authored-by: chenyan-dfinity <yan.chen@dfinity.org>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spec: Refactor description of deserialization #128

Spec: Refactor description of deserialization #128

nomeata commented Nov 2, 2020 •

edited

Loading

nomeata commented Nov 2, 2020

nomeata commented Nov 2, 2020

rossberg left a comment

rossberg Nov 3, 2020

nomeata Nov 3, 2020

rossberg Nov 3, 2020

nomeata Nov 3, 2020

nomeata Nov 3, 2020

rossberg Nov 3, 2020

nomeata Nov 3, 2020

nomeata Nov 3, 2020

nomeata left a comment

nomeata Nov 3, 2020

rossberg commented Nov 3, 2020

nomeata commented Nov 3, 2020

nomeata commented Nov 3, 2020

chenyan-dfinity left a comment

chenyan-dfinity Nov 3, 2020

nomeata Nov 4, 2020

rossberg Nov 4, 2020

nomeata Nov 4, 2020

nomeata Nov 4, 2020

Spec: Refactor description of deserialization #128

Spec: Refactor description of deserialization #128

Conversation

nomeata commented Nov 2, 2020 • edited Loading

nomeata commented Nov 2, 2020

nomeata commented Nov 2, 2020

rossberg left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nomeata left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rossberg commented Nov 3, 2020

nomeata commented Nov 3, 2020

nomeata commented Nov 3, 2020

chenyan-dfinity left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nomeata commented Nov 2, 2020 •

edited

Loading