Start of new types on the client #4254

pbiggar · 2022-07-09T04:16:45Z

This started as an attempt to move all the client types to match the backend ProgramTypes and RuntimeTypes, but it got to be a lot so I stopped early. So far, just all this really does is updates ID/TLIDs to use int32s, and fixes the fallout (and adds a few types that aren't used yet).

pbiggar · 2022-07-10T02:16:27Z

The integration tests are failing because some packages have uint64 tlids. In dev, the one package created has a 64bit tlid. In production, about half the packages have 64bit tlids.

It seems also that 1000 production toplevels have tlids above 2147483647 (all for some reason being a number like 1587041099384 but 156, 157, or 158.

Also, int_of_string is crashing so we shouldn't use it anyway. But it looks like it will make sense to switch to 64 bit ints here if possible anyway :(

pbiggar · 2022-07-10T02:28:05Z

Wait, if we json parse 64-bit ints, what makes it to rescript? 64bit ints or strings, or floats with missing precision?

Does it matter whether they're a field in an array (as in the json representation of variants) or as ids in a dictionary).

pbiggar · 2022-07-10T03:21:09Z

Parsing 64 bit ints gets us floats with a loss of precision.

I wonder if we're safe to just convert the tlids in the DB to a lower precision (except for users editing). I don't think tlids are used except to tell the apiserver what db/handler is being deleted.

pbiggar · 2022-07-10T03:23:17Z

Parsing 64 bit ints gets us floats with a loss of precision.

I wonder if we're safe to just convert the tlids in the DB to a lower precision (except for users editing). I don't think tlids are used except to tell the apiserver what db/handler is being deleted.

They are binary serialized into ops. Ugh.

pbiggar · 2022-07-10T03:31:14Z

Related: the expression 923483483489348934 saves properly but evaluates to 923483483489349000. This is due to loss of precision when encoding Dvals as int64 in json.

OCamlTypes has a string in EInteger and FPInteger, which is why programs stay valid using OCamlTypes (when sent back and forth from the apiserver). But dvals don't! Either when sent from a trace, or when sent back from Wasm.

But, I think if we encoded tlid and int64 as strings when they're too big, we could handle this ok.

pbiggar · 2022-07-11T02:28:55Z

OK, the plan here is to serialize int64s and uint64s (including ids and tlids using strings when they're over the max that can fix in an int. That should fix a bunch of small bugs. I'll do that in a separate PR, and then in this PR I'll switch the client to using uint64s (like the backend).

pbiggar · 2022-07-12T18:05:57Z

This addresses all the issues by creating a UInt64 type (backed by a combination of BigInt for printing/serialization as strings, and Int64 for use in-memory), and then using UInt64 for the TLID/ID types.

Numbers over 2^53 are serialized as strings because JSON has a 53bit limit for integers being represented accurately as themselves.

Corrects some other types on the way

Test suite slowed down 3x just from using a U.UInt64.t from bs-zarith. However, it still makes sense to use the string functions from U.UInt64.

Also improve comments and remove dead code

pbiggar · 2022-07-12T20:57:50Z

Summary and description

This has gone on a while, so I'm going to restate it all.

Originally, this started as an attempt to replace client types with the types used by the server. And indeed, this adds those types. However, after I made a few subsequent changes, it got big enough that now it's time to stop and get this merged.

The major change was changing the type of TLIDs (and IDs). The backend used uint64s, and the client used strings. Using strings led to some pretty bad situations: the backend used uint64s and so while it was mostly fine to use uint64-encoded-as-strings, in practice we snuck in lots of things that weren't uint64s.

In some cases, those were test items like ID("fake-ac-data1"). But in one place we actually put both traceIDs and IDs into the same data by wrapping the traceID in the ID constructor and pretending everything was OK. That has been fixed.

The attempt to switch to a real uint64 may have been too much work for the value we got out of it, but it's done now. I initially tried to store it as a bs-zarith U.UInt64.t type, but this was incredibly slow in practice. I also tried to store it as a JS BigInt, but using a JS type broke the comparators used in TLID.Dict (and lots of other things probably, there were thousands of broken tests).

As a result, it made more sense to come back to using uint64s-in-theory but int64s-in-practice. That is, I'm using an int64 as a uint64. Since int64 is a known type in Rescript, comparators work. Since it's simply two ints together, the performance is good.

Both uint64 and int64 use 64 bits. There wasn't tooling to just cast them between each other, so I explicitly made the translation (using the negative int64 range to hold the values above the positive int64 range).

The complexity here was converting negative int64s to positive uint64s, especially during stringification (needed for encoders and decoders). For that, I used JS BigInts - this allowed me more range, and also to use their toString functions to avoid needing to create my own.

Along the way I tried a bunch of other things, but they all came out more complex in one way: encoding it as a tuple of 2 int32, manually scaling int64s, my own version of toString, using bs-zarith, etc. All things I came up with were more complex. In retrospect, leaving them as strings and just making the type opaque might have worked, but I'd probably have missed some edge cases so this isn't a terrible outcome.

This PR also:

renames dval encode to ocamlDval
add RuntimeTypes to the client
recompiles on .resi file changes
adds tuple encoding/serialization tests
refactors client-side encoding/decoding tests (and TestRpcs)

client/test/TestEncoder.res

StachuDotNet · 2022-07-13T15:12:26Z

client/src/core/RuntimeTypes.res

+  | Rail
+  | NoRail
+
+and expr =


Should this file include some Tuple types? it doesn't right now

Yes it should. Wrote this before they merged, nice catch.

StachuDotNet

it feels a bit offputting to use negative values to represent those out of bounds, but I can't think of anything better. Looks good!

pbiggar · 2022-07-13T16:13:58Z

it feels a bit offputting to use negative values to represent those out of bounds, but I can't think of anything better.

Yeah. The other option is to keep the type opaque and use strings. Any preferences?

StachuDotNet · 2022-07-13T16:16:42Z

it feels a bit offputting to use negative values to represent those out of bounds, but I can't think of anything better.

Yeah. The other option is to keep the type opaque and use strings. Any preferences?

Not really, they're both kinda bad :) So might as well go with the thing we've already coded.

pbiggar requested a review from StachuDotNet July 10, 2022 01:51

pbiggar removed the request for review from StachuDotNet July 10, 2022 02:16

pbiggar force-pushed the paul/pass-functions-to-client branch from 971e1ac to bb21b0e Compare July 10, 2022 23:48

pbiggar mentioned this pull request Jul 11, 2022

64 bit serializers #4257

Merged

pbiggar force-pushed the paul/pass-functions-to-client branch from bb21b0e to 795dcc6 Compare July 12, 2022 01:39

pbiggar mentioned this pull request Jul 12, 2022

backend IDs should be smaller #4253

Closed

pbiggar force-pushed the paul/pass-functions-to-client branch 2 times, most recently from 72f4b2c to 09a04b5 Compare July 12, 2022 18:40

pbiggar added 12 commits July 12, 2022 18:49

Rename old encoders and decoders to ocamlDval

d6c6a52

Add runtimetypes to client

e92ebb4

Also compile on .resi changes

b8cad7e

Move encode and decode functions into ID and TLID modules

e4f26cd

Corrects some other types on the way

Remove TLIDDict and TLIDSet (-> TLID.Dict and TLID.set)

1f071e8

Use int32s for IDs and TLIDs

a422dd0

Don't throw exceptions when parsing TLIDs

fda964c

Switch representation of ID and TLID to U.Uint64.t

87f5167

For speed, use an int64 to represent a UInt64.t

960e53b

Test suite slowed down 3x just from using a U.UInt64.t from bs-zarith. However, it still makes sense to use the string functions from U.UInt64.

99% working solution

a2f79d2

Fix the bugs

c776cd7

Also improve comments and remove dead code

Fix types

ca198a4

pbiggar force-pushed the paul/pass-functions-to-client branch from 09a04b5 to ca198a4 Compare July 12, 2022 18:51

pbiggar added 2 commits July 12, 2022 19:00

Add more roundtrips and move them to TestEncoder

61057db

Make test data match up on frontend and backend

71aa12b

pbiggar added 4 commits July 12, 2022 19:13

New serializations data files for new serialization test data

03a73ec

Combine testRpcs into TestEncoder

25b2246

Rename to TestJsonEncoding and roll TestDecoder into it

68b18c0

Add tuples to serialization tests

f26851f

pbiggar requested a review from StachuDotNet July 12, 2022 20:58

pbiggar marked this pull request as ready for review July 12, 2022 20:58

StachuDotNet reviewed Jul 12, 2022

View reviewed changes

client/test/TestEncoder.res Outdated Show resolved Hide resolved

StachuDotNet reviewed Jul 13, 2022

View reviewed changes

StachuDotNet self-requested a review July 13, 2022 15:15

StachuDotNet approved these changes Jul 13, 2022

View reviewed changes

StachuDotNet and others added 4 commits July 13, 2022 13:23

Disable git restore-mtime in F# build

228ca27

Add new serialization data files with tuples

4b7a002

Delete dead code (test moved elsewhere)

5886371

Add in Tuple typles

4e80ac3

pbiggar force-pushed the paul/pass-functions-to-client branch from 10035f5 to 4e80ac3 Compare July 13, 2022 22:56

pbiggar merged commit 30a6c23 into main Jul 13, 2022

pbiggar deleted the paul/pass-functions-to-client branch July 13, 2022 23:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Start of new types on the client #4254

Start of new types on the client #4254

pbiggar commented Jul 9, 2022 •

edited

pbiggar commented Jul 10, 2022 •

edited

pbiggar commented Jul 10, 2022

pbiggar commented Jul 10, 2022

pbiggar commented Jul 10, 2022

pbiggar commented Jul 10, 2022

pbiggar commented Jul 11, 2022

pbiggar commented Jul 12, 2022 •

edited

pbiggar commented Jul 12, 2022

StachuDotNet Jul 13, 2022 •

edited

pbiggar Jul 13, 2022

StachuDotNet left a comment •

edited

pbiggar commented Jul 13, 2022

StachuDotNet commented Jul 13, 2022 •

edited

Start of new types on the client #4254

Start of new types on the client #4254

Conversation

pbiggar commented Jul 9, 2022 • edited

pbiggar commented Jul 10, 2022 • edited

pbiggar commented Jul 10, 2022

pbiggar commented Jul 10, 2022

pbiggar commented Jul 10, 2022

pbiggar commented Jul 10, 2022

pbiggar commented Jul 11, 2022

pbiggar commented Jul 12, 2022 • edited

pbiggar commented Jul 12, 2022

Summary and description

StachuDotNet Jul 13, 2022 • edited

Choose a reason for hiding this comment

pbiggar Jul 13, 2022

Choose a reason for hiding this comment

StachuDotNet left a comment • edited

Choose a reason for hiding this comment

pbiggar commented Jul 13, 2022

StachuDotNet commented Jul 13, 2022 • edited

pbiggar commented Jul 9, 2022 •

edited

pbiggar commented Jul 10, 2022 •

edited

pbiggar commented Jul 12, 2022 •

edited

StachuDotNet Jul 13, 2022 •

edited

StachuDotNet left a comment •

edited

StachuDotNet commented Jul 13, 2022 •

edited