ssz: switch integer encoding to little endian #139

arnetheduck · 2018-11-16T21:32:59Z

the choice between little and big endian is arbitrary from a functional
point of view, but practially:

most commodity hardware these days is either little- or biendian
mechanical sympathy between encoding and hardware allows a wider range
of tricks to be used when encoding and decoding data leading to better
efficiency
we're developing a format that favors "decoding-free" access to data

the choice between little and big endian is arbitrary from a functional point of view, but practially: * most commodity hardware these days is either little- or biendian * mechanical sympathy between encoding and hardware allows a wider range of tricks to be used when encoding and decoding data leading to better efficiency * we're developing a format that favors "decoding-free" access to data

JustinDrake · 2018-12-05T14:46:15Z

Pinging @AlexeyAkhunov and @karalabe. Big or little endian?

djrtwo · 2018-12-05T21:05:48Z

The main push-back I heard on this was that eth1.0 uses big-endian so don't introduce a different-endian encoding in eth2.0. The small gain in efficiency is not worth the potential confusion and overhead of having to remember which is which.

arnetheduck · 2018-12-05T21:18:24Z

Well, there's a fairly clean break here, considering SSZ vs RLP - it's also an unfortunate fact of life that you have to remember endianness whenever you deal with binary protocols in general.

The main performance difference will be that adjacent integers can either be bulk-copied or will have to be byte-flipped one-by-one. It affects both network serialization and hashing.

JustinDrake · 2018-12-06T21:46:33Z

The main performance difference will be that adjacent integers can either be bulk-copied or will have to be byte-flipped one-by-one.

Can this be quantified? What is the performance difference?

It affects both network serialization and hashing.

Would it negatively affect light clients of Ethereum 2.0 built in Ethereum 1.0 contracts?

mkalinin · 2018-12-07T13:22:16Z

It could be that a win in efficiency gained with this optimization is too low comparing with efficiency of other operations, for example, calculating a hash of validators registry.

Another thing is that all big number implementations in Java that I've seen uses big-endian to encode/decode numbers to/from byte arrays. So does Milagro, even in C implementation. What about other languages, btw? And in our case it results in reversing signature bytes on each encoding/decoding since signature has uint384 type. There is a workaround to it, use byteN type for all big numbers starting from uint72. Otherwise, reversing 48 bytes could be much less efficient than reversing byte order in several adjacent primitives that I believe could be done with bitwise arithmetic.

arnetheduck · 2018-12-07T13:52:49Z

Can this be quantified? What is the performance difference?

I'll see if I can pull up some numbers, but we're really not on that stage yet (it's a pretty low-level / final-touch optimization) - the idea itself is mainly taken from other modern serialization formats that state "direct access" as one of their design goals, for example flatbuffers.

all big number implementations

yep - though here the machine endianess no longer matters - there's no mechanical sympathy to consider because you can't directly use these numbers anyway, and at this point, it's kind of.. arbitrary.

arnetheduck · 2018-12-07T13:55:40Z

anyway, if there's pushback, we can certainly drop this - it's a drop in the sea, as @mkalinin points out (or one of many paper cuts).

mkalinin · 2018-12-08T07:23:32Z

yep - though here the machine endianess no longer matters

Agree. We may use whichever endiannes for big numbers depending on the case. For instance, BLS12-381#Serialization defines that Fq elements are encoded in big-endian form. And endiannes could be explicitly defined in our spec for this particular value.

The main push-back I heard on this was that eth1.0 uses big-endian so don't introduce a different-endian encoding in eth2.0.

As for me, this is not a strong argument. Cause eth2.0 has many differences wrt eth1.0 and that's even great.

I am not opposed to little endian. Indeed, it's better to have an optimization opportunity even if doesn't seem too valuable at the moment. Possible solution for big numbers would be in representing them with bytesN type in the spec.

JustinDrake · 2018-12-10T09:42:46Z

one of many paper cuts

@arnetheduck Is there anything beyond your current 3 issues and 1 PR? I'm keen to address as many issues as possible before we declare the spec a release candidate, so now is a good time to flag things. 👍

arnetheduck · 2018-12-10T16:45:46Z

one of many paper cuts
Is there anything beyond

ah, I hope that it did not came across wrong - it was intended as a general comment and not to say that there are necessarily many in the spec as of now :)

I'll go over my notes and see what is still relevant after the latest refactorings (:+1: good work!), and post ASAP!

arnetheduck · 2019-01-06T03:25:08Z

It's worth noting WASM is little-endian also: https://github.com/WebAssembly/design/blob/master/Semantics.md#linear-memory-accesses

sorpaas · 2019-01-14T16:11:45Z

SSZ and RLP are already vastly different, so I think using big-endian because eth1.0 uses big-endian may not be that of a really strong argument.

Besides all the architectures using little-endian, for Parity there's also a really specific reason we would prefer little-endian -- our parity-codec format uses little-endian, and parity-codec and ssz are nearly identical in all the basic forms, just except the endianness! By using little-endian, we can unify those two formats.

JustinDrake · 2019-01-14T21:44:36Z

Pros of little-endian:

Consistent with WASM
Consistent with commodity hardware
Consistent with parity-codec

Cons of little-endian:

Inconsistent with RLP (I agree this is a weak argument)
Inconsistent with big number implementations (can be worked around with byteN)

Little-endian feels on net positive :)

JustinDrake · 2019-01-17T15:33:39Z

Consensus reached on the Eth2.0 call. Thanks @djrtwo 🎉

ethereum/consensus-specs#139

hwwhww requested a review from vbuterin November 17, 2018 08:35

Merge branch 'master' into little-endian

d28c11f

hwwhww added the general:RFC Request for Comments label Nov 30, 2018

Merge branch 'master' into little-endian

c9b3a7b

JustinDrake approved these changes Jan 14, 2019

View reviewed changes

zilm13 mentioned this pull request Jan 15, 2019

Switch integer encoding in SSZ to little endian harmony-dev/beacon-chain-java#19

Closed

hwwhww added the scope:SSZ Simple Serialize label Jan 15, 2019

terencechain mentioned this pull request Jan 17, 2019

Switch from Big Endian to Small Endian prysmaticlabs/prysm#1338

Closed

JustinDrake merged commit a80f271 into ethereum:master Jan 17, 2019

hwwhww mentioned this pull request Jan 17, 2019

Switch byte_order to little endian ethereum/py-ssz#27

Closed

wemeetagain mentioned this pull request Jan 19, 2019

Switch to integer encoding to little-endian ChainSafe/ssz-js#23

Closed

arnetheduck added a commit to status-im/nimbus-eth2 that referenced this pull request Jan 22, 2019

ssz: switch to little-endian

523a990

ethereum/consensus-specs#139

arnetheduck mentioned this pull request Jan 22, 2019

ssz: switch to little-endian status-im/nimbus-eth2#65

Merged

JustinDrake mentioned this pull request May 5, 2019

Little-endian vs big-endian (take 2) #1046

Closed

LefterisJP mentioned this pull request Jan 25, 2023

Add eth2 deposits decoder rotki/rotki#5470

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ssz: switch integer encoding to little endian #139

ssz: switch integer encoding to little endian #139

arnetheduck commented Nov 16, 2018

JustinDrake commented Dec 5, 2018

djrtwo commented Dec 5, 2018

arnetheduck commented Dec 5, 2018

JustinDrake commented Dec 6, 2018

mkalinin commented Dec 7, 2018

arnetheduck commented Dec 7, 2018

arnetheduck commented Dec 7, 2018

mkalinin commented Dec 8, 2018

JustinDrake commented Dec 10, 2018

arnetheduck commented Dec 10, 2018 •

edited

Loading

arnetheduck commented Jan 6, 2019

sorpaas commented Jan 14, 2019 •

edited

Loading

JustinDrake commented Jan 14, 2019

JustinDrake commented Jan 17, 2019

ssz: switch integer encoding to little endian #139

ssz: switch integer encoding to little endian #139

Conversation

arnetheduck commented Nov 16, 2018

JustinDrake commented Dec 5, 2018

djrtwo commented Dec 5, 2018

arnetheduck commented Dec 5, 2018

JustinDrake commented Dec 6, 2018

mkalinin commented Dec 7, 2018

arnetheduck commented Dec 7, 2018

arnetheduck commented Dec 7, 2018

mkalinin commented Dec 8, 2018

JustinDrake commented Dec 10, 2018

arnetheduck commented Dec 10, 2018 • edited Loading

arnetheduck commented Jan 6, 2019

sorpaas commented Jan 14, 2019 • edited Loading

JustinDrake commented Jan 14, 2019

JustinDrake commented Jan 17, 2019

arnetheduck commented Dec 10, 2018 •

edited

Loading

sorpaas commented Jan 14, 2019 •

edited

Loading