Points and scalars are serialized as 40 bytes (instead of 32) #265

trevp · 2019-07-12T22:19:46Z

RistrettoPoint, CompressedRistretto, and Scalar are serialized into 40 bytes (instead of 32) when using serde with efficient encodings like bincode.

This is because they are handled as variable-length "bytes" in serde terminology, instead of fixed-length "tuples", so are given a 8-byte length field.

This is fairly easy to fix using serialize_tuple / deserialize_tuple (I've done so in a local branch; could submit a PR), but I don't know what fixes are possible given stability guarantees.

hdevalence · 2019-07-16T20:14:54Z

I'm not sure that we can fix this without breaking API stability, but I recall that the overhead is reduced when using other encodings (like CBOR, which I think uses as little as 1 byte).

I think that some amount of overhead is inevitable when using Serde, because Serde operates on self-describing data structures, not ones specified in advance. If the data format is specified elsewhere, it should be possible to use the to/from byte conversion methods to achieve a precise and compact encoding for a more complex object like a proof or a signature (we do this in Bulletproofs). This aggregate can then implement the Serde traits, using its own encoding internally, instead of using a derived encoding of all of its components that may not be as efficient.

trevp · 2019-07-16T21:28:25Z

I think that some amount of overhead is inevitable when using Serde, because Serde operates on self-describing data structures, not ones specified in advance.

I don't think that's true, Serde works great with non-self-describing formats like bincode.

https://serde.rs/impl-deserialize.html

Indeed, if points and scalars serialize to 32 bytes it's easy to have structs consisting of points and scalar and use bincode to #derive efficient, packed representations of signatures, proofs, etc.

To me, it seems better to serialize crypto objects like (keys, signatures, proofs) into fixed-length, non-self-describing, efficiently-packed blobs.

These blobs can then be embedded into (JSON, Protobufs, XML, etc), keeping a separation between your crypto format and code and your application protocol. Serializing crypto directly into self-describing formats is usually a worse idea (think ASN-1 encoded DSA sigs, or XML-DSIG).

If the data format is specified elsewhere, it should be possible to use the to/from byte conversion methods to achieve a precise and compact encoding for a more complex object like a proof or a signature (we do this in Bulletproofs).

Yes, one could write serialize/deserialize code by hand, but that defeats the point of serde. If points and scalars serialize to 32 bytes, then you just #derive these implementations instead of writing them.

tarcieri · 2019-07-16T22:27:11Z

I think Serde is agnostic to the self-describing vs "schema" debate, although generally using it with the latter has been somewhat cumbersome (and squaring it with a format like Protos is still an unsolved challenge).

It seems like the problem here is the serialization is treating fixed-width fields as variable-width. I think it'd be better for the serialization to be more specific in this case (i.e. what @trevp wants). For self-describing formats which lack a schema language to describe the precise length in advance, this shouldn't be a breaking change. It would be a breaking change for formats like bincode.

My suggestion would be if you release a curve25519-dalek 2.x for other reasons (e.g. RNG APIs), that would be a good time to address this issue too.

hdevalence · 2019-07-31T21:01:33Z

This would be good to fix, but I'm not sure there's an easy way to do so without breaking compatibility. One option would be to have a Cargo feature that changes the serialization format, but because Cargo features are additive it seems like a recipe for future pain (e.g., one crate enables the feature, the other doesn't, now the code breaks).

tarcieri · 2019-07-31T23:37:28Z

I guess it depends on what sort of compatibility you want. If you don't mind changing the on-wire serialization, it may still be compatible with the existing deserializers with no other changes.

That's all going to depend on specific formats, I'm afraid, but it's entirely possible for all practical intents and purposes it could be backwards compatible.

Seems like it'd need testing with all serialization formats you intend to support.

hdevalence · 2019-09-28T00:14:43Z

Hi all, I think the best path forward for this is:

Check that using Serde tuples produces acceptable results on backends we care about (bincode, others? i.e., do a survey of a few commonly used ones);
Release a 2.0.0 version with fixed serialization, updated rand_core API, and Remove build.rs? #217, and advise in the changelog that it is a compatible upgrade except for users of the Serde traits, due to the change in data model.

I will do this next week.

hdevalence · 2019-10-10T21:41:26Z

Missed my window to do maintenance work last Friday due to other work but I am hoping to get to this tomorrow.

goldenMetteyya · 2019-10-10T21:45:48Z

Hi all, I think the best path forward for this is:

Check that using Serde tuples produces acceptable results on backends we care about (bincode, others? i.e., do a survey of a few commonly used ones);

Release a 2.0.0 version with fixed serialization, updated rand_core API, and Remove build.rs? #217, and advise in the changelog that it is a compatible upgrade except for users of the Serde traits, due to the change in data model.

I will do this next week.

@hdevalence Can we add rust 2018 upgrade as well?
And potentially @tarcieri can finish zeroize V1

If 2.0.0 is being cut might as well make the additions:)

hdevalence · 2019-10-10T23:29:29Z

I think neither of those are breaking changes, so they can be done independently.

hdevalence · 2019-10-23T22:59:57Z

I got stuck on a large ZF project and kept missing my weekly window to get this done, but now that that project is complete I've fixed this in #297.

hdevalence · 2019-10-24T20:40:45Z

The fix can be tested in 2.0.0-alpha.0, closing this issue for now.

* Added and cleaned up some verification docs Co-authored-by: Michael Rosenberg <michael@mrosenberg.pub>

Silur mentioned this issue Aug 1, 2019

Serialization broken due to curve25519-dalek serde behaviour Silur/ZeroTwo#1

Closed

hdevalence mentioned this issue Sep 28, 2019

Upgrade rand dependency dalek-cryptography/x25519-dalek#45

Closed

hdevalence mentioned this issue Oct 10, 2019

Update to latest Rand Core #291

Closed

hdevalence mentioned this issue Oct 16, 2019

Consider adding support for serde dalek-cryptography/x25519-dalek#47

Closed

hdevalence mentioned this issue Oct 23, 2019

Fix serde data modeling. #297

Merged

hdevalence mentioned this issue Oct 24, 2019

Update rand_core, rand, rand_os. #298

Merged

hdevalence closed this as completed Oct 24, 2019

hdevalence mentioned this issue Dec 11, 2019

Impl TryFrom<&[u8]> for all compressed point types. #296

Merged

sgued mentioned this issue Mar 24, 2022

Consider a Serializer::serialize_byte_array method serde-rs/serde#2120

Closed

pinkforest pushed a commit to pinkforest/curve25519-dalek that referenced this issue Jun 27, 2023

Implement Hash trait for VerifyingKey (dalek-cryptography#265)

e1d4ef3

* Added and cleaned up some verification docs Co-authored-by: Michael Rosenberg <michael@mrosenberg.pub>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Points and scalars are serialized as 40 bytes (instead of 32) #265

Points and scalars are serialized as 40 bytes (instead of 32) #265

trevp commented Jul 12, 2019

hdevalence commented Jul 16, 2019

trevp commented Jul 16, 2019

tarcieri commented Jul 16, 2019

hdevalence commented Jul 31, 2019

tarcieri commented Jul 31, 2019 •

edited

Loading

hdevalence commented Sep 28, 2019

hdevalence commented Oct 10, 2019

goldenMetteyya commented Oct 10, 2019 •

edited

Loading

hdevalence commented Oct 10, 2019

hdevalence commented Oct 23, 2019

hdevalence commented Oct 24, 2019

Points and scalars are serialized as 40 bytes (instead of 32) #265

Points and scalars are serialized as 40 bytes (instead of 32) #265

Comments

trevp commented Jul 12, 2019

hdevalence commented Jul 16, 2019

trevp commented Jul 16, 2019

tarcieri commented Jul 16, 2019

hdevalence commented Jul 31, 2019

tarcieri commented Jul 31, 2019 • edited Loading

hdevalence commented Sep 28, 2019

hdevalence commented Oct 10, 2019

goldenMetteyya commented Oct 10, 2019 • edited Loading

hdevalence commented Oct 10, 2019

hdevalence commented Oct 23, 2019

hdevalence commented Oct 24, 2019

tarcieri commented Jul 31, 2019 •

edited

Loading

goldenMetteyya commented Oct 10, 2019 •

edited

Loading