Re-implementing public key ordering using underlying FFI functions #309

dr-orlovsky · 2021-06-19T09:34:00Z

No description provided.

dr-orlovsky · 2021-06-19T12:33:52Z

Not sure I understand why fuzzification destroys comparison operation in tests. Should I move ffi:*_cmp functions to #[cfg(not(fuzzify))] section and mark all Ord/PartialOrd impl and corresponding tests as #[cfg(not(fuzzify))]?

TheBlueMatt · 2021-06-19T15:27:12Z

Can you write up a benchmark for this in some context (eg inserting public keys into a BtreeMap)? I'm somewhat worried the lack of ability to inline here may hurt some common use-cases.

dr-orlovsky · 2021-06-20T16:01:26Z

Added tests in commit 76cfe2b before the rest of commits so benchmarking with previous ordering version can be measured by checking it out.

Before (with autoderived cmp):

test key::benches::bench_pk_ordering  ... bench:       5,608 ns/iter (+/- 771)

After (with libsecp256k1 backed cmp):

test key::benches::bench_pk_ordering  ... bench:       4,107 ns/iter (+/- 855)

So with C function it is actually faster :)

dr-orlovsky · 2021-06-27T18:03:36Z

@TheBlueMatt any ideas on this? #309 (comment)

TheBlueMatt · 2021-06-28T01:01:34Z

So with C function it is actually faster :)

Huh. That sounds like a rather significant bug in LLVM or at least rustc, then, actually.

As for fuzz, I think best to re-implement it as a extern C function in fuzz_dummy module in secp256k1-sys if you need it to behave differently.

real-or-random · 2021-06-28T07:43:28Z

Oh, so this means that we the current derived implementation of Ord just compares the internal FFI bytes, which is platform-dependent...?

Concept ACK then ;) But this is an API change then.

dr-orlovsky · 2021-08-09T14:11:14Z

@TheBlueMatt I was finally able to make this PR to pass CI :) Pls review

apoelstra · 2021-09-08T21:46:10Z

Since there is no way to access a "real" representation of the keys without going through the C API, there is no way we can get the kind of inlining behavior that Matt wants. (Ok, we could make assumptions about the internal/opaque representation of the pubkey types, which we could get away with since we vendor specific versions of the library, but this representation is not supposed to be user-visible so we should really not do this.)

I am also very surprised by @dr-orlovsky benchmark results. They do not replicate for me. With auto-derive I get

test key::benches::bench_pk_ordering  ... bench:       3,344 ns/iter (+/- 38)

and with the FFI version I get

test key::benches::bench_pk_ordering  ... bench:       5,057 ns/iter (+/- 284)

But anyway, there is no non-FFI way to implement this so I think we are stuck.

apoelstra · 2021-09-08T21:49:46Z

With cfg(fuzzing) now there is no implementation of Ord for PublicKey, which will cause downstream compliation failures during fuzzing.

@dr-orlovsky can you implement some sort of ordering for fuzzable public keys? You can even just auto-derive one.

TheBlueMatt · 2021-09-08T22:17:28Z

My understanding of the libsecp API is that it *does* guarantee that the public key is a byte array of a certain number of bytes. Given we’re not seeking any specific order, but only *some* order, I don’t see why we’re not allowed to simply memcmp? It seems that would be API guaranteed to not be defined, though the specific order may vary between versions or be random.

…

On Sep 8, 2021, at 11:46, Andrew Poelstra ***@***.***> wrote: Since there is no way to access a "real" representation of the keys without going through the C API, there is no way we can get the kind of inlining behavior that Matt wants. (Ok, we could make assumptions about the internal/opaque representation of the pubkey types, which we could get away with since we vendor specific versions of the library, but this representation is not supposed to be user-visible so we should really not do this.) I am also very surprised by @dr-orlovsky benchmark results. They do not replicate for me. With auto-derive I get test key::benches::bench_pk_ordering ... bench: 3,344 ns/iter (+/- 38) and with the FFI version I get test key::benches::bench_pk_ordering ... bench: 5,057 ns/iter (+/- 284) But anyway, there is no non-FFI way to implement this so I think we are stuck. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or unsubscribe.

apoelstra · 2021-09-08T22:19:50Z

Hmm, yeah, that makes sense to me. cc @real-or-random what do you think? If we're comparing public keys from exactly the same build of libsecp, and we know that they are fixed-size byte arrays, and we don't care about the specific order, surely we can just use memcmp?

elichai · 2021-09-09T08:46:50Z

I remember we had a discussion around that on IRC, can we even assume that equal public keys have the same byte representation? because even for "some order" that should be important

real-or-random · 2021-09-10T15:58:09Z

My understanding of the libsecp API is that it does guarantee that the public key is a byte array of a certain number of bytes. Given we’re not seeking any specific order, but only some order, I don’t see why we’re not allowed to simply memcmp? It seems that would be API guaranteed to not be defined, though the specific order may vary between versions or be random.

I don't understand, maybe I lack context.

Why aren't we seeking any specific order? I think we do. Wouldn't it be strange to expose different order on every platform? We're doing this currently but I think it's a bug.

If performance is really an issue (I can believe it is), wouldn't it be possible to offer a verbose wrapper type with a fast Ord?

But assuming we think memcmp is fine, then why can't we stick with currently autoderived Ord which I think is essentially memcmp?

And yeah, I think @elichai is right. The secp256k1 API does nowhere state the guarantee that equal keys have an equal byte representation. Although we may extend secp256k1 to guarantee this, or simply rely on it: If you look at the implementation, those 64 bytes will (of course) hold the x-coordinate and the y-coordinate. This is not guaranteed either but if we make this assumption, then this already enough to deduce that there are no padding bytes in the FFI type (since both values are 32-byte integers), and this implies it's safe to use memcmp.

apoelstra · 2021-09-13T22:05:28Z

@real-or-random we cannot fix the performance issues by using wrapper types. The problem is that to get from the platform-dependent representation to any other representation, we have to go through the C layer, which makes inlining impossible.

TheBlueMatt · 2021-09-13T22:50:40Z

Why aren't we seeking any specific order? I think we do. Wouldn't it be strange to expose different order on every platform? We're doing this currently but I think it's a bug.

Hmm, why is this a bug? I don't see why a user would ever materially care, unless they're trying to be very specific, in which case you probably want to do it manually (Eg lightning cares about the order, serialized as 33 bytes, but we're not gonna get that from libsecp).

apoelstra · 2021-09-14T00:41:34Z

Because it may make it hard to send sorted data structures, e.g. binary search trees, across the wire.

TheBlueMatt · 2021-09-14T00:52:01Z

Because it may make it hard to send sorted data structures, e.g. binary search trees, across the wire.

Hmm, so that would be a bug for any platform-specific behavior but not for a specific ordering? I suppose we're kinda screwed either way there, though.

Of course the real bug here is that rust encourages rustup so heavily instead of encouraging distro packages, where cross-language LTO is a totally reasonable thing to expect, and then we wouldn't care.

This is somewhat annoying either way - for reference, rust-lightning does have a large BTreeMap with a PublicKey key. We don't care about the specific order, but do need reliable ordering when walking the map, so we'd be impacted by any performance change here. https://github.com/rust-bitcoin/rust-lightning/blob/75f7af6/lightning/src/routing/network_graph.rs#L58

real-or-random · 2021-09-14T07:35:47Z

@real-or-random we cannot fix the performance issues by using wrapper types. The problem is that to get from the platform-dependent representation to any other representation, we have to go through the C layer, which makes inlining impossible.

What I meant is this:

We can have a PublicKey and implement Ord using the FFI functions as in this in PR.
And then we can have a new struct PublicKeyUnstableOrd(PublicKey) where we implement Ord by comparing bytes. Here we don't need to go through the FFI functions, so this should be fast.

It's certainly not elegant but it does the job. A more elegant option would be comparators but nothing like this is in std. I read somewhere that a single Ord was chosen over comparators (maybe with a default comparator) because it makes the syntax simpler in most cases, and one can still emulate comparators using newtypes.

I don't see why a user would ever materially care, unless they're trying to be very specific, in which case you probably want to do it manually (Eg lightning cares about the order, serialized as 33 bytes, but we're not gonna get that from libsecp).

It's simply what (I personally) would expect from an order as I expect -3 < 17 to be true on every platform. In general, I don't expect Rust code to behave differently depending on the platform. That's a major inconvenience when you try to write portable code.

elichai · 2021-09-14T08:37:50Z

We could also have a PublicKeyStableOrd([u8; 64]) that stores uncompressed bytes and de-serializes when converted back to PublicKey (which is very cheap)
(So here you pay a small price on converting and then very cheap comparing, can also go with compressed and pay bigger price on converting but even cheaper comparing)

real-or-random · 2021-09-14T11:51:11Z

We could also have a PublicKeyStableOrd([u8; 64]) that stores uncompressed bytes and de-serializes when converted back to PublicKey (which is very cheap)
(So here you pay a small price on converting and then very cheap comparing, can also go with compressed and pay bigger price on converting but even cheaper comparing)

That's also reasonable. When you store the keys in a collection, you probably want to compare each key at least once, so you anyway want to serialize.

apoelstra · 2021-09-14T13:53:51Z

If somebody implements, say, a BST which is serialized as a flat vector, then sends it across the wire to a machine with a different ordering, the receiving machine will get mysteriously incorrect behavior when trying to use the BST.

I'm not sure how to parse

Hmm, so that would be a bug for any platform-specific behavior but not for a specific ordering?

Yes, any platform-specific behavior would result in these sorts of bugs ... but AFAIK the pubkey ordering is the only such behavior in the library right now.

dr-orlovsky · 2021-09-27T12:06:28Z

Can we summarize somehow to which specific conclusion we have arrived on this matter, such that I can update the PR?

@apoelstra @TheBlueMatt @real-or-random @elichai

real-or-random · 2021-09-27T14:20:38Z

I don't think we have arrived at a conclusion yet. The three proposals are:

Don't care that Ord will be different across machines: Re-implementing public key ordering using underlying FFI functions #309 (comment)
Introduce a struct PublicKeyUnstableOrd(PublicKey) with faster Ord, keep the normal Ord of PublicKey slow but consistent (Re-implementing public key ordering using underlying FFI functions #309 (comment))
Do it the other way around: introduce PublicKeyStableOrd([u8; 64]) and keep the normal Ord of PublicKey fast but inconsistent (Re-implementing public key ordering using underlying FFI functions #309 (comment))
Make Ord consistent and do not produce extra types.

dr-orlovsky · 2021-09-27T15:04:34Z

Is the option "let's make ordering consistent and not produce extra types" off the table?

real-or-random · 2021-09-27T15:06:01Z

Is the option "let's make ordering consistent and not produce extra types" off the table?

I added that one to the list.

dr-orlovsky · 2021-11-06T13:58:00Z

How we can move forward with making a decision of which way to follow?

real-or-random · 2021-11-08T10:43:38Z

I don't think I'm in the position to take a decision but (unsurprisingly) I still favor my own solution. @apoelstra Do you want to have a look and take a decision?

apoelstra · 2021-11-12T13:06:12Z

My decision for now is to punt for the next major release (which implicitly means "don't care that Ord is inconsistent"). But I'd like to revisit this in the future.

TheBlueMatt · 2021-11-12T16:14:25Z

Following up on this - we moved to a NodeId([u8; 33]) and stopped using PublicKeys in places where these things matter. In part so this can move forward and in part because deserializing a million PublicKeys on startup is slow and in part because it cuts the size of key performance-critical maps down substantially. I'm fine with this moving forward either way now.

apoelstra · 2021-12-20T14:41:25Z

I think we should implement (my) slow solution, and try to document somewhere that if you need fast sorting and searching of keys then you should pre-serialize them like this.

Kixunil · 2022-06-10T12:24:51Z

I think that focusing on correctness (cross-platform, stable ordering) is better for Bitcoin stuff. :) To avoid too many types, we could add a method cmp_unstable(&self, other: &PublicKey) -> cmp::Ordering that can be used in sort functions and such and also document that pre-serializing may be better anyway.

Is there a chance to get this in the next release? From what I understand the only difference between @apoelstra and @real-or-random opinions is that @real-or-random wants to provide the unstable wrappers while @apoelstra doesn't want the burden with providing them (LMK if I misunderstood). Maybe my suggestion is an acceptable middle ground? I personally don't care which of the three options is chosen as long as the default ordering is the stable one.

Regarding cross-language LTO, unless we're making it cumbersome, it should be the responsibility of the application developers to setup their environment to allow it if their code is performance-sensitive.

apoelstra · 2022-06-10T18:41:53Z

Yeah, I think if this were rebased I'd be happy to take it.

Kixunil · 2022-06-10T23:50:09Z

To my knowledge Maxim is busy at least until June 24. If someone has the time to rebase and open a new PR I think it'd be great. I may be able to do it in upcoming days too but not sure.

@dr-orlovsky

…ctions 13af519 Make key comparison non-fuzzable (Dr Maxim Orlovsky) 7396604 Implement PublicKey ordering using FFI (Dr Maxim Orlovsky) 0faf404 Benchmark for key ordering (Dr Maxim Orlovsky) 999d165 FFI for pubkey comparison ops (Dr Maxim Orlovsky) Pull request description: Re-base #309 for @dr-orlovsky on request by @Kixunil. To do the rebase I just had to change instances of cfg_attr to use `v0_5_0` instead of `v0_4_1` e.g., ``` #[cfg_attr(not(rust_secp_no_symbol_renaming), link_name = "rustsecp256k1_v0_5_0_xonly_pubkey_cmp")] ``` And drop the changes to `src/schnorrsig.rs`, all these changes are covered by the changes in `key.rs` I believe. ACKs for top commit: Kixunil: ACK 13af519 apoelstra: ACK 13af519 Tree-SHA512: 3054fcbc1707679f54466cdc91162c286394ad691e4f5c8ee18635a22b0854a4e60f1186ef3ca1532aacd8a637d0a153601ec203947e9e58dfcebf1bcb619955

apoelstra · 2022-06-15T15:46:54Z

Superceded by #449

FFI for pubkey comparison ops

45e89df

dr-orlovsky force-pushed the extrakeys/cmp branch from cea9f79 to 8c741de Compare June 19, 2021 12:01

dr-orlovsky added 4 commits June 20, 2021 17:58

Benchmark for key ordering

76cfe2b

Re-implementing PublicKey ordering using FFI

1bd98bc

Re-implementing X-only PublicKey ordering using FFI

ba1395c

Removing ordering and hashing for KeyPair's

3f063c7

dr-orlovsky force-pushed the extrakeys/cmp branch from 8c741de to 3f063c7 Compare June 20, 2021 16:00

dr-orlovsky force-pushed the extrakeys/cmp branch from fda53a6 to e001be2 Compare August 1, 2021 19:56

dr-orlovsky force-pushed the extrakeys/cmp branch from e001be2 to 7476c6a Compare August 9, 2021 13:09

Making key comparison non-fuzziable

84aa321

dr-orlovsky force-pushed the extrakeys/cmp branch from 7476c6a to 84aa321 Compare August 9, 2021 14:03

tcharding mentioned this pull request Jan 25, 2022

Improve documentation #340

Merged

Kixunil mentioned this pull request Jun 10, 2022

Bump crate version to 0.23.0 #444

Merged

tcharding mentioned this pull request Jun 14, 2022

Re-implement public key ordering using underlying FFI functions #449

Merged

apoelstra closed this Jun 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Re-implementing public key ordering using underlying FFI functions #309

Re-implementing public key ordering using underlying FFI functions #309

dr-orlovsky commented Jun 19, 2021

dr-orlovsky commented Jun 19, 2021

TheBlueMatt commented Jun 19, 2021

dr-orlovsky commented Jun 20, 2021

dr-orlovsky commented Jun 27, 2021

TheBlueMatt commented Jun 28, 2021

real-or-random commented Jun 28, 2021

dr-orlovsky commented Aug 9, 2021

apoelstra commented Sep 8, 2021

apoelstra commented Sep 8, 2021

TheBlueMatt commented Sep 8, 2021 via email

apoelstra commented Sep 8, 2021

elichai commented Sep 9, 2021

real-or-random commented Sep 10, 2021

apoelstra commented Sep 13, 2021

TheBlueMatt commented Sep 13, 2021

apoelstra commented Sep 14, 2021

TheBlueMatt commented Sep 14, 2021

real-or-random commented Sep 14, 2021

elichai commented Sep 14, 2021

real-or-random commented Sep 14, 2021

apoelstra commented Sep 14, 2021

dr-orlovsky commented Sep 27, 2021 •

edited

real-or-random commented Sep 27, 2021 •

edited

dr-orlovsky commented Sep 27, 2021 •

edited

real-or-random commented Sep 27, 2021

dr-orlovsky commented Nov 6, 2021

real-or-random commented Nov 8, 2021

apoelstra commented Nov 12, 2021

TheBlueMatt commented Nov 12, 2021

apoelstra commented Dec 20, 2021

Kixunil commented Jun 10, 2022

apoelstra commented Jun 10, 2022

Kixunil commented Jun 10, 2022

apoelstra commented Jun 15, 2022

Re-implementing public key ordering using underlying FFI functions #309

Re-implementing public key ordering using underlying FFI functions #309

Conversation

dr-orlovsky commented Jun 19, 2021

dr-orlovsky commented Jun 19, 2021

TheBlueMatt commented Jun 19, 2021

dr-orlovsky commented Jun 20, 2021

dr-orlovsky commented Jun 27, 2021

TheBlueMatt commented Jun 28, 2021

real-or-random commented Jun 28, 2021

dr-orlovsky commented Aug 9, 2021

apoelstra commented Sep 8, 2021

apoelstra commented Sep 8, 2021

TheBlueMatt commented Sep 8, 2021 via email

apoelstra commented Sep 8, 2021

elichai commented Sep 9, 2021

real-or-random commented Sep 10, 2021

apoelstra commented Sep 13, 2021

TheBlueMatt commented Sep 13, 2021

apoelstra commented Sep 14, 2021

TheBlueMatt commented Sep 14, 2021

real-or-random commented Sep 14, 2021

elichai commented Sep 14, 2021

real-or-random commented Sep 14, 2021

apoelstra commented Sep 14, 2021

dr-orlovsky commented Sep 27, 2021 • edited

real-or-random commented Sep 27, 2021 • edited

dr-orlovsky commented Sep 27, 2021 • edited

real-or-random commented Sep 27, 2021

dr-orlovsky commented Nov 6, 2021

real-or-random commented Nov 8, 2021

apoelstra commented Nov 12, 2021

TheBlueMatt commented Nov 12, 2021

apoelstra commented Dec 20, 2021

Kixunil commented Jun 10, 2022

apoelstra commented Jun 10, 2022

Kixunil commented Jun 10, 2022

apoelstra commented Jun 15, 2022

dr-orlovsky commented Sep 27, 2021 •

edited

real-or-random commented Sep 27, 2021 •

edited

dr-orlovsky commented Sep 27, 2021 •

edited