secp256k1: Optimize precomp values to use affine. #2690

davecgh · 2021-07-31T08:34:04Z

This optimizes the pre-computed byte points used to accelerate scalar base multiplication to store the data in affine coordinates instead of Jacobian coordinates which reduces the memory usage requirement to 66% of what it current requires and also has the important benefit of further speeding up the computation.

This is the case because projecting affine coordinates into Jacobian space is essentially free and the point doubling and addition routines have optimizations which allow them to avoid additional operations when the Z coordinate is 1, which is the case for an initial affine projection.

Further, since the compressed table is stored in the string table of the binary, it also reduces the size the of final binary by ~385KiB.

There are also a couple of preparatory commits to ease the review process that separates the code that loads the pre-computed byte points from the elliptic adaptor code which further makes the internals of the package independent of the crypto/elliptic and crypto/ecdsa stdlib interfaces.

The following benchmark shows a before and after comparison of scalar base multiplication as well as how that translates to signature verification:

name                       old time/op    new time/op    delta
-------------------------------------------------------------------------
ScalarBaseMult        34.5µs ± 1%   24.7µs ± 1%   -28.43% (p=0.008 n=5+5)
ScalarBaseMultLarge   48.2µs ± 1%   38.0µs ± 1%   -21.08% (p=0.008 n=5+5)
SigVerify              181µs ± 5%    163µs ± 2%    -9.86% (p=0.008 n=5+5)

While 18 µs less per signature verification might not seem like much on the surface, consider that every transaction requires at least one signature operation, so there are a ton of them when doing no checkpoint syncs. For a concrete number, verifying 100 million signatures would take 30 minutes less time.

davecgh · 2021-08-02T23:20:05Z

Rebased to the latest master. No changes.

dcrec/secp256k1/loadprecomputed.go

matheusd

Tested regenerating the compressedpoints.go file (after locally removing it) and it matches the one in the commit.

rstaudt2

Looks good to me! Ran a full sync on mainnet with --nocheckpoints without issue.

dcrec/secp256k1/loadprecomputed.go

This separates the code that loads the pre-computed byte points used to accelerate scalar base multiplication from the elliptic adaptor code which further makes the internals of the package independent of the crypto/elliptic and crypto/ecdsa stdlib interfaces. It also takes this opportunity to improve the related code a bit by making it less dependent on magic numbers and defining a proper type for the data table. Finally, it retains and improves the logic which only loads the data on first use by making use of a closure to house and access the loaded data so it is no longer possible to accidentally access the uninitialized pointer.

This removes the code that deals with only initializing the adaptor instance on first use since the adaptor code no longer houses the pre-computed byte points which motivated that behavior.

This optimizes the pre-computed byte points used to accelerate scalar base multiplication to store the data in affine coordinates instead of Jacobian coordinates which reduces the memory usage requirement to 66% of what it current requires and also has the important benefit of further speeding up the computation. This is the case because projecting affine coordinates into Jacobian space is essentially free and the point doubling and addition routines have optimizations which allow them to avoid additional operations when the Z coordinate is 1, which is the case for an initial affine projection. Further, since the compressed table is stored in the string table of the binary, it also reduces the size the of final binary by ~385KiB. The following benchmark shows a before and after comparison of scalar base multiplication as well as how that translates to signature verification: name old time/op new time/op delta ------------------------------------------------------------------------- ScalarBaseMult 34.5µs ± 1% 24.7µs ± 1% -28.43% (p=0.008 n=5+5) ScalarBaseMultLarge 48.2µs ± 1% 38.0µs ± 1% -21.08% (p=0.008 n=5+5) SigVerify 181µs ± 5% 163µs ± 2% -9.86% (p=0.008 n=5+5) While 18 µs less per signature verification might not seem like much on the surface, consider that every transaction requires at least one signature operation, so there are a ton of them when doing no checkpoint syncs. For a concrete number, verifying 100 million signatures would take 30 minutes less time.

davecgh added the optimization label Jul 31, 2021

davecgh added this to the 1.7.0 milestone Jul 31, 2021

davecgh force-pushed the secp256k1_optimize_precomps_and_base_mult branch 3 times, most recently from af58a2d to d886f32 Compare August 2, 2021 23:18

davecgh mentioned this pull request Aug 3, 2021

secp256k1: Optimize NAF conversion. #2695

Merged

jrick reviewed Aug 4, 2021

View reviewed changes

dcrec/secp256k1/loadprecomputed.go Show resolved Hide resolved

matheusd approved these changes Aug 5, 2021

View reviewed changes

rstaudt2 approved these changes Aug 10, 2021

View reviewed changes

dcrec/secp256k1/loadprecomputed.go Outdated Show resolved Hide resolved

JoeGruffins approved these changes Aug 11, 2021

View reviewed changes

davecgh added 3 commits August 11, 2021 14:08

secp256k1: Always initialize adaptor instance.

8cef307

This removes the code that deals with only initializing the adaptor instance on first use since the adaptor code no longer houses the pre-computed byte points which motivated that behavior.

davecgh force-pushed the secp256k1_optimize_precomps_and_base_mult branch from d886f32 to fdfae1a Compare August 11, 2021 19:10

davecgh merged commit fdfae1a into decred:master Aug 11, 2021

davecgh deleted the secp256k1_optimize_precomps_and_base_mult branch August 11, 2021 19:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

secp256k1: Optimize precomp values to use affine. #2690

secp256k1: Optimize precomp values to use affine. #2690

davecgh commented Jul 31, 2021

davecgh commented Aug 2, 2021

matheusd left a comment

rstaudt2 left a comment

secp256k1: Optimize precomp values to use affine. #2690

secp256k1: Optimize precomp values to use affine. #2690

Conversation

davecgh commented Jul 31, 2021

davecgh commented Aug 2, 2021

matheusd left a comment

Choose a reason for hiding this comment

rstaudt2 left a comment

Choose a reason for hiding this comment