Pasta / Halo2 MSM bench #243

mratsim · 2023-05-31T07:14:58Z

Benchmarks and optimization of MSM for the Halo2 Pasta curves

Current on 8 cores i9-11980HK with Clang

TODO

Bench vs Zcash / Privacy Scaling Exploration implementation
Make "no ASM" an environment variable to divide by 2 the number of nimble tasks
Generate coefficient-point pairs in parallel
Bench on large core count
Tune if needed

mratsim · 2023-05-31T12:48:48Z

Might have missed something, there is no benchmark for MSM in Halo2 or pasta_curves repo?

Used this PR: https://github.com/zcash/halo2/pull/619/files
file: https://github.com/zcash/halo2/blob/b131df023c9a860244b5b0a24d03a1c249f4c82c/halo2_proofs/benches/multiexp.rs

mratsim · 2023-05-31T13:08:52Z

Rebench on same scale as Halo2

Something is really strange.

For 2⁸ inputs / 256 inputs it takes 13ms, this is even slower than my naive scalar mul at 8.7ms. And multithreaded I am at 313µs so 41x faster.

The code in the PR is indeed multithreaded: https://github.com/zcash/halo2/blob/b131df0/halo2_proofs/src/arithmetic.rs#L28-L30

Is the benchmark flawed and measuring the trusted setup setup as well? https://github.com/zcash/halo2/blob/b131df0/halo2_proofs/benches/multiexp.rs#L25-L28

Thing is, time is roughly doubling each time we double the input size but MSM should scale as O(n/log n) while the Rust bench seems to grow linearly.

With 2¹⁵ inputs (32748), they take 1.7s while Constantine takes 12ms so a 141.7x ratio which sounds crazy. And the naive implementation in Constantine takes 1.11 seconds

mratsim · 2023-05-31T13:25:49Z

Benching vs Supranational at https://github.com/supranational/pasta-msm

2x faster

mratsim · 2023-06-04T15:00:36Z

On a watercooled overclocked 18-core i9-9980XE

mratsim · 2023-06-04T15:32:28Z

From Zcash repo: https://github.com/zcash/halo2/pull/619/files#diff-a07879d4aa4c95cfbfb03f5de33deee89d548aba465ff7bbdc5965d24463b0cb

Similar perf issues on Zcash:

26ms for 256 inputs while on my machine Constantine is 0.412ms, a 63x ratio
3.3932s for 32768 inputs while Constantine is 0.011s, a 308x ratio.

mratsim · 2023-06-04T15:38:33Z

Supranational: https://github.com/supranational/pasta-msm

There is a ratio 2.22x in favor of Constantine

mratsim closed this Jun 1, 2023

mratsim reopened this Jun 1, 2023

mratsim force-pushed the pasta-bench branch from 8208990 to 93b2601 Compare June 2, 2023 06:23

mratsim added 5 commits June 3, 2023 10:11

Pasta bench

c30e132

cleanup env variables

19b287e

[MSM]: generate benchmark coef-points pairs in parallel

2ee5866

try to fix windows Ci

ab611e8

add diagnostic info

3dc3d09

mratsim force-pushed the pasta-bench branch from 93b2601 to 3dc3d09 Compare June 3, 2023 08:25

mratsim added 2 commits June 3, 2023 10:36

fix old test for new codecs/io primitives

9201c6b

Ensure the projective point at infinity is not all zeros, but (0, 1, 0)

c43919d

mratsim marked this pull request as ready for review June 4, 2023 15:40

mratsim merged commit 0eba593 into master Jun 4, 2023
12 checks passed

mratsim deleted the pasta-bench branch June 4, 2023 15:42

mratsim mentioned this pull request Jun 11, 2024

Towards state-of-the-art multi-scalar-muls privacy-scaling-explorations/halo2curves#163

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pasta / Halo2 MSM bench #243

Pasta / Halo2 MSM bench #243

mratsim commented May 31, 2023 •

edited

Loading

mratsim commented May 31, 2023

mratsim commented May 31, 2023

mratsim commented May 31, 2023

mratsim commented Jun 4, 2023

mratsim commented Jun 4, 2023

mratsim commented Jun 4, 2023

Pasta / Halo2 MSM bench #243

Pasta / Halo2 MSM bench #243

Conversation

mratsim commented May 31, 2023 • edited Loading

Benchmarks and optimization of MSM for the Halo2 Pasta curves

TODO

mratsim commented May 31, 2023

mratsim commented May 31, 2023

mratsim commented May 31, 2023

mratsim commented Jun 4, 2023

mratsim commented Jun 4, 2023

mratsim commented Jun 4, 2023

mratsim commented May 31, 2023 •

edited

Loading