BN254 block multiplier (assembly version) by xrvdg · Pull Request #30 · worldfnd/provekit

xrvdg · 2025-05-01T03:37:45Z

This PR adds inline assembly for bn254 block multipliers. This is a partial replacement for the Rust version as this PR doesn’t include an optimised squaring routine. Even with this the Rust version is useful as a fallback option on non-NEON architectures and as a reference implementation for testing.

Motivation
The motivation for an assembly version is to have predictable performance. With the Rust version we’ve seen that small changes to the source can have big performance differences. Which makes it likely to be susceptible to changes in Rust/LLVM as well.

Concurrent multiplications
Three concurrent multiplication has been the best performing on Raspberry Pi 5 (130ns for 3 multiplication vs 97ns for a single multiplication. On Apple Silicon (M3) we saw an improvement of ~6% running four concurrent multiplication compared to a single multiplication.

Depends on #28
Tracking: #35

Round towards Zero

Support for more architectures and rounding modes.

xrvdg added 2 commits May 1, 2025 11:22

Add block-multiplier-sys

829d9aa

fixup! Add block-multiplier-sys

c98628d

xrvdg marked this pull request as draft May 1, 2025 03:37

xrvdg mentioned this pull request May 2, 2025

Round towards Zero #28

Merged

xrvdg changed the title ~~Assembly for montgomery multipliers of size 3 and 4~~ BN254 block multiplier (assembly version) May 2, 2025

xrvdg mentioned this pull request May 2, 2025

Tracking: BN254 montgomery multiplier #35

Closed

xrvdg marked this pull request as ready for review May 2, 2025 06:53

xrvdg requested review from Dzejkop, Quarky93 and recmo May 2, 2025 06:53

recmo added 8 commits May 6, 2025 12:59

Refactor to generalize over modes and archs

09f12aa

Fix clippies

302435a

Merge pull request #28 from worldfnd/xr/rtz

ca4a5ec

Round towards Zero

Merge branch 'main' into recmo/rounding

128045e

update to main

2a96bbd

Add tests

7b039d3

Merge pull request #38 from worldfnd/recmo/rounding

9bddb32

Support for more architectures and rounding modes.

Merge branch 'main' into xr/block-multiplier-sys

baa60f8

recmo approved these changes May 6, 2025

View reviewed changes

recmo merged commit 6db5c0e into xr/rtz May 6, 2025
0 of 2 checks passed

recmo deleted the xr/block-multiplier-sys branch May 6, 2025 18:04

recmo restored the xr/block-multiplier-sys branch May 6, 2025 18:17

recmo deleted the xr/block-multiplier-sys branch May 6, 2025 18:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BN254 block multiplier (assembly version)#30

BN254 block multiplier (assembly version)#30
recmo merged 10 commits into
xr/rtzfrom
xr/block-multiplier-sys

xrvdg commented May 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

xrvdg commented May 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xrvdg commented May 1, 2025 •

edited

Loading