math/bits: add extended precision Add, Sub, Mul, Div

This is a proposal to speed up crypto/poly1305 using [u]int128 and multiword arithmetic. Arm64 has instructions that let you multiply two 64-bit registers to two 64-bit registers.  It also has instructions for multiword addition. I have implemented some of these intrinsics and intrinsified them in https://go-review.googlesource.com/c/go/+/106376 which improved the performance of crypto/poly1305 by ~30% on arm64 (Amberwing). I have added these intrinsics for arm64 in poly1305 package but they might benefit other platforms as well. I am seeking advice on the design of this implementation. Is poly1305 package the right place to have these intrinsics or should they go in math/big or math/bit? This might also be a use case for #9455

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

math/bits: add extended precision Add, Sub, Mul, Div #24813

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

math/bits: add extended precision Add, Sub, Mul, Div #24813

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions