Add initial WASM SIMD128 support #8

ajakubowicz-canva · 2025-06-18T07:18:25Z

Note

The measurements in this PR were done very rough via a quick Chrome profile. Because the numbers are small, until we run a rigorous benchmark it'll be hard to validate the impact. Especially across browser engines, and with JS engines optimising WASM and fusing operators.

Overview

This PR adds initial WASM SIMD support to fearless_simd, implementing enough operations to enable WASM SIMD in linebender/vello#1053. Rather than implementing all operations in one large PR, this focuses on the essential subset needed for Vello and breaks up an otherwise huge change. There's also some tricky operations to implement using the small amount of WASM instructions 😅 .

Performance Impact

Tested with Ghost Tiger rendering in Vello:

Without fast kurbo (baseline):

Without SIMD: ~42.31ms per frame
With SIMD: ~39.91ms per frame
Improvement: ~5% faster

With fast kurbo (linebender/kurbo#427):

Without SIMD: ~6ms per frame
With SIMD: ~4ms per frame
Improvement: ~30% faster (Take this with a huge grain of salt)

Test methodology: I linked vello locally to fearless_simd via path reference, and modified vello_hybrid to use the WASM SIMD level.

Changes

New architecture: Added WASM SIMD128 support
Operations implemented: Core subset including:
- Binary ops: add, sub, mul, min, max
- Comparison ops: simd_eq, simd_ne, simd_lt, etc.
- Math ops: sqrt, madd
Testing: Added parity tests ensuring Fallback and WASM SIMD produce identical results
Bug fix: Fixed incorrect mask generation in Fallback comparison operations (was returning 0/1 instead of 0/-1)

Test Plan

Added test_wasm_simd_parity! macro that verifies operations produce identical results across Fallback and WASM SIMD implementations.

I only tested a small subset. Maybe in the future we code-gen the tests as well?

Next Steps

Future PRs will add more operations to achieve full WASM SIMD coverage.

fearless_simd_tests/tests/wasm.rs

LaurenzV · 2025-06-18T08:21:26Z

fearless_simd_tests/tests/wasm.rs

I agree that, in the long term, we will probably want to autogenerate the tests, so that you write a test once with the expected output, and then compare it across all implementations. But, happy to land this as an intermediate version so we have at least some test coverage.

Thank you!

I tried a couple different macro approaches, but it ended up being extremely confusing. Maybe there is a way to express the tests as a nice macro as well. I am also not very confident with macros.

I also don't yet have a good idea of how to test neon. Maybe the CI mac already supports it?

The CI Macs should support this, because they're physical M1 machines. In theory, you should be running the same code with different level enum values, I think.

.github/workflows/ci.yml

LaurenzV · 2025-06-18T08:27:58Z

fearless_simd_tests/tests/wasm.rs

+#[cfg(target_arch = "wasm32")]
+use wasm_bindgen_test::*;
+
+/// `test_wasm_simd_parity` enforces that the fallback level and +simd128 levels output the same


Ideally the reference result should be provided manually (since it's possible the fallback is wrong as well), but that's for the future when we implement a proper test suite

fearless_simd_gen/src/arch/wasm.rs

LaurenzV · 2025-06-18T08:55:37Z

fearless_simd_gen/src/mk_fallback.rs

                                let expr = Fallback.expr(method, vec_ty, &args);
                                let mask_ty = mask_type.scalar.rust(scalar_bits);
-                                quote! { #expr as #mask_ty }
+                                quote! { -(#expr as #mask_ty) }


Oh I see now, this is because in neon, the mask for true is all bits set to 1? I assumed it doesn't matter what the representation for true is as long as false is, but I guess it makes sense to keep it consistent. Do you know if all SIMD variants are guaranteed to have that representation?

From a quick look, SSE/AVX (x86) and neon both set all bits to 1 for true, and all to 0 for false. So this change will be consistent with them.

E.g. Neon, x86, fallback, and Wasm should all be identical.

fearless_simd_gen/src/mk_wasm.rs

ajakubowicz-canva · 2025-06-18T12:05:20Z

Validated that tests run on CI: https://github.com/raphlinus/fearless_simd/actions/runs/15732084678/job/44335357500#step:6:3110

ajakubowicz-canva added 3 commits June 18, 2025 17:06

initial boilerplate - enough to run linebender PR

393d7ed

small cleanup

72d7ab9

run fmt

6f45bb2

ajakubowicz-canva requested a review from LaurenzV June 18, 2025 07:18

ajakubowicz-canva changed the title ~~Start on wasm32 simd128 architecture~~ Add initial WASM SIMD128 support Jun 18, 2025

revert cfg change

15375de

DJMcNab reviewed Jun 18, 2025

View reviewed changes

fearless_simd_tests/tests/wasm.rs Outdated Show resolved Hide resolved

code review feedback - remove test prefix

230270f

LaurenzV approved these changes Jun 18, 2025

View reviewed changes

add ci

732454e

remove comments

96eed42

ajakubowicz-canva merged commit e46bcfd into main Jun 18, 2025
6 checks passed

ajakubowicz-canva deleted the ajakubowicz-wasm32-simd128 branch June 18, 2025 12:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add initial WASM SIMD128 support #8

Add initial WASM SIMD128 support #8

Uh oh!

ajakubowicz-canva commented Jun 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

LaurenzV Jun 18, 2025

Uh oh!

ajakubowicz-canva Jun 18, 2025

Uh oh!

ajakubowicz-canva Jun 18, 2025

Uh oh!

DJMcNab Jun 18, 2025

Uh oh!

Uh oh!

LaurenzV Jun 18, 2025

Uh oh!

Uh oh!

Uh oh!

LaurenzV Jun 18, 2025

Uh oh!

ajakubowicz-canva Jun 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

ajakubowicz-canva commented Jun 18, 2025

Uh oh!

Uh oh!

Uh oh!

Add initial WASM SIMD128 support #8

Add initial WASM SIMD128 support #8

Uh oh!

Conversation

ajakubowicz-canva commented Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Performance Impact

Without fast kurbo (baseline):

With fast kurbo (linebender/kurbo#427):

Changes

Test Plan

Next Steps

Uh oh!

Uh oh!

LaurenzV Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

ajakubowicz-canva Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

ajakubowicz-canva Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

DJMcNab Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

LaurenzV Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

LaurenzV Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

ajakubowicz-canva Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ajakubowicz-canva commented Jun 18, 2025

Uh oh!

Uh oh!

Uh oh!

ajakubowicz-canva commented Jun 18, 2025 •

edited

Loading

ajakubowicz-canva Jun 18, 2025 •

edited

Loading