Add `interleave` method by Shnatsel · Pull Request #206 · linebender/fearless_simd

Shnatsel · 2026-04-12T11:46:02Z

Adds a new method with API matching std::simd's interleave method.

The primary motivation is performance: on AVX2, zip_low followed by zip_high requires 6 instructions, while a combined interleave function only needs 4 instructions. (With AVX-512 we'd be able to to do it in 2 instructions on 256-bit vectors, but that's not supported by fearless_simd yet and #201 seems stalled).

This also improves API compatibility with std::simd as a nice bonus.

AI use disclosure: this work was assisted by Claude Opus 4.5 for the initial commit and 4.6 for the rest. I have manually reviewed the code and take full responsibility for it.

…d by zip_high

Shnatsel · 2026-04-18T10:49:12Z

I'm seeing non-trivial improvements from this, up to 8% end-to-end for my FFT implementation, so I'd really appreciate this getting reviewed and published as a point release.

LaurenzV · 2026-04-18T10:58:05Z

Haven't forgotten about this, just haven't gotten to it yet. Will try to do soon, if no one else gets to it before me.

LaurenzV · 2026-04-18T15:45:26Z

+    pub(crate) fn handle_deinterleave(
+        &self,
+        method_sig: TokenStream,
+        vec_ty: &VecType,
+    ) -> TokenStream {
+        let unzip_low = generic_op_name("unzip_low", vec_ty);
+        let unzip_high = generic_op_name("unzip_high", vec_ty);
+        quote! {
+            #method_sig {
+                (self.#unzip_low(a, b), self.#unzip_high(a, b))
+            }
+        }
+    }
+


WHy canw e not apply the same optimization for deinterleave?

That's an oversight on my part. I was focused on interleaving, since that's the hot path in my code, and forgot to optimize deinterleave. I'll do so shortly.

It's not big deal, I was just wondering.

LaurenzV

Seems fine to me, since no one else has commented I presume no one has any objections. Thanks!

Shnatsel added 11 commits January 21, 2026 11:17

Add interleave function mimicking std::simd

9c754c6

Merge branch 'main' into interleave

f5dac39

Add an optimized AVX2 implementation of interleave

0f0f905

re-run the generator

f689305

cargo fmt

b8e4adc

Add basic tests for deinterleaving

2abf59e

Implement deinterleave() operation in the generator

5c87962

Re-run the generator

f058be3

Add more interleave/deinterleave tests

7b59999

Document that interleave/deinterleave are faster than zip_low followe…

c57f5e2

…d by zip_high

re-run the generator

7642be3

LaurenzV self-requested a review April 13, 2026 04:44

Shnatsel mentioned this pull request Apr 18, 2026

PoC: Make the codelets operate entirely in registers QuState/PhastFT#113

Merged

LaurenzV added 4 commits April 18, 2026 13:44

Reduce comments

b767998

Reduce comments

5889e19

Reformat

aaa0cac

Remove more comments

781c7fe

LaurenzV changed the title ~~Add interleave() matching std::simd API, faster than zip_low/zip_high on AVX2~~ Add interleave method Apr 18, 2026

LaurenzV reviewed Apr 18, 2026

View reviewed changes

LaurenzV approved these changes Apr 18, 2026

View reviewed changes

LaurenzV added this pull request to the merge queue Apr 18, 2026

Merged via the queue into linebender:main with commit 8fcafea Apr 18, 2026
22 checks passed

This was referenced Apr 18, 2026

Optimize deinterleave() for AVX2 #207

Open

Use interleave() instead of zip_low()/zip_high() for better perf on AVX2 QuState/PhastFT#116

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `interleave` method#206

Add `interleave` method#206
LaurenzV merged 15 commits intolinebender:mainfrom
Shnatsel:interleave

Shnatsel commented Apr 12, 2026 •

edited

Loading

Uh oh!

Shnatsel commented Apr 18, 2026

Uh oh!

LaurenzV commented Apr 18, 2026

Uh oh!

LaurenzV Apr 18, 2026

Uh oh!

Shnatsel Apr 18, 2026

Uh oh!

LaurenzV Apr 18, 2026

Uh oh!

LaurenzV left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Shnatsel commented Apr 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Shnatsel commented Apr 18, 2026

Uh oh!

LaurenzV commented Apr 18, 2026

Uh oh!

LaurenzV Apr 18, 2026

Choose a reason for hiding this comment

Uh oh!

Shnatsel Apr 18, 2026

Choose a reason for hiding this comment

Uh oh!

LaurenzV Apr 18, 2026

Choose a reason for hiding this comment

Uh oh!

LaurenzV left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Shnatsel commented Apr 12, 2026 •

edited

Loading