GPU Hadamard for large N #1879

barronalex · 2025-02-18T21:47:13Z

Add support for large Hadamard transforms on the GPU.

For $N=2^{24}$ the GPU version is about 50x faster than the CPU:

Timing hadamard_transform ... 2.32494 msec
Timing hadamard_transform ... 123.39948 msec

angeloskath · 2025-02-19T01:49:10Z

Looks great! As per offline discussion let's move this into the primitive instead.

angeloskath · 2025-04-29T16:55:17Z

This should be fine to review and merge now.

The kernel is a bit faster than copying and calling the contiguous one and it also has the benefit of being completely in-place which the transpose copy can't do.

The 16K elements in-place transform was not launching correctly (max threads per thread group 832) so I reduced the limit to 8K. @barronalex if you remember encountering the same issue before let me know how you did fix it 🤔.

Edit: The plot is in GB/s not MB/s

mlx/ops.cpp

awni

Awesome!!

barronalex requested a review from angeloskath February 18, 2025 21:47

angeloskath force-pushed the big-gpu-hadamard branch from d61c204 to 90f66b6 Compare April 25, 2025 00:53

angeloskath requested a review from awni April 29, 2025 16:55

awni reviewed May 1, 2025

View reviewed changes

mlx/ops.cpp Outdated Show resolved Hide resolved

awni approved these changes May 1, 2025

View reviewed changes

angeloskath force-pushed the big-gpu-hadamard branch 2 times, most recently from fb6f761 to 61ffdf0 Compare May 1, 2025 22:56

angeloskath added a commit that referenced this pull request May 2, 2025

GPU Hadamard for large N (#1879)

00bad20

angeloskath force-pushed the big-gpu-hadamard branch from 61ffdf0 to 00bad20 Compare May 2, 2025 00:18

GPU Hadamard for large N (#1879)

4813494

angeloskath force-pushed the big-gpu-hadamard branch from 00bad20 to 4813494 Compare May 2, 2025 00:19

angeloskath merged commit 4813494 into main May 2, 2025
0 of 3 checks passed

angeloskath deleted the big-gpu-hadamard branch May 2, 2025 00:19

faisalmemon pushed a commit to faisalmemon/mlx that referenced this pull request Oct 30, 2025

GPU Hadamard for large N (ml-explore#1879)

46aa036

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GPU Hadamard for large N #1879

GPU Hadamard for large N #1879

Uh oh!

barronalex commented Feb 18, 2025 •

edited

Loading

Uh oh!

angeloskath commented Feb 19, 2025

Uh oh!

angeloskath commented Apr 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

awni left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

GPU Hadamard for large N #1879

GPU Hadamard for large N #1879

Uh oh!

Conversation

barronalex commented Feb 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

angeloskath commented Feb 19, 2025

Uh oh!

angeloskath commented Apr 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

awni left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

barronalex commented Feb 18, 2025 •

edited

Loading

angeloskath commented Apr 29, 2025 •

edited

Loading