Skip to content

[kernel] feat: vectorized kernels#1497

Merged
AlpinDale merged 1 commit into
mainfrom
vectorized_kernels
Sep 5, 2025
Merged

[kernel] feat: vectorized kernels#1497
AlpinDale merged 1 commit into
mainfrom
vectorized_kernels

Conversation

@AlpinDale

@AlpinDale AlpinDale commented Sep 5, 2025

Copy link
Copy Markdown
Member

So far, RMSNorm is done. No decrease in accuracy:

Before:

|Tasks|Version|     Filter     |n-shot|  Metric   |   |Value |   |Stderr|
|-----|------:|----------------|-----:|-----------|---|-----:|---|-----:|
|gsm8k|      3|flexible-extract|     5|exact_match|↑  |0.6808|±  |0.0128|
|     |       |strict-match    |     5|exact_match|↑  |0.6740|±  |0.0129|

After:

|Tasks|Version|     Filter     |n-shot|  Metric   |   |Value |   |Stderr|
|-----|------:|----------------|-----:|-----------|---|-----:|---|-----:|
|gsm8k|      3|flexible-extract|     5|exact_match|↑  |0.6816|±  |0.0128|
|     |       |strict-match    |     5|exact_match|↑  |0.6748|±  |0.0129|

Performance (Olmoe-1B-7B):
image

  • RMSNorm
  • Activations
  • RoPE

@AlpinDale AlpinDale merged commit e6a618d into main Sep 5, 2025
0 of 4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant