Skip to content

[RVV] add rvv f32 kernels for velu, vgelu, vapproxgelu#9963

Merged
copybara-service[bot] merged 3 commits intogoogle:masterfrom
velonica0:rvv-elementwise
Apr 29, 2026
Merged

[RVV] add rvv f32 kernels for velu, vgelu, vapproxgelu#9963
copybara-service[bot] merged 3 commits intogoogle:masterfrom
velonica0:rvv-elementwise

Conversation

@velonica0
Copy link
Copy Markdown
Contributor

Part of #9954

Add RVV kernels for three elementwise activation ops:

  • f32-velu (ELU) - rr2_p6 polynomial approximation
  • f32-vgelu (GELU) - rational 12/10 approximation (div + nr variants)
  • f32-vapproxgelu - rational 12/10 approximation (div + nr variants)

Tested on SpacemiT K1 CPU and K3 CPU, both VLEN=256.

Add RVV kernels for three elementwise activation ops:
- f32-velu (ELU)       - rr2_p6 polynomial approximation
- f32-vgelu (GELU)     - rational 12/10 approximation (div + nr variants)
- f32-vapproxgelu      - rational 12/10 approximation (div + nr variants)

Tested on SpacemiT K1 CPU and K3 CPU, both VLEN=256.
Move f32-velu, f32-vgelu, f32-vapproxgelu entries from end of
PROD/NON_PROD lists to alphabetical positions to avoid merge conflicts
with other RVV PRs.
@copybara-service copybara-service Bot merged commit bbc68d9 into google:master Apr 29, 2026
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants