Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

neon mlal qs8 rsum use addw instead of mlal #6438

Merged
merged 1 commit into from
May 18, 2024
Merged

Conversation

copybara-service[bot]
Copy link

@copybara-service copybara-service bot commented May 18, 2024

neon mlal qs8 rsum use addw instead of mlal

  • remove vone which was being lengthened with mlal and then multiplied by the input
  • use sliced accumulators for 16 bit accumulation
  • rename functions from neon_mlal to neon_addw
  • add const to mask variable in remainder handler for addw and neondot microkernels

@copybara-service copybara-service bot force-pushed the test_634595235 branch 3 times, most recently from 385e9f9 to 26b51c7 Compare May 18, 2024 11:05
- remove vone which was being lengthened with mlal and then multiplied by the input
- use sliced accumulators for 16 bit accumulation
- rename functions from neon_mlal to neon_addw
- add const to mask variable in remainder handler for addw and neondot microkernels

PiperOrigin-RevId: 635029345
@copybara-service copybara-service bot merged commit fcb3669 into master May 18, 2024
@copybara-service copybara-service bot deleted the test_634595235 branch May 18, 2024 11:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant