Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use non-reference blocking parameters for fp32 weights in quantized inference kernel #1224

Closed
wants to merge 1 commit into from

Conversation

ajtulloch
Copy link
Contributor

Summary: While this is technically a reference implementation, I'm sure it'll show up somewhere and this at least gives a ~2-3x improvement in perf for parameters I looked at.

Differential Revision: D38186856

…nference kernel

Summary: While this is technically a reference implementation, I'm sure it'll show up somewhere and this at least gives a ~2-3x improvement in perf for parameters I looked at.

Differential Revision: D38186856

fbshipit-source-id: 91c11c19d1320be18e1a4be0f138a7c00f92bf3f
@netlify
Copy link

netlify bot commented Jul 27, 2022

Deploy Preview for eclectic-stroopwafel-199537 canceled.

Name Link
🔨 Latest commit 1523425
🔍 Latest deploy log https://app.netlify.com/sites/eclectic-stroopwafel-199537/deploys/62e08b6316b05e000805877f

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D38186856

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants