Use non-reference blocking parameters for fp32 weights in quantized inference kernel #1224

ajtulloch · 2022-07-27T00:48:34Z

Summary: While this is technically a reference implementation, I'm sure it'll show up somewhere and this at least gives a ~2-3x improvement in perf for parameters I looked at.

Differential Revision: D38186856

…nference kernel Summary: While this is technically a reference implementation, I'm sure it'll show up somewhere and this at least gives a ~2-3x improvement in perf for parameters I looked at. Differential Revision: D38186856 fbshipit-source-id: 91c11c19d1320be18e1a4be0f138a7c00f92bf3f

netlify · 2022-07-27T00:48:41Z

✅ Deploy Preview for eclectic-stroopwafel-199537 canceled.

Name	Link
🔨 Latest commit	`1523425`
🔍 Latest deploy log	https://app.netlify.com/sites/eclectic-stroopwafel-199537/deploys/62e08b6316b05e000805877f

facebook-github-bot · 2022-07-27T00:49:16Z

This pull request was exported from Phabricator. Differential Revision: D38186856

facebook-github-bot added the cla signed label Jul 27, 2022

facebook-github-bot added the fb-exported label Jul 27, 2022

facebook-github-bot closed this in 2e14df6 Jul 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use non-reference blocking parameters for fp32 weights in quantized inference kernel #1224

Use non-reference blocking parameters for fp32 weights in quantized inference kernel #1224

ajtulloch commented Jul 27, 2022

netlify bot commented Jul 27, 2022 •

edited

Loading

facebook-github-bot commented Jul 27, 2022

Use non-reference blocking parameters for fp32 weights in quantized inference kernel #1224

Use non-reference blocking parameters for fp32 weights in quantized inference kernel #1224

Conversation

ajtulloch commented Jul 27, 2022

netlify bot commented Jul 27, 2022 • edited Loading

✅ Deploy Preview for eclectic-stroopwafel-199537 canceled.

facebook-github-bot commented Jul 27, 2022

netlify bot commented Jul 27, 2022 •

edited

Loading