Support Linear operation with fp16 weights in ATen #22023

mingzhe09088 · 2019-06-20T16:23:24Z

Summary:
This diff implements Linear operation with fp16 weights based on FBGEMM. At a hight level, we want to perform the following operation:
Y = X * W + B with dtypes:
(fp32, fp32, fp16, fp32)

To do that, three steps are needed:

Quantize weights from fp32 to fp16, this is done using PackedGemmMatrixFP16 in the fbgemm_pack_gemm_matrix_fp16
Conduct matrix multiplication with quantized weights using cblas_gemm_compute in fbgemm_linear_fp16_weight
Add bias to the result from step2 and return the final Y

Differential Revision: D15921768

Summary: Pull Request resolved: #22023 This diff implements Linear operation with fp16 weights based on FBGEMM. At a hight level, we want to perform the following operation: Y = X * W + B with dtypes: (fp32, fp32, fp16, fp32) To do that, three steps are needed: 1. Quantize weights from fp32 to fp16, this is done using `PackedGemmMatrixFP16` in the `fbgemm_pack_gemm_matrix_fp16` 2. Conduct matrix multiplication with quantized weights using `cblas_gemm_compute` in `fbgemm_linear_fp16_weight` 3. Add bias to the result from step2 and return the final Y Reviewed By: jianyuh Differential Revision: D15921768 fbshipit-source-id: f48ed23c4a446e6454b1334ede492b7efec45260

facebook-github-bot · 2019-07-13T00:01:22Z

This pull request has been merged in 573d9e6.

Summary: Pull Request resolved: pytorch/pytorch#22023 This diff implements Linear operation with fp16 weights based on FBGEMM. At a hight level, we want to perform the following operation: Y = X * W + B with dtypes: (fp32, fp32, fp16, fp32) To do that, three steps are needed: 1. Quantize weights from fp32 to fp16, this is done using `PackedGemmMatrixFP16` in the `fbgemm_pack_gemm_matrix_fp16` 2. Conduct matrix multiplication with quantized weights using `cblas_gemm_compute` in `fbgemm_linear_fp16_weight` 3. Add bias to the result from step2 and return the final Y Reviewed By: jianyuh Differential Revision: D15921768 fbshipit-source-id: dc4e5b366f846ce9d58975876940a9b3372b8b8d

pytorchbot added module: operators module: nn Related to torch.nn module: internals Related to internal abstractions in c10 and ATen labels Jun 20, 2019

facebook-github-bot closed this in 573d9e6 Jul 12, 2019

facebook-github-bot added the merged label Jul 13, 2019

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support Linear operation with fp16 weights in ATen #22023

Support Linear operation with fp16 weights in ATen #22023

Uh oh!

mingzhe09088 commented Jun 20, 2019

Uh oh!

facebook-github-bot commented Jul 13, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Support Linear operation with fp16 weights in ATen #22023

Support Linear operation with fp16 weights in ATen #22023

Uh oh!

Conversation

mingzhe09088 commented Jun 20, 2019

Uh oh!

facebook-github-bot commented Jul 13, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants