[reland][quant] QuantizedCUDA implementation #36936

jerryzh168 · 2020-04-20T17:57:39Z

Stack from ghstack:

[reland][quant] QuantizedCUDA implementation #36936 QuantizedCUDA implementation

Summary:
Closes #30813

Relanding of #35463

Tensor quantization logic(quantize_*) is moved to the aten/native/quantized. Previously all logic for tensor quantization lived in the aten/quantized/Quantizer.cpp file, and started to become complicated and hard to read. This problem should be addressed in refactoring PR. Still, I reworked this partially because I had to add tensor quantization logic for CUDA, and it was native to move everything to the aten/native/quantized.
Requirements to run CUDA_tensor_apply* was eased to process any tenser that lives on the CUDA device(QuantizedCUDA included).
All quantized data types now have a default constructor. NVCC refuses to compile any gpu_kernel or CUDA_tensor_apply* without them.
Minor changes in many files to register QuantizedCUDA backend.
test_quantized_tensor is extended to process QuantizedCUDA backend where possible.

Differential Revision: D21143025

Summary: Closes #30813 1. Tensor quantization logic(quantize_*) is moved to the aten/native/quantized. Previously all logic for tensor quantization lived in the aten/quantized/Quantizer.cpp file, and started to become complicated and hard to read. This problem should be addressed in refactoring PR. Still, I reworked this partially because I had to add tensor quantization logic for CUDA, and it was native to move everything to the aten/native/quantized. 2. Requirements to run CUDA_tensor_apply* was eased to process any tenser that lives on the CUDA device(QuantizedCUDA included). 3. All quantized data types now have a default constructor. NVCC refuses to compile any gpu_kernel or CUDA_tensor_apply* without them. 4. Minor changes in many files to register QuantizedCUDA backend. 5. test_quantized_tensor is extended to process QuantizedCUDA backend where possible. [ghstack-poisoned]

Summary: Closes #30813 1. Tensor quantization logic(quantize_*) is moved to the aten/native/quantized. Previously all logic for tensor quantization lived in the aten/quantized/Quantizer.cpp file, and started to become complicated and hard to read. This problem should be addressed in refactoring PR. Still, I reworked this partially because I had to add tensor quantization logic for CUDA, and it was native to move everything to the aten/native/quantized. 2. Requirements to run CUDA_tensor_apply* was eased to process any tenser that lives on the CUDA device(QuantizedCUDA included). 3. All quantized data types now have a default constructor. NVCC refuses to compile any gpu_kernel or CUDA_tensor_apply* without them. 4. Minor changes in many files to register QuantizedCUDA backend. 5. test_quantized_tensor is extended to process QuantizedCUDA backend where possible. ghstack-source-id: 7b5f047 Pull Request resolved: #36936

z-a-f

lg

alnfedorov · 2020-04-21T09:23:42Z

@jerryzh168 Sorry, haven't visited GitHub for some time. Unfortunately, I don't have Windows machines at hand. I can set up everything from scratch and investigate the problem, but that would take a day or so. Is it worth it?

jerryzh168 · 2020-04-21T16:58:06Z

@jerryzh168 Sorry, haven't visited GitHub for some time. Unfortunately, I don't have Windows machines at hand. I can set up everything from scratch and investigate the problem, but that would take a day or so. Is it worth it?

I think I have fixed it, I'll land again

facebook-github-bot · 2020-04-21T22:13:55Z

@jerryzh168 merged this pull request in 97d3a84.

mruberry · 2020-04-22T03:32:32Z

Unlanding. This appears to have broken CUDA 10 and in some cases CUDA 10.1 builds of PyTorch. While breaking CUDA 10 may be acceptable, how that's done should likely go through review and notice provided to engineers still using CUDA 10 that they'll need to upgrade.

The CUDA 10.1 issue is reported as blocking development by some engineers using devfairs. The cause/scope of this build issue is unclear at this time.

Summary: Closes #30813 Relanding of #35463 1. Tensor quantization logic(quantize_*) is moved to the aten/native/quantized. Previously all logic for tensor quantization lived in the aten/quantized/Quantizer.cpp file, and started to become complicated and hard to read. This problem should be addressed in refactoring PR. Still, I reworked this partially because I had to add tensor quantization logic for CUDA, and it was native to move everything to the aten/native/quantized. 2. Requirements to run CUDA_tensor_apply* was eased to process any tenser that lives on the CUDA device(QuantizedCUDA included). 3. All quantized data types now have a default constructor. NVCC refuses to compile any gpu_kernel or CUDA_tensor_apply* without them. 4. Minor changes in many files to register QuantizedCUDA backend. 5. test_quantized_tensor is extended to process QuantizedCUDA backend where possible. [ghstack-poisoned]

Summary: Closes #30813 Relanding of #35463 1. Tensor quantization logic(quantize_*) is moved to the aten/native/quantized. Previously all logic for tensor quantization lived in the aten/quantized/Quantizer.cpp file, and started to become complicated and hard to read. This problem should be addressed in refactoring PR. Still, I reworked this partially because I had to add tensor quantization logic for CUDA, and it was native to move everything to the aten/native/quantized. 2. Requirements to run CUDA_tensor_apply* was eased to process any tenser that lives on the CUDA device(QuantizedCUDA included). 3. All quantized data types now have a default constructor. NVCC refuses to compile any gpu_kernel or CUDA_tensor_apply* without them. 4. Minor changes in many files to register QuantizedCUDA backend. 5. test_quantized_tensor is extended to process QuantizedCUDA backend where possible. ghstack-source-id: 845ce0b Pull Request resolved: #37081

Summary: Closes #30813 Relanding of #35463 1. Tensor quantization logic(quantize_*) is moved to the aten/native/quantized. Previously all logic for tensor quantization lived in the aten/quantized/Quantizer.cpp file, and started to become complicated and hard to read. This problem should be addressed in refactoring PR. Still, I reworked this partially because I had to add tensor quantization logic for CUDA, and it was native to move everything to the aten/native/quantized. 2. Requirements to run CUDA_tensor_apply* was eased to process any tenser that lives on the CUDA device(QuantizedCUDA included). 3. All quantized data types now have a default constructor. NVCC refuses to compile any gpu_kernel or CUDA_tensor_apply* without them. 4. Minor changes in many files to register QuantizedCUDA backend. 5. test_quantized_tensor is extended to process QuantizedCUDA backend where possible. [ghstack-poisoned]

Summary: Closes #30813 Relanding of #35463 1. Tensor quantization logic(quantize_*) is moved to the aten/native/quantized. Previously all logic for tensor quantization lived in the aten/quantized/Quantizer.cpp file, and started to become complicated and hard to read. This problem should be addressed in refactoring PR. Still, I reworked this partially because I had to add tensor quantization logic for CUDA, and it was native to move everything to the aten/native/quantized. 2. Requirements to run CUDA_tensor_apply* was eased to process any tenser that lives on the CUDA device(QuantizedCUDA included). 3. All quantized data types now have a default constructor. NVCC refuses to compile any gpu_kernel or CUDA_tensor_apply* without them. 4. Minor changes in many files to register QuantizedCUDA backend. 5. test_quantized_tensor is extended to process QuantizedCUDA backend where possible. ghstack-source-id: 447571e Pull Request resolved: #37081

Summary: Closes #30813 Relanding of #35463 1. Tensor quantization logic(quantize_*) is moved to the aten/native/quantized. Previously all logic for tensor quantization lived in the aten/quantized/Quantizer.cpp file, and started to become complicated and hard to read. This problem should be addressed in refactoring PR. Still, I reworked this partially because I had to add tensor quantization logic for CUDA, and it was native to move everything to the aten/native/quantized. 2. Requirements to run CUDA_tensor_apply* was eased to process any tenser that lives on the CUDA device(QuantizedCUDA included). 3. All quantized data types now have a default constructor. NVCC refuses to compile any gpu_kernel or CUDA_tensor_apply* without them. 4. Minor changes in many files to register QuantizedCUDA backend. 5. test_quantized_tensor is extended to process QuantizedCUDA backend where possible. [ghstack-poisoned]

Summary: Closes #30813 Relanding of #35463 1. Tensor quantization logic(quantize_*) is moved to the aten/native/quantized. Previously all logic for tensor quantization lived in the aten/quantized/Quantizer.cpp file, and started to become complicated and hard to read. This problem should be addressed in refactoring PR. Still, I reworked this partially because I had to add tensor quantization logic for CUDA, and it was native to move everything to the aten/native/quantized. 2. Requirements to run CUDA_tensor_apply* was eased to process any tenser that lives on the CUDA device(QuantizedCUDA included). 3. All quantized data types now have a default constructor. NVCC refuses to compile any gpu_kernel or CUDA_tensor_apply* without them. 4. Minor changes in many files to register QuantizedCUDA backend. 5. test_quantized_tensor is extended to process QuantizedCUDA backend where possible. ghstack-source-id: c307ca5 Pull Request resolved: #37081

Summary: Closes #30813 Relanding of #35463 1. Tensor quantization logic(quantize_*) is moved to the aten/native/quantized. Previously all logic for tensor quantization lived in the aten/quantized/Quantizer.cpp file, and started to become complicated and hard to read. This problem should be addressed in refactoring PR. Still, I reworked this partially because I had to add tensor quantization logic for CUDA, and it was native to move everything to the aten/native/quantized. 2. Requirements to run CUDA_tensor_apply* was eased to process any tenser that lives on the CUDA device(QuantizedCUDA included). 3. All quantized data types now have a default constructor. NVCC refuses to compile any gpu_kernel or CUDA_tensor_apply* without them. 4. Minor changes in many files to register QuantizedCUDA backend. 5. test_quantized_tensor is extended to process QuantizedCUDA backend where possible. Differential Revision: [D21206694](https://our.internmc.facebook.com/intern/diff/D21206694) [ghstack-poisoned]

Summary: Closes #30813 Relanding of #35463 1. Tensor quantization logic(quantize_*) is moved to the aten/native/quantized. Previously all logic for tensor quantization lived in the aten/quantized/Quantizer.cpp file, and started to become complicated and hard to read. This problem should be addressed in refactoring PR. Still, I reworked this partially because I had to add tensor quantization logic for CUDA, and it was native to move everything to the aten/native/quantized. 2. Requirements to run CUDA_tensor_apply* was eased to process any tenser that lives on the CUDA device(QuantizedCUDA included). 3. All quantized data types now have a default constructor. NVCC refuses to compile any gpu_kernel or CUDA_tensor_apply* without them. 4. Minor changes in many files to register QuantizedCUDA backend. 5. test_quantized_tensor is extended to process QuantizedCUDA backend where possible. ghstack-source-id: 4e511be Pull Request resolved: #37081

Summary: Pull Request resolved: #37081 Closes #30813 Relanding of #35463 1. Tensor quantization logic(quantize_*) is moved to the aten/native/quantized. Previously all logic for tensor quantization lived in the aten/quantized/Quantizer.cpp file, and started to become complicated and hard to read. This problem should be addressed in refactoring PR. Still, I reworked this partially because I had to add tensor quantization logic for CUDA, and it was native to move everything to the aten/native/quantized. 2. Requirements to run CUDA_tensor_apply* was eased to process any tenser that lives on the CUDA device(QuantizedCUDA included). 3. All quantized data types now have a default constructor. NVCC refuses to compile any gpu_kernel or CUDA_tensor_apply* without them. 4. Minor changes in many files to register QuantizedCUDA backend. 5. test_quantized_tensor is extended to process QuantizedCUDA backend where possible. Test Plan: Imported from OSS Differential Revision: D21206694 Pulled By: jerryzh168 fbshipit-source-id: c7433aad9c095a34c57e6dddd128b5c5d9292373

jerryzh168 changed the title ~~QuantizedCUDA implementation (#35463)~~ QuantizedCUDA implementation Apr 20, 2020

z-a-f approved these changes Apr 20, 2020

View reviewed changes

jerryzh168 changed the title ~~QuantizedCUDA implementation~~ [reland][quant] QuantizedCUDA implementation Apr 20, 2020

facebook-github-bot closed this in 97d3a84 Apr 21, 2020

facebook-github-bot added the merged label Apr 21, 2020

alnfedorov mentioned this pull request Apr 22, 2020

QuantizedCUDA implementation #35463

Closed

jerryzh168 mentioned this pull request Apr 22, 2020

[reland][quant] QuantizedCUDA implementation (#36936) #37081

Closed

facebook-github-bot deleted the gh/jerryzh168/289/head branch April 25, 2020 14:16

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[reland][quant] QuantizedCUDA implementation #36936

[reland][quant] QuantizedCUDA implementation #36936

Uh oh!

jerryzh168 commented Apr 20, 2020 •

edited

Loading

Uh oh!

z-a-f left a comment

Uh oh!

alnfedorov commented Apr 21, 2020

Uh oh!

jerryzh168 commented Apr 21, 2020

Uh oh!

facebook-github-bot commented Apr 21, 2020

Uh oh!

mruberry commented Apr 22, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

[reland][quant] QuantizedCUDA implementation #36936

[reland][quant] QuantizedCUDA implementation #36936

Uh oh!

Conversation

jerryzh168 commented Apr 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

z-a-f left a comment

Choose a reason for hiding this comment

Uh oh!

alnfedorov commented Apr 21, 2020

Uh oh!

jerryzh168 commented Apr 21, 2020

Uh oh!

facebook-github-bot commented Apr 21, 2020

Uh oh!

mruberry commented Apr 22, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

jerryzh168 commented Apr 20, 2020 •

edited

Loading