plz help #6

1gzcdestiny1 · 2020-06-29T06:52:37Z

hi,i appreciate the code you provided,but could you tell me what the method you did to show the model size after the quantification?i could't find the corresponding part in your code,thanks

hossein1387 · 2020-11-25T03:15:38Z

Hi,

The quantization that I used is categorized under fake quantization. Meaning that I store 2/3/4 bit tensors into 32-bit float torch tensors. Hence, there is no real way to store the model after quantization. What I and many people do in such scenario is to guesstimate which more or less is accurate IF sub-eight bit quantization was fully supported by ML frameworks.

hossein1387 closed this as completed Nov 25, 2020

hossein1387 mentioned this issue Nov 25, 2020

How to use the inference function to get more output images? #5

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

plz help #6

plz help #6

1gzcdestiny1 commented Jun 29, 2020

hossein1387 commented Nov 25, 2020

plz help #6

plz help #6

Comments

1gzcdestiny1 commented Jun 29, 2020

hossein1387 commented Nov 25, 2020