Different quantization bit configurations for different layers #731

mengna0707 · 2021-11-25T14:06:46Z

Hello，I would like to ask whether larqsupports 8-bit quantization for the first and last layers， and binary quantization for the middle layer？

I have learned that the QAT toolkit of Tensorflow supports 8-bit quantitative perception training.
Can larq be used in combination with this toolkit ？

CNugteren · 2021-11-26T09:31:52Z

Yes that is supported. In fact, it is even recommended in many cases to include some int8 layers. Indeed, typically your first layer can remain int8 to get better accuracy. Here's an example where only the weights are quantized in the first layer, not the activations: https://docs.larq.dev/larq/tutorials/mnist/#create-the-model. In this example, you can replace the kernel_quantizer argumentin the first layer by any quantizer adhering to the lq.quantizers.Quantizer abstract class (or just leave the argument out). Or you can mix in normal tf.keras layers in your network as well, you are not restricted to use only lq. layers.

mengna0707 changed the title ~~Combined with TensorFlow's QAT toolkit~~ Different quantization bit configurations for different layers Nov 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Different quantization bit configurations for different layers #731

Different quantization bit configurations for different layers #731

mengna0707 commented Nov 25, 2021

CNugteren commented Nov 26, 2021

Different quantization bit configurations for different layers #731

Different quantization bit configurations for different layers #731

Comments

mengna0707 commented Nov 25, 2021

CNugteren commented Nov 26, 2021