Calibration dataset sample size and its sequence length are always fixed. Why? #191

Sakusakumura · 2023-11-14T09:37:18Z

I've noticed in the implementation of AwqQuantizer that the number of samples and sequence length for the calibration dataset are fixed at 128 and 512, respectively. Is this a deliberate choice due to certain constraints or optimizations? I believe that making these values variable may have been overlooked when the change "Allow user to use custom calibration data for quantization #27" enabled the use of custom datasets.

AutoAWQ/awq/quantize/quantizer.py

Line 27 in 299c460

self.modules, self.module_kwargs, self.inps = self.init_quant()

AutoAWQ/awq/quantize/quantizer.py

Line 293 in 299c460

def init_quant(self, n_samples=128, seqlen=512):

casper-hansen · 2023-11-14T09:54:04Z

This is how the original AWQ was implemented. I allowed custom datasets but I can see there are some restrictions now that I should look into

Sakusakumura · 2023-11-14T10:06:21Z

Understood. Thank you!

casper-hansen closed this as completed Nov 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Calibration dataset sample size and its sequence length are always fixed. Why? #191

Calibration dataset sample size and its sequence length are always fixed. Why? #191

Sakusakumura commented Nov 14, 2023

casper-hansen commented Nov 14, 2023

Sakusakumura commented Nov 14, 2023

Calibration dataset sample size and its sequence length are always fixed. Why? #191

Calibration dataset sample size and its sequence length are always fixed. Why? #191

Comments

Sakusakumura commented Nov 14, 2023

casper-hansen commented Nov 14, 2023

Sakusakumura commented Nov 14, 2023