Hello,
I tried to quantize an onnx model by int8 calibration. I used class BatchStream and Int8EntropyCalibrator2 and tried calibration data format of .batch and .ppm. In both cases, getBatchSize(), readCalibrationCache() and writeCalibrationCache() was called but getBatch() was not. So I can't get a right calibration table.
Could you please tell me what is probably going wrong? Thank you.
Hello,
I tried to quantize an onnx model by int8 calibration. I used class BatchStream and Int8EntropyCalibrator2 and tried calibration data format of .batch and .ppm. In both cases, getBatchSize(), readCalibrationCache() and writeCalibrationCache() was called but getBatch() was not. So I can't get a right calibration table.
Could you please tell me what is probably going wrong? Thank you.