[ONNX] Quantization documentation outdated

The documentation [here](https://github.com/llvm/torch-mlir/blob/4e2d0fd0fda51797f23a9fb50e8e1c5cedd5ffe5/docs/importers/onnx_importer.md?plain=1#L107) states the following:

> Quantization parameters are carried out of line in the ONNX protobuf
and will be repatriated upon import to torch. The exact mechanism is
not yet implemented.

This is outdated. The following simple conv2d in onnx:

![Image](https://github.com/user-attachments/assets/dc23c0ab-b395-4bb4-b0dd-b7a81d844b8d)

contains the quantization parameters in line. There are already test cases in the torch-mlir repo (e.g. [here](https://github.com/llvm/torch-mlir/pull/3917/files#diff-b584b152020af6d2e5dbf62a08b2f25ed5afc2c299228383b9651d22d44b5af4R144)) that can translate this successfully to torch. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ONNX] Quantization documentation outdated #4188

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[ONNX] Quantization documentation outdated #4188

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions