Update VitisAIQuantization to use Quark #1715

vortex-captain · 2025-03-27T03:22:35Z

Description

Example usage in Olive workflow json:

  "passes": {
    "conversion": {
      "device": "cpu",
      "type": "OnnxConversion",
      "target_opset": 17,
      "use_dynamo_exporter": false
    },
    "to_fixed_shape": {
      "type": "DynamicToFixedShape",
      "dim_param": ["batch_size", "sequence_length"],
      "dim_value": [1, 77]
    },
    "quantization": {
      "type": "QuarkQuantization",
      "data_config": "calib_data",
      "config_template": "XINT8",
      "enable_npu_transformer": true,
      "extra_options": {
        "OpTypesToExcludeOutputQuantization": ["MatMul", "Gemm"],
        "ActivationSymmetric": true
      },
      "debug_mode": true,
      "log_severity_level": 0,
      "ignore_warnings": false
    }
  }

Please refer to https://quark.docs.amd.com/latest/onnx/user_guide_config_description.html for the complete list of config_template options. All the other quantization options are listed in https://quark.docs.amd.com/latest/onnx/appendix_full_quant_config_features.html .

Examples

2 ResNet examples are added to examples/vai, which convert the models using Quark then evaluate on VitisAIExecutionProvider (run on NPU, RyzenAI 1.3.1, onnxruntime-vitisai 1.19).

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.
Is this PR including examples changes? If yes, please remember to update example documentation in a follow-up PR.

(Optional) Issue link

…sformer for VAI

olive/passes/onnx/vitis_ai_quantization.py

github-advanced-security

lintrunner found more than 20 potential problems in the proposed changes. Check the Files changed tab for more details.

olive/passes/onnx/vitis_ai_quantization.py

olive/common/ort_inference.py

ChaoLi-AMD · 2025-04-01T04:58:55Z

Describe your changes

Example usage in Olive workflow json:
  "passes": {
    "conversion": {
      "device": "cpu",
      "type": "OnnxConversion",
      "target_opset": 17,
      "use_dynamo_exporter": false
    },
    "quantization": {
      "type": "VitisAIQuantization",
      "data_config": "calib_data",
      "config_template": "INT8_TRANSFORMER_ACCURATE",
      "extra_options": {
        "OpTypesToExcludeOutputQuantization": ["MatMul", "Gemm"],
        "ActivationSymmetric": true
      },
      "debug_mode": true,
      "log_severity_level": 0,
      "ignore_warnings": false
    }
  }
Checklist before requesting a review

Add unit tests for this change.

Make sure all tests can pass.

Update documents if necessary.

Lint and apply fixes to your code by running lintrunner -a

Is this a user-facing change? If yes, give a description of this change to be included in the release notes.

Is this PR including examples changes? If yes, please remember to update example documentation in a follow-up PR.

(Optional) Issue link

please improve the example to refer to Quark documentation: https://quark.docs.amd.com/latest/supported_accelerators/ryzenai/index.html

olive/passes/onnx/vitis_ai_quantization.py

xiaoyu-work · 2025-04-01T20:10:06Z

/azp run

azure-pipelines · 2025-04-01T20:10:25Z

Azure Pipelines successfully started running 1 pipeline(s).

olive/passes/onnx/vitis_ai_quantization.py

xiaoyu-work · 2025-04-07T19:48:58Z

olive/cache.py

@@ -40,6 +40,7 @@ class CacheSubDirs:
    evaluations: Path
    resources: Path
    mlflow: Path
+    vitis_ai: Path


Can you explain more about how will you use this folder? The cache folder is designed to be pass-agnostic so i want to double confirm the use case here.

The folder will be created at the beginning of the evaluation step, upon the creation of a VitisAIExecutionProvider inference session (used as model cache by EP). Is evaluation considered an Olive pass?

No, if VitisAIEP will need to cache a model for evaluation, can we create a temporal folder for it? (and it will be deleted after all. I assume this model cache is not needed when the workflow finish.). We can create a temporary folder in cache.evaluations like temp_model_cache or something.

shaahji · 2025-04-09T17:12:02Z

Update the entry in olive_config.json to point to the correct location of the pass implementation in the module. Many of the tests are failing because of the wrong entry.

vortex-captain · 2025-04-17T05:59:42Z

Describe your changes

Example usage in Olive workflow json:
  "passes": {
    "conversion": {
      "device": "cpu",
      "type": "OnnxConversion",
      "target_opset": 17,
      "use_dynamo_exporter": false
    },
    "quantization": {
      "type": "VitisAIQuantization",
      "data_config": "calib_data",
      "config_template": "INT8_TRANSFORMER_ACCURATE",
      "extra_options": {
        "OpTypesToExcludeOutputQuantization": ["MatMul", "Gemm"],
        "ActivationSymmetric": true
      },
      "debug_mode": true,
      "log_severity_level": 0,
      "ignore_warnings": false
    }
  }
Checklist before requesting a review

Add unit tests for this change.

Make sure all tests can pass.

Update documents if necessary.

Lint and apply fixes to your code by running lintrunner -a

Is this a user-facing change? If yes, give a description of this change to be included in the release notes.

Is this PR including examples changes? If yes, please remember to update example documentation in a follow-up PR.

(Optional) Issue link
please improve the example to refer to Quark documentation: https://quark.docs.amd.com/latest/supported_accelerators/ryzenai/index.html

Added links of Quark documentation on quantization configurations

olive/passes/onnx/vitis_ai_quantization.py

ChaoLi-AMD · 2025-04-17T12:45:31Z

Describe your changes

Example usage in Olive workflow json:
  "passes": {
    "conversion": {
      "device": "cpu",
      "type": "OnnxConversion",
      "target_opset": 17,
      "use_dynamo_exporter": false
    },
    "quantization": {
      "type": "VitisAIQuantization",
      "data_config": "calib_data",
      "config_template": "INT8_TRANSFORMER_ACCURATE",
      "extra_options": {
        "OpTypesToExcludeOutputQuantization": ["MatMul", "Gemm"],
        "ActivationSymmetric": true
      },
      "debug_mode": true,
      "log_severity_level": 0,
      "ignore_warnings": false
    }
  }
Checklist before requesting a review

Add unit tests for this change.

Make sure all tests can pass.

Update documents if necessary.

Lint and apply fixes to your code by running lintrunner -a

Is this a user-facing change? If yes, give a description of this change to be included in the release notes.

Is this PR including examples changes? If yes, please remember to update example documentation in a follow-up PR.

(Optional) Issue link
please improve the example to refer to Quark documentation: https://quark.docs.amd.com/latest/supported_accelerators/ryzenai/index.html
Added links of Quark documentation on quantization configurations

For a Ryzen AI example, please use XINT8 as the example instead of INT8_TRANSFORMER_ACCURATE. Just checking — does this example currently runnable on Olive?

olive/passes/onnx/vitis_ai/__init__.py

jambayk · 2025-04-17T17:10:32Z

@vortex-captain please create a copy of your branch directly in this repo and open a new PR to be able to run the CI without the login issue.

vortex-captain · 2025-04-18T02:19:34Z

Describe your changes

Example usage in Olive workflow json:
  "passes": {
    "conversion": {
      "device": "cpu",
      "type": "OnnxConversion",
      "target_opset": 17,
      "use_dynamo_exporter": false
    },
    "quantization": {
      "type": "VitisAIQuantization",
      "data_config": "calib_data",
      "config_template": "INT8_TRANSFORMER_ACCURATE",
      "extra_options": {
        "OpTypesToExcludeOutputQuantization": ["MatMul", "Gemm"],
        "ActivationSymmetric": true
      },
      "debug_mode": true,
      "log_severity_level": 0,
      "ignore_warnings": false
    }
  }
Checklist before requesting a review

Add unit tests for this change.

Make sure all tests can pass.

Update documents if necessary.

Lint and apply fixes to your code by running lintrunner -a

Is this a user-facing change? If yes, give a description of this change to be included in the release notes.

Is this PR including examples changes? If yes, please remember to update example documentation in a follow-up PR.

(Optional) Issue link
please improve the example to refer to Quark documentation: https://quark.docs.amd.com/latest/supported_accelerators/ryzenai/index.html
Added links of Quark documentation on quantization configurations
For a Ryzen AI example, please use XINT8 as the example instead of INT8_TRANSFORMER_ACCURATE. Just checking — does this example currently runnable on Olive?

Updated example in description. And yes, such an example (BERT text model) is runnable on Olive, but in evaluation, the output model cannot run on NPU (all nodes assigned to CPU), unlike the ResNet examples. Any insights?

olive/passes/onnx/quark_quantization.py

examples/vai/image.py

examples/vai/ms_resnet_50_vitis_ai_ptq_npu.json

olive/passes/onnx/quark_quantization.py

docs/source/features/quantization.md

docs/source/reference/options.md

docs/source/features/quantization.md

examples/resnet/resnet_vitis_ai_ptq_cpu.json

examples/vai/image.py

olive/passes/onnx/quark_quantization.py

examples/vai/image.py

olive/passes/onnx/quark_quantization.py

examples/resnet/image.py

examples/resnet/resnet_50_vitis_ai_ptq_npu.json

examples/resnet/resnet_vitis_ai_ptq_npu.json

VishalX · 2025-05-07T03:34:43Z

olive/common/ort_inference.py

+        elif provider == "VitisAIExecutionProvider":
+            import os
+
+            apu_type = get_vai_apu_type()
+            set_vai_environment_variable(apu_type)
+            install_dir = Path(os.environ["RYZEN_AI_INSTALLATION_PATH"])
+            provider_options[idx]["config_file"] = str(install_dir / "voe-4.0-win_amd64" / "vaip_config.json")


This is adding dependency to a specific version. I don't think we should add this here.

Yi Ren added 2 commits March 27, 2025 11:16

replace deprecated enable_dpu with enable_npu_cnn and enable_npu_tran…

56b873d

…sformer for VAI

use quark in VitisAIQuantization; expose params

469e225

github-advanced-security bot found potential problems Mar 27, 2025

View reviewed changes

configure VitisAIExecutionProvider

1646eb7

vortex-captain marked this pull request as ready for review March 27, 2025 05:56

github-advanced-security bot found potential problems Mar 27, 2025

View reviewed changes

fix linter issues

32f22ae

jambayk reviewed Mar 27, 2025

View reviewed changes

olive/passes/onnx/vitis_ai_quantization.py Show resolved Hide resolved

jambayk reviewed Mar 27, 2025

View reviewed changes

olive/common/ort_inference.py Show resolved Hide resolved

jambayk reviewed Mar 27, 2025

View reviewed changes

olive/common/ort_inference.py Outdated Show resolved Hide resolved

jambayk reviewed Mar 27, 2025

View reviewed changes

olive/common/ort_inference.py Outdated Show resolved Hide resolved

use pathlib.Path; fix linter issues

e5cb11d

github-advanced-security bot found potential problems Mar 28, 2025

View reviewed changes

Yi Ren added 3 commits March 28, 2025 10:41

remove old vitis_ai code

c569355

use Olive cache for VitisAI EP

c4d1543

fix linter

9c36257

github-advanced-security bot found potential problems Mar 28, 2025

View reviewed changes

olive/common/ort_inference.py Fixed Show fixed Hide fixed

olive/common/ort_inference.py Fixed Show fixed Hide fixed

olive/common/ort_inference.py Fixed Show fixed Hide fixed

olive/common/ort_inference.py Fixed Show fixed Hide fixed

fix linter

9844278

vortex-captain requested a review from jambayk March 28, 2025 03:55

ChaoLi-AMD reviewed Apr 1, 2025

View reviewed changes

olive/passes/onnx/vitis_ai_quantization.py Show resolved Hide resolved

xiaoyu-work reviewed Apr 7, 2025

View reviewed changes

olive/passes/onnx/vitis_ai_quantization.py Outdated Show resolved Hide resolved

xiaoyu-work reviewed Apr 7, 2025

View reviewed changes

Yi Ren added 3 commits April 17, 2025 13:14

Merge branch 'main' into reny/add_quark

36c5ea9

use optional as input types

b7fc958

vai_q_onnx -> quark

c426989

vortex-captain requested review from shaahji and ChaoLi-AMD and removed request for ChaoLi-AMD April 17, 2025 07:31

ChaoLi-AMD reviewed Apr 17, 2025

View reviewed changes

olive/passes/onnx/vitis_ai_quantization.py Outdated Show resolved Hide resolved

jambayk reviewed Apr 17, 2025

View reviewed changes

olive/passes/onnx/vitis_ai/__init__.py Outdated Show resolved Hide resolved

xiaoyu-work mentioned this pull request Apr 17, 2025

Copy of #1715 #1763

Closed

6 tasks

remove examples/resnet/resnet_vitis_ai_ptq_cpu.json

40ad7d5

Yi Ren added 3 commits April 22, 2025 17:55

vitisai -> Quark

498b59a

rename jsons

7c62a0c

Merge branch 'main' into reny/add_quark

6645663

github-advanced-security bot found potential problems Apr 22, 2025

View reviewed changes

olive/passes/onnx/quark_quantization.py Fixed Show fixed Hide fixed

github-advanced-security bot found potential problems Apr 22, 2025

View reviewed changes

ChaoLi-AMD reviewed Apr 22, 2025

View reviewed changes

docs/source/features/quantization.md Outdated Show resolved Hide resolved

ChaoLi-AMD reviewed Apr 22, 2025

View reviewed changes

docs/source/reference/options.md Outdated Show resolved Hide resolved

gengxinwu reviewed Apr 23, 2025

View reviewed changes

docs/source/features/quantization.md Show resolved Hide resolved

examples/resnet/resnet_vitis_ai_ptq_cpu.json Outdated Show resolved Hide resolved

examples/vai/image.py Outdated Show resolved Hide resolved

olive/passes/onnx/quark_quantization.py Show resolved Hide resolved

chinazhangchao and others added 2 commits April 27, 2025 10:20

Merge branch 'microsoft:main' into reny/add_quark

40541c8

fix lint

d1ebcca

github-advanced-security bot found potential problems Apr 27, 2025

View reviewed changes

examples/vai/image.py Fixed Show fixed Hide fixed

olive/passes/onnx/quark_quantization.py Fixed Show fixed Hide fixed

fix comments

42c8620

github-advanced-security bot found potential problems Apr 27, 2025

View reviewed changes

examples/resnet/image.py Fixed Show fixed Hide fixed

examples/resnet/image.py Fixed Show fixed Hide fixed

examples/resnet/resnet_50_vitis_ai_ptq_npu.json Fixed Show fixed Hide fixed

fix comments

9763389

github-advanced-security bot found potential problems Apr 27, 2025

View reviewed changes

examples/resnet/resnet_vitis_ai_ptq_npu.json Fixed Show fixed Hide fixed

chinazhangchao and others added 4 commits April 27, 2025 17:49

fix cache folder comments

042c553

remove model cache

a80f9e1

fix lint

0a4b9ab

remove ununsed test

1b5d4d9

VishalX suggested changes May 7, 2025

View reviewed changes

Update VitisAIQuantization to use Quark #1715

Are you sure you want to change the base?

Update VitisAIQuantization to use Quark #1715

Uh oh!

Conversation

vortex-captain commented Mar 27, 2025 • edited by chinazhangchao Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Examples

Checklist before requesting a review

(Optional) Issue link

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-advanced-security bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ChaoLi-AMD commented Apr 1, 2025

Describe your changes

Checklist before requesting a review

(Optional) Issue link

Uh oh!

Uh oh!

xiaoyu-work commented Apr 1, 2025

Uh oh!

azure-pipelines bot commented Apr 1, 2025

Uh oh!

Uh oh!

xiaoyu-work Apr 7, 2025

Choose a reason for hiding this comment

Uh oh!

vortex-captain Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

xiaoyu-work Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

shaahji commented Apr 9, 2025

Uh oh!

vortex-captain commented Apr 17, 2025

Describe your changes

Checklist before requesting a review

(Optional) Issue link

Uh oh!

Uh oh!

ChaoLi-AMD commented Apr 17, 2025

Describe your changes

Checklist before requesting a review

(Optional) Issue link

Uh oh!

Uh oh!

jambayk commented Apr 17, 2025

Uh oh!

vortex-captain commented Apr 18, 2025

Describe your changes

Checklist before requesting a review

(Optional) Issue link

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vortex-captain commented Mar 27, 2025 •

edited by chinazhangchao

Loading

VishalX May 7, 2025 •

edited

Loading