Skip to content

fix test_evaluation and AutoTuner new issue#1417

Merged
xin3he merged 3 commits intomainfrom
xinhe/2-9
Feb 9, 2026
Merged

fix test_evaluation and AutoTuner new issue#1417
xin3he merged 3 commits intomainfrom
xinhe/2-9

Conversation

@xin3he
Copy link
Contributor

@xin3he xin3he commented Feb 9, 2026

Description

Please briefly describe your main changes, the motivation.

Type of Change

  • Bug fix
  • New feature
  • Documentation update
  • Performance improvement
  • Code refactoring
  • Other (please specify):

Related Issues

Fixes or relates to #1227

Checklist Before Submitting

  • My code has been tested locally.
  • Documentation has been updated as needed.
  • New or updated tests are included where applicable.

root and others added 2 commits February 9, 2026 02:16
Copilot AI review requested due to automatic review settings February 9, 2026 03:05
@xin3he xin3he mentioned this pull request Feb 9, 2026
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adjusts CUDA evaluation tests and centralizes a Triton Autotuner compatibility patch to address issues seen during evaluation and autotuning.

Changes:

  • Set NCCL_ASYNC_ERROR_HANDLING=1 in CUDA evaluation tests that use vLLM.
  • Remove per-module Autotuner patching from Triton dequant modules.
  • Add a centralized Autotuner compatibility patch in auto_round_extension.triton package init.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 4 comments.

File Description
test/test_cuda/advanced/test_evaluation.py Adds NCCL async error handling env var to stabilize vLLM-based eval runs.
auto_round_extension/triton/triton_utils_zp/dequant.py Removes inline Autotuner patching side effect from the dequant module.
auto_round_extension/triton/triton_utils/dequant.py Removes inline Autotuner patching side effect from the dequant module.
auto_round_extension/triton/init.py Introduces centralized Autotuner patching on package import for older Triton versions.

Signed-off-by: He, Xin3 <xin3.he@intel.com>
@xin3he xin3he merged commit 520214a into main Feb 9, 2026
29 checks passed
@xin3he xin3he deleted the xinhe/2-9 branch February 9, 2026 05:04
@chensuyue chensuyue added this to the 0.10.0 milestone Feb 9, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants