Skip to content

Update gaudi-docker to v1.24.0 & fix CUDA UT#1708

Merged
chensuyue merged 2 commits into
mainfrom
xuehao/habana
Apr 21, 2026
Merged

Update gaudi-docker to v1.24.0 & fix CUDA UT#1708
chensuyue merged 2 commits into
mainfrom
xuehao/habana

Conversation

@XuehaoSun
Copy link
Copy Markdown
Contributor

@XuehaoSun XuehaoSun commented Apr 20, 2026

Description

This pull request updates dependencies and improves test script robustness in the CI pipeline. The most significant changes are upgrades to key package versions, improved handling of test duration extraction, and changes to how pre-release dependencies are installed.

Dependency and environment updates:

  • Upgraded gptqmodel and its requirements from version 5.8.0 to 6.0.3 in .azure-pipelines/scripts/cuda_unit_test/run_cuda_ut.sh to use the latest release from the ModelCloud/GPTQModel repository.
  • Updated the sglang dependency in test/test_cuda/requirements_sglang.txt from a direct GitHub reference to a PyPI version requirement (sglang >= 0.5.10).
  • Updated the Habana docker image in .azure-pipelines/template/docker-template.yml from 1.23.0/pytorch-installer-2.9.0 to 1.24.0/pytorch-installer-2.10.0 for both pull and run steps. [1] [2]

Testing and installation improvements:

  • Added the --prerelease=allow flag to uv pip install commands for requirements_sglang.txt in both CUDA unit test scripts to allow installation of pre-release package versions. [1] [2]

Test result parsing:

  • Improved duration extraction in .azure-pipelines/scripts/ut/collect_result.py to use the last matched time pattern in the test output, ensuring the most accurate test duration is reported.

Type of Change

  • Bug fix
  • New feature
  • Documentation update
  • Performance improvement
  • Code refactoring
  • Other (please specify):

Related Issues

Fixes or relates to #

Checklist Before Submitting

  • My code has been tested locally.
  • Documentation has been updated as needed.
  • New or updated tests are included where applicable.

Signed-off-by: Sun, Xuehao <xuehao.sun@intel.com>
Copilot AI review requested due to automatic review settings April 20, 2026 07:29
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Updates the CI pipeline dependencies used for Habana Gaudi and CUDA unit test runs, and improves duration parsing for test log summaries.

Changes:

  • Bump Habana Gaudi Docker image reference from gaudi-docker/1.23.0 (PyTorch installer 2.9.0) to gaudi-docker/1.24.0 (PyTorch installer 2.10.0) in the Azure Pipelines docker template.
  • Adjust log duration extraction to use the last reported in <seconds>s occurrence in log tail content.
  • Update CUDA unit test dependency install to GPTQModel v6.0.3 (wheel + requirements).

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.

File Description
.azure-pipelines/template/docker-template.yml Updates the pulled/run Habana Gaudi container image version used in CI.
.azure-pipelines/scripts/ut/collect_result.py Improves duration parsing robustness by selecting the last time match in logs.
.azure-pipelines/scripts/cuda_unit_test/run_cuda_ut.sh Bumps GPTQModel dependency version used in CUDA unit test setup.

Signed-off-by: Sun, Xuehao <xuehao.sun@intel.com>
@XuehaoSun
Copy link
Copy Markdown
Contributor Author

/azp run Unit-Test-CUDA-AutoRound

@azure-pipelines
Copy link
Copy Markdown

Azure Pipelines successfully started running 1 pipeline(s).

@XuehaoSun XuehaoSun changed the title Update gaudi-docker to v1.24.0 Update gaudi-docker to v1.24.0 & fix CUDA UT Apr 21, 2026
@XuehaoSun XuehaoSun requested a review from chensuyue April 21, 2026 02:49
@chensuyue chensuyue merged commit 8d78450 into main Apr 21, 2026
26 checks passed
@chensuyue chensuyue deleted the xuehao/habana branch April 21, 2026 02:55
lvliang-intel pushed a commit that referenced this pull request May 12, 2026
Signed-off-by: Sun, Xuehao <xuehao.sun@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants