Skip to content

[AI Generated] BugFix: Fix verify_sriov_reload_modules failure on HPC images with broken mana_ib#4449

Merged
LiliDeng merged 1 commit into
microsoft:mainfrom
Gnandeep99:bugfix/sriov-reload-skip-unloaded-modules_290426_113910
May 11, 2026
Merged

[AI Generated] BugFix: Fix verify_sriov_reload_modules failure on HPC images with broken mana_ib#4449
LiliDeng merged 1 commit into
microsoft:mainfrom
Gnandeep99:bugfix/sriov-reload-skip-unloaded-modules_290426_113910

Conversation

@Gnandeep99
Copy link
Copy Markdown
Collaborator

Summary

Fix verify_sriov_reload_modules failing on HPC images where mana_ib.ko exists but cannot load due to kernel symbol mismatches. Changed unload_module() to use is_module_loaded() instead of module_exists() so only actually-loaded modules are included in the unload/reload list. Also fixed module_name_list not being reset per node.

Validation Results

Image Result
almalinux almalinux-hpc 9-hpc-gen2 9.7.2026010601 PASSED

… images with broken mana_ib

unload_module() used module_exists() (modprobe --dry-run) to filter the
driver list before unloading. On AlmaLinux 9 HPC, mana_ib.ko exists on
disk but was never loaded at boot due to symbol version mismatches with
IB core. The test then tried to reload mana_ib, failing with EINVAL.

Change the filter from module_exists() to is_module_loaded() so only
modules that are actually loaded get unloaded and subsequently reloaded.

Also fix a secondary bug: module_name_list was declared outside the node
loop, causing all nodes to share the same growing list reference.
@Gnandeep99 Gnandeep99 requested a review from LiliDeng as a code owner April 29, 2026 18:40
Copilot AI review requested due to automatic review settings April 29, 2026 18:40
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

@LiliDeng LiliDeng merged commit 0c4914f into microsoft:main May 11, 2026
34 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants