You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Due to a known error introduced in kernel 5.17
some instances that use MSI IRQ can have the list
of IRQs empty in sysfs which makes the tuner fail
and make hwloc seg fault (impossible to tune).
This change only prints a warning instead of an
error and links to the issue where explains
what is the problem and the upstream reports.
Fixesredpanda-data#10838
Version & Environment
Redpanda version: 22.3.1 (and others)
What went wrong?
disk_irq
tuner may fail with a message like so:What should have happened instead?
The problem as pointed out by @r-vasquez is that the
msi_irqs
file is empty for this PCI-E device despite using msi IRQs:This causes us to pass 0 to hwloc-distrib:
which crashes with an assert failure.
How to reproduce the issue?
As above.
Additional information
This seems to be a bug introduced somewhere in kernel version 5.16 to 5.19 as described here:
amzn/amzn-drivers#268
It is patched upstream and will presumably show up in some later kernel:
amzn/amzn-drivers#268 (comment)
In the meantime we could just fail the irq_tuner when we detect this with a better error message?
The text was updated successfully, but these errors were encountered: