v0.5.48
What's Changed
- fix(gpu-reporter): bypass cache when getting pod metadata by @luomingmeng in #1143
- fix: correct 17 klog format-string bugs across the codebase by @pierluigilenoci in #1134
- refactor(irq-tuning): handle known failure of get GetNicQueue2Irq by @zhanghaoyu1986 in #1122
- feat(gpu): ignore machine state of pod if generator doesnt exist by @JustinChengLZ in #1153
- feat: implement hugepages resource allocation by @JustinChengLZ in #1084
- fix: adjust hints to avoid affinity error if not restricted by @junyu-peng in #1156
- feat(device affinity): generate priority dimensions from calculating device dimensions by @JustinChengLZ in #1146
- feat(sysadvisor): add disk metrics by @syc4704413 in #1154
- refactor(memory-qrm): refactor huge page adjustment for host watermark and test by @luomingmeng in #1161
- feat(eviction): add system CPU pressure eviction plugin by @luomingmeng in #1141
- feat(qrm): add cpu threshold check for shared_cores with numabinding GetTopologyHint by @liunxaa in #1155
- fix(qrm): qrm-mb-plugin exits start if resctrl FS not mounted yet by @h-w-chen in #1164
- chore: watch events of cgroup file CREATE and REMOVE by @luomingmeng in #1168
- metrics(qrm): mbm throttle metrics by @h-w-chen in #1159
- fix(sysadvisor): fix mbm calculation logic by @syc4704413 in #1171
- fix(gpu): fix reporter to trigger periodically and filter out inactive pods by @JustinChengLZ in #1173
New Contributors
- @pierluigilenoci made their first contribution in #1134
- @liunxaa made their first contribution in #1155
Full Changelog: v0.5.47...v0.5.48