Main branch v2025-09-29
·
785 commits
to main
since this release
What's Changed
- Improve Slurm monitoring dashboards by @theyoprst in #531
- Merge to main: Improve Slurm monitoring dashboards by @github-actions[bot] in #533
- Add "All Jobs by User" and "All Jobs by State" graphs by @theyoprst in #534
- Merge to main: Add "All Jobs by User" and "All Jobs by State" graphs by @github-actions[bot] in #536
- Support DCGM Exporter on driverful by @ali-sattari in #535
- Merge to main: Support DCGM Exporter on driverful by @github-actions[bot] in #537
- Fix variable refrence by @ali-sattari in #538
- Merge to main: Fix variable refrence by @github-actions[bot] in #539
- Ubuntu 24 by default for driver-less and driver-full by @asteny in #540
- Merge to main: Ubuntu 24 by default for driver-less and driver-full by @github-actions[bot] in #542
- parametrize ssh check by @itechdima in #543
- Merge to main: parametrize ssh check by @github-actions[bot] in #544
- DCGM Exporter fix for toolkit validation by @ali-sattari in #541
- Merge to main: DCGM Exporter does not need to know about drivers by @github-actions[bot] in #545
- Overwrite Enroot's custom dirs if image disks are used by @dstaroff in #547
- reference module path for helm chart by @cyril-k in #549
- Merge to main: Overwrite Enroot's custom dirs if image disks are used by @github-actions[bot] in #548
- Bump Soperator Terraform version 1.22.0-1 by @rdjjke in #550
- del k8up values null by @Uburro in #553
- Merge to main: del k8up values null by @github-actions[bot] in #554
- Merge to main: Bump Soperator Terraform version 1.22.0-1 by @github-actions[bot] in #551
- fix invalid stable git ref by @itechdima in #556
- Merge to main: fix invalid stable git ref by @github-actions[bot] in #557
- soperator-tf release 1.22.0-2 by @itechdima in #558
- soperator-tf release 1.22.0-2 by @itechdima in #559
- Merge to main: soperator-tf release 1.22.0-2 by @github-actions[bot] in #560
- add more aws variables for proper cleanup by @itechdima in #562
- deploy checks based on the environment by @itechdima in #529
- add nfs-system to the allowed namespaces for cleanup by @itechdima in #563
- GPU slicing guides by @brianlechthaler in #525
- disk sizes normalized by @pbutler in #566
- create bucket keypair in tf instead of CLI by @cyril-k in #565
- Preemptible instances and K8s by @rene-tech in #506
- Change soperator release workflow to manual triggering only by @theyoprst in #570
- Merge to main: Change soperator release workflow to manual triggering only by @github-actions[bot] in #571
New Contributors
Full Changelog: main-v2025-09-15...main-v2025-09-29