Skip to content

v0.3.1

Choose a tag to compare

@github-actions github-actions released this 12 Feb 22:08
· 109 commits to main since this release

NVIDIA GPU Health Agent v0.3.1

This release includes GPU health monitoring and reporting capabilities for NVIDIA GPU infrastructure.

Available Packages

  • DEB packages: amd64 and arm64 for Debian/Ubuntu systems
  • RPM packages: x86_64 and aarch64 for RHEL/CentOS/Fedora systems
  • Binary archives: Standalone binaries for both architectures

Installation

Download the appropriate package for your system from the assets below. For installation instructions, see the project README.

Changelog

New Features

  • ee05ebf feat: add configurable env vars through helm values

Bug Fixes

  • fcc9ad3 fix: Attestation failed after first enrollment for 24h
  • 3e52f83 fix: dev mount directly to avoid nvidiactl issue writing file
  • 6a61296 fix: enroll failure doesn't stop the agent start
  • 22721db fix: pkgdb CVE fix

Documentation Updates

  • 57278e8 docs: improve configuration docs by adding the helm configuration
  • ca6c3d9 docs: separate install docs for different platforms

Other Changes

  • 1431987 [GPUHEALTH-1297] chore: helm installer improvement
  • 592a225 refactor: remove skyhook install path for the agent

Verification

All release artifacts are checksummed. Verify downloads using:

sha256sum -c checksums.txt

Support

For issues and questions, please visit: https://github.com/NVIDIA/gpuhealth/issues