Hopper/Blackwell support by espeer · Pull Request #25 · espeer/linux

espeer · 2025-12-12T19:25:11Z

Applying John's patches for test

Using the Ubuntu configuration on the test branch is a known good general starting point. This configuration will be pared down and refined in future commits in order to reduce build time while still retaining some degree of hardware flexibility for the test cluster. Signed-off-by: Edwin Peer <epeer@nvidia.com>

Signed-off-by: Edwin Peer <epeer@nvidia.com>

For convenience of the reader: now you can directly see the sizes of each range. It is suprising just how much this helps. Sample output (using an Ampere GA104): NovaCore 0000:e1:00.0: FbLayout { fb: 0x0..0x3ff800000 (16376 MB), vga_workspace: 0x3ff700000..0x3ff800000 (1 MB), frts: 0x3ff600000..0x3ff700000 (1 MB), boot: 0x3ff5fa000..0x3ff600000 (0 MB), elf: 0x3fb960000..0x3ff5f9000 (60 MB), wpr2_heap: 0x3f3900000..0x3fb900000 (128 MB), wpr2: 0x3f3800000..0x3ff700000 (191 MB), heap: 0x3f3700000..0x3f3800000 (1 MB), vf_partition_count: 0x0, } Signed-off-by: John Hubbard <jhubbard@nvidia.com>

A tiny simplification: now that FbLayout uses its own specific FbRange type, add an FbRange.len() method, and use that to (very slightly) simplify the calculation of Frts::frts_size initialization. Suggested-by: Alexandre Courbot <acourbot@nvidia.com> Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Hopper (GH100) and Blackwell identification, including ELF .fwsignature_* items. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

…tion() Keep Gsp::new() from getting too cluttered, by factoring out the selection of .fwsignature* items. This will continue to grow as we add GPUs. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Instead of long, exhaustive lists of GPUs ("Chipsets"), use entire GPU Architectures, such as "Blackwell" or "Turing", to make HAL choices. A tiny side effect: moved a "use" statement out of function scope, in each file, up to the top of the file, as per Rust for Linux conventions. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

As per [1], we need one "use" item per line, in order to reduce merge conflicts. Furthermore, we need a trailing ", //" in order to tell rustfmt(1) to leave it alone. This does that for commands.rs, which is the only file in nova-core that has any remaining instances of the old style. [1] https://docs.kernel.org/rust/coding-guidelines.html#imports Signed-off-by: John Hubbard <jhubbard@nvidia.com>

This removes a "TODO" item in the code, which was hardcoded to work on Ampere and Ada GPUs. Hopper/Blackwell+ have a larger width, so do an early read of boot42, in order to pick the correct value. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Up until now, only the GSP required parsing of its firmware headers. However, upcoming support for Hopper/Blackwell+ adds another firmware image (FMC), along with another format (ELF32). Therefore, the current ELF64 section parsing support needs to be moved up a level, so that both of the above can use it. There are no functional changes. This is pure code movement. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Factor out a chunk of complexity into a new subroutine. This is an incremental step in adding ELF32 support to the existing ELF64 section support, for handling GPU firmware. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Add ElfHeader and ElfSectionHeader traits to abstract out differences between ELF32 and ELF64. Implement these for ELF64. This is in preparation for upcoming ELF32 section support, and for auto-selecting ELF32 or ELF64. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Add Elf32Hdr and Elf32SHdr newtypes, implement the ElfHeader and ElfSectionHeader traits for them, and add elf32_section(). This mirrors the existing ELF64 support, using the same generic infrastructure. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Add elf_section() which checks the ELF magic and class byte to automatically dispatch to elf32_section() or elf64_section(). Update existing callers to use elf_section() instead of calling elf64_section() directly. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

…of FSP FSP is a hardware unit that runs FMC firmware. Co-developed-by: Alexandre Courbot <acourbot@nvidia.com> Signed-off-by: Alexandre Courbot <acourbot@nvidia.com> Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Add the FSP (Firmware System Processor) falcon engine type that will handle secure boot and Chain of Trust operations on Hopper and Blackwell architectures. The FSP falcon replaces SEC2's role in the boot sequence for these newer architectures. This initial stub just defines the falcon type and its base address. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Add external memory (EMEM) read/write operations to the GPU's FSP falcon engine. These operations use Falcon PIO (Programmed I/O) to communicate with the FSP through indirect memory access. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Add the FSP messaging infrastructure needed for Chain of Trust communication on Hopper/Blackwell GPUs. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Various "reserved" areas of FB (frame buffer: vidmem) have to be calculated, because the GSP booting process needs this information. The calculations are const, so a new const-compatible alignment function is also added to num.rs, in order to align the reserved areas. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Hopper, Blackwell and later need to reserve more memory than previous GPUs did. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Add the FSP (Firmware System Processor) module for Hopper/Blackwell GPUs. These architectures use a simplified firmware boot sequence: FMC --> FSP --> GSP, with no SEC2 involvement. This commit adds the ability to wait for FSP secure boot completion by polling the I2CS thermal scratch register until FSP signals success. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Add the data structures for FSP Chain of Trust communication. These include the FMC signature container (hash, public key, signature) and the NVDM payload structures for sending COT messages and receiving responses. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Add extract_fmc_signatures_static() to parse cryptographic signatures from FMC ELF firmware sections. This extracts the SHA-384 hash, RSA public key, and signature needed for Chain of Trust verification. Also exposes the elf_section() helper from firmware.rs for use by FSP. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Add send_sync_fsp() which sends an MCTP/NVDM message to FSP and waits for the response. This handles the low-level protocol details including header validation, error checking, and timeout handling. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Add the boot functions that construct FMC boot parameters and send the Chain of Trust message to FSP. This completes the FSP communication infrastructure needed to boot GSP firmware on Hopper/Blackwell GPUs. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Hopper, Blackwell and later require more space for the non-WPR heap. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Hopper, Blackwell and later GPUs require a larger heap for WPR2. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Extract the SEC2 booter loading sequence into a dedicated helper function. This is a pure refactoring with no behavior change, done in preparation for adding an alternative FSP boot path. Co-developed-by: Alexandre Courbot <acourbot@nvidia.com> Signed-off-by: Alexandre Courbot <acourbot@nvidia.com> Signed-off-by: John Hubbard <jhubbard@nvidia.com>

Hopper and Blackwell GPUs use FSP-based secure boot and do not require waiting for GFW_BOOT completion. Skip this step for these architectures. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

espeer · 2026-03-18T19:01:23Z

/build

espeer · 2026-03-18T20:53:49Z

/build

espeer and others added 30 commits December 1, 2025 19:42

Reduce kernel build configuration

f4b9d01

Signed-off-by: Edwin Peer <epeer@nvidia.com>

Create GH workflow

91df4f3

gpu: nova-core: Hopper/Blackwell: basic GPU identification

108a983

Hopper (GH100) and Blackwell identification, including ELF .fwsignature_* items. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

nova-core: factor .fwsignature* selection into a new get_gsp_sigs_sec…

e0bcbba

…tion() Keep Gsp::new() from getting too cluttered, by factoring out the selection of .fwsignature* items. This will continue to grow as we add GPUs. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

gpu: nova-core: factor out a section_name_eq() function

ef0857a

Factor out a chunk of complexity into a new subroutine. This is an incremental step in adding ELF32 support to the existing ELF64 section support, for handling GPU firmware. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

gpu: nova-core: Hopper/Blackwell: add FSP message infrastructure

0a5c797

Add the FSP messaging infrastructure needed for Chain of Trust communication on Hopper/Blackwell GPUs. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

gpu: nova-core: Hopper/Blackwell: add needs_large_reserved_mem()

96685f3

Hopper, Blackwell and later need to reserve more memory than previous GPUs did. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

gpu: nova-core: Hopper/Blackwell: larger non-WPR heap

96ca4c7

Hopper, Blackwell and later require more space for the non-WPR heap. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

gpu: nova-core: Hopper/Blackwell: larger WPR2 (GSP) heap

437d773

Hopper, Blackwell and later GPUs require a larger heap for WPR2. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

gpu: nova-core: Hopper/Blackwell: skip GFW boot waiting

7597bda

Hopper and Blackwell GPUs use FSP-based secure boot and do not require waiting for GFW_BOOT completion. Skip this step for these architectures. Signed-off-by: John Hubbard <jhubbard@nvidia.com>

github-actions bot force-pushed the nova-test branch 8 times, most recently from ec59f53 to fdea910 Compare February 5, 2026 04:51

github-actions bot force-pushed the nova-test branch 7 times, most recently from 4be6b4c to 5fddb26 Compare February 12, 2026 05:00

github-actions bot force-pushed the nova-test branch 8 times, most recently from fc0344e to b56cdcc Compare February 20, 2026 04:38

github-actions bot force-pushed the nova-test branch 5 times, most recently from be6932a to 5ba69f3 Compare February 25, 2026 04:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hopper/Blackwell support#25

Hopper/Blackwell support#25
espeer wants to merge 34 commits intonova-testfrom
john-blackwell-hopper

espeer commented Dec 12, 2025

Uh oh!

espeer commented Mar 18, 2026

Uh oh!

espeer commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

espeer commented Dec 12, 2025

Uh oh!

espeer commented Mar 18, 2026

Uh oh!

espeer commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants