Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Grub "error: out of memory" with nvidia extensions #272

Closed
mdallaire opened this issue Nov 23, 2023 · 8 comments · Fixed by siderolabs/pkgs#836
Closed

Grub "error: out of memory" with nvidia extensions #272

mdallaire opened this issue Nov 23, 2023 · 8 comments · Fixed by siderolabs/pkgs#836

Comments

@mdallaire
Copy link
Contributor

Talos v1.5.5
nonfree-kmod-nvidia:535.54.03-v1.5.5
nvidia-container-toolkit:535.54.03-v1.13.5

Hardware: NUC11PHKi7CAA (UEFI boot)
64g RAM

After performing the initial installation with the vanilla metal-amd64.iso I apply the configuration with the nvidia extensions and kernel modules. All of which work well. When I perform a reboot with the -m powercycle option the machine won't boot with the following error:
image

Discussing with frezbo he mentionned this grub issue as a possible cause : https://bugs.launchpad.net/oem-priority/+bug/1842320

I performed other tests in proxmox running on the same hardware model by switching between UEFI and BIOS on VMs with the NVIDIA card in PCI passthrough but I was not able to reproduce the issue.

Machine Boot Mode Result
Proxmox VM BIOS Works
Proxmox VM UEFI Works
Physical UEFI grub OOM
Physical BIOS Not available for this hw
@mdallaire
Copy link
Contributor Author

The following schema (e6584563021316c06939b85c8060bd4535499279ae9a6323d9356d7a2fcd7fb4) and ISO also result in the same behavior.

customization:
    extraKernelArgs:
        - nomodeset
    systemExtensions:
        officialExtensions:
            - siderolabs/i915-ucode
            - siderolabs/intel-ucode
            - siderolabs/nvidia-container-toolkit
            - siderolabs/nonfree-kmod-nvidia

@frezbo
Copy link
Member

frezbo commented Nov 24, 2023

@mdallaire could you try this installer image: ghcr.io/frezbo/installer/e6584563021316c06939b85c8060bd4535499279ae9a6323d9356d7a2fcd7fb4:v1.6.0-alpha.2-4-g8e2307466-dirty, also add the nomodeset kernel arg in machine.install.extraKernelArgs, the same extension list is there in the installer

@mdallaire
Copy link
Contributor Author

I don't have access to the image:
Error response from daemon: Head "https://ghcr.io/v2/frezbo/installer/e6584563021316c06939b85c8060bd4535499279ae9a6323d9356d7a2fcd7fb4/manifests/v1.6.0-alpha.2-4-g8e2307466-dirty": unauthorized

@frezbo
Copy link
Member

frezbo commented Nov 24, 2023

I don't have access to the image: Error response from daemon: Head "https://ghcr.io/v2/frezbo/installer/e6584563021316c06939b85c8060bd4535499279ae9a6323d9356d7a2fcd7fb4/manifests/v1.6.0-alpha.2-4-g8e2307466-dirty": unauthorized

sory, should be fixed now

@mdallaire
Copy link
Contributor Author

That installation image with the new grub version works perfectly on my machine now.

@frezbo
Copy link
Member

frezbo commented Nov 24, 2023

That installation image with the new grub version works perfectly on my machine now.

cool, i'll get this tested with another setup also

frezbo added a commit to frezbo/pkgs that referenced this issue Dec 6, 2023
This reverts commit 70919d8.

Newer grub has issues with loading from XFS filesystems.
See siderolabs/talos#8023, and the fix in
`70919d82` cannot be backported to older version. So the safe option is
to revert while siderolabs/extensions#272
remains open.
frezbo added a commit to frezbo/pkgs that referenced this issue Dec 6, 2023
This reverts commit 70919d8.

Newer grub has issues with loading from XFS filesystems.
See siderolabs/talos#8023, and the fix in
`70919d82` cannot be backported to older version. So the safe option is
to revert while siderolabs/extensions#272
remains open.

Signed-off-by: Noel Georgi <git@frezbo.dev>
@frezbo
Copy link
Member

frezbo commented Dec 6, 2023

Re-opening since new grub has other bugs

@frezbo frezbo reopened this Dec 6, 2023
@frezbo
Copy link
Member

frezbo commented Apr 3, 2024

Talos 1.7 should have grub 2.12

@frezbo frezbo closed this as completed Apr 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants