Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bridge Network Module Fails to Load Automatically #23583

Closed
mr-karan opened this issue Jul 12, 2024 · 4 comments · Fixed by #23707
Closed

Bridge Network Module Fails to Load Automatically #23583

mr-karan opened this issue Jul 12, 2024 · 4 comments · Fixed by #23707
Assignees
Labels
theme/docs Documentation issues and enhancements type/bug

Comments

@mr-karan
Copy link
Contributor

Nomad version

Output from nomad version: Nomad v1.7.7

Operating system and Environment details

OS: Linux (AWS EC2)
Kernel version: 6.8.0-1008-aws
Environment: AWS EC2 instance

Issue

After upgrading Nomad from 1.6.1 to 1.7.7, the bridge network is not functioning. The Nomad agent logs indicate that the bridge module is not found or loaded in the system kernel.

Reproduction steps

  1. Upgrade Nomad from version 1.6.1 to 1.7.7.
  2. Restart Nomad services.
  3. Check Nomad client logs for network-related errors.

Expected Result

The bridge network functions correctly without any errors, similar to the behavior seen in version 1.6.1.

Actual Result

The following errors are observed in Nomad client logs indicating issues with the bridge module:

  • failed to find /sys/module/bridge: no such file or directory
  • module bridge not in /proc/modules
  • module bridge not in /lib/modules/6.8.0-1008-aws/modules.builtin
  • module bridge not in /lib/modules/6.8.0-1008-aws/modules.dep

Nomad Client logs (if appropriate)

Jul 13 02:24:33 app nomad[4191]:   error=
Jul 13 02:24:33 app nomad[4191]:   | 4 errors occurred:
Jul 13 02:24:33 app nomad[4191]:   | \t* failed to find /sys/module/bridge: stat /sys/module/bridge: no su>
Jul 13 02:24:33 app nomad[4191]:   | \t* module bridge not in /proc/modules
Jul 13 02:24:33 app nomad[4191]:   | \t* module bridge not in /lib/modules/6.8.0-1008-aws/modules.builtin
Jul 13 02:24:33 app nomad[4191]:   | \t* module bridge not in /lib/modules/6.8.0-1008-aws/modules.dep
Jul 13 02:24:33 app nomad[4191]:   |
Jul 13 02:24:33 app nomad[4191]:

Note

I fixed it by manually loading the module with sudo modprobe bridge. After doing this, I restarted Nomad and spawned my job which had network.mode="bridge" and it worked fine. I think this is a regression from 1.6->1.7 upgrade.

@tgross
Copy link
Member

tgross commented Jul 15, 2024

Hi @mr-karan! It's a little strange that you're finding a regression here, as Nomad doesn't load the kernel module on its own. That's actually a known issue #10902 (and also #17311, sort of). But we also have #23523 open recently where someone wasn't seeing network fingerprinting happen correctly but seems to think it's Docker-related.

Is there any chance you've updated other infrastructure components other than Nomad when seeing this regression, @mr-karan? Specifically kernel, distro, Docker version, etc.?

@mr-karan
Copy link
Contributor Author

Ah yes, indeed we upgraded to Ubuntu Minimal 24.04 (we were on 22.04 Minimal in 1.6.1). I'll try to reproduce this in a VM and get back. Thanks for taking a look!

@tgross
Copy link
Member

tgross commented Jul 29, 2024

Similar issue reported here: #23700

We'll need to update install documentation on this, as lots of folks are going to get caught out by it unfortunately.

@tgross tgross added theme/docs Documentation issues and enhancements and removed stage/waiting-reply labels Jul 29, 2024
@tgross tgross self-assigned this Jul 29, 2024
@tgross
Copy link
Member

tgross commented Aug 6, 2024

Docs are being updated in #23707

@tgross tgross closed this as completed in 021692e Aug 6, 2024
tgross pushed a commit that referenced this issue Aug 6, 2024
- Pulled common content from multiple pages into new partials
- Refactored install/index to be OS-based so I could add linux-distro-based instructions to install-consul-cni-plugins.mdx partial. The tab groups on the install/index page do match and change focus as expected.
- Moved CNI overview-type content to networking/index
- Refactored networking/cni to include install CNI plugins and configuration content (from install/index).
- Moved CNI plugins explanation in bridge mode configuration section into bullet points. They had been #### headings, which aren't rendered in the R page TOC. I tried to simplify and format the bullet point content to be easier to scan.

Ref: https://hashicorp.atlassian.net/browse/CE-661
Fixes: #23229
Fixes: #23583
tgross pushed a commit that referenced this issue Aug 6, 2024
- Pulled common content from multiple pages into new partials
- Refactored install/index to be OS-based so I could add linux-distro-based instructions to install-consul-cni-plugins.mdx partial. The tab groups on the install/index page do match and change focus as expected.
- Moved CNI overview-type content to networking/index
- Refactored networking/cni to include install CNI plugins and configuration content (from install/index).
- Moved CNI plugins explanation in bridge mode configuration section into bullet points. They had been #### headings, which aren't rendered in the R page TOC. I tried to simplify and format the bullet point content to be easier to scan.

Ref: https://hashicorp.atlassian.net/browse/CE-661
Fixes: #23229
Fixes: #23583
tgross pushed a commit that referenced this issue Aug 6, 2024
- Pulled common content from multiple pages into new partials
- Refactored install/index to be OS-based so I could add linux-distro-based instructions to install-consul-cni-plugins.mdx partial. The tab groups on the install/index page do match and change focus as expected.
- Moved CNI overview-type content to networking/index
- Refactored networking/cni to include install CNI plugins and configuration content (from install/index).
- Moved CNI plugins explanation in bridge mode configuration section into bullet points. They had been #### headings, which aren't rendered in the R page TOC. I tried to simplify and format the bullet point content to be easier to scan.

Ref: https://hashicorp.atlassian.net/browse/CE-661
Fixes: #23229
Fixes: #23583
tgross added a commit that referenced this issue Aug 6, 2024
)

- Pulled common content from multiple pages into new partials
- Refactored install/index to be OS-based so I could add linux-distro-based instructions to install-consul-cni-plugins.mdx partial. The tab groups on the install/index page do match and change focus as expected.
- Moved CNI overview-type content to networking/index
- Refactored networking/cni to include install CNI plugins and configuration content (from install/index).
- Moved CNI plugins explanation in bridge mode configuration section into bullet points. They had been #### headings, which aren't rendered in the R page TOC. I tried to simplify and format the bullet point content to be easier to scan.

Ref: https://hashicorp.atlassian.net/browse/CE-661
Fixes: #23229
Fixes: #23583

Co-authored-by: Aimee Ukasick <aimee.ukasick@hashicorp.com>
tgross added a commit that referenced this issue Aug 6, 2024
)

- Pulled common content from multiple pages into new partials
- Refactored install/index to be OS-based so I could add linux-distro-based instructions to install-consul-cni-plugins.mdx partial. The tab groups on the install/index page do match and change focus as expected.
- Moved CNI overview-type content to networking/index
- Refactored networking/cni to include install CNI plugins and configuration content (from install/index).
- Moved CNI plugins explanation in bridge mode configuration section into bullet points. They had been #### headings, which aren't rendered in the R page TOC. I tried to simplify and format the bullet point content to be easier to scan.

Ref: https://hashicorp.atlassian.net/browse/CE-661
Fixes: #23229
Fixes: #23583

Co-authored-by: Aimee Ukasick <aimee.ukasick@hashicorp.com>
tgross pushed a commit that referenced this issue Aug 6, 2024
- Pulled common content from multiple pages into new partials
- Refactored install/index to be OS-based so I could add linux-distro-based instructions to install-consul-cni-plugins.mdx partial. The tab groups on the install/index page do match and change focus as expected.
- Moved CNI overview-type content to networking/index
- Refactored networking/cni to include install CNI plugins and configuration content (from install/index).
- Moved CNI plugins explanation in bridge mode configuration section into bullet points. They had been #### headings, which aren't rendered in the R page TOC. I tried to simplify and format the bullet point content to be easier to scan.

Ref: https://hashicorp.atlassian.net/browse/CE-661
Fixes: #23229
Fixes: #23583

Co-authored-by: Aimee Ukasick <aimee.ukasick@hashicorp.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
theme/docs Documentation issues and enhancements type/bug
Projects
Development

Successfully merging a pull request may close this issue.

2 participants