Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

www.podman.io flaking - netavark bug? #17044

Closed
edsantiago opened this issue Jan 9, 2023 · 11 comments · Fixed by #17053
Closed

www.podman.io flaking - netavark bug? #17044

edsantiago opened this issue Jan 9, 2023 · 11 comments · Fixed by #17053
Labels
flakes Flakes from Continuous Integration kind/bug Categorizes issue or PR as related to a bug. locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments.

Comments

@edsantiago
Copy link
Member

Tests trying to fetch from www.podman.io are flaking. Sometimes a curl timeout, sometimes a DNS name resolution failure:

curl: (6) Could not resolve host: www.podman.io

I can't tell if it's a real problem with podman.io and/or its nameservers, or a connectivity hiccup, or if it's a Neta/Aardvark problem. The latter seems unlikely because I've only seen this with podman.io.

There are many more but I'm getting tired of scanning through flake logs.

@edsantiago edsantiago added the flakes Flakes from Continuous Integration label Jan 9, 2023
@Luap99
Copy link
Member

Luap99 commented Jan 9, 2023

podman.io is hosted on github infra. I think one easy change would be to switch it to something like google.com and hope that this is a server side problem and not a client side bug.

@rhatdan
Copy link
Member

rhatdan commented Jan 9, 2023

Should do redhat.com before google.

@edsantiago
Copy link
Member Author

Yesterday, remote f37 root

edsantiago added a commit to edsantiago/libpod that referenced this issue Jan 10, 2023
Much as we'd love to eat our dogfood, podman.io is not hosted
on reliable infrastructure; redhat.com is. Let's see if this
gets rid of CI flakes.

Closes: containers#17044

Signed-off-by: Ed Santiago <santiago@redhat.com>
@edsantiago
Copy link
Member Author

Well, shucks. It's not a podman.io problem

@edsantiago
Copy link
Member Author

@containers/netavark-maintainers PTAL. It's possible that this is a network-infrastructure problem in Cirrus or gcloud... but it's also possible that our networking is broken.

@Luap99
Copy link
Member

Luap99 commented Jan 10, 2023

Do you know when this started? I know there was a bug in netavark v1.3 which could cause this. VM image updates should include netavark v1.4 but the automation_image CI is still broken so I cannot update it at the moment.
Edit: Updates should be working again so I will try to update the VM images: #16525

@edsantiago
Copy link
Member Author

First instance of Could not resolve host: www.podman.io is Dec 5

First instance of command timed out ... wget www.podman.io is Nov 2

@edsantiago
Copy link
Member Author

Reopening, because #17053 is unlikely to fix anything

@edsantiago edsantiago reopened this Jan 10, 2023
@Luap99
Copy link
Member

Luap99 commented Jan 11, 2023

I agree, the common theme here is that it fails with --network private or bridge which means it uses netavark on f37, f36 uses CNI and at least from the instances listed here it is not effected. Therefore it it most likely a netavark bug.

@vrothberg vrothberg changed the title www.podman.io flaking www.podman.io flaking - netavark bug? Jan 19, 2023
@vrothberg vrothberg added the kind/bug Categorizes issue or PR as related to a bug. label Jan 19, 2023
@Luap99
Copy link
Member

Luap99 commented Jan 25, 2023

@edsantiago New images with a newer netavark were merged last week, does it still happen?
I don't remember it seeing it this week so far.

@edsantiago
Copy link
Member Author

Sorry for the late reply: we're having a LOT of flakes this week, and I've had to categorize them one by one, which takes time.

The last two instances of this flake were on January 16, both on a PR that did not include #16525 (the update-images PR). I feel comfortable closing. Thank you!

@github-actions github-actions bot added the locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments. label Sep 3, 2023
@github-actions github-actions bot locked as resolved and limited conversation to collaborators Sep 3, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
flakes Flakes from Continuous Integration kind/bug Categorizes issue or PR as related to a bug. locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants