-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
update subdomain tests #194
update subdomain tests #194
Conversation
Hi @vsoch. Thanks for your PR. I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/ok-to-test |
Problem: there are duplicated attributes for subdomain/networksubdomain, and setting the subdomain on the result should be done by the JobSet. Solution: remove the duplicate (subdomain) and manual setting. Signed-off-by: vsoch <vsoch@users.noreply.github.com>
3620766
to
f6ae87a
Compare
/retest-required |
/test pull-jobset-test-e2e-main-1-24 |
/test all rerunning tests to check flakiness |
That's weird it is based on the container build (and finding packages). |
/retest Yeah I'm hoping #191 will help resolve the unit test flakiness (for some reason |
I don't know the details of the differences, but fwiw I usually do ENV DEBIAN_FRONTEND=noninteractive
RUN apt-get update && apt-get install -y <package> Or I may just be an old crusty linux person, that is very likely too! 😆 |
/retest |
/retest all |
@ahg-g: The
Use In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/test all |
I think we might try using apt-get in Dockerfile and scripts, e.g.,:
That's the warning I was mentioning earlier. I'm going to try running this manually to see if I can reproduce. |
okay looks like you already did that! https://github.com/kubernetes-sigs/jobset/pull/192/files |
What is / where is the base image that this testing setup called prow is using? |
|
Thank you - perfect! And I actually I saw (I think?) the same populated one here: https://prow.k8s.io/prowjob?prowjob=9c8ca3c0-37c3-4ba5-b35e-76b15bf25415 (albeit it's not as nice to look at!). Okay so if we are using the golang 1.20 base, I can reproduce:
I tried explicitly adding the repository - it didn't seem to work (but here are the steps if anyone is interested) apt-get update && apt-get install -y software-properties-common python3-launchpadlib To add the repository: add-apt-repository ppa:openjdk-r/ppa ( And it seems opendjk-11-jdk is going EOL this year - it's a long term support version but seems to end in 3 months: Perhaps we can use openjdk-17-jdk instead? That works for me in a fresh image: apt-get install -y openjdk-17-jdk I'm going to poke around and see if there are any notices for why the package isn't found. But I think we might do well to update to a version that isn't going EOL soon anyway. |
okay more debugging - this shows a lot of results (for version 17) $ apt search openjdk This shows no results: $ apt search openjdk-11 Sorting... Done
Full Text Search... Done And I don't see the package under "bookworm" https://packages.debian.org/bookworm/openjdk-11-jdk. It's definitely there for buster! https://packages.debian.org/buster/openjdk-11-jdk And maybe this is the underlying issue - the golang base images for 19 and 20 were updated to have bookworm and not buster. docker-library/golang@db757a0 So I don't think we did anything here - we just need to update the libraries we used based on what is available. And note there are buster variants available: https://hub.docker.com/layers/library/golang/1.20-buster/images/sha256-f86220e98c9d858143a68158dde675a0178854549511d9fee469767edfefc0d7?context=explore if you absolutely cannot change, but based on the EOL of jdk 11 I think it's probably an OK time. Hope that helps! |
A suggestion from the Chainguard Dev CEO is to try their wolfi images: https://twitter.com/lorenc_dan/status/1670535938240389121?s=20. They are fairly minimalist, and not sure if it would greatly help with the EOL, but I thought I'd bring it up for discussion in case there is interest! |
What I am confused about is how an error like "Unable to locate package openjdk-11-jdk" causes a flake? why would the test ever succeed if the package doesn't exist? |
Because the golang images when they were buster did have the package. You likely were hitting a cache with the older images if it worked. |
btw, in bookworm it is openjdk-17: https://packages.debian.org/bookworm/openjdk-17-jdk, so we just need to change the script at https://github.com/kubernetes-sigs/jobset/blob/main/hack/python-sdk/gen-sdk.sh#L48 to point to 17, right? |
Yes! That's what I suggested we do here: #194 (comment)
And the follow-up thread was just an exercise to understand why version 11 was not around! |
Would you like me to put in a quick PR to do that update, or is that planned for https://github.com/kubernetes-sigs/jobset/pull/192/files? |
yeah, to make it a bit more robust during this transition period, perhaps the script can try 11 first then if not successful try 17? |
Sure, but I can almost guarantee since the images have been deployed (golang 19 and 20, specifically) to use bookworm, it will be unlikely. But I could see someone keeping a cache somewhere, so +1 to try 11 and fall back to 17. |
👉 #201 👈 Can't wait to squash this bug! 🐛 🐞 |
/retest |
I likely need to do a rebase here, but I can wait for the recently triggered tests to run. |
it should automatically rebase |
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: ahg-g, vsoch The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Thank you! |
Problem: there are duplicated attributes for subdomain/networksubdomain, and setting the subdomain on the result should be done by the JobSet.
Solution: remove the duplicate (subdomain) and manual setting.
This should hopefully help with #181 if the issue is related!