-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Upgrade all VMs to AMD64 Ubuntu 22.04 #86194
Conversation
Tagging subscribers to this area: @dotnet/runtime-infrastructure Issue DetailsAll VMs need to move to 22.04 unless there is a specific reason. This PR is a bulk upgrade of the VMs. Nothing has been tested nor do I know how to (other than making the PR and running CI). This PR is specific to AMD64 to make the change smaller. Other changes need to follow. If this change is still too big, it can be further reduced.
|
If we're moving forward machine versions by policy, do we want to extract the name of the "current standard linux testing VM" into a general variable like "linux-latest"? |
I like the idea in theory, however a scheme like that forces us to make everything compatible in one PR. The current scheme seems better. |
/azp run runtime-extra-platforms |
Azure Pipelines successfully started running 1 pipeline(s). |
Nice @carlossanlop -- It is ironic (and a little embarrassing) that I was changing these pipelines with all these conditions for the various run types and didn't realize that this PR requires running the full matrix of runs. Oops! |
No worries @richlander, the runtime test matrix is super complex. Thanks for sending the change. |
Co-authored-by: Jeremy Koritzinsky <jkoritzinsky@gmail.com>
Regarding the Android Helix queues: those only run on 18.04 because that was the common version when the queues were created, we can ask core-eng to create 22.04-based ones. |
We also want to backport this to 7.0 and 6.0. Right? |
@carlossanlop -- yes, we want to backport. Might we wait until we have the .NET 8 project done (so that we understand the full scope, including where we needed to make extra changes)? If the best pattern is to just port as we go, that's fine, too. Yes @akoeplinger -- please ask for that. FYI @ilyas1974 |
/azp run runtime-extra-platforms |
Azure Pipelines successfully started running 1 pipeline(s). |
/azp run runtime-wasm |
Azure Pipelines successfully started running 1 pipeline(s). |
may be related to #89788 |
Just re-ran the failed checks. Looks like a repeat. Should we disable the test again? |
The HTTP test failures seems environmental. I set up helix-repro machine and I could not reproduce the failure. However, looking at one the test failure:
there are more events than excepted and it really looks like there is extra request e.g. like multiple tests interfering. It is possible that something changed in the new Kernel and the port conflict are more likely. While 18.04 is out of support we can still move to 20.04, right @richlander? It should be supported for another two years. If this turns out the be interaction between kernel and test infrastructure there may not be any quick and easy fix. cc: @karelz |
This doesn't make sense as a strategy. We support 22.04. If our test machines cannot successfully run on it, that's a problem. For the life of .NET 8, 22.04 will be much more important than 20.04. We should focus on it. |
yes, it is. But as I mentioned I don't see easy fix at the moment e.g. it will take some time to get stable runs IMHO. |
Is this mainly a test issue (apparently customers haven't reported this since 2204 was released)? In which case, that's small beans compared to we have an impactful bug when on 2204. |
So, what should be do? I see no reason to give up on 22.04. |
We should investigate IMHO. I saw some metric failures on other platforms -> may be just flaky test. I also saw random HTTP/2 failures in my local run. Everything seem happier after reverting #89788 -> may (or may not) be very recent failure. Both @MihaZupan and @antonfirsov are out this week. I can keep looking into since RC1 checkpoint is reached as of today. I'll discuss next steps with @karelz tomorrow and we should be able to come up with some action plan. |
We should also update http and ssl stress (here or in a separate PR):
@wfurt which test does #86194 (comment) refer to? Now I only see |
I'm trying to upgrade the vm images used by wasm to 22.04 also. For that I opened dotnet/dotnet-buildtools-prereqs-docker#911 . But I'm wondering whether there is any way to use the image produced on that CI for the PR, in a The alternative would be to merge a new PR there for any change/debugging that needs to be done with the image. |
One way to do it is to build the container locally, and push to your own container registry. Then you can make a runtime PR which uses that image. I used our Azure credit to create a container registry last time I did this. Here's an example of a commit where I pointed some of the legs to my own container registry: ce31e13 |
We talk about this with team @richlander and you should probably just go ahead and merge this. Some of the test that failed here in the past failed elsewhere - with low failure rate so it does not seems unique to Ubuntu 22. We have ongoing monitoring and we are ready to react if we increased number of failures. In the mean time, investigation how to improve stability is going on but that IMHO should not block this. And the linux_musl failures is clearly unrelated. |
Thanks. The musl failure is also an HTTP test, yes? Do you have thoughts on it, or do I misunderstand? |
yes, Http2 / Multiple connections. |
Is there an open issue to include here? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Taking into account what @wfurt explained here #86194 (comment), we are good to go to merge this PR. Approving right now to unblock it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. We can deal with tests later if we see spike of failures.
Thanks! Glad to have this one merged. |
In case it become important, this is the relevant issue that was still active. |
Ubuntu 18.04 has transitioned from standard to LTS support -> https://wiki.ubuntu.com/Releases. Our VM hosts should always run on standard support Ubuntu versions.
All VMs need to move to 22.04 unless there is a specific reason. This PR is a bulk upgrade of the VMs. Nothing has been tested nor do I know how to (other than making the PR and running CI).
This PR is specific to AMD64 to make the change smaller. Other changes need to follow. If this change is still too big, it can be further reduced.