-
Notifications
You must be signed in to change notification settings - Fork 137
FreeBSD CI-builds are failing #20
Comments
Yeah I was looking at this yesterday. I went to go create a new node, so I captured the old one and created two new versions. Either it wasn't really working that well before and we just got lucky, or something happened in the deprovisioning and capture, because the temporary drive stopped working within a few minutes.and started giving IO errors. |
I've updated to the latest version of waagent, which has correct provisioning for the temporary disk. Should be up and running quite soon. |
Looking at the latest build they still don't seem entirely stable:
Odd that they would chug along nicely to begin with and then suddenly stop working... |
@josteink I think we might actually be good here. The last few failures on the 30th were real failures, but @ajensenwaud fixed this last night. Woohoo! |
Ooh right. Nice of him then :) |
Actually looks like you're right about the debug one. It appears that for whatever reason, it didn't update it's workspace directory when I updated the value in Jenkins. It was still using the primary drive, which has very little space. I'll figure out what's going on. |
Done any changes since last time? They're all looking good now. Feel free to close this issue at your own discretion. |
We are all good now, and the build will fail on FreeBSD failures now! |
@mmitche, @josteink, now that dotnet/coreclr#1030 is merged, should the tests be enabled for FreeBSD as well? |
@jasonwilliams200OK Yep will do so today |
Currently 4 of the tests are failing. Will enabling the tests on the CI-build cause all PRs from now to turn red until we get those fixed? If so I would advice to delay enabling the tests until we have the 4 failing tests fixed. (My 2 norwegian øre) |
@josteink Yes. we should have them all passing before we enable them in the CI |
👍 |
@mmitche I looked slightly deeper into this, and it may be that enabling tests are safeish. Can you take a look at the findings and weight in your opinion? https://github.com/dotnet/coreclr/issues/1010#issuecomment-104756763 |
@josteink I got two failures on an exploratory run: http://dotnet-ci.cloudapp.net/job/dotnet_coreclr_freebsd_debug/lastFailedBuild/console The following test(s) failed: PAL Test Results: |
Ok. If so I still advice against enabling them. Was worth a shot though. :) |
How about we exclude these two tests for FreeBSD target something what these guys have done: dotnet/llilc#598? I mean only 0.2% of tests are failing. I think it is a bad idea to hold back the majority. |
Currently we're investigating the cause for those performing badly in #1044, but if those investigations turn up inconclusive or without any resolution, I'm inclined to agree with @jasonwilliams200OK . |
@jasonwilliams200OK Yeah that seems reasonable. If it seems like the values in the tests are unreasonable (they were probably chosen somewhat arbitrarily in the ancient past) then we could also alter them. |
@mmitche, 👍 A very valuable and yet interesting discussion on this subject is going on here https://github.com/dotnet/coreclr/issues/1044 between @josteink and @bsdjhb (FreeBSD kernel dev). It seems like there is more to just the timer values which need adjustment. Hopefully, the root issue will be identified soon. |
As can be seen on via Jenkins: http://dotnet-ci.cloudapp.net/job/dotnet_coreclr_freebsd_debug/
Looking into the first failing build, we have this log:
Failure to clone the repository could hint at the disks being full. But subsequent builds are failing for other reasons. That latest build gets to 16%, then suddenly Jenkins seems to disconnect and call it a day, so obviously something else is going on too.
@mmitche Looking at the build log It seem's you're already on this? :)
The text was updated successfully, but these errors were encountered: