New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Jenkins plugin e2e hangs and runs forever #7277
Comments
@bparees some preliminary research and thoughts on changing this test ... PTAL:
If I interpret the original error msg correctly that the test ran for 1h and 1 min and then was kicked out, the hang(s) could have been in 1 spot or potentially spread out across a few:
There still is a bit of a time discrepancy of course (the above 4 points add up to 48 minutes)....perhaps the wait for input on the channel is taking longer than we expect ?? Initial thoughts on elements of a change
Any thoughts? |
we'd have more info if @smarterclayton had included the full logs of his run since it would show which steps were completed (and possibly timestamps for each step?) my first guess would be a bug in WaitForADeployment that causes it to get hung somewhere inside the loop and thus the 15 minute interval doesn't get checked. For example if the watch hangs (no events show up, but the watch doesn't get closed), i think you could end up sitting here indefinitely: Given that no new stuff is likely being added to etcd, your watch will probably not expire frequently. (based on my current poor understanding of etcd and watches). So we probably need a separate go routine that babysits WaitForADeployment, or use a channel switch: |
@bparees thx for the input (including the channel switch stuff) and the +1 on the theory the hang centers around the channel processing. As an fyi, I may have reproduced the hang locally ... at least running the extended test locally is hanging. I'll see if it is a general env issue but running some other tests. And in looking at the jenkins logs, it says the test job failed. So perhaps a fix for the job failure itself, as well as better error reaction, is in the offing. |
OK ... I did reproduce the test hang locally. Recent plugin usability changes necessitated an update to the test jenkins job config associated with the jenkins extended test. I have a fix in hand. That said, I'll stash the job config fix temporarily, and work on better error detection and test exit. |
Looks like it didn't detect failure and didn't exit?
The text was updated successfully, but these errors were encountered: