New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug 577508 - Fixing regression #38
Conversation
Hi Jay, you seem to be hitting the same freeze that is blocking me for some days already in #28 I added a bunch of sysouts to debug this (which doesn't happen in the IDE), see result in https://ci.eclipse.org/jdt/job/eclipse.jdt.core-Github/view/change-requests/job/PR-28/10/consoleText All I know by now is: the execution stalls when @iloveeclipse , @gayanper I believe the two of you know best what might have changed in indexing / searching. Can you help? |
Thanks Stephan for point that out. I was wondering if that was something about publishing the test results since I assumed that the model tests had completed. |
Stephan, I did a comparison between one of the passed runs [1] and last build for this PR [2] and I don't see the tests being failing or freezing things. I suspect things go wrong after an attempt to "Recording test results". Also I see this being printed in the failed run - "script returned exit code 143". I wonder if the test result is too big and that causes something to blow up. I have seen something like this in the past with gerrit when I had forgotten to remote the DEBUG AUTOMATON flag in parser. But that's not the case here. [1] https://ci.eclipse.org/jdt/job/eclipse.jdt.core-Github/view/change-requests/job/PR-34/5/console |
Can you try running with tycho 2.7.0. You will need to add "-Dtycho.version=2.7.0" to the maven command present in the Jenkinsfile from repo root |
Please see that I already narrowed it down to freezing within |
Staphan, sorry, I had no time yet to look into, but don't we use in jdt a timer thread to report thread dumps on deadlocks/hangs? I'm pretty sure we do this in jdt or platform debug, so if jdt core misses that, we should add it asap so we don't need to guess why build is hanging. |
After jumping through some more maven hoops, I managed to get stacktraces from the freeze: freeze-jstack.txt two interesting portions extracted here:
I read this as: search is waiting for a background job, but no thread has any matching activity. |
I don't know how, where these timers are implemented. But I can see that timeout during an individual test execution produces stack traces, but timeout during suite set-up does not. |
OK, found it: |
Thanks for the hint. Tried this locally. No difference. At this point my time for investigation is up. |
Tried, but didn't help. I can still see this being stuck. |
I don't know what is wrong with this and other PR that hangs, but my PR was done in 30 minutes, see #40 and https://ci.eclipse.org/jdt/job/eclipse.jdt.core-Github/job/PR-40/1/ I will merge #40 now, so please rebase this change to see if it will properly report freeze (assuming freeze is in one of the the model test suites setup) |
So with the new thread dump reported in https://ci.eclipse.org/jdt/job/eclipse.jdt.core-Github/job/PR-38/6/console
But AttachedJavadocTests code is still running too ???
|
I believe all the trouble here could be caused by unfortunate jenkins setup or really by the actual code changes. I've also tried #46 which enables JobManager.VERBOSE mode for tests, all tests were all green there plus one get an idea what the indexer does (but beware: the console output is about 60 MB). I do not plan to merge that, because the console will explode - but feel free to do try that change too, in case hanging isn't solved after rebase on master here. |
That's a very interesting observation! Here's what AttachedJavadocTests$1 is doing (in a thread of its own):
Interestingly, the enclosing test |
@jarthana you may try again after rebase, since #48 fixed this issue for me, and thus hopefully also for this PR (remember to revert the change of tycho version :) ). I briefly tried to retrigger builds, but (a) did jenkins not find required artifacts, and (b) did I get confused by the difference of ".../jenkins/pr-head" vs. ".../jenkins/pr-merge", perhaps a recent configuration change by @iloveeclipse ? |
Signed-off-by: Jay Arthanareeswaran <jarthana@in.ibm.com>
Still run into this. I have no clue what's going on. |
@jarthana : you have to rebase your branch on the top of master & force push again.
|
OK, I see that the branch was rebased already, but now we have an infrastructure errors, not a hang:
I assume something like https://gitlab.eclipse.org/eclipsefdn/helpdesk/-/issues/1223 reported today. |
644a1a8
to
8af9fa5
Compare
Signed-off-by: Jay Arthanareeswaran jarthana@in.ibm.com