Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Workspaces stuck consuming 100% CPU since 5.17 #6114

Closed
davidwindell opened this issue Sep 1, 2017 · 6 comments · Fixed by #6175
Closed

Workspaces stuck consuming 100% CPU since 5.17 #6114

davidwindell opened this issue Sep 1, 2017 · 6 comments · Fixed by #6175
Assignees
Labels
kind/bug Outline of a bug - must adhere to the bug report template. severity/P1 Has a major impact to usage or development of the system. sprint/current status/code-review This issue has a pull request posted for it and is awaiting code review completion by the community.

Comments

@davidwindell
Copy link
Contributor

davidwindell commented Sep 1, 2017

We are experimenting with v5.17.0. It seems a bug has been introduced that causes che to get stuck (indefinitely) consuming 100% of the CPU when retrieving the project file tree.

I can reliably reproduce this by switching between v5.16.0 (where it doesn't happen) to v5.17.0 where it does.

The process in question is the main workspace process:

/usr/lib/jvm/java-1.8.0-openjdk-amd64/bin/java -Dnop -Djava.util.logging.manager=org.apache.juli.ClassLoaderLogManager -Xms256m -Xmx2048m -Djava.security.egd=file:/dev/./urandom -Dche.logs.dir=/home/user/che/ws-agent/logs -Dche.logs.level=INFO -Djuli-logback.configurationFile=file:/home/user/che/ws-agent/conf/tomcat-logger.xml -Djdk.tls.ephemeralDHKeySize=2048 -Djava.protocol.handler.pkgs=org.apache.catalina.webresources -agentlib:jdwp=transport=dt_socket,address=4403,server=y,suspend=n -Dcom.sun.management.jmxremote -Dcom.sun.management.jmxremote.ssl=false -Dcom.sun.management.jmxremote.authenticate=false -Dche.local.conf.dir=/mnt/che/conf -classpath /home/user/che/ws-agent/conf/:/usr/lib/jvm/java-1.8.0-openjdk-amd64/lib/tools.jar:/home/user/che/ws-agent/bin/bootstrap.jar:/home/user/che/ws-agent/bin/tomcat-juli.jar -Dcatalina.base=/home/user/che/ws-agent -Dcatalina.home=/home/user/che/ws-agent -Djava.io.tmpdir=/home/user/che/ws-agent/temp org.apache.catalina.startup.Bootstrap start

I'm happy to provide access to che devs to the workspace in question if that helps debug the issue.

Large project trees seem to be the most affected.

@davidwindell davidwindell changed the title Workspaces consuming 100% CPU since 5.17 Workspaces stuck consuming 100% CPU since 5.17 Sep 1, 2017
@ghost
Copy link

ghost commented Sep 2, 2017

@davidwindell to reproduce it I need to start Che v 5.17, import a big project (btw, can you suggest one?) and expand it in the project tree?

@ghost ghost added the kind/question Questions that haven't been identified as being feature requests or bugs. label Sep 2, 2017
@davidwindell
Copy link
Contributor Author

Here's a screenshot from Che v5.17 (note the XHR request takes 1 minute just to get the project tree).

screenshot from 2017-09-04 14-20-51

And here's a screenshot from Che v5.16 (exactly the same project, folder, network, etc)

screenshot from 2017-09-04 14-27-35

Now only a few ms.

@ghost ghost added kind/bug Outline of a bug - must adhere to the bug report template. severity/P1 Has a major impact to usage or development of the system. team/ide and removed kind/question Questions that haven't been identified as being feature requests or bugs. labels Sep 4, 2017
@ghost
Copy link

ghost commented Sep 4, 2017

@davidwindell thanks. I can confirm that the request to project api in 5.16 and 5.16 is completed differently...5 seconds vs milliseconds.

@ghost
Copy link

ghost commented Sep 4, 2017

Btw, deleting git repo fixes the problem, but that's, of course, not a solution.

@vparfonov
Copy link
Contributor

Possibly related to #5722
@vinokurig can you check?

@vinokurig vinokurig self-assigned this Sep 5, 2017
@vinokurig vinokurig added the status/in-progress This issue has been taken by an engineer and is under active development. label Sep 5, 2017
@vinokurig
Copy link
Contributor

vinokurig commented Sep 5, 2017

analyzing

@vinokurig vinokurig added status/code-review This issue has a pull request posted for it and is awaiting code review completion by the community. and removed status/in-progress This issue has been taken by an engineer and is under active development. labels Sep 7, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Outline of a bug - must adhere to the bug report template. severity/P1 Has a major impact to usage or development of the system. sprint/current status/code-review This issue has a pull request posted for it and is awaiting code review completion by the community.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants