New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue with RethinkDB hanging #4299
Comments
You can enter slow growth mode, it's doesn't always mean that RethinkDB is dead/hanging. I tried running |
After running the tests about 15-20 times in loop the db stops responding and all tests always fail. |
Thanks for the bug report. Is there any error output from RethinkDB before it hangs? It might be that you're hitting the maximum open file limit on OS X. Do you still see the hang if you run |
@mlucy still happenning, here is my
|
@mlucy I get on server logs:
I ran
Web interface does not respond. |
Thanks for the update. @danielmewes, any idea what might be going on here? |
Very odd. I wonder if the log messages are relevant. They might be a red herring, since the issue appears to only happen after a while and then goes away once @Ijmir when RethinkDB hangs, could you check two things for us and post the results here?
This data might help narrow down the possible causes of this. |
Hmm the same "Protocol wrong type" error appeared here #4327 (comment) for a different user. It might be unrelated. Still curious what it means. |
@danielmewes |
Thanks for the data and information @Ijmir ! Will try to reproduce. |
@Ijmir I think I'm not running the tests correctly. All I get when I run
It doesn't seem to run any RethinkDB tests. Any idea what I'm doing wrong? (I'm not that familiar with node) |
Oh never mind. Got it working by running
instead I think. |
On Linux I run into #4300 pretty quickly, so I'm going to test this on Mac next. Maybe we're just not catching the stack overflow properly on Mac. |
@danielmewes to run the test you have to execute
|
Thanks, I can reproduce it on OS X. |
The weird thing is that once RethinkDB gets into that state, htop shows that it spends ~50% CPU in kernel code. However according to dtrace there are virtually no syscalls going on. I also tried attaching instruments timing-based profiler and it didn't show anything interesting either. However I found that increasing the coroutine stack size avoids the issue, so I think the initial theory that this is the same as #4300 , except that something is broken about our coroutine stack overflow detection on OS X. |
This will probably go away with #4024 (comment) , though we also need to find out why the stack overflow protection isn't working on OS X. |
Ah, on OS X accessing a protected page can trigger SIGSEGV or SIGBUS. On Linux it always causes SIGSEGV. We are handling SIGSEGV but not SIGBUS. |
A fix for us not catching the stack overflow on OS X is now in code review 2954 by @AtnNn . @thaiat this fix will stop RethinkDB from hanging, but instead it is going to crash just like described by @neumino in #4300 . In the near future, we plan to implement a protection for such queries such that the query itself will fail, but the server won't crash (our current behavior is obviously bad). Sorry you had to run into this. |
I believe I'm still experiencing an issue related to this: neumino/thinky#323 |
@primitive-type That failure looks unrelated to this issue. The "too many open files" message usually means that there's still a configuration problem on the OS side. Can you make sure that your increased maxfiles settings are correctly applied to the user / shell that you run RethinkDB from? |
@danielmewes Sure, here is the result of
I closely followed the instructions at http://docs.basho.com/riak/latest/ops/tuning/open-files-limit/#Mac-OS-X RethinkDB was using about 25% CPU at the time of the crash. (Mac OS X 10.10.4) |
Recently running a Rethinkdb 2.2.5 cluster goes on my local machine into slow growth mode a lot. I am using Docker for Mac Version 1.12.1-rc1-beta23. I am not really using volumes or anything but still it happens. Any ideas? |
@ambodi It's likely a different issue from what this was about originally. Check out @neumino's comment here #4299 (comment) about what slow-growth mode means. I recommend checking the ulimits, specifically the open file limit by running |
Hi all,
We are in the process of writing a loopback connector for rethinkdb that uses the amazing
rethinkdbdash
The code is available here : https://github.com/yoobic/loopback-connector-rethinkdbdash
After running test (
npm test
) for about 15 to 20 times, rethinkdb will hang.I can see this message : 'Entering slow growth mode'
The only way to come back to normal is to execute:
I'm on mac, lots of RAM and disk space available.
I can reproduce on different laptop.
Using latest of rethinkdb and rethinkdbdash
The text was updated successfully, but these errors were encountered: