New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
changefeed.cc:1799 Guarantee failed: [num_subs == 0] #5708
Comments
Oh, and there is also a small one-liner in
too cryptic for me, but maybe it's relevant |
Ouch. Thanks for the report. Pinging @mlucy. |
@mlucy Looking at the code, I don't understand how we can get to that guarantee, since there's an identical guarantee just before we start destructing |
(also I'd like to still sneak this into 2.3.1 if possible) |
My theory about the bug turned out to be wrong, and we currently don't have an explanation for this. @haraldschilly Did rethinkdb leave a core file behind by any chance? |
sorry, we don't have one, but I've enabled it now... |
ok well, apport might have collected something despite core dumps not being enabled via those process limits. I'll send you the file via email. |
We hit this bug almost any time a server is restarted. Now that the apport stuff is enabled, it's even worse, since that spins at 100% forever, instead of rethinkdb getting restarted. This is really so bad that it makes using rethinkdb as a cluster worse regarding "single point of failure" than it would be to just use a single big rethinkdb node... How can we use automatic failover, if failure of one node (even on purpose via "service rethinkdb stop") leads to other nodes segfaulting and exiting? Sorry, not happy right now. |
@williamstein @haraldschilly Very sorry. :-( Did you get a core file from apport? |
Unfortunately, one of our rethinkdb servers in our 6 node cluster crashed. I didn't found a similar error report here, so please excuse a possible duplicate. (github search isn't good)
running on linux, 2.3.0~0wily, ...
The text was updated successfully, but these errors were encountered: