emergency_repair="force" option #4720

danielmewes · 2015-08-18T23:34:58Z

We've had some situations where enough servers were up, but the configuration didn't propagate properly and thus the cluster got stuck.

Our current emergency_repair options only work if servers are actually disconnected. We should strongly consider adding some sort of emergency_repair="force" mode. I'm not sure if this could actually be an option to reconfigure, and we have to think about how and whether it should actually change the configuration.
Even if it just kept the current configuration, it could be useful if it forced a new epoch for the table configuration.

The text was updated successfully, but these errors were encountered:

VeXocide · 2015-08-31T18:24:19Z

In discussing this offline with @danielmewes we came to the conclusion that emergency_repair="force" may do more harm than good if it could overwrite the existing configuration with a completely new one. Instead we're going to add an emergency_repair mode that recommits the existing configuration with a new epoch.

Given that this is to repair problems that shouldn't normally occur we've decided to name it emergency_repair="_debug_recommit".

VeXocide · 2015-09-01T18:40:13Z

In CR 3199 by @danielmewes.

VeXocide · 2015-09-01T23:20:00Z

Merged into next via commit 072575e.

VeXocide · 2015-09-03T18:02:17Z

Merged into v2.1.x via commit ddf6c84 as well.

danielmewes added the cp:clustering label Aug 18, 2015

danielmewes added this to the 2.2 milestone Aug 18, 2015

danielmewes assigned VeXocide Aug 31, 2015

VeXocide closed this as completed Sep 1, 2015

danielmewes modified the milestones: 2.1.x, 2.2 Sep 3, 2015

danielmewes modified the milestones: 2.1.x, 2.1.3 Sep 4, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

emergency_repair="force" option #4720

emergency_repair="force" option #4720

danielmewes commented Aug 18, 2015

VeXocide commented Aug 31, 2015

VeXocide commented Sep 1, 2015

VeXocide commented Sep 1, 2015

VeXocide commented Sep 3, 2015

emergency_repair="force" option #4720

emergency_repair="force" option #4720

Comments

danielmewes commented Aug 18, 2015

VeXocide commented Aug 31, 2015

VeXocide commented Sep 1, 2015

VeXocide commented Sep 1, 2015

VeXocide commented Sep 3, 2015