CURRENT file pointing to missing MANIFEST file. [JIRA: RIAK-1789] #153

angrycub · 2015-04-30T14:33:55Z

Re: Zendesk Ticket #10730

After a restart of a node, the riak vnodes wouldn't start because the CURRENT file was pointing to an non-existent version of the MANIFEST. How did these get out of sync?

engelsanchez · 2015-04-30T15:13:40Z

It is possible that we are seeing the problem described in the All filesystems are not created equal paper. Basically, doing a sync on data files and atomic renames is not enough to ensure consistency of the set of files in the directory. The directory entry itself may need a sync in Linux systems to ensure that the file operations are executed in the expected order and the new file mods survive a crash.

matthewvon · 2015-06-01T16:33:28Z

@angrycub ... define "restart of a node" ... riak restart, linux restart, machine died and was restarted, etc.

angrycub · 2015-06-01T17:10:37Z

From the ticket narrative:

"At the same time this node was discovered down, two other nodes were being reported by ring-status as being down, but their beam processes were still running. Since the cluster already believed them unavailable, I killed the beam process on each of these nodes and restarted riak successfully. Currently all but one of the cluster members are up."

So I'm going to say, killed after being in an "indeterminate" state and then Riak was restarted. No indication that the physical nodes were rebooted; however, I can contact the user in question for more details if you'd like.

matthewvon · 2015-06-01T17:24:41Z

let me ponder this ... however, a vnode repair would fix. You already do that?

angrycub · 2015-06-01T17:32:16Z

That was how we addressed that particular ticket. Dumb shell script to look where the filename in the CURRENT file did not exist in that partitions folder and echo out the partition IDs. Ran eleveldb:repair on all of them and all was well.

Basho-JIRA changed the title ~~CURRENT file pointing to missing MANIFEST file.~~ CURRENT file pointing to missing MANIFEST file. [JIRA: RIAK-1789] Apr 30, 2015

Basho-JIRA added the JIRA: To Do label Apr 30, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CURRENT file pointing to missing MANIFEST file. [JIRA: RIAK-1789] #153

CURRENT file pointing to missing MANIFEST file. [JIRA: RIAK-1789] #153

angrycub commented Apr 30, 2015

engelsanchez commented Apr 30, 2015

matthewvon commented Jun 1, 2015

angrycub commented Jun 1, 2015

matthewvon commented Jun 1, 2015

angrycub commented Jun 1, 2015

CURRENT file pointing to missing MANIFEST file. [JIRA: RIAK-1789] #153

CURRENT file pointing to missing MANIFEST file. [JIRA: RIAK-1789] #153

Comments

angrycub commented Apr 30, 2015

engelsanchez commented Apr 30, 2015

matthewvon commented Jun 1, 2015

angrycub commented Jun 1, 2015

matthewvon commented Jun 1, 2015

angrycub commented Jun 1, 2015