Sync Issue in PersistentQueue on I/O Exception? #21

ebarlas · 2010-08-04T16:05:51Z

Upon close inspection of the PersistentQueue class, it occurred to me that if an I/O Exception is raised at certain points, the in-memory queue may become out of sync with the journal. For example, this can occur in add if an I/O Exception occurs on journal.add after the item has been added to the in-memory queue. Similar behavior exists in remove. Is this an accurate reading of the code? If so, what is the reasoning behind it? Thanks.

robey · 2010-08-05T04:28:08Z

it looks like an i/o exception would bounce out to the handler and possibly disconnect the client. i guess we should catch exceptions when writing the journal, and kill the server if they happen, so that queues don't get into this weird state if the disk fills. does that sound okay?

ebarlas · 2010-08-05T14:52:53Z

Hmm, possibly. The best approach, I suppose, would be to rollback journal operations, but it seems to me that simply isn't possible with the current system. Another approach is to place I/O operations ahead of in-memory data structure operations to raise I/O exceptions before modifying the queue, transaction table, or other PersistentQueue data. That should keep the PersistentQueue in a consistent state. Yet another approach is to close and reopen the queue on I/O Exceptions, however this may seemingly result in a huge number of journal reads as the journals are replayed. Perhaps this is just something to be aware of and need not be addressed?

ebarlas · 2010-08-09T15:13:48Z

Thoughts?

robey · 2010-08-10T04:42:02Z

i think you're right that it shouldn't try to continue as if nothing happened.

i'm leaning toward catching i/o exceptions inside the journal code, and writing a fatal log message and calling system.exit. it would be an unambiguous signal that something has gone wrong with the machine, and i think if the machine is hosed, kestrel shouldn't try to paste over it.

ebarlas · 2010-08-11T16:31:26Z

Okay, that does seem reasonable. One problem is that it might adversely affect folks using Kestrel as a library since the proposed fix would shutdown the JVM.

robey closed this as completed Apr 10, 2012

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sync Issue in PersistentQueue on I/O Exception? #21

Sync Issue in PersistentQueue on I/O Exception? #21

ebarlas commented Aug 4, 2010

robey commented Aug 5, 2010

ebarlas commented Aug 5, 2010

ebarlas commented Aug 9, 2010

robey commented Aug 10, 2010

ebarlas commented Aug 11, 2010

Sync Issue in PersistentQueue on I/O Exception? #21

Sync Issue in PersistentQueue on I/O Exception? #21

Comments

ebarlas commented Aug 4, 2010

robey commented Aug 5, 2010

ebarlas commented Aug 5, 2010

ebarlas commented Aug 9, 2010

robey commented Aug 10, 2010

ebarlas commented Aug 11, 2010