[FLINK-5218] [state backends] Eagerly close checkpoint streams on cancellation #2920

StephanEwen · 2016-12-01T18:41:30Z

When a task is canceled during a checkpoint operation, the operation needs to cancel fast.

This is a forward fis from version 1.1, where checkpoints could get stuck when the state output streams did not handle interruptions correctly (HDFS has that problem).

Most of this is already handled in version 1.2 via the CloseableRegistry.

This adds a test to validate this case is handled correctly and adds minor changes to make it work reliably, like:

fail fast on write() on closed checkpoint streams
fail fast on flush() on closed checkpoint streams
slight optimization to save a flag in the checkpoint streams

… Streams are eagerly closed on cancellation. This is important for some stream implementations (such as HDFS) that do not properly handle thread interruption.

StephanEwen · 2016-12-01T18:43:15Z

@StefanRRichter There is one change of semantics that would be good to get your input on: A checkpoint stream to which a byte[0] array was written is now actually empty and returns a null state handle in the same way as if nothing was ever written. Before this change, it would have created a state of zero bytes.

StefanRRichter · 2016-12-02T09:33:30Z

I think the changed semantics makes actually more sense. It should also be fine for all callers, as returning null to them was also previously possible and IRC there should be no special meaning to an empty handle in 1.1.

StephanEwen · 2016-12-02T11:27:39Z

Thanks for the review, merging this...

StephanEwen · 2016-12-02T12:21:59Z

Manually merged in cc006ff

[FLINK-5218] [state backends] Add test that validates that Checkpoint…

e592c09

… Streams are eagerly closed on cancellation. This is important for some stream implementations (such as HDFS) that do not properly handle thread interruption.

StephanEwen closed this Dec 2, 2016

rmetzger added the component=Runtime/StateBackends label Mar 14, 2019

souo mentioned this pull request Dec 5, 2022

[Snyk] Security upgrade hapi from 8.8.1 to 11.0.4 souo/flink#118

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FLINK-5218] [state backends] Eagerly close checkpoint streams on cancellation #2920

[FLINK-5218] [state backends] Eagerly close checkpoint streams on cancellation #2920

StephanEwen commented Dec 1, 2016

StephanEwen commented Dec 1, 2016

StefanRRichter commented Dec 2, 2016

StephanEwen commented Dec 2, 2016

StephanEwen commented Dec 2, 2016

[FLINK-5218] [state backends] Eagerly close checkpoint streams on cancellation #2920

[FLINK-5218] [state backends] Eagerly close checkpoint streams on cancellation #2920

Conversation

StephanEwen commented Dec 1, 2016

StephanEwen commented Dec 1, 2016

StefanRRichter commented Dec 2, 2016

StephanEwen commented Dec 2, 2016

StephanEwen commented Dec 2, 2016