[FLINK-6719] [docs] Add details about fault-tolerance of timers to ProcessFunction docs by bowenli86 · Pull Request #5887 · apache/flink

bowenli86 · 2018-04-21T06:48:42Z

What is the purpose of the change

The fault-tolerance of timers is a frequently asked questions on the mailing lists. We should add details about the topic in the ProcessFunction docs.

Brief change log

Added details about the topic in the ProcessFunction docs.

Verifying this change

This change is a trivial rework / code cleanup without any test coverage.

Does this pull request potentially affect one of the following parts:

none

Documentation

none

rice668 · 2018-04-22T01:41:41Z

Looks good @bowenli86

fhueske

Thanks for extending and improving the documentation about timer @bowenli86!

I've made a few comments and suggestions.
Best, Fabian

fhueske · 2018-04-24T12:20:25Z

docs/dev/stream/operators/process_function.md

-### Timer Coalescing
+### Optimizations - Timer Coalescing

 Every timer registered at the `TimerService` via `registerEventTimeTimer()` or


Move the first paragraph under the ## Timer section

Also it would be great if you could find a good spot to add a note that calls to processElement() and onTimer() are always synchronized, i.e., users do not have to worry about concurrent modification of state.

fhueske · 2018-04-24T12:21:12Z

docs/dev/stream/operators/process_function.md

 </div>
 </div>
+
+### Fault Tolerance


Move the ###Fault Tolerance section above the ###Optimizations section

fhueske · 2018-04-24T12:22:50Z

docs/dev/stream/operators/process_function.md

+Timers registered within `ProcessFunction` are fault tolerant.
+
+Timers registered within `ProcessFunction` will be checkpointed by Flink. Upon restoring, timers that are checkpointed
+from the previous job will be restored on whatever new instance is responsible for that key.


Add a note that timers are synchronously checkpointed (regardless of the configuration of the state backend). Hence, a large number of timers can significantly increase checkpointing time. See optimizations section for advice to reduce the number of timers.

fhueske · 2018-04-24T12:24:49Z

docs/dev/stream/operators/process_function.md

+
+For processing timer timers, note that the firing time of a timer is an absolute value of when to fire.
+
+What this means is that if a checkpointed timer’s firing processing timestamp is t (which is basically the registering


(which is basically the registering time + configured trigger time)

This is often the case, but not necessarily true. Esp. for processing time, the timer can also be set to something completely different. I'd remove this to avoid confusion.

fhueske · 2018-04-24T12:26:06Z

docs/dev/stream/operators/process_function.md

+For processing timer timers, note that the firing time of a timer is an absolute value of when to fire.
+
+What this means is that if a checkpointed timer’s firing processing timestamp is t (which is basically the registering
+time + configured trigger time), then it will also fire at processing timestamp t on the new instance. Therefore, you


What do you mean by new instance? Are you discussing the scenario when a task is recovered on a different machine? I don't think we need to mention this. It should be quite clear that clock synchronization is an issue in processing time.

The info that a pt-timer fires on restore if the time passed while the job was down is important. Also mention that this is true for savepoint, which is even more critical because more time may pass between taking and restoring from a savepoint.

fhueske · 2018-04-24T12:30:58Z

docs/dev/stream/operators/process_function.md

+
+#### Event Time Timers
+
+For event time timers, given that Flink does not checkpoint watermarks, a restored event time timer will fire when the


The fact that Flink doesn't checkpoint watermarks is not really related to and does not affect the behavior of timers. It is useful information but I don't think we need to mention it here.

It's sufficient to mention that et-timer fire when the wm passes them.

bowenli86 · 2018-04-27T23:55:50Z

@fhueske updated! let me know how it looks now

…nction docs

alpinegizmo · 2018-04-30T11:15:03Z

docs/dev/stream/operators/process_function.md

+
+Timers registered within `ProcessFunction` are fault tolerant. They are synchronously checkpointed by Flink, regardless of
+configurations of state backends. (Therefore, a large number of timers can significantly increase checkpointing time. See optimizations
+section for advice to reduce the number of timers.)


See the optimizations section for advice on how to reduce the number of timers.

fhueske · 2018-05-02T14:05:31Z

Thanks for the update @bowenli86.

I'll merge the PR later.

This closes #5887.

This closes apache#5887.

bowenli86 changed the title ~~[FLINK-6719] Add details about fault-tolerance of timers to ProcessFunction docs~~ [FLINK-6719] [docs] Add details about fault-tolerance of timers to ProcessFunction docs Apr 23, 2018

fhueske reviewed Apr 24, 2018

View reviewed changes

[FLINK-6719] Add details about fault-tolerance of timers to ProcessFu…

79ce22a

…nction docs

alpinegizmo reviewed Apr 30, 2018

View reviewed changes

Update process_function.md

5c96676

asfgit pushed a commit that referenced this pull request May 3, 2018

[FLINK-6719] [docs] Add details about timers to ProcessFunction docs.

9435cd4

This closes #5887.

asfgit closed this in 2cfd89c May 3, 2018

asfgit pushed a commit that referenced this pull request May 3, 2018

[FLINK-6719] [docs] Add details about timers to ProcessFunction docs.

b84cdda

This closes #5887.

sampathBhat pushed a commit to sampathBhat/flink that referenced this pull request Jul 26, 2018

[FLINK-6719] [docs] Add details about timers to ProcessFunction docs.

6a2e0f9

This closes apache#5887.

rmetzger added component=API/DataStream component=Documentation labels Mar 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FLINK-6719] [docs] Add details about fault-tolerance of timers to ProcessFunction docs#5887

[FLINK-6719] [docs] Add details about fault-tolerance of timers to ProcessFunction docs#5887
bowenli86 wants to merge 2 commits intoapache:masterfrom
bowenli86:FLINK-6719

bowenli86 commented Apr 21, 2018

Uh oh!

rice668 commented Apr 22, 2018

Uh oh!

fhueske left a comment

Uh oh!

fhueske Apr 24, 2018 •

edited

Loading

Uh oh!

fhueske Apr 24, 2018

Uh oh!

fhueske Apr 24, 2018

Uh oh!

fhueske Apr 24, 2018

Uh oh!

fhueske Apr 24, 2018

Uh oh!

fhueske Apr 24, 2018

Uh oh!

fhueske Apr 24, 2018

Uh oh!

bowenli86 commented Apr 27, 2018

Uh oh!

alpinegizmo Apr 30, 2018

Uh oh!

fhueske commented May 2, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants


		For processing timer timers, note that the firing time of a timer is an absolute value of when to fire.

		What this means is that if a checkpointed timer’s firing processing timestamp is t (which is basically the registering


		#### Event Time Timers

		For event time timers, given that Flink does not checkpoint watermarks, a restored event time timer will fire when the

Conversation

bowenli86 commented Apr 21, 2018

What is the purpose of the change

Brief change log

Verifying this change

Does this pull request potentially affect one of the following parts:

Documentation

Uh oh!

rice668 commented Apr 22, 2018

Uh oh!

fhueske left a comment

Choose a reason for hiding this comment

Uh oh!

fhueske Apr 24, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fhueske Apr 24, 2018

Choose a reason for hiding this comment

Uh oh!

fhueske Apr 24, 2018

Choose a reason for hiding this comment

Uh oh!

fhueske Apr 24, 2018

Choose a reason for hiding this comment

Uh oh!

fhueske Apr 24, 2018

Choose a reason for hiding this comment

Uh oh!

fhueske Apr 24, 2018

Choose a reason for hiding this comment

Uh oh!

fhueske Apr 24, 2018

Choose a reason for hiding this comment

Uh oh!

bowenli86 commented Apr 27, 2018

Uh oh!

alpinegizmo Apr 30, 2018

Choose a reason for hiding this comment

Uh oh!

fhueske commented May 2, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

fhueske Apr 24, 2018 •

edited

Loading