Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FLINK-26600][tests] Wait with savepoint until job is running #20162

Merged
merged 1 commit into from
Jul 6, 2022

Conversation

zentol
Copy link
Contributor

@zentol zentol commented Jul 5, 2022

It only makes sense to trigger savepoints when the job is actually RUNNING. Triggering a savepoint while the job is in the CREATED state is tolerated in the default scheduler (even though the checkpoint will immediately fail), but the adaptive scheduler rejects it eagerly.

Ideally we'd align the behavior in a follow-up.

Ideally ideally the clients would take care of that, but there are quite a few of them and so far they are pretty dumb in that they just forward the request as is. This is possible to change, but it's a larger effort.

@zentol zentol requested a review from alpinegizmo July 5, 2022 08:13
@flinkbot
Copy link
Collaborator

flinkbot commented Jul 5, 2022

CI report:

Bot commands The @flinkbot bot supports the following commands:
  • @flinkbot run azure re-run the last Azure build

Copy link
Contributor

@MartijnVisser MartijnVisser left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this time long enough to wait? Otherwise looks good to me

@zentol
Copy link
Contributor Author

zentol commented Jul 6, 2022

WDYM? It waits indefinitely until the condition is fulfilled.

@MartijnVisser
Copy link
Contributor

WDYM? It waits indefinitely until the condition is fulfilled.

I was more worried about it waiting/validating every 50 milliseconds with impact on resource usage, but if it's fine, let's get this merged :)

@zentol zentol merged commit b42c70d into apache:master Jul 6, 2022
@zentol zentol deleted the 26600 branch July 6, 2022 09:40
@zentol
Copy link
Contributor Author

zentol commented Jul 6, 2022

I see; I don't think we need to worry about that; 50ms is a surprisingly long time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
3 participants