Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FLINK-5476] Fail fast if trying to submit a job to a non-existing Fl… #3753

Closed
wants to merge 0 commits into from

Conversation

DmytroShkvyra
Copy link
Contributor

@DmytroShkvyra DmytroShkvyra commented Apr 21, 2017

In case of entering the wrong job manager address when submitting a job via flink run, the JobClientActor waits per default 60 s until a JobClientActorConnectionException, indicating that the JobManager is no longer reachable, is thrown. In order to fail fast in case of wrong connection information, we could change it such that it uses initially a much lower timeout and only increases the timeout if it had at least once successfully connected to a JobManager before.

  • General

    • The pull request references the related JIRA issue ("[FLINK-5476] Fail fast if trying to submit a job to a non-existing Flink cluster")
  • Documentation

  • Tests & Build

@tillrohrmann
Copy link
Contributor

The code does not seem to match the PR description @DmytroShkvyra. I don't see where you use different timeouts for the connection. The only thing which seems to be modified is that the JobManager now understands the GetClusterStatus message. Could you please clarify what this PR tries to achieve?

@DmytroShkvyra
Copy link
Contributor Author

DmytroShkvyra commented May 3, 2017

Hi @tillrohrmann,
The main idea of this PR is propagation of exception from StandaloneClusterClient.getClusterStatus, so if cluster non-existing it will throw RuntimeExeption with clear message and will not waiting for timeouts.
Is it clear?

@DmytroShkvyra
Copy link
Contributor Author

Hi @tillrohrmann could you review this PR

@tillrohrmann
Copy link
Contributor

Sorry for not coming back to you earlier. I will try to give it another round of review in the next days.

@DmytroShkvyra
Copy link
Contributor Author

Was closed occasionally.

@DmytroShkvyra DmytroShkvyra deleted the FLINK-5476 branch May 27, 2017 12:19
@DmytroShkvyra
Copy link
Contributor Author

New PR is #4001

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
3 participants