Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FLINK-8643] [flip6] Use JobManagerOptions#SLOT_REQUEST_TIMEOUT in ExecutionGraph #5475

Merged
merged 14 commits into from Feb 15, 2018

Conversation

tillrohrmann
Copy link
Contributor

What is the purpose of the change

This commit changes the initialization of the ExecutionGraph to use the
JobManagerOptions#SLOT_REQUEST_TIMEOUT for the slot allocation. Furthermore,
it changes the behaviour of the SlotPool#ProviderAndOwner implementation such
that the timeout is given to it via the SlotProvider#allocateSlot call.

Brief change log

  • Don't start SlotPool#ProviderAndOwner with slot allocation timeout. Instead require it as a parameter of SlotProvider#allocateSlot
  • Pass in JobManagerOptions#SLOT_REQUEST_TIMEOUT to ExecutionGraph as allocation timeout

Verifying this change

This change is already covered by existing tests, such as SlotPoolTest.

Does this pull request potentially affect one of the following parts:

  • Dependencies (does it add or upgrade a dependency): (no)
  • The public API, i.e., is any changed class annotated with @Public(Evolving): (no)
  • The serializers: (no)
  • The runtime per-record code paths (performance sensitive): (no)
  • Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Yarn/Mesos, ZooKeeper: (no)
  • The S3 file system connector: (no)

Documentation

  • Does this pull request introduce a new feature? (no)
  • If yes, how is the feature documented? (not applicable)

@tillrohrmann
Copy link
Contributor Author

Merging this PR.

tillrohrmann added a commit to tillrohrmann/flink that referenced this pull request Feb 14, 2018
…ecutionGraph

This commit changes the initialization of the ExecutionGraph to use the
JobManagerOptions#SLOT_REQUEST_TIMEOUT for the slot allocation. Furthermore,
it changes the behaviour of the SlotPool#ProviderAndOwner implementation such
that the timeout is given to it via the SlotProvider#allocateSlot call.

This closes apache#5475.
tillrohrmann and others added 14 commits February 15, 2018 08:44
Let all Yarn entry points use the YarnConfigOptions.APPLICATION_MASTER_PORT option
to specify the valid port range for the common RpcService.

This closes apache#5388.
…n result retrieval

Split RestClusterClient#submitJob into submitJob and requestJobResult which can
be called individually.

This closes apache#5428.
Register the JobTerminationHandler at the WebMonitorEndpoint to make it accessible to
all REST endpoints.

This closes apache#5429.
In order to support the job cancellation from the web UI, including when using
Yarn, we have to register the JobTerminationHandler under /jobs/:jobid/yarn-cancel
and /jobs/:jobid/yarn-stop. This is just a temporary fix until we can send
arbitrary REST verbs through the Yarn proxy.

This closes apache#5430.
The MiniDispatcher is responsible for submitting the single job with which
a job mode cluster is started. Once the job has completed and if the cluster
has been started in detached mode, the MiniDispatcher will terminate.

In order to reduce code duplication, the MiniDispatcher is a sub class of the
Dispatcher which is started with a single job submitted job graph store.

This closes apache#5431.
This commit allows to deploy detached job mode clusters via the
CliFrontend. In order to do that, it first extracts the JobGraph
from the PackagedProgram and then uses the ClusterDescriptor to
deploy the job mode cluster.

This closes apache#5432.
Upon notification of newly allocated containers, the YarnResourceManager
will only accept as many containers as there are pending container requests.
All excess containers will be returned.

This closes apache#5436.
…luster

This commit makes the RestClusterClient aware whether the user wishes to submit
a job in detached or non-detached mode. If it is detached, then the RestClusterClient
won't poll for the execution result.

This closes apache#5466.
Shutting AkkaRpcActors down with Kill can result in uncompleted futures, if
the mailbox contains requests which will be dropped. Therefore, it is better
to stop AkkaRpcActors with a PoisonPill which assures that all messages before
the pill will be processed.

This closes apache#5476.
…ecutionGraph

This commit changes the initialization of the ExecutionGraph to use the
JobManagerOptions#SLOT_REQUEST_TIMEOUT for the slot allocation. Furthermore,
it changes the behaviour of the SlotPool#ProviderAndOwner implementation such
that the timeout is given to it via the SlotProvider#allocateSlot call.

This closes apache#5475.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
4 participants