Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FLINK-930] Netty Initialization is sometimes very slow #19

Closed
wants to merge 2 commits into from

Conversation

uce
Copy link
Contributor

@uce uce commented Jun 15, 2014

This is FLINK-930.

The issue with the Netty warning was reported in netty/netty#2412 and has been fixed in 4.0.19.Final. I couldn't reproduce the reported problem (with and without the changes in this PR).

This PR contains the following changes:

  1. Update Netty version from 4.0.19.Final to 4.0.20.Final and
  2. Introduce ExecutionMode for TaskManagers, where in LOCAL mode the NetworkConnectionManager is not started (e.g. no ServerSocket binding etc.)

The 2. change speeds up the startup and shutdown time of the TaskManagers noticeably and solves/circumvents the reported problem with each test taking ~ 3 seconds for startup/shutdown.

@uce
Copy link
Contributor Author

uce commented Jun 16, 2014

Can someone please review and merge? Travis is running consistently out of time atm and this PR should "fix it". We can then further monitor the Netty ThreadLocalRandom warning.

try {
this.networkConnectionManager.shutdown();
} catch (IOException e) {
LOG.warn("NetworkConnectionManager did not shutdown properly.");
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we should log the exception as well here (I guess the error is unlikely and needs review by the user)

@uce
Copy link
Contributor Author

uce commented Jun 17, 2014

I agree with both your points and have addressed them. The thrown exceptions are rethrown as IOExceptions up to the TaskManager's shutdown method, where it is logged. I decided against changing the signature of the TaskManager's shutdown method to rethrow the Exception as I think this should go hand in hand with changes to the other component shutdowns as well.

@uce
Copy link
Contributor Author

uce commented Jun 17, 2014

Nice to see it green again: https://travis-ci.org/uce/incubator-flink/builds/27754618

@StephanEwen
Copy link
Contributor

Looks good to me. Will merge.

@asfgit asfgit closed this in a70de7e Jun 17, 2014
uce pushed a commit to uce/flink that referenced this pull request Jun 21, 2014
@uce uce deleted the FLINK-930 branch December 18, 2014 00:00
tweise pushed a commit to tweise/flink that referenced this pull request Oct 12, 2018
zhijiangW pushed a commit to zhijiangW/flink that referenced this pull request Jul 23, 2019
uce pushed a commit to uce/flink that referenced this pull request Aug 26, 2020
uce pushed a commit to uce/flink that referenced this pull request Aug 26, 2020
qinghui-xu pushed a commit to qinghui-xu/flink that referenced this pull request Nov 3, 2021
HuangZhenQiu pushed a commit to HuangZhenQiu/flink that referenced this pull request Sep 20, 2022
jnh5y pushed a commit to jnh5y/flink that referenced this pull request Dec 18, 2023
…or so that checkpoints can correctly fail

This closes apache#19.
XComp added a commit to XComp/flink that referenced this pull request Mar 22, 2024
morozov pushed a commit to morozov/flink that referenced this pull request Apr 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
3 participants