-
Notifications
You must be signed in to change notification settings - Fork 103
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug 1901208: Nodes goes into NotReady state (VMware) #466
Bug 1901208: Nodes goes into NotReady state (VMware) #466
Conversation
it allows the client to detect broken connections to the api server more quickly than the default TCP timeout of 15 minutes. note that it will be replaced by an HTTP ping mechanism in the future releases.
@p0lyn0mial: No Bugzilla bug is referenced in the title of this pull request. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/assign @sttts |
/test integration |
/retest |
Both integration test ( |
/retest |
@@ -16,28 +15,18 @@ func dialerWithDefaultOptions() DialContext { | |||
nd := &net.Dialer{ | |||
// TCP_USER_TIMEOUT does affect the behaviour of connect() which is controlled by this field so we set it to the same value | |||
Timeout: 25 * time.Second, | |||
// KeepAlive must to be set to a negative value to stop std library from applying the default values | |||
// by doing so we ensure that the options we are interested in won't be overwritten | |||
KeepAlive: time.Duration(-1), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it gets overwritten here https://github.com/golang/go/blob/master/src/net/dial.go#L431
KeepAlive: time.Duration(-1), | ||
Control: func(network, address string, con syscall.RawConn) error { | ||
var err error | ||
err = con.Control(func(fd uintptr) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
now we are using the original file descriptor not a copy
the last commit requires performing manual tests - will do that on Monday |
/retest |
/test e2e-agnostic-cmd |
1 similar comment
/test e2e-agnostic-cmd |
2782777
to
a5615fa
Compare
@p0lyn0mial: No Bugzilla bug is referenced in the title of this pull request. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
@rphillips I did manual tests this morning and haven't found any issues (openshift/library-go#944 (comment)). Note that I tested it against my custom applications not kubelet. @rphillips feel free to retitle so that it points to the BZ you want. |
/assign @rphillips |
/retest |
1 similar comment
/retest |
/lgtm |
@p0lyn0mial: This pull request references Bugzilla bug 1873114, which is invalid:
Comment In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
@nee1esh: This pull request references Bugzilla bug 1901208, which is valid. The bug has been updated to refer to the pull request using the external bug tracker. 6 validation(s) were run on this bug
In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/retest |
2 similar comments
/retest |
/retest |
/retest Please review the full test history for this PR and help us cut down flakes. |
8 similar comments
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest |
1 similar comment
/retest |
/retest Please review the full test history for this PR and help us cut down flakes. |
6 similar comments
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
@p0lyn0mial: All pull requests linked via external trackers have merged: Bugzilla bug 1901208 has been moved to the MODIFIED state. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/cherry-pick release-4.5 |
@cuppett: cannot checkout release-4.5: error checking out release-4.5: exit status 1. output: error: pathspec 'release-4.5' did not match any file(s) known to git In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
@cuppett for 4.5 you have to bring the change into openshift/origin. |
we will carry a patch that allows the clients (Kubelet, KCM, etc.) to detect broken connections to the api server more quickly than the default TCP timeout of 15 minutes.
note: this patch will be dropped in future releases once http2/ping is introduced.
the first commit uses openshift/library-go#937
the second commit was created as follows:
manually changed go.mod to pin to github.com/openshift/library-go v0.0.0-20201123125610-83d6d67a1e98
followed by./hack/update-vendor.sh