-
Notifications
You must be signed in to change notification settings - Fork 5.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
test/run-rbd-unit-tests.sh: increase ip-local-port-range #48018
Conversation
jenkins test make check |
This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved |
https://jenkins.ceph.com/job/ceph-pull-requests/103655/ the output of nestat is way too large, so i pasted it as a gist: https://gist.github.com/tchaikov/b3858da3afcba00eb71394dffb5899d1 |
to understand the root cause better See-also: https://tracker.ceph.com/issues/57116 Signed-off-by: Kefu Chai <tchaikov@gmail.com>
in case we are running out of available ports while performing tests. See-also: https://tracker.ceph.com/issues/57116 Signed-off-by: Kefu Chai <tchaikov@gmail.com>
less repeatings this way. Signed-off-by: Kefu Chai <tchaikov@gmail.com>
@tchaikov made this Tracker to document your additional fix: https://tracker.ceph.com/issues/57491 Can you update the commit? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't understand the purpose of this change. https://gist.github.com/tchaikov/b3858da3afcba00eb71394dffb5899d1 shows over 28000 TIME_WAIT
sockets to 172.21.2.2:6810. I don't think unittest_librbd
is involved in that in any way -- I would expect it to generate ~20 TIME_WAIT
sockets to localhost -- so I don't see any reason to touch run-rbd-unit-tests.sh
.
And even if this wasn't done from run-rbd-unit-tests.sh
but in some other file, why are we attempting to increase ip_local_port_range
instead of tracking down what is responsible for tens of thousands of locally closed sockets?
i created this PR in the same spirit of #47962. why? because i think i understand the issue better, and want to get the "make check" completed sooner with less number of i admit that i didn't understand the root cause of "what is responsible for tens of thousands of locally closed sockets" back then, neither do i now. why did i created these PRs without understanding the root case? because i want to "workaround" this issue to alleviate the pain we are suffering in the past weeks. and because i don't have enough bandwidth working on this issue. if any of us had come up with a better fix i would not do these. i noticed that the #48014 was created claiming that PR was approved and backported. that change didn't come with a root cause analysis explaining why we had this issue. and it reverted some changes proposed by the author made in #47962. again, i admit that #47962 also failed to root cause the issue. that's why i didn't claim that i fixed the issue. i thought we reviewed the PRs with a consistent standard. the comment is confusing. anyway, if this change is not following our guideline or best practice. i am closing this change for good. |
hi Laura, thanks. i don't intend to continue working on this change or this issue anymore. so i reset the related fields in https://tracker.ceph.com/issues/57491. sorry for the noise. |
No worries, Kefu! |
Hi Kefu,
This wasn't noted anywere -- neither in the PR description nor in any of the commit messages. "want to get the "make check" completed sooner with less number of The other thing that confused me was that you added this increase just in
I didn't see a single occurrence of this issue on runs that included both #47962 and #48014 and therefore thought that the workaround was good enough and that the pain was gone. I didn't realize that you had another improvement to the workaround in mind.
The
I think our approach to |
Signed-off-by: Kefu Chai tchaikov@gmail.com
Contribution Guidelines
To sign and title your commits, please refer to Submitting Patches to Ceph.
If you are submitting a fix for a stable branch (e.g. "pacific"), please refer to Submitting Patches to Ceph - Backports for the proper workflow.
Checklist
Show available Jenkins commands
jenkins retest this please
jenkins test classic perf
jenkins test crimson perf
jenkins test signed
jenkins test make check
jenkins test make check arm64
jenkins test submodules
jenkins test dashboard
jenkins test dashboard cephadm
jenkins test api
jenkins test docs
jenkins render docs
jenkins test ceph-volume all
jenkins test ceph-volume tox
jenkins test windows