New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
mgr/orchestrator: use deepcopy for copying exceptions #32881
Conversation
@liewegas that's a different issue. i update the PR to address it. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Doesn't work. See my comment to https://tracker.ceph.com/issues/43913
see also https://tracker.ceph.com/issues/43913 ? |
i think you are talking about a different issue which is not directly related to my fix. |
@liewegas i replied in the tracker. |
mgr.x was not able establish an ssh connection to smithi042. see http://pulpito.ceph.com/kchai-2020-01-31_03:20:11-rados-wip-sage-testing-2020-01-29-1034-distro-basic-smithi/ |
so one cannot jump to another test node using "root" as "ubuntu".
one way to fix this is to push the pub key to the host to be added by "ceph orchestrator host add". |
So, this seems to be the root cause for the |
yes. the default user of cephadm is root. and actually |
abb64ea
to
988fc9a
Compare
My impression is that the deepcopy change doesn't actually fix anything (mainly because this code is executed only in the error case). What about extracting the other two commits into a new PR and then run this through QA? |
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the failed test has been removed in the latest changeset.
yay!
@sebastian-philipp updated the commit message. |
jenkins test make check |
since rexec module has been removed in python3, we cannot use it anymore. Fixes: https://tracker.ceph.com/issues/43657 Signed-off-by: Kefu Chai <kchai@redhat.com>
this test will end with a failure like ``` 2020-01-30T18:15:15.870 INFO:tasks.ceph.mgr.x.smithi042.stderr:Warning: Permanently added 'smithi042.front.sepia.ceph.com,172.21.15.42' (ECDSA) to the list of known hosts. 2020-01-30T18:15:15.925 INFO:tasks.ceph.mgr.x.smithi042.stderr:Permission denied, please try again. 2020-01-30T18:15:15.932 INFO:tasks.ceph.mgr.x.smithi042.stderr:Permission denied, please try again. 2020-01-30T18:15:15.939 INFO:tasks.ceph.mgr.x.smithi042.stderr:root@smithi042.front.sepia.ceph.com: Permission denied (publickey,gssapi-keyex,gssapi-with-mic,password). ``` because mgr is not able to establish an ssh connection to that host with "root". please note, the teuthology worker is acting using the "ubuntu" account on the test node, and by default, "root" does not have its pubkey. and actually `qa/tasks/cephadm.py` does push the pubkey to all the managed hosts before testing cephadm. since `qa/tasks/cephadm.py` is a better test for cephadm, let's just drop this one. as suites/rados/cephadm already covers cephadm Signed-off-by: Kefu Chai <kchai@redhat.com>
Signed-off-by: Kefu Chai <kchai@redhat.com>
mgr/orchestrator: use deepcopy for copying exceptions
since rexec module has been removed in python3, we cannot use it anymore.
Fixes: https://tracker.ceph.com/issues/43657
Signed-off-by: Kefu Chai kchai@redhat.com
Checklist
Show available Jenkins commands
jenkins retest this please
jenkins test crimson perf
jenkins test signed
jenkins test make check
jenkins test make check arm64
jenkins test submodules
jenkins test dashboard
jenkins test dashboard backend
jenkins test docs
jenkins render docs
jenkins test ceph-volume all
jenkins test ceph-volume tox