New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OADP-1801: Include restic and velero binaries in must-gather image, and gather various pieces of version information. #994
Conversation
Signed-off-by: Matthew Arnold <marnold@redhat.com>
@mrnold: This pull request references OADP-1801 which is a valid jira issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Skipping CI for Draft Pull Request. |
@mrnold: This pull request references OADP-1801 which is a valid jira issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Signed-off-by: Matthew Arnold <marnold@redhat.com>
The build failed with "No space left on device", so I added some cleanup steps as an avoidance. |
Signed-off-by: Matthew Arnold <marnold@redhat.com>
Signed-off-by: Matthew Arnold <marnold@redhat.com>
@mrnold: This pull request references OADP-1801 which is a valid jira issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Signed-off-by: Matthew Arnold <marnold@redhat.com>
Since we're adding version collections, we should add velero deployments too? that's one of the parameters that can break which is more than one velero versions installed. So oc get deployment -A | grep velero |
Signed-off-by: Matthew Arnold <marnold@redhat.com>
Signed-off-by: Matthew Arnold <marnold@redhat.com>
I added this and the restic client version too, do you specifically want this to "oc exec" to a particular pod to get the velero version from there? Or is the one I have built-in to the must-gather image good enough? |
Signed-off-by: Matthew Arnold <marnold@redhat.com>
Signed-off-by: Matthew Arnold <marnold@redhat.com>
That is probably good enough.. we just want to know the server version. I doubt that breaks.. could fall back to oc exec on error tho. |
DPAs are already logged by gather_crs. Signed-off-by: Matthew Arnold <marnold@redhat.com>
@mrnold: This pull request references OADP-1801 which is a valid jira issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
if [ -z "${storageclasses}" -o "${storageclasses}" == " " ]; then | ||
log_command "StorageClass" "echo No StorageClasses found in cluster" | ||
else | ||
log_command "StorageClasses" "oc get storageclasses -o yaml" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
great!
else | ||
while read dpa namespace; do | ||
log_command "DataProtectionApplication ${namespace}/${dpa}" "oc get dpa -n ${namespace} ${dpa} -o yaml" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
woot!
Signed-off-by: Matthew Arnold <marnold@redhat.com>
Signed-off-by: Matthew Arnold <marnold@redhat.com>
@@ -0,0 +1,8 @@ | |||
#!/bin/bash | |||
|
|||
for pod in $(oc get pods -n openshift-adp -o jsonpath='{.items[?(.status.containerStatuses[0].lastState.terminated.reason=="Error")].metadata.name}'); do |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we need insecure_tls here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
for pod in $(oc get pods -n openshift-adp -o jsonpath='{.items[?(.status.containerStatuses[0].lastState.terminated.reason=="Error")].metadata.name}'); do | |
for pod in $(oc get pods -n openshift-adp -o jsonpath='{.items[?(.status.containerStatuses[0].lastState.terminated.reason=="Error")].metadata.name}' --insecure-skip-tls-verify=${skip_tls}); do |
if [ -z "${storageclasses}" -o "${storageclasses}" == " " ]; then | ||
log_command "StorageClass" "echo No StorageClasses found in cluster" | ||
else | ||
log_command "StorageClasses" "oc get storageclasses -o yaml" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
insecure_tls?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
log_command "StorageClasses" "oc get storageclasses -o yaml" | |
log_command "StorageClasses" "oc get storageclasses -o yaml --insecure-skip-tls-verify=${skip_tls}" |
log_command "DataProtectionApplication CRs" "echo No DPAs found in cluster" | ||
else | ||
while read dpa namespace; do | ||
log_command "DataProtectionApplication ${namespace}/${dpa}" "oc get dpa -n ${namespace} ${dpa} -o yaml" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
insecure tls?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
log_command "DataProtectionApplication ${namespace}/${dpa}" "oc get dpa -n ${namespace} ${dpa} -o yaml" | |
log_command "DataProtectionApplication ${namespace}/${dpa}" "oc get dpa -n ${namespace} ${dpa} -o yaml --insecure-skip-tls-verify=${skip_tls}" |
/hold |
Signed-off-by: Matthew Arnold <marnold@redhat.com>
for pod in $(oc get pods --insecure-skip-tls-verify=${skip_tls} -n openshift-adp -o jsonpath='{.items[?(.status.containerStatuses[0].lastState.terminated.reason=="Error")].metadata.name}'); do | ||
echo "***" | ||
echo "* Last logs from failed pod openshift-adp/${pod}:" | ||
oc logs -n openshift-adp $pod --tail=10 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do you think we need insecure here too?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
At some point probably all of this must-gather should be reviewed for this, there are a bunch of places that don't look at skip_tls at all.
Signed-off-by: Matthew Arnold <marnold@redhat.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/unhold
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/unhold
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: kaovilai, mrnold, weshayutin The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
1 similar comment
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: kaovilai, mrnold, weshayutin The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/test 4.11-operator-e2e-gcp |
@mrnold: all tests passed! Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
/cherry-pick oadp-1.2 |
@mrnold: new pull request created: #1004 In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Addresses OADP-1801, download and build restic and velero binaries and copy them to the must-gather image, and save various pieces of version information.
A backport should change the must-gather Dockerfile branch pointers for restic and velero to (for example) oadp-1.2.