Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

USHIFT-2286: Use timeout command for interrupting failed VM installations #3088

Merged
merged 1 commit into from
Mar 2, 2024

Conversation

ggiguash
Copy link
Contributor

@ggiguash ggiguash commented Feb 29, 2024

The fix comes to work around stuck VM installations when the ostree payload cannot be pulled.
The --wait <time> option of virt-install does not terminate the tclsh process used by the unbuffer command.

When timeout is used, all the process chain of unbuffer->sudo->virt-install is terminated.

Note: this problem is reproducing itself on ARM platform (not on x86) as noted in the JIRA issue comments.

@openshift-ci-robot openshift-ci-robot added the jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. label Feb 29, 2024
@openshift-ci-robot
Copy link

openshift-ci-robot commented Feb 29, 2024

@ggiguash: This pull request references USHIFT-2286 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the bug to target the "4.16.0" version, but no target version was set.

In response to this:

The fix comes to work around stuck VM installations when the ostree payload cannot be pulled.
The --wait <time> option of virt-install does not terminate the tclsh process due to the usage of unbuffer command.

When timeout is used, all the process chain of unbuffer->sudo->virt-install is terminated.

Note: this problem is reproducing itself on ARM platform (not on x86) as noted in the JIRA issue comments.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 29, 2024
@openshift-ci-robot
Copy link

openshift-ci-robot commented Feb 29, 2024

@ggiguash: This pull request references USHIFT-2286 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the bug to target the "4.16.0" version, but no target version was set.

In response to this:

The fix comes to work around stuck VM installations when the ostree payload cannot be pulled.
The --wait <time> option of virt-install does not terminate the tclsh process used by the unbuffer command.

When timeout is used, all the process chain of unbuffer->sudo->virt-install is terminated.

Note: this problem is reproducing itself on ARM platform (not on x86) as noted in the JIRA issue comments.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

Copy link
Contributor

@dhellmann dhellmann left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

/hold

This looks good, but I don't want to merge a CI change on Friday evening. Feel free to remove the hold when you're back online next week.

@openshift-ci openshift-ci bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Mar 1, 2024
@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Mar 1, 2024
Copy link
Contributor

openshift-ci bot commented Mar 1, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dhellmann, ggiguash

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@ggiguash
Copy link
Contributor Author

ggiguash commented Mar 2, 2024

/unhold
/cherry-pick release-4.15

@openshift-ci openshift-ci bot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Mar 2, 2024
@openshift-cherrypick-robot

@ggiguash: once the present PR merges, I will cherry-pick it on top of release-4.15 in a new PR and assign it to you.

In response to this:

/unhold
/cherry-pick release-4.15

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Copy link
Contributor

openshift-ci bot commented Mar 2, 2024

@ggiguash: all tests passed!

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@openshift-merge-bot openshift-merge-bot bot merged commit e20cf8c into openshift:main Mar 2, 2024
8 checks passed
@openshift-cherrypick-robot

@ggiguash: #3088 failed to apply on top of branch "release-4.15":

Applying: Use timeout command for interrupting VM installations
Using index info to reconstruct a base tree...
M	test/bin/scenario.sh
Falling back to patching base and 3-way merge...
Auto-merging test/bin/scenario.sh
CONFLICT (content): Merge conflict in test/bin/scenario.sh
error: Failed to merge in the changes.
hint: Use 'git am --show-current-patch=diff' to see the failed patch
Patch failed at 0001 Use timeout command for interrupting VM installations
When you have resolved this problem, run "git am --continue".
If you prefer to skip this patch, run "git am --skip" instead.
To restore the original branch and stop patching, run "git am --abort".

In response to this:

/unhold
/cherry-pick release-4.15

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. lgtm Indicates that a PR is ready to be merged.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants