Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

kvm: ITCO watchdog added #8282

Merged
merged 3 commits into from
Feb 12, 2024
Merged

kvm: ITCO watchdog added #8282

merged 3 commits into from
Feb 12, 2024

Conversation

DaanHoogland
Copy link
Contributor

Description

This PR, ...

Fixes: #8280

Types of changes

  • Breaking change (fix or feature that would cause existing functionality to change)
  • New feature (non-breaking change which adds functionality)
  • Bug fix (non-breaking change which fixes an issue)
  • Enhancement (improves an existing feature and functionality)
  • Cleanup (Code refactoring and cleanup, that may add test cases)
  • build/CI

Feature/Enhancement Scale or Bug Severity

Feature/Enhancement Scale

  • Major
  • Minor

Bug Severity

  • BLOCKER
  • Critical
  • Major
  • Minor
  • Trivial

Screenshots (if appropriate):

How Has This Been Tested?

How did you try to break this feature and the system with this change?

@DaanHoogland
Copy link
Contributor Author

come to think of it, this is probably not the only thing needed. If anyone knows... ?

@weizhouapache
Copy link
Member

come to think of it, this is probably not the only thing needed. If anyone knows... ?

@DaanHoogland
create a RHEL 9 env and test it ?

@blueorangutan
Copy link

@DaanHoogland [SL] unsupported parameters provided. Supported mgmt server os are: centos7, centos6, suse15, alma8, ubuntu18, ubuntu22, ubuntu20, rocky8, alma9. Supported hypervisors are: kvm-centos6, kvm-centos7, kvm-rocky8, kvm-alma8, kvm-alma9, kvm-ubuntu18, kvm-ubuntu20, kvm-ubuntu22, kvm-suse15, vmware-55u3, vmware-60u2, vmware-65u2, vmware-67u3, vmware-70u1, vmware-70u2, vmware-70u3, vmware-80, vmware-80u1, xenserver-65sp1, xenserver-71, xenserver-74, xcpng74, xcpng76, xcpng80, xcpng81, xcpng82

@DaanHoogland
Copy link
Contributor Author

$%#
@blueorangutan test alma9 kvm-alma9 keepEnv

Copy link
Member

@weizhouapache weizhouapache left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

code lgtm

need testing

@weizhouapache
Copy link
Member

Assigned to myself. I will test it @DaanHoogland @shwstppr

@weizhouapache
Copy link
Member

tried to configure watchdog on kvm host, but no success.

anyway, I think this is good to go. According to libvirt, itco is a valid model for watchdog.
https://libvirt.org/formatdomain.html#watchdog-devices

@DaanHoogland
maybe we could add new action inject-nmi as well.

cc @shwstppr

@DaanHoogland
Copy link
Contributor Author

@DaanHoogland maybe we could add new action inject-nmi as well.

seems like a new PR, would you suggest a API/UI enhancement for that? (creating #8368)

@weizhouapache
Copy link
Member

weizhouapache commented Dec 18, 2023

@DaanHoogland maybe we could add new action inject-nmi as well.

seems like a new PR, would you suggest a API/UI enhancement for that? (creating #8368)

no, just a new option for action (which is set by vm.watchdog.action in agent.properties).
you can update this PR to support it as well.

Copy link

codecov bot commented Jan 17, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (cb2b6ac) 13.11% compared to head (35e3b11) 13.16%.
Report is 27 commits behind head on 4.18.

Additional details and impacted files
@@             Coverage Diff              @@
##               4.18    #8282      +/-   ##
============================================
+ Coverage     13.11%   13.16%   +0.04%     
- Complexity     9133     9201      +68     
============================================
  Files          2720     2724       +4     
  Lines        257659   258087     +428     
  Branches      40171    40223      +52     
============================================
+ Hits          33802    33984     +182     
- Misses       219566   219797     +231     
- Partials       4291     4306      +15     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@apache apache deleted a comment from blueorangutan Jan 17, 2024
@apache apache deleted a comment from blueorangutan Jan 17, 2024
@apache apache deleted a comment from blueorangutan Jan 17, 2024
@apache apache deleted a comment from blueorangutan Jan 17, 2024
@DaanHoogland
Copy link
Contributor Author

@blueorangutan package

@blueorangutan
Copy link

@DaanHoogland a [SL] Jenkins job has been kicked to build packages. It will be bundled with KVM, XenServer and VMware SystemVM templates. I'll keep you posted as I make progress.

@weizhouapache weizhouapache changed the title ITCO watchdog added kvm: ITCO watchdog added Jan 17, 2024
Copy link
Member

@weizhouapache weizhouapache left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

code lgtm

@DaanHoogland DaanHoogland linked an issue Jan 22, 2024 that may be closed by this pull request
@DaanHoogland DaanHoogland added this to the 4.18.2.0 milestone Jan 22, 2024
…/resource/LibvirtVMDef.java

Co-authored-by: Wei Zhou <weizhou@apache.org>
@rohityadavcloud
Copy link
Member

@blueorangutan package

@blueorangutan
Copy link

@rohityadavcloud a [SL] Jenkins job has been kicked to build packages. It will be bundled with KVM, XenServer and VMware SystemVM templates. I'll keep you posted as I make progress.

@blueorangutan
Copy link

Packaging result [SF]: ✔️ el7 ✔️ el8 ✔️ el9 ✔️ debian ✔️ suse15. SL-JID 8519

@DaanHoogland
Copy link
Contributor Author

@blueorangutan test

@blueorangutan
Copy link

@DaanHoogland a [SL] Trillian-Jenkins test job (centos7 mgmt + kvm-centos7) has been kicked to run smoke tests

@weizhouapache
Copy link
Member

I was not able to test it. (ITCO is unsupported on my kvm hosts)

I do not see any error or risk to merge this PR.
what do you think ? @DaanHoogland @rohityadavcloud @shwstppr @JoaoJandre

@DaanHoogland
Copy link
Contributor Author

just to be sure
@blueorangutan test

@blueorangutan
Copy link

@DaanHoogland a [SL] Trillian-Jenkins test job (centos7 mgmt + kvm-centos7) has been kicked to run smoke tests

@blueorangutan
Copy link

[SF] Trillian test result (tid-9146)
Environment: kvm-centos7 (x2), Advanced Networking with Mgmt server 7
Total time taken: 46187 seconds
Marvin logs: https://github.com/blueorangutan/acs-prs/releases/download/trillian/pr8282-t9146-kvm-centos7.zip
Smoke tests completed. 110 look OK, 0 have errors, 0 did not run
Only failed and skipped tests results shown below:

Test Result Time (s) Test File

@weizhouapache weizhouapache merged commit 672206c into apache:4.18 Feb 12, 2024
27 checks passed
@DaanHoogland DaanHoogland deleted the watchDogDef branch February 12, 2024 10:49
dhslove pushed a commit to ablecloud-team/ablestack-cloud that referenced this pull request Feb 20, 2024
* ITCO watchdog added

* add inject-nmi action

* Update plugins/hypervisors/kvm/src/main/java/com/cloud/hypervisor/kvm/resource/LibvirtVMDef.java

Co-authored-by: Wei Zhou <weizhou@apache.org>

---------

Co-authored-by: Wei Zhou <weizhou@apache.org>
@rahultolearn59
Copy link

hi @DaanHoogland, I have experienced the same error with RHEL9 KVM as well with ACS 4.19. Can we get this merged to 4.19 as well?

@weizhouapache
Copy link
Member

@rahultolearn59

All commits in 4.18 will be merged into 4.19 and main branch

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

KVM: add new action inject-nmi
5 participants