Skip to content

Conversation

markgoddard
Copy link
Contributor

  • Fix tox whitespace warning
  • Add retries to overcloud host image pulp tasks
  • Raise alert on degraded network bonds
  • Add alert to detect bonds with a single link
  • Correct backup for seed images in RL9 migration

markgoddard and others added 13 commits April 12, 2024 10:09
Retries have been added to the stackhpc.pulp collection to improve
reliability. Adding the same here.
Add retries to overcloud host image pulp tasks
This will raise a alert when at least one of the bond members is down.
Adapted from awesome-prometheus-alerts [1].

[1] https://samber.github.io/awesome-prometheus-alerts/rules.html#rule-host-and-hardware-1-34
Raise alert on degraded network bonds
This change adds a new Prometheus alert HostNetworkBondSingleLink which
will be raised when a bond is configured with only one member. This can
happen when NetworkManager detects that a bond member is down at boot
time. This would fail to be detected by the HostNetworkBondDegraded
alert.
Add alert to detect bonds with a single link
Current instructions have a recursive copy:

``cp: cannot copy a directory, '/var/lib/libvirt/images', into itself, '/var/lib/libvirt/images/backup/images'``
@markgoddard markgoddard requested a review from a team as a code owner April 25, 2024 09:52
@markgoddard markgoddard self-assigned this Apr 25, 2024
@markgoddard markgoddard merged commit a3f600f into stackhpc/2023.1 Apr 25, 2024
@markgoddard markgoddard deleted the 2023.1-zed-merge branch April 25, 2024 12:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants