Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
56 commits
Select commit Hold shift + click to select a range
987f3be
Fix default when ANSIBLE_SERIAL not set
jovial Feb 15, 2023
fbe3f24
Merge pull request #396 from stackhpc/bugfix/wallaby/ansible-serial-d…
markgoddard Feb 16, 2023
d9d142d
Merge pull request #397 from stackhpc/refresh/wallaby/ubuntu-2023-02-15
jovial Feb 21, 2023
e0570cd
Bump the default worker count
jovial Feb 22, 2023
b9cbe9f
Fix release train 02-2023 renos
markgoddard Mar 1, 2023
0a2a0e1
release notes: Remove default_branch config option
markgoddard Mar 1, 2023
ac83501
Use yoga upper constraints for releasenotes tox env
markgoddard Mar 1, 2023
3f2b4d4
Run doc8 and reno lint in pep8 tox env
markgoddard Mar 1, 2023
f52fe3f
Reduce concurrency
jovial Feb 3, 2023
b1a3bb0
Configure tunnel mtu in neutron
jovial Feb 8, 2023
54c8d28
Reduce tempest concurrency to 1
jovial Feb 8, 2023
f0c6d4a
Bump timeouts to account for ceph being slow
jovial Feb 8, 2023
3c3c366
Bump MTU on external network
jovial Feb 8, 2023
210c3c0
Set storage protocol
jovial Feb 10, 2023
af2dcdb
Clarify which exchanges should remain after the forced reset of RabbitMQ
MoteHue Mar 1, 2023
d8ac5c7
Change fix-networking.yml so it doesn't overwrite all of /etc/hosts
MoteHue Mar 1, 2023
51017f8
Update doc/source/operations/rabbitmq.rst
MoteHue Mar 1, 2023
79bfa3b
Correct marker
MoteHue Mar 1, 2023
b1f78d3
Fix InstanceDown alert (#419)
technowhizz Mar 1, 2023
0d888be
Merge pull request #418 from stackhpc/multinode-improvements
MoteHue Mar 2, 2023
95bcfcc
Merge pull request #421 from stackhpc/fix-fix-networking
MoteHue Mar 2, 2023
146a20e
Merge pull request #420 from stackhpc/rabbitmq-correct-operator-docs
MoteHue Mar 2, 2023
daaf54a
Merge pull request #416 from stackhpc/xena-pep8++
markgoddard Mar 2, 2023
4d12100
Merge pull request #414 from stackhpc/xena-release-notes-fix
markgoddard Mar 2, 2023
9df33a6
Build Neutron from stackhpc fork
priteau Mar 3, 2023
936adad
Build caso container with release train
jovial Mar 3, 2023
d887546
Build caso image
jovial Mar 3, 2023
97463a8
Build blazar image
jovial Mar 3, 2023
1f97a12
Revert "Build caso image"
jovial Mar 3, 2023
f7659d0
Merge pull request #425 from stackhpc/wallaby-neutron
priteau Mar 3, 2023
557972d
Add image tags for caso and blazar
jovial Mar 3, 2023
ed1e1f1
Bump Neutron to include IPv6 metadata workaround
priteau Mar 4, 2023
16adeb3
Adds a release note for caso
jovial Mar 6, 2023
636ea20
Adds a release note for blazar
jovial Mar 6, 2023
e41de7e
Merge pull request #428 from stackhpc/wallaby-neutron-dadfailed
priteau Mar 6, 2023
b8363f8
Merge branch 'stackhpc/wallaby' into xena-neutron-dadfailed
priteau Mar 6, 2023
31b44dc
Merge branch 'stackhpc/wallaby' into feature/wallaby/caso
jovial Mar 7, 2023
957d1a9
Merge pull request #427 from stackhpc/feature/wallaby/caso
jovial Mar 7, 2023
b7b0842
Bump neutron containers to support NGS batching
jovial Mar 7, 2023
5834dea
Merge pull request #405 from stackhpc/feature/increase-pulp-workers
jovial Mar 7, 2023
388c381
Bump Neutron to include IPv6 metadata workaround
priteau Mar 7, 2023
0f40c79
Add release note for Neutron IPv6 metadata workaround
priteau Mar 8, 2023
0cf9477
Merge pull request #432 from stackhpc/wallaby-neutron-dadfailed-reno
priteau Mar 8, 2023
d01fa8a
Merge pull request #430 from stackhpc/xena-neutron-dadfailed
markgoddard Mar 8, 2023
4e6d187
Merge pull request #431 from stackhpc/feature/wallaby/ngs-batching
jovial Mar 8, 2023
46cfb11
Use newer liblasso package in Ubuntu
cityofships Mar 8, 2023
4552d37
Bump up Keystone's image tag for Ubuntu
cityofships Mar 8, 2023
c88c90e
Use newer liblasso package in Ubuntu
cityofships Mar 8, 2023
f782f82
Merge pull request #433 from stackhpc/wallaby-focal-liblasso-upgrade
cityofships Mar 8, 2023
f09071e
Bump up Keystone's image tag for Ubuntu
cityofships Mar 8, 2023
00a9a69
Merge pull request #434 from stackhpc/xena-focal-liblasso-upgrade
cityofships Mar 8, 2023
7748d7b
Add unit (GiB) to LowMemory alert (#437)
technowhizz Mar 14, 2023
dab92e7
Merge stackhpc/wallaby into stackhpc/xena
markgoddard Mar 15, 2023
1b014bc
Merge stackhpc/xena into stackhpc/yoga
markgoddard Mar 15, 2023
30ae407
Add tags for Blazar and Caso images
markgoddard Mar 15, 2023
fb1ad08
CI: Fix doc8 newline issues
markgoddard Mar 15, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
9 changes: 7 additions & 2 deletions .automation.conf/config.sh
Original file line number Diff line number Diff line change
Expand Up @@ -19,10 +19,15 @@ if [ ! -z ${KAYOBE_ENVIRONMENT:+x} ]; then
# Seem to get servers failing to spawn with higher concurrency
export TEMPEST_CONCURRENCY=1
fi

if [[ "$KAYOBE_ENVIRONMENT" =~ "ci-multinode" ]]; then
export KAYOBE_AUTOMATION_TEMPEST_LOADLIST=tempest-full
export KAYOBE_AUTOMATION_TEMPEST_SKIPLIST=ci-multinode
# SMSLab is currently running with 1G switches. This causes tests using volumes and images to fail if
# the concurrency is set too high.
export TEMPEST_CONCURRENCY=1
export KAYOBE_AUTOMATION_TEMPEST_LOADLIST=tempest-full
export KAYOBE_AUTOMATION_TEMPEST_SKIPLIST=ci-multinode
fi

fi

if [[ -z "${KAYOBE_AUTOMATION_TEMPEST_CONF_OVERRIDES:+x}" ]] || [[ ! -e "${KAYOBE_AUTOMATION_TEMPEST_CONF_OVERRIDES}" ]]; then
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -23,8 +23,13 @@ volume_backed_live_migration = true
console_output = true

[volume]
storage_protocol = ceph
build_timeout = 600
min_microversion = 3.0
max_microversion = 3.66

[image]
build_timeout = 600

[dashboard]
dashboard_url = http://192.168.39.2
2 changes: 1 addition & 1 deletion doc/source/configuration/host-images.rst
Original file line number Diff line number Diff line change
Expand Up @@ -43,7 +43,7 @@ The 3 partitions are:
* p0: EFI ESP bootloader
* p1: EFI BSP
* p2: LVM PV (``rootpv``)

The LVM Logical Volumes are:

============== ================== =========
Expand Down
2 changes: 1 addition & 1 deletion doc/source/configuration/monitoring.rst
Original file line number Diff line number Diff line change
Expand Up @@ -126,4 +126,4 @@ need to be set, and the proxy URL sould be set or removed.
If you want to add an alerting rule, there are many good examples of alerts are
available `here <https://awesome-prometheus-alerts.grep.to/>`__. They simply
need to be added to one of the ``*.rules`` files in the prometheus configuration
directory.
directory.
8 changes: 4 additions & 4 deletions doc/source/configuration/release-train.rst
Original file line number Diff line number Diff line change
Expand Up @@ -233,7 +233,7 @@ see this message when you later try to run ``pulp-container-sync.yml``:
The issue is that pushing an image automatically creates a `container-push repository
<https://docs.pulpproject.org/pulp_container/restapi.html#tag/Repositories:-Container-Push>`__
which conflicts with the creation of a regular container repository of the same
name. You can resolve this conflict by deleting the distribution associated
name. You can resolve this conflict by deleting the distribution associated
with the push repository using the pulp CLI:

.. code-block:: console
Expand All @@ -242,17 +242,17 @@ with the push repository using the pulp CLI:
Started background task /pulp/api/v3/tasks/1f0a474a-b7c0-44b4-9ef4-ed633077f4d8/
.Done.

HTTP Error 404: Not Found
HTTP Error 404: Not Found
~~~~~~~~~~~~~~~~~~~~~~~~~

If your login credentials are incorrect, or lack the required permissions,
If your login credentials are incorrect, or lack the required permissions,
you will see a 404 error during ``pulp-repo-sync.yml``:

.. code-block:: console

TASK [stackhpc.pulp.pulp_repository : Sync RPM remotes into repositories] ****************************************************************************************************************************************
An exception occurred during task execution. To see the full traceback, use -vvv. The error was: Exception: Task failed to complete. (failed; 404, message='Not Found', url=URL('https://ark.stackhpc.com/pulp/content/centos/8-stream/BaseOS/x86_64/os/20211122T102435'))
failed: [localhost] (item=centos-stream-8-baseos-development) => changed=false
failed: [localhost] (item=centos-stream-8-baseos-development) => changed=false
ansible_loop_var: item
item:
name: centos-stream-8-baseos-development
Expand Down
2 changes: 1 addition & 1 deletion doc/source/configuration/walled-garden.rst
Original file line number Diff line number Diff line change
Expand Up @@ -49,7 +49,7 @@ In some environments we have found that squid’s preference for IPv6 can
cause problems. It can be forced to prefer IPv4, by adding the following
in ``etc/kayobe/containers/squid_proxy/squid.conf``:

.. code:: none
.. code::

dns_v4_first on

Expand Down
2 changes: 1 addition & 1 deletion doc/source/contributor/environments/aufn-ceph.rst
Original file line number Diff line number Diff line change
Expand Up @@ -198,4 +198,4 @@ We then use the CLI to create a keypair, floating IP and test VM:
openstack server add floating ip test-vm-1 `openstack floating ip list -c ID -f value`
openstack server list

which will create a VM named ``test-vm-1`` with a Cirros OS iamge and a default login password of 'gocubsgo'.
which will create a VM named ``test-vm-1`` with a Cirros OS iamge and a default login password of 'gocubsgo'.
8 changes: 4 additions & 4 deletions doc/source/contributor/environments/ci-multinode.rst
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
==========================
Multinode Test Environment
Multinode Test Environment
==========================

Set up hosts
Expand All @@ -21,7 +21,7 @@ Basic Kayobe Setup
2. ``sudo dnf install -y python3-virtualenv``
3. ``mkdir src`` and ``cd src``
4. Clone https://github.com/stackhpc/stackhpc-kayobe-config.git, then checkout
commit f31df6256f1b1fea99c84547d44f06c4cb74b161
commit f31df6256f1b1fea99c84547d44f06c4cb74b161
5. ``cd ..`` and ``mkdir venvs``
6. ``virtualenv venvs/kayobe`` and source ``venvs/kayobe/bin/activate``
7. ``pip install -U pip``
Expand All @@ -39,8 +39,8 @@ Config changes
4. Also under vxlan_interfaces, check vxlan_dstport is not 4789 (this causes
conflicts, change to 4790)
5. In /etc/kayobe/environments/ci-multinode/tf-networks.yml, edit admin_ips so
that the compute and controller IPs line up with the
instances that were created earlier, remove the other IPs for seed and
that the compute and controller IPs line up with the
instances that were created earlier, remove the other IPs for seed and
cephOSD
6. In /etc/kayobe/environments/ci-multinode/network-allocation.yml, remove all
the entries and just assign ``aio_ips:`` an empty set ``[]``
Expand Down
2 changes: 1 addition & 1 deletion doc/source/contributor/environments/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -9,4 +9,4 @@ The following Kayobe environments are provided with this configuration:
ci-aio
ci-builder
ci-multinode
aufn-ceph
aufn-ceph
2 changes: 2 additions & 0 deletions doc/source/operations/rabbitmq.rst
Original file line number Diff line number Diff line change
Expand Up @@ -128,6 +128,8 @@ Check to see if all OpenStack queues and exchanges have been removed from the Ra
kayobe overcloud host command run --limit controllers --show-output --command 'docker exec rabbitmq rabbitmqctl list_queues name'
kayobe overcloud host command run --limit controllers --show-output --command 'docker exec rabbitmq rabbitmqctl list_exchanges name'

There should be no queues listed, and the only exchanges listed should start with `amq.`.

Start the OpenStack services which use RabbitMQ. Note that this will start all
matching services, even if they weren't running prior to starting this
procedure.
Expand Down
9 changes: 4 additions & 5 deletions etc/kayobe/ansible/fix-networking.yml
Original file line number Diff line number Diff line change
Expand Up @@ -11,11 +11,10 @@
ansible_ssh_common_args: "-o StrictHostKeyChecking=no"
tasks:
- name: Ensure `hosts` file contains pulp entries
ansible.builtin.copy:
content: |
127.0.0.1 localhost localhost.localdomain localhost4 localhost4.localdomain4
::1 localhost localhost.localdomain localhost6 localhost6.localdomain6
blockinfile:
path: /etc/hosts
marker: "# {mark} Kayobe Pulp entries"
block: |
10.0.0.34 pelican pelican.service.compute.sms-lab.cloud
10.205.3.187 pulp-server pulp-server.internal.sms-cloud
dest: /etc/hosts
become: true
2 changes: 1 addition & 1 deletion etc/kayobe/ansible/reboot.yml
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
---
- name: Reboot the host
hosts: seed-hypervisor:seed:overcloud:infra-vms
serial: "{{ lookup('env', 'ANSIBLE_SERIAL') | default(0) }}"
serial: "{{ lookup('env', 'ANSIBLE_SERIAL') | default(0, true) }}"
tags:
- reboot
tasks:
Expand Down
1 change: 1 addition & 0 deletions etc/kayobe/containers/pulp/settings.py
Original file line number Diff line number Diff line change
@@ -1,4 +1,5 @@
CONTENT_ORIGIN='{{ pulp_url }}'
ANALYTICS=False
ANSIBLE_API_HOSTNAME='{{ pulp_url }}'
ANSIBLE_CONTENT_HOSTNAME='{{ pulp_url }}/pulp/content'
TOKEN_AUTH_DISABLED=True
Expand Down
1 change: 1 addition & 0 deletions etc/kayobe/environments/ci-builder/stackhpc-ci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -8,6 +8,7 @@ kolla_docker_namespace: stackhpc-dev
# Kolla feature flag configuration.
kolla_enable_barbican: true
kolla_enable_blazar: true
kolla_enable_caso: true
kolla_enable_central_logging: true
kolla_enable_cinder: true
kolla_enable_cloudkitty: true
Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
[DEFAULT]
global_physnet_mtu = {{ tunnel_net_name | net_mtu }}

Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
[ml2]
path_mtu = {{ tunnel_net_name | net_mtu }}

2 changes: 1 addition & 1 deletion etc/kayobe/environments/ci-multinode/networks.yml
Original file line number Diff line number Diff line change
Expand Up @@ -81,7 +81,7 @@ internal_vlan: 101

# External network
external_cidr: 192.168.38.0/24
external_mtu: 1350
external_mtu: 1450
external_allocation_pool_start: 192.168.38.3
external_allocation_pool_end: 192.168.38.128
external_vlan: 102
Expand Down
8 changes: 8 additions & 0 deletions etc/kayobe/kolla.yml
Original file line number Diff line number Diff line change
Expand Up @@ -338,6 +338,14 @@ kolla_build_blocks:
&& grafana-cli plugins install grafana-piechart-panel
ironic_inspector_header: |
ADD additions-archive /
keystone_base_footer: |
{% raw %}
{% if base_package_type == 'deb' %}
RUN curl -sLO https://github.com/stackhpc/lasso/releases/download/applied%2F2.7.0-2build3/liblasso3_2.7.0-2build3_amd64.deb && \
sudo dpkg -i liblasso3_2.7.0-2build3_amd64.deb && \
rm -f liblasso3_2.7.0-2build3_amd64.deb
{% endif %}
{% endraw %}
nova_base_footer: |
# Fix for https://bugs.launchpad.net/nova/+bug/1955035, i.e.
# https://bugzilla.redhat.com/show_bug.cgi?id=2090752 on c8s
Expand Down
4 changes: 2 additions & 2 deletions etc/kayobe/kolla/config/prometheus/system.rules
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,7 @@ groups:
severity: alert
annotations:
summary: "Prometheus exporter at {{ $labels.instance }} reports low memory"
description: "Available memory is {{ $value }}."
description: "Available memory is {{ $value }} GiB."

- alert: HostOomKillDetected
expr: increase(node_vmstat_oom_kill[5m]) > 0
Expand Down Expand Up @@ -53,7 +53,7 @@ groups:

- alert: InstanceDown
expr: up{job="node"} == 0
for: 1m
for: 5m
labels:
severity: alert
annotations:
Expand Down
6 changes: 6 additions & 0 deletions etc/kayobe/kolla/globals.yml
Original file line number Diff line number Diff line change
Expand Up @@ -9,11 +9,17 @@ enable_docker_repo: {% raw %}"{{ 'overcloud' not in group_names or ansible_facts

{% if kolla_base_distro == 'centos' %}
bifrost_tag: yoga-20230217T160618
blazar_tag: yoga-20230315T125157
caso_tag: yoga-20230315T125157
neutron_tag: yoga-20230309T123152
{% elif kolla_base_distro == 'rocky' %}
bifrost_tag: yoga-20230310T194732
blazar_tag: yoga-20230315T130918
caso_tag: yoga-20230315T130918
{% else %}
bifrost_tag: yoga-20230220T184947
blazar_tag: yoga-20230315T125441
caso_tag: yoga-20230315T125441
neutron_tag: yoga-20230309T123143
{% endif %}

Expand Down
5 changes: 4 additions & 1 deletion etc/kayobe/seed.yml
Original file line number Diff line number Diff line change
Expand Up @@ -106,11 +106,14 @@ seed_pulp_container:
image: pulp/pulp
pre: "{{ kayobe_config_path }}/containers/pulp/pre.yml"
post: "{{ kayobe_config_path }}/containers/pulp/post.yml"
tag: "{{ '3.21-https' if pulp_enable_tls | bool else '3.21' }}"
tag: "{{ '3.22-https' if pulp_enable_tls | bool else '3.22' }}"
network_mode: host
# Override deploy_containers_defaults.init == true to ensure
# s6-overlay-suexec starts as pid 1
init: false
env:
PULP_CONTENT_WORKERS: "{{ ansible_facts.processor_vcpus * 2 + 1 }}"
PULP_API_WORKERS: "{{ ansible_facts.processor_vcpus * 2 + 1 }}"
volumes:
- /opt/kayobe/containers/pulp:/etc/pulp
- pulp_storage:/var/lib/pulp
Expand Down
1 change: 0 additions & 1 deletion releasenotes/config.yaml
Original file line number Diff line number Diff line change
@@ -1,4 +1,3 @@
---
# This needs to be updated to the latest release.
default_branch: stackhpc/yoga
release_tag_re: stackhpc/12\.\d+\.\d+\.\d
8 changes: 8 additions & 0 deletions releasenotes/notes/add-caso-f36b98453be10169.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
---
features:
- |
Adds ``caso`` container images. ``cASO`` is an is an accounting reporter
that supports Cloud Accounting Usage Records. For more information, see
the `upstream docs <https://caso.readthedocs.io/en/stable/>`__. Note that
this container does not exist in upstream Kolla and is maintained
downstream by StackHPC.
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
---
features:
- |
Brings in new neutron container images to add batching support to
Networking Generic Switch. This is opt in via the ``ngs_batch_requests``
configuration option and only affects Ironic deployments that use
Networking Generic Switch. See the following `PR
<https://github.com/stackhpc/networking-generic-switch/pull/54>`__ for more
details.
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
---
fixes:
- |
Fixes the `issue <https://dev.entrouvert.org/issues/25640>`__ with using
SAML2 federation in Keystone against NetIQ IdP.
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
---
fixes:
- |
Fixes the InstanceDown alerting rule wait time to be consistent with
the alert message. The alert message says "for 5 minutes" but the rule
was set to wait for 1 minute.

Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
---
fixes:
- |
Add unit to LowMemory alert description.
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
---
fixes:
- |
Disables metadata proxy over IPv6 inside Neutron DHCP agent to work around
`bug 1953165 <https://bugs.launchpad.net/neutron/+bug/1953165>`__.
Original file line number Diff line number Diff line change
Expand Up @@ -5,14 +5,21 @@ features:
Key packages to note are:

* Kernel

* version: 4.18.0
* release: 448.el8

* Libvirt

* version: 8.0.0
* release: 6.module_el8.7.0+1140+ff0772f9

* OVS

* version: 2.17.0
* release: 71.el8s

* OVN

* version: 22.09.0
* release: 11.el8s
Original file line number Diff line number Diff line change
Expand Up @@ -5,11 +5,16 @@ features:
Key packages to note are:

* Libvirt

* version: 8.0.0
* release: 1ubuntu7.4~cloud0

* OVS

* version: 2.17.3
* release: 0ubuntu0.22.04.1~cloud0

* OVN (unchanged since last container build)

* version: 22.03.0
* release: 0ubuntu1~cloud0
7 changes: 7 additions & 0 deletions releasenotes/notes/update-pulp-3.22-aa485b7e619cd380.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
---
features:
- Upgrades Pulp from ``3.21`` to ``3.22``.
- Disables Pulp analytics.
- |
Sets Pulp worker based on available CPU cores. This may improve performance
when pulling container images to many hosts simultaneously.
2 changes: 2 additions & 0 deletions test-requirements.txt
Original file line number Diff line number Diff line change
Expand Up @@ -3,3 +3,5 @@
# process, which may cause wedges in the gate later.

yamllint # GPLv3
doc8 # Apache-2.0
reno>=3.4.0 # Apache-2.0
2 changes: 2 additions & 0 deletions tox.ini
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,8 @@ deps =
[testenv:pep8]
commands =
yamllint etc/kayobe
reno lint
doc8 README.rst doc/source --ignore D001

# StackHPC Kayobe configuration release notes:
[testenv:releasenotes]
Expand Down