Skip to content

(cluster/kueyen) Add RKE2 roles to Kueyen #1676

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: production
Choose a base branch
from

Conversation

gseriche
Copy link
Contributor

@gseriche gseriche commented Apr 4, 2025

No description provided.

@gseriche gseriche added the enhancement New feature or request label Apr 4, 2025
@gseriche gseriche self-assigned this Apr 4, 2025
@gseriche gseriche force-pushed the IT-6035_reprovision_kueyen_rke2 branch 2 times, most recently from 959b3ed to ce64a46 Compare April 4, 2025 03:01
@dtapiacl dtapiacl force-pushed the IT-6035_reprovision_kueyen_rke2 branch from ce64a46 to 5f2e961 Compare April 28, 2025 14:18
@gseriche gseriche force-pushed the IT-6035_reprovision_kueyen_rke2 branch from 5f2e961 to c0431a5 Compare April 30, 2025 04:29
@gseriche gseriche requested review from cbarria, jhoblitt and dtapiacl and removed request for cbarria May 5, 2025 19:50
@gseriche gseriche force-pushed the IT-6035_reprovision_kueyen_rke2 branch from c0431a5 to 476d1c2 Compare May 5, 2025 19:51
@gseriche gseriche marked this pull request as ready for review May 5, 2025 19:51
@gseriche gseriche force-pushed the IT-6035_reprovision_kueyen_rke2 branch from 476d1c2 to ef0ef8d Compare May 5, 2025 20:35
Copy link
Member

@jhoblitt jhoblitt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  • The convention used for PR titles in this repo is slightly different that for k8s-cookbook as this repo controls hosts, clusters, sites, etc. The () should refer to the class or hiera layer being changed. For this PR, it should be something like (cluster/kueyen).
  • The network configuration for kueyen01-03 have multiple interfaces obtaining a lease via dhcp. This consumes IP unessicarily from the pool:
 ~ $ ssh kueyen02.dev.lsst.org ip -c a
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000
    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
    inet 127.0.0.1/8 scope host lo
       valid_lft forever preferred_lft forever
2: eno1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq state UP group default qlen 1000
    link/ether 4c:d9:8f:6f:27:2a brd ff:ff:ff:ff:ff:ff
    altname enp4s0f0
    inet 139.229.134.77/24 brd 139.229.134.255 scope global dynamic noprefixroute eno1
       valid_lft 595sec preferred_lft 595sec
3: eno2: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc mq state DOWN group default qlen 1000
    link/ether 4c:d9:8f:6f:27:2b brd ff:ff:ff:ff:ff:ff
    altname enp4s0f1
4: ens2f0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq state UP group default qlen 1000
    link/ether a0:36:9f:c7:67:a4 brd ff:ff:ff:ff:ff:ff
    altname enp59s0f0
5: ens2f1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq state UP group default qlen 1000
    link/ether a0:36:9f:c7:67:a6 brd ff:ff:ff:ff:ff:ff
    altname enp59s0f1
    inet 139.229.134.71/24 brd 139.229.134.255 scope global dynamic noprefixroute ens2f1
       valid_lft 595sec preferred_lft 595sec
262: calif88ddbe0b4d@if2: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1450 qdisc noqueue state UP group default qlen 1000
    link/ether ee:ee:ee:ee:ee:ee brd ff:ff:ff:ff:ff:ff link-netns cni-b85b66bc-5eaa-e02b-09db-be91418b2faa
6: br2301: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000
    link/ether a0:36:9f:c7:67:a4 brd ff:ff:ff:ff:ff:ff
    inet 139.229.145.236/24 brd 139.229.145.255 scope global dynamic noprefixroute br2301
       valid_lft 885sec preferred_lft 885sec
7: ens2f0.2301@ens2f0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue master br2301 state UP group default qlen 1000
    link/ether a0:36:9f:c7:67:a4 brd ff:ff:ff:ff:ff:ff

and creates problems for the routing table:

 ~ $ ssh kueyen01.dev.lsst.org ip route
default via 139.229.134.254 dev eno1 proto dhcp src 139.229.134.76 metric 100 
default via 139.229.134.254 dev ens2f1 proto dhcp src 139.229.134.70 metric 101 
default via 139.229.145.254 dev br2301 proto dhcp src 139.229.145.234 metric 425 
10.42.0.7 dev calia811b3dd28b scope link 
10.42.0.74 dev cali65b1011dc9f scope link 
10.42.0.84 dev cali9eeba3290dc scope link 
10.42.0.86 dev cali7dd35812f57 scope link 
10.42.0.87 dev cali798aa5d74e3 scope link 
10.42.0.88 dev cali007204e3be2 scope link 
10.42.0.94 dev cali08eaa7eba89 scope link 
10.42.0.95 dev calia64889b34e8 scope link 
10.42.0.98 dev cali4368f414ed4 scope link 
10.42.0.100 dev cali1db263d6cfd scope link 
10.42.0.102 dev cali3ddb0661fe4 scope link 
10.42.1.0/24 via 10.42.1.0 dev flannel.1 onlink 
10.42.2.0/24 via 10.42.2.0 dev flannel.1 onlink 
10.42.3.0/24 via 10.42.3.0 dev flannel.1 onlink 
10.42.4.0/24 via 10.42.4.0 dev flannel.1 onlink 
10.42.5.0/24 via 10.42.5.0 dev flannel.1 onlink 
139.229.134.0/24 dev eno1 proto kernel scope link src 139.229.134.76 metric 100 
139.229.134.0/24 dev ens2f1 proto kernel scope link src 139.229.134.70 metric 101 
139.229.145.0/24 dev br2301 proto kernel scope link src 139.229.145.234 metric 425 
  • ldap auth for kueyen04 is broken:
 ~ $ ssh kueyen04.dev.lsst.org
Received disconnect from 139.229.134.242 port 22:2: Too many authentication failures
Disconnected from 139.229.134.242 port 22
  • kueyen04-06 are missing the bridge interface:
 ~ $ ssh kueyen06.dev.lsst.org brctl show
 ~ $ 

The convention for rebuilt clusters is that they should only have an FQDN IP on the a bridge interface.

  • kueyen05-06 are missing from ipam.cp.lsst.org

@gseriche gseriche changed the title (kueyen) Add RKE2 roles to Kueyen (cluster/kueyen) Add RKE2 roles to Kueyen May 6, 2025
@gseriche
Copy link
Contributor Author

  • ldap auth for kueyen04 is broken:
 ~ $ ssh kueyen04.dev.lsst.org
Received disconnect from 139.229.134.242 port 22:2: Too many authentication failures
Disconnected from 139.229.134.242 port 22
  • kueyen04-06 are missing the bridge interface:
 ~ $ ssh kueyen06.dev.lsst.org brctl show
 ~ $ 
  • kueyen05-06 are missing from ipam.cp.lsst.org

Access restored.

@gseriche gseriche force-pushed the IT-6035_reprovision_kueyen_rke2 branch 6 times, most recently from 302606f to 1fe5cea Compare May 20, 2025 20:14
@gseriche gseriche force-pushed the IT-6035_reprovision_kueyen_rke2 branch from 1fe5cea to 913d2e9 Compare May 20, 2025 20:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants