k3s server crashing on Raspberry Pi 4 (8GB) #6654

samip5 · 2022-12-15T06:48:19Z

Environmental Info:
K3s Version: v1.24.4+k3s1 (c3f830e)
go version go1.18.1

Node(s) CPU architecture, OS, and Version:

arm64, Ubuntu 22.04 (all except two)
Linux k8s-master1 5.15.0-1021-raspi #23-Ubuntu SMP PREEMPT Fri Nov 25 15:27:43 UTC 2022 aarch64 aarch64 aarch64 GNU/Linux

amd64, Ubuntu 22.04 x 2
Linux k8s-worker-amd64-0 5.15.0-56-generic #62-Ubuntu SMP Tue Nov 22 19:54:14 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux

Cluster Configuration:
1 server, 5 agents

Describe the bug:
My k3s apiserver seems to frequently crash / auto-restart

Steps To Reproduce:

Installed K3s: Using the ansible role at https://github.com/PyratLabs/ansible-role-k3s

Expected behavior:
I would expect it to not keep frequently crashing.

Actual behavior:
Frequent crashes / auto-restarts of service

Additional context / logs:
k3s.log

The text was updated successfully, but these errors were encountered:

samip5 · 2022-12-15T09:15:03Z

It happened again when I got an

Dec 15 11:11:26 k8s-master1 k3s[2140000]: E1215 11:11:26.004914 2140000 server.go:218] "Leaderelection lost"

Which I don't understand why or how?

Updated logs:
k3s_20221215T1113.log

bbkz · 2022-12-15T13:29:34Z

I had similar problems when using etcd on sdcards (industrial), they can't really handle etcd as it is write intensive.

After switching to emmc, etcd was happy.

samip5 · 2022-12-15T13:31:31Z

I had similar problems when using etcd on sdcards (industrial), they can't really handle etcd as it is write intensive.

After switching to emmc, etcd was happy.

Not running on a SD-card, it's running off of a external SSD.

bbkz · 2022-12-15T13:34:10Z

ok. what i also had to do to make it stable, was cordon the master nodes.

samip5 · 2022-12-15T13:35:15Z

ok. what i also had to do to make it stable, was cordon the master nodes.

That seems wierd.... I also got only one master. :)

bbkz · 2022-12-15T13:41:34Z

I'm running it stable on 4 fedora rpi4 and 4 odroid n2+ with 3 master nodes.

But i just found the following, and will give raspberry pi os a other try:

https://docs.k3s.io/advanced#old-iptables-versions

Unfortunatly i don't have a other idea.

brandond · 2022-12-16T01:45:14Z

If you have only a single server, there's not really any point in using etcd - especially on raspberry pi, where CPU and IO is already somewhat constrained. You can't go back to sqlite from etcd, but you might consider rebuilding the cluster at some point, and not using etcd. The logs show that your storage (even if it is ssd) is not able to keep up, and it is frequently taking several seconds for etcd to sync your changes to disk - to the point where leader elections are timing out. This is almost exclusively caused by high storage fsync latency.

If you're on a node with older iptables, you might take a look at the --prefer-bundled-bin flag available in the releases coming out this month - but that will only fix the issue with growing iptables rulesets, it will not do anything about disk latency.

samip5 · 2022-12-16T06:32:09Z

If you have only a single server, there's not really any point in using etcd - especially on raspberry pi, where CPU and IO is already somewhat constrained. You can't go back to sqlite from etcd, but you might consider rebuilding the cluster at some point, and not using etcd. The logs show that your storage (even if it is ssd) is not able to keep up, and it is frequently taking several seconds for etcd to sync your changes to disk - to the point where leader elections are timing out. This is almost exclusively caused by high storage fsync latency.

If you're on a node with older iptables, you might take a look at the --prefer-bundled-bin flag available in the releases coming out this month - but that will only fix the issue with growing iptables rulesets, it will not do anything about disk latency.

The crashing was already happening when running SQLite (not sure why tho) and as that doesn't keep db backups, I thought etcd would be better used for that reason.

It seems that I should move the master to be not on a Pi or rather have it running on a CM4 with emmc storage?

brandond · 2022-12-16T06:57:50Z

I have personally run K3s on a Pi4b with SSD using etcd with no issues. I have also used sqlite on SDHC without issues. However in both cases I made sure that IO-intensive workloads were not using the same disk as the datastore - I put everything on NFS PVCs and minimized large image pull operations. The key is just to make sure that there's not a lot of other IO that will need to be flushed before the datastore write can be completed.

samip5 · 2022-12-17T05:32:26Z

It seems that longhorn was scheduled on the master, which is probably a bad thing so I evicted it via toleration. Let's see if that helps.

brandond · 2022-12-17T06:44:48Z

Oh yeah that would do it. If you're going to do LH, try to put it on a separate physical disk from the datastore to avoid competing with it for iops.

samip5 · 2022-12-17T07:05:07Z

Two USB enclosures with SSD's would probably result in competing for USB bandwidth (at least on a Pi4). :)

bbkz · 2022-12-17T07:45:49Z

I got me some RasPiKey which are EMMC storage keys to be used in the sdcard slot.

caroline-suse-rancher · 2023-04-19T17:35:18Z

I'm going to convert this to a discussion just in case someone runs into the same thing. Doesn't appear to be a clear K3s bug though.

k3s-io locked and limited conversation to collaborators Apr 19, 2023

caroline-suse-rancher converted this issue into discussion #7317 Apr 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This issue was moved to a discussion.

k3s server crashing on Raspberry Pi 4 (8GB) #6654

k3s server crashing on Raspberry Pi 4 (8GB) #6654

samip5 commented Dec 15, 2022

samip5 commented Dec 15, 2022

bbkz commented Dec 15, 2022 •

edited

Loading

samip5 commented Dec 15, 2022 •

edited

Loading

bbkz commented Dec 15, 2022

samip5 commented Dec 15, 2022

bbkz commented Dec 15, 2022

brandond commented Dec 16, 2022

samip5 commented Dec 16, 2022 •

edited

Loading

brandond commented Dec 16, 2022 •

edited

Loading

samip5 commented Dec 17, 2022

brandond commented Dec 17, 2022 •

edited

Loading

samip5 commented Dec 17, 2022 •

edited

Loading

bbkz commented Dec 17, 2022

caroline-suse-rancher commented Apr 19, 2023

This issue was moved to a discussion.

This issue was moved to a discussion.

k3s server crashing on Raspberry Pi 4 (8GB) #6654

k3s server crashing on Raspberry Pi 4 (8GB) #6654

Comments

samip5 commented Dec 15, 2022

samip5 commented Dec 15, 2022

bbkz commented Dec 15, 2022 • edited Loading

samip5 commented Dec 15, 2022 • edited Loading

bbkz commented Dec 15, 2022

samip5 commented Dec 15, 2022

bbkz commented Dec 15, 2022

brandond commented Dec 16, 2022

samip5 commented Dec 16, 2022 • edited Loading

brandond commented Dec 16, 2022 • edited Loading

samip5 commented Dec 17, 2022

brandond commented Dec 17, 2022 • edited Loading

samip5 commented Dec 17, 2022 • edited Loading

bbkz commented Dec 17, 2022

caroline-suse-rancher commented Apr 19, 2023

This issue was moved to a discussion.

bbkz commented Dec 15, 2022 •

edited

Loading

samip5 commented Dec 15, 2022 •

edited

Loading

samip5 commented Dec 16, 2022 •

edited

Loading

brandond commented Dec 16, 2022 •

edited

Loading

brandond commented Dec 17, 2022 •

edited

Loading

samip5 commented Dec 17, 2022 •

edited

Loading