-
Notifications
You must be signed in to change notification settings - Fork 938
Description
Description
Lighthouse beacon got killed by the linux OOM killer after consuming 117GB RSS memory.
Nov 18 21:58:05 validator kernel: [1326827.608571] Out of memory: Killed process 304391 (lighthouse) total-vm:181985348kB, anon-rss:117344980kB, file-rss:0kB, shmem-rss:0kB, UID:997 pgtables:307460kB oom_score_adj:0
Looking at the logs the application seems to have got stuck in a Address updated loop where it toggles between
udp_port: 9000 and udp_port: 1046 about 500 times per second according to the logs, until it gets killed and restarts.
The problems started 21:44:21 and it got killed 21:58:05.
Nov 18 21:44:21 validator lighthouse[304391]: Nov 18 21:44:21.747 INFO Address updated ip: REDACTED, udp_port: 1046
Nov 18 21:44:21 validator lighthouse[304391]: Nov 18 21:44:21.783 INFO Address updated ip: REDACTED, udp_port: 9000
Nov 18 21:44:21 validator lighthouse[304391]: Nov 18 21:44:21.814 INFO Address updated ip: REDACTED, udp_port: 1046
<SIMILAR LINES CUT>
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.070 INFO Address updated ip: REDACTED, udp_port: 9000
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.080 INFO Address updated ip: REDACTED, udp_port: 1046
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.080 INFO Address updated ip: REDACTED, udp_port: 9000
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.080 INFO Address updated ip: REDACTED, udp_port: 1046
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.080 INFO Address updated ip: REDACTED, udp_port: 9000
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.094 INFO Address updated ip: REDACTED, udp_port: 1046
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.095 INFO Address updated ip: REDACTED, udp_port: 9000
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.095 INFO Address updated ip: REDACTED, udp_port: 1046
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.095 INFO Address updated ip: REDACTED, udp_port: 9000
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.095 INFO Address updated ip: REDACTED, udp_port: 1046
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.095 INFO Address updated ip: REDACTED, udp_port: 9000
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.095 INFO Address updated ip: REDACTED, udp_port: 1046
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.097 INFO Address updated ip: REDACTED, udp_port: 9000
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.102 INFO Address updated ip: REDACTED, udp_port: 1046
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.102 INFO Address updated ip: REDACTED, udp_port: 9000
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.106 INFO Address updated ip: REDACTED, udp_port: 1046
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.110 INFO Address updated ip: REDACTED, udp_port: 9000
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.111 INFO Address updated ip: REDACTED, udp_port: 1046
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.113 INFO Address updated ip: REDACTED, udp_port: 9000
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.113 INFO Address updated ip: REDACTED, udp_port: 1046
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.117 INFO Address updated ip: REDACTED, udp_port: 9000
Nov 18 21:58:05 validator lighthouse[304391]: Nov 18 21:58:05.118 INFO Address updated ip: REDACTED, udp_port: 1046
Nov 18 21:58:05 validator systemd[1]: lighthousebeacon.service: A process of this unit has been killed by the OOM killer.
Nov 18 21:58:10 validator systemd[1]: lighthousebeacon.service: Main process exited, code=killed, status=9/KILL
Nov 18 21:58:10 validator systemd[1]: lighthousebeacon.service: Failed with result 'oom-kill'.
Nov 18 21:58:10 validator systemd[1]: lighthousebeacon.service: Consumed 4d 4h 52min 37.562s CPU time.
Nov 18 21:58:15 validator systemd[1]: lighthousebeacon.service: Scheduled restart job, restart counter is at 1.
Nov 18 21:58:15 validator systemd[1]: Stopped Lighthouse Consensus Client BN (mainnet).
Nov 18 21:58:15 validator systemd[1]: lighthousebeacon.service: Consumed 4d 4h 52min 37.562s CPU time.
Nov 18 21:58:15 validator systemd[1]: Started Lighthouse Consensus Client BN (mainnet).
Nov 18 21:58:15 validator lighthouse[556287]: Nov 18 21:58:15.959 INFO Lighthouse started version: "Lighthouse/v8.0.0-e3ee7fe"
Nov 18 21:58:15 validator lighthouse[556287]: Nov 18 21:58:15.959 INFO Configured network network_name: "mainnet"
Nov 18 21:58:15 validator lighthouse[556287]: Nov 18 21:58:15.961 INFO Data directory initialised datadir: /var/lib/validator-data/lighthouse
Version
Lighthouse/v8.0.0-e3ee7fe. (binary release from github)
Present Behaviour
After restart it behaves normal.
Expected Behaviour
The application should never get into this state and should not leak memory while doing so.
Steps to resolve
Restart lighthouse beacon