Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

v1.9.0-rc.0: Memory leak #5549

Closed
twoeths opened this issue May 26, 2023 · 6 comments
Closed

v1.9.0-rc.0: Memory leak #5549

twoeths opened this issue May 26, 2023 · 6 comments
Milestone

Comments

@twoeths
Copy link
Contributor

twoeths commented May 26, 2023

Describe the bug

rss keeps going up on beta mainnet node

Screenshot 2023-05-26 at 16 01 45

Expected behavior

expect it to be like stable mainnet node

Screenshot 2023-05-26 at 16 02 17

Steps to reproduce

it also happens on all goerli beta nodes

Additional context

No response

Operating system

Linux

Lodestar version or commit hash

v1.9.0-rc.0

@twoeths
Copy link
Contributor Author

twoeths commented May 29, 2023

looking in to /proc/${pid}/maps file and compare to stable mainnet node, 2 candidates I found

13939:7f9958600000-7f9958601000 r-xp 00000000 09:02 27007162                   /usr/src/lodestar/node_modules/utf-8-validate/prebuilds/linux-x64/node.napi.node   =======> 1st candidate
13961:7f9958a00000-7f9958a02000 r-xp 00000000 09:02 26935736                   /usr/src/lodestar/node_modules/bufferutil/prebuilds/linux-x64/node.napi.node =======> 2nd candidate

both point to web3 dev dependencies of prover packages (cc @nazarhussain ), but they should not be loaded in production environment

I reinstalled node modules with yarn --production and add NODE_ENV=production before our node command, proc maps file does not show dev dependencies, will need monitor the issue again

Update: still got the issue

Screenshot 2023-05-29 at 11 12 49

@twoeths
Copy link
Contributor Author

twoeths commented May 29, 2023

another direction is to review metrics in the last 30 days

Screenshot 2023-05-29 at 10 31 44

the issue happens since May 23, need to investigate commits around that day

git log --oneline --since="2023-05-22 00:00" --until="2023-05-23 23:59"
186a5e8899 fix(validator): improve error handling in attestation service (#5511)
4fd72b2b61 refactor: change archiving strategy to always store last finalized (#5520)
42c80973a5 feat: more metrics for sync committee message validation (#5516)
504a9d364f test: update the sim tests to simplify assertion logic (#5501)

@philknows philknows added this to the v1.9.0 milestone May 29, 2023
@twoeths
Copy link
Contributor Author

twoeths commented May 30, 2023

42c8097 in the last 24h

Screenshot 2023-05-30 at 11 37 29

4fd72b2b61 in the last 24h

Screenshot 2023-05-30 at 11 38 21

#5520 is the cause for this issue

@dapplion
Copy link
Contributor

dapplion commented Jun 1, 2023

@dapplion dapplion closed this as completed Jun 1, 2023
@dapplion dapplion reopened this Jun 1, 2023
@twoeths
Copy link
Contributor Author

twoeths commented Jun 6, 2023

@dapplion this is not the same issue, this issue is merely about rss and node was not crashed while #5349 is related to heap memory

@twoeths
Copy link
Contributor Author

twoeths commented Jun 24, 2023

since we revert #5520, this issue does not happen in the final v1.9.0. If we improve any strategies in archiving state, need to refer to this issue

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants