Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fatal error: concurrent map read and map write #4807

Closed
wknechtel opened this issue Nov 21, 2023 · 3 comments
Closed

fatal error: concurrent map read and map write #4807

wknechtel opened this issue Nov 21, 2023 · 3 comments
Labels
defect Suspected defect such as a bug or regression

Comments

@wknechtel
Copy link

Observed behavior

7 days ago I upgraded NATS server from 2.8.4 to 2.10.5. We have a three-node cluster and I performed a rolling upgrade. There seemed to be no immediate consequences to this upgrade, and things ran well, until suddenly two days ago I started seeing odd errors cropping up in the logs from apps that use NATS, and I discovered that two of my three nodes had crashed. The system logs on those boxen both showed the same odd error: "fatal error: concurrent map read and map write", followed by a massive stack dump. Googling reveled some generic golang-related articles, but nothing NATS specific. I had two more crashes today, same error. I'm attaching the logged stack dump.

There doesn't seem to be a specific cause that I can relate to this. Looking through the message histories we keep reveal no common message or event that correlates with the crashes. They occurred at different times of the day and, of course, are on different hosts. Message volume is not particularly intese when the crashes occur. I'm at a loss.
 
nats-kablooey.log

Expected behavior

Not crash?

Server and client version

Server: 2.10.5
Clients: NPM versions 2.7.1, 2.17.0, 2.9.0, 2.12.1, 2.8.0, 2.10.2 (Various scripts have installed whatever version was current at the time the script was authored)

Host environment

VMs Running Ubuntu LTS One is on 20.04.4, the others on 18.04.1.

Steps to reproduce

No response

@wknechtel wknechtel added the defect Suspected defect such as a bug or regression label Nov 21, 2023
@derekcollison
Copy link
Member

We are aware of the issue in general and have a solution on top of main already, will be included in 2.10.6 release which should probably be next week.

@derekcollison
Copy link
Member

Depending on your situation, you could wait for official release or run a nightly build in the interim..

@wknechtel
Copy link
Author

Derek, Thank you for a speedy response. We look forward to the next release!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
defect Suspected defect such as a bug or regression
Projects
None yet
Development

No branches or pull requests

2 participants