Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Crash "Fatal lock manager error: Process disappeared in LockManager::acquire_shmem" #7809

Closed
gerdus opened this issue Oct 18, 2023 · 16 comments

Comments

@gerdus
Copy link

gerdus commented Oct 18, 2023

Log message:
Wed Oct 18 07:54:39 2023 Fatal lock manager error: Process disappeared in LockManager::acquire_shmem, errno: 22 --Invalid argument

Core dump: https://infireal.com/github-issues/firebird-crash-2023-10-18/

version is V4.0.4.2997 snapshot binaries from firebird website
OS: Ubuntu 22.04.3 LTS

@AlexPeshkoff
Copy link
Member

@gerdus Have you seen #7762 (comment) ?

@gerdus
Copy link
Author

gerdus commented Oct 19, 2023

Sorry, did not realize there was not snapshot archive on the website to get a specific snapshot build.

Bin: https://infireal.com/github-issues/firebird-crash-2023-10-18/Firebird-4.0.4.2997-0.amd64.tar.gz
Debuginfo: https://infireal.com/github-issues/firebird-crash-2023-10-18/Firebird-debuginfo-4.0.4.2997-0.amd64.tar.gz

@AlexPeshkoff
Copy link
Member

AlexPeshkoff commented Oct 24, 2023 via email

@gerdus
Copy link
Author

gerdus commented Oct 27, 2023

New crash this morning with same log message

Fri Oct 27 11:40:03 2023 Fatal lock manager error: Process disappeared in LockManager::acquire_shmem, errno: 0
Fri Oct 27 11:41:42 2023 /opt/firebird/bin/fbguard: /opt/firebird/bin/firebird terminated abnormally (-1)
Fri Oct 27 11:41:42 2023 /opt/firebird/bin/fbguard: guardian starting /opt/firebird/bin/firebird

Core Dump: https://infireal.com/github-issues/firebird-crash-2023-10-27/

@AlexPeshkoff AlexPeshkoff self-assigned this Oct 31, 2023
@AlexPeshkoff AlexPeshkoff changed the title Crash "Fatal lock manager error: Process disappeared in LockManager::acquire_shmem, errno: 22" Crash "Fatal lock manager error: Process disappeared in LockManager::acquire_shmem" Oct 31, 2023
AlexPeshkoff added a commit that referenced this issue Oct 31, 2023
@AlexPeshkoff
Copy link
Member

Gerdus, please try special build https://drive.google.com/drive/folders/1sRlWtC2fYsPtL-Gzq0x6HpPj53ciuN6H?usp=sharing
That's exactly snapshot you are using currently + fix for this bug. Suppose it should help but certainly I need your feedback.

@gerdus
Copy link
Author

gerdus commented Oct 31, 2023

Ok I have deployed your build. Will monitor and let you know result.
Thank you.

@gerdus
Copy link
Author

gerdus commented Nov 13, 2023

It has been two weeks without a crash, so I think you can close this as resolved. Thank you very much.

@fperana
Copy link

fperana commented Nov 15, 2023

@AlexPeshkoff in your opinion, could it be that this issue and issue #7762 would be related to my issues #7371 and #7530? I'm still running your patched 4.0.2 version (no time to install 4.0.3), but it still crashes now and then. The problems reported here are similar/identical to the ones I'm experiencing.
I'm now thinking of upgrading to 4.0.4, but this issue makes me wonder if it's better to try the latest fix you've posted here.

@AlexPeshkoff
Copy link
Member

Very possible. When we deal with races-related bugs it's often a matter of lucky chance that caught core dump contains useful information. @gerdus succeeded to get useful dump with which reasons got clear and fix became possible. I'm sure that all issues are races-related - in other case it's much simpler to reproduce and fix them. But certainly nobody can guarantee that there no other races around.

@fperana
Copy link

fperana commented Nov 22, 2023

Yes of course, I can understand it very well as a programmer. I'll try your fixed version in production then, hope this will fix that rare but still boring crashes.
Just for info, 4.0.4 has been released just two weeks ago, do you plan to release 4.0.5 soon, given that you've hopefully fixed a server crash condition?

@AlexPeshkoff
Copy link
Member

I doubt a fix of single problem which is not regression in 404 is enough reason for 405 release.

@fperana
Copy link

fperana commented Nov 22, 2023

Just another question, does this build contain fix for #7772?

@AlexPeshkoff
Copy link
Member

No - it was split 2 days before fix for 7772. But I hope very soon you will have snapshot with both.

@TommiPrami
Copy link
Contributor

TommiPrami commented Nov 22, 2023 via email

@fperana
Copy link

fperana commented Nov 22, 2023

@AlexPeshkoff so I'll wait for that snapshot (how can I tell when it's ready? or will you prepare it for me? best would be a 4.0.4 release + fix).

@TommiPrami we're talking of a special build with a fix for the subject of this thread.

@AlexPeshkoff
Copy link
Member

Snapshot should be ready tomorrow: I've merged fix into release branch. Just make sure it's fresh (current date) and also I recommend to check http://firebirdtest.com/ how did it pass regular QA. Pay attention - that's exactly same QA which is used for releases.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment