-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[core] fix session deadlock (#2290) #2300
base: master
Are you sure you want to change the base?
Conversation
@seven1240 Teacher Du and I explained this issue in detail. |
@andywolk cdeveop has a system which left 3-10 dead locked channels per day, which could be fixed by this patch. Review the patch, it looks like the read_frame parts are fine as the message will be processed on the next read. But I'm worrying about the write parts. What if this happens on a sendonly channel that never has read? or on a App which never read but write ( I don't know such an App in my head though). So the message will never be processed. Best to find out the root cause of the dead lock instead of the work around I think. |
Hello, any news about this commit ? |
I have an installation where 30-40 inbound and outbound calls are getting deadlocked on a daily basis, and the call center agent connected to the deadlocked call also gets stuck, until a restart or the extension leg of the call is hung up. #2387 and #2390 did not resolve the the deadlocks nor the stuck call center agent. This fix has stopped all deadlocks and subsequently no stuck call center agents. |
same problem in my instance. |
I can also report that this patch fixes a similar issue with stuck channels and stuck call center agents in my installation. |
Why not make it pass the Unit Tests so that it can be merged? Many people seem to be experiencing this problem. |
@jakubkarolczyk can you check it? |
Can I bump this thread. This issue is real problem and a lot of people have this issue... |
Yesterday on Office Hours I asked about this and the related tickets which exhibit a similar problem. BKW said that it is a much deeper problem than is solved by the proposed patch and that they are refarming the code to solve everything at once. So it is a bigger effort than we thought, no ETR. |
Before I open another ticket like this. I am curious if the issue I am finding could be related to this one. I have found that I am getting a lot of stuck calls in this scenario. This does seem like something that has started/gotten worse after upgrading from 1.10.7 which we were getting stuck calls but at a low rate. say 10 / week vs 100/day Getting a carrier to change some behaviour reduced the re-invites and does seemed to have directly affected the number of dead calls. Let me know if I should submit a different report for this I can provide more details there. |
Is there any progress/update related to this issue? |
I upgraded several servers from FreeSWITCH 1.10.8 to 1.10.12. This error showed up and it's creating dozens of stuck calls per day, as well as numerous client complaints on a daily basis. 1.10.8 does not have the issue. Is there any update on this, it seems like a problem that has been going on for a long time? FusionPBX support said the solution to the problem is to apply this patch and compile from source as a resolution to the problem. Many users have reported this resolves the problem. I have wanted to use the signalwire packages on debian, but they do not work due to this issue. If we can't get a solution to this I'll need to start compiling from source. |
We've resolved the issue by migrating away from prebuilt packages and compiling FS from source with this patch, and it works perfectly. However, I'm disappointed that after almost a year, this hasn't been included in the release despite its impact on many users. |
My understanding from what the core developers have said is that this ticket and at least 3 others are a combination of problems deep down in the core that require extensive re-farming of source code to correct. They found some long-standing bugs in the process and aim to correct all of them in this effort. This patch works for some, but not others, and that is why. I'm as anxious as you for a permanent fix. |
#2290