[2.5 rc4] Users hitting 403 (tricky to reproduce) #15070

schrd · 2022-05-23T21:20:52Z

Describe the bug
In a load test with bots users got kicked out from the meeting with a 403 "you have been removed from the meeting" message. There was an unintended configuration problem on the server which resulted in all listen only participants being connected to freeswitch instead of mediasoup. Freeswitch then consumed all available CPU on the server, top showed 0.x% idle CPU. Not all bots were able to connect and few of the humans in the meeting were kicked out.

To Reproduce
I don't know how I can reproduce this. It happend only once in several tests.

BBB version:
BBB 2.5 rc4

Desktop (please complete the following information):

OS: Linux
Browser Firefox
Version 91 ESR

Additional context

I don't know if this behaviour of kicking out users in an overload situation is intended. If it is then there should be a different message. If you don't consider this a serious bug, I'm fine with this. Just didn't want to ignore our observation

ffdixon · 2022-05-23T21:35:42Z

Thanks for reporting this issue @schrd.

mokazemi · 2022-06-16T18:02:15Z

We've also experienced this for some users in the meetings. (in both desktop and android). I also don't know where it's coming from.
403 "you have been removed from the meeting" message.
Even if the user was moderator.

There was an unintended configuration problem on the server which resulted in all listen only participants being connected to freeswitch instead of mediasoup.
...

But I haven't changed any specific configurations. Just a bit of playing with audio/video bitrates.
Next time I'll try to check browser console logs when I experienced the same issue again, to check if it has any useful information.

mokazemi · 2022-06-25T15:07:02Z

This issue happened to me again!
I noticed two things:

My internet connection was so bad, and it went on reconnecting state a few times, then suddenly I saw 403 Error
I saw this error in the console logs:

Uncaught (in promise) Call to chatMessageBeforeJoinCounter failed because Meteor is not connected

I hope it helps.

ffdixon · 2022-06-25T17:23:36Z

Thanks for sharing this @mokazemi. It looks like the client may have just lost connection to the server (after trying to reconnect). Do you know if any other users in the session experienced the same issue (i.e. the problem looks closer to your internet connection and not with the server)?

ffdixon · 2022-07-23T11:37:26Z

We've released 2.5.4 and we're still tracking this issue.

We starting to look at ValidateAuthToken/the reconnection procedure. Our theory is there may be a reconnection issue that, when triggered, will flood meteor with events, which causes a CPU spike and similar to FreeSWITCH using all the memory, causes clients to disconnect with a 403 error.

Of course, the challenge is right now to replicate this reconnection issue. @schrd We'd be interested if your able to force this happening in that release under testing load.

MBM1607 · 2022-07-25T07:38:35Z

@ffdixon we are facing this same issue, Its reproduceable under stress testing.

MBM1607 · 2022-07-25T08:32:09Z

We were able to reproduce the issue by joining with 8+ users, and remove connection to force reconnecting attempts.

Error

Uncaught TypeError: a.getSubscription(...) is null

Uncaught (in promise) Call to chatMessageBeforeJoinCounter failed because Meteor is not connected

BrentBaccala · 2022-08-04T04:02:46Z

I can reproduce this consistently.

I've seen the Call to chatMessageBeforeJoinCounter error, but it seems to come after the 403 error, so I think it's incidental to the issue.

Still investigating.

BrentBaccala · 2022-08-04T19:39:14Z

This is reproducible on 2.6.0-alpha.2, in addition to 2.5.4.

ffdixon · 2022-08-09T12:24:08Z

In reproducing the issue on 2.5, does it matter if allowDuplicateExtUserid is set to true or false? See

https://groups.google.com/g/bigbluebutton-setup/c/Qefm8dduv5Y/m/SYThs_6uAQAJ

blueiceprj · 2022-08-09T20:12:36Z

Hi. If you tracking websocket connections over userid and the parameter (allowDuplicateExtUserid) set true it could be a problem. Best solution to handle websocket connection without problem is using stomp broker (rabbitmq). With this method you can solve websocket connection problems( reconnect, heartbeat, etc) and also it will help to solve you access bbb over the load balancer. We have based on spring cloud app and multiple nginx and gateway. It was only one option for us.

BrentBaccala · 2022-08-11T22:34:38Z

@ffdixon, allowDuplicateExtUserid doesn't seem to make a difference

MBM1607 · 2022-10-13T11:07:25Z

@ffdixon After upgrading to 2.5.6, I am no longer able to reproduce this issue. Previously on version 2.5.4, I was able to join using firefox multi-containers and then trigger reconnections by turning the connection on and off repeatedly.

ffdixon · 2022-10-13T11:37:32Z

@ffdixon After upgrading to 2.5.6, I am no longer able to reproduce this issue. Previously on version 2.5.4, I was able to join using fire multi-containers and then trigger reconnections by turning the connection on and off repeatedly.

That's very positive feedback -- thanks for sharing!

MBM1607 · 2022-10-13T12:04:12Z

Is it possible that it was resolved due to #15723? seems relevant to me,

ffdixon · 2022-10-13T12:44:59Z

Check on your server, that setting is currently false by default

https://github.com/bigbluebutton/bigbluebutton/blob/v2.5.x-release/bigbluebutton-html5/private/config/settings.yml#L210

But if you enable it, it will reduce the load when users fall back to long polling (which isn't very efficient and too many users doing long polling could cause disconnects for others, which is why we introduced this setting).

ffdixon · 2022-10-27T20:00:28Z

Hi Brent, I'm curious on your tests with the latest 2.5.8 regarding 403 disconnects.

BrentBaccala · 2022-10-28T04:37:54Z

@ffdixon, an hour-long test with ten clients and a modest level of broken TCP sessions (all sessions broken once every ten seconds) yielded no client disconnects of any kind.

I'd still like to collect some more data on this issue, though.

ffdixon · 2022-10-28T09:23:35Z

@ffdixon, an hour-long test with ten clients and a modest level of broken TCP sessions (all sessions broken once every ten seconds) yielded no client disconnects of any kind.

Thanks Brent! Keep pushing the boundaries, but very positive indeed.

Davka · 2023-04-14T11:27:52Z

I I'm not sure if it's the same bug, but we have a similar problem in 2.6.1
The meeting sizes were limited to 40 participants and after a certain time even moderators / presenters are removed from the meeting with the 403 message. Re-entry is not possible. Even afterwards, if there are only 4 people in the meeting, for example, each new participant gets the 403 message.

The browser console shows following error.

I haven't found anything in the logs yet

@antobinary

hostbbb · 2023-04-14T13:28:40Z

so it looks like max_participant counter is rejecting in above browser console. Wonder if reconnects of users keep adding to the meeting count somehow. A good test could be to set max_participant to 3 for a meeting, and play around with brower refreshs and new users joining. Let me try to replicate in 2.6.1

hostbbb · 2023-04-24T11:08:42Z

#17699

See this for 2.6.4 example. can replicate with maxParticipants set to 2

schrd added the module: client label May 23, 2022

antobinary added this to the Release 2.5 milestone May 31, 2022

antobinary added the status: verify label May 31, 2022

antobinary changed the title ~~[2.5 rc4]~~ [2.5 rc4] Users hitting 403 (tricky to reproduce) May 31, 2022

mokazemi mentioned this issue Jul 23, 2022

Users ejected from Meetings #15430

Closed

antobinary modified the milestones: Release 2.5, Release 2.7 Jan 31, 2023

antobinary assigned paultrudel Apr 14, 2023

flausen mentioned this issue Apr 24, 2023

[2.6.1/2.6.3] 403 no one is able to join the meeting (problem max participants) #17699

Closed

paultrudel mentioned this issue May 4, 2023

fix: maxParticipants check counting users that have left #17814

Merged

1 task

antobinary closed this as completed May 8, 2023

antobinary modified the milestones: Release 2.7, Release 2.6 May 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[2.5 rc4] Users hitting 403 (tricky to reproduce) #15070

[2.5 rc4] Users hitting 403 (tricky to reproduce) #15070

schrd commented May 23, 2022

ffdixon commented May 23, 2022

mokazemi commented Jun 16, 2022 •

edited

mokazemi commented Jun 25, 2022

ffdixon commented Jun 25, 2022

ffdixon commented Jul 23, 2022

MBM1607 commented Jul 25, 2022

MBM1607 commented Jul 25, 2022

BrentBaccala commented Aug 4, 2022

BrentBaccala commented Aug 4, 2022

ffdixon commented Aug 9, 2022

blueiceprj commented Aug 9, 2022

BrentBaccala commented Aug 11, 2022

MBM1607 commented Oct 13, 2022 •

edited

ffdixon commented Oct 13, 2022

MBM1607 commented Oct 13, 2022

ffdixon commented Oct 13, 2022

ffdixon commented Oct 27, 2022

BrentBaccala commented Oct 28, 2022

ffdixon commented Oct 28, 2022

Davka commented Apr 14, 2023

hostbbb commented Apr 14, 2023

hostbbb commented Apr 24, 2023

[2.5 rc4] Users hitting 403 (tricky to reproduce) #15070

[2.5 rc4] Users hitting 403 (tricky to reproduce) #15070

Comments

schrd commented May 23, 2022

ffdixon commented May 23, 2022

mokazemi commented Jun 16, 2022 • edited

mokazemi commented Jun 25, 2022

ffdixon commented Jun 25, 2022

ffdixon commented Jul 23, 2022

MBM1607 commented Jul 25, 2022

MBM1607 commented Jul 25, 2022

Error

BrentBaccala commented Aug 4, 2022

BrentBaccala commented Aug 4, 2022

ffdixon commented Aug 9, 2022

blueiceprj commented Aug 9, 2022

BrentBaccala commented Aug 11, 2022

MBM1607 commented Oct 13, 2022 • edited

ffdixon commented Oct 13, 2022

MBM1607 commented Oct 13, 2022

ffdixon commented Oct 13, 2022

ffdixon commented Oct 27, 2022

BrentBaccala commented Oct 28, 2022

ffdixon commented Oct 28, 2022

Davka commented Apr 14, 2023

hostbbb commented Apr 14, 2023

hostbbb commented Apr 24, 2023

mokazemi commented Jun 16, 2022 •

edited

MBM1607 commented Oct 13, 2022 •

edited