[WIP] updated disconnection logic and clean up unused noise #381

digitaldan · 2022-08-22T05:08:02Z

Signed-off-by: Dan Cunningham dan@digitaldan.com

ssalonen · 2022-08-22T17:01:13Z

app.js

@@ -458,41 +428,31 @@ io.use(function (socket, next) {
 io.sockets.on('connection', function (socket) {
    logger.info('openHAB-cloud: Incoming openHAB connection for uuid ' + socket.handshake.uuid);
    socket.join(socket.handshake.uuid);
-    // Remove openHAB from offline array if needed
-    delete offlineOpenhabs[socket.handshake.uuid];
    Openhab.findOne({


This could replaced with pre = findOneAndUpdate(.., returnOriginal=true) atomic operation?

Nice find! Yes i think we could in fact use this method, and as an extra bonus, save ourselves an extra mongo call as well. I eliminated another expensive write to an unused collection in this function, so combined with this, both could really have a positive impact on performance when we have to restart a container and thousands of openHABs try and reconnect.

ssalonen · 2022-08-22T17:07:48Z

app.js

+                    }
+                });
+
+                //actually would redis be better to store?  How would we coordinate who send notification?


Indeed is this notification sending a completely independent task? It could be triggered by
Regularly, checking redis for instances to notify?

Basically a completely different process.

Would then the offlineopenhabs be a redis backed object

So, that was a note i left to my self when running through the code, did not mean to check it in :-)

If i was starting from scratch or doing a major refactor, then yes i would be using redis (or some queue/messaging like backend) to persist this kind of stuff in a more distributed friendly fashion. I again restrained from rewriting too much to keep the changes small so if something does go wrong it's easier to debug, and quicker to get this out.

Agree, makes sense not to bundle it in here.

digitaldan · 2022-08-25T03:30:51Z

I ended removing a bunch of dead code that has never been used. Also i put a data cap on our notification log as that collection on mongo is uncapped and has grown to over 40gb on production for no good reason.

Signed-off-by: Dan Cunningham <dan@digitaldan.com>

ssalonen · 2022-09-19T14:39:13Z

For the record, this potentially resolves a race condition which leads to cloud thinking that client is offline even though it is connected. See #134 (comment) for more info

Quoting from our 1on1 discussion

Only failure scenario that comes to mind is race condition

Client disconnects on node X, code continues to prepare to send “disconnected” notification and store new status to Mongo

Node X hangs a moment

Client reconnects on node Y code continues to prepare to send “connected” notification and store new status to Mongo

Process on node Y commits to Mongo (online status)

Process ok node X commits to Mongo (offline status)

I realized that any false online status will be always cleared by ping/pong. However, there are no mechanism in place to correct false offline status.

This PR utilizes unique IDs for sessions combined with Mongo atomic query/update commands to resolve the possible race conditions

ssalonen · 2022-10-11T03:43:18Z

@digitaldan how to get this merged?

digitaldan · 2022-10-14T13:41:02Z

Yeah, i have been procrastinating a little as I need to block off time to do a proper deployment to the general service once we merge. I also then need to remove a few unused (but very large) collections from mongo. I'll probably do that Sunday, probably will post something to the forums later today about the upcoming maintenance.

digitaldan self-assigned this Aug 22, 2022

digitaldan force-pushed the disconnect-cleanup branch 4 times, most recently from 344f4ea to 810415e Compare August 22, 2022 05:22

digitaldan changed the title ~~[WIP] updated disconnection logic~~ [WIP] updated disconnection logic and clean up unused noise Aug 22, 2022

ssalonen reviewed Aug 22, 2022

View reviewed changes

digitaldan force-pushed the disconnect-cleanup branch 5 times, most recently from dd1b618 to a074496 Compare August 25, 2022 03:27

digitaldan force-pushed the disconnect-cleanup branch from a074496 to 310bb9f Compare August 25, 2022 04:09

updated disconnection logic, general cleanup

6ae1dca

Signed-off-by: Dan Cunningham <dan@digitaldan.com>

digitaldan force-pushed the disconnect-cleanup branch from 310bb9f to 6ae1dca Compare August 25, 2022 04:11

digitaldan merged commit 7baf6d1 into openhab:main Oct 16, 2022

ssalonen mentioned this pull request Oct 22, 2022

Connection error recovery failed #134

Open

cniweb mentioned this pull request Feb 19, 2024

[Snyk] Fix for 2 vulnerabilities cniweb/openhab-cloud#76

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] updated disconnection logic and clean up unused noise #381

[WIP] updated disconnection logic and clean up unused noise #381

digitaldan commented Aug 22, 2022

ssalonen Aug 22, 2022 •

edited

Loading

digitaldan Aug 23, 2022

digitaldan Aug 23, 2022

ssalonen Aug 23, 2022

ssalonen Aug 22, 2022 •

edited

Loading

digitaldan Aug 23, 2022

ssalonen Aug 23, 2022

digitaldan commented Aug 25, 2022

ssalonen commented Sep 19, 2022 •

edited

Loading

ssalonen commented Oct 11, 2022

digitaldan commented Oct 14, 2022

[WIP] updated disconnection logic and clean up unused noise #381

[WIP] updated disconnection logic and clean up unused noise #381

Conversation

digitaldan commented Aug 22, 2022

ssalonen Aug 22, 2022 • edited Loading

Choose a reason for hiding this comment

digitaldan Aug 23, 2022

Choose a reason for hiding this comment

digitaldan Aug 23, 2022

Choose a reason for hiding this comment

ssalonen Aug 23, 2022

Choose a reason for hiding this comment

ssalonen Aug 22, 2022 • edited Loading

Choose a reason for hiding this comment

digitaldan Aug 23, 2022

Choose a reason for hiding this comment

ssalonen Aug 23, 2022

Choose a reason for hiding this comment

digitaldan commented Aug 25, 2022

ssalonen commented Sep 19, 2022 • edited Loading

ssalonen commented Oct 11, 2022

digitaldan commented Oct 14, 2022

ssalonen Aug 22, 2022 •

edited

Loading

ssalonen Aug 22, 2022 •

edited

Loading

ssalonen commented Sep 19, 2022 •

edited

Loading