Run DELETE on expired session based on a timed check instead of a random factor. #13

paquettea · 2015-02-11T02:55:01Z

Since the random method is based on a random factor match per request, more requests means more expiry checks in the database. Since expiry is based on time, I though it would make sense to base the checks frequency on a timed delay instead. This was added as an option, defaulting to the original random factor checks. I've also updated the readme to reflect my changes.

paquettea · 2015-02-23T19:06:50Z

@voxpelli do you have any feedback on this, if it's possible to merge or not ? Thanks.

voxpelli · 2015-02-23T20:12:37Z

Haven't had time to look at and think about this yet unfortunately.

Am I understanding it correctly that the purpose is to lessen the performance impact the random delete might cause? Or are there any other purpose for choosing this new option rather than the old one? If we are to support both we should probably explain in the Readme when one might want to chose this option rather than the standard one so that it's easy for new users to figure out.

paquettea · 2015-02-23T20:19:21Z

I've already updated the readme to reflect this option and difference between one or the other method. Default behavior is keeping random check for retro-compatibility. I've simply added the options to use a timed check if preferred.

The timed check is indeed to lessen the number of delete queries, because lots of requests means more checks, and is not relevant on elapsed time, on which expiry is based

hugowetterberg · 2015-02-24T09:52:03Z

Instead of internalising different approaches for when to purge expired sessions, as different projects can have varying needs, it might be simpler to just make the "should we purge"-logic extensible: hugowetterberg@1f5e6b0

paquettea · 2015-02-24T13:42:48Z

What your suggesting is indeed a good approach for flexibility.

Here my reflection on all this:

I think the question should be "why should it be an option?". We are talking about the "garbage collection" of elapsed session (since the SELECT makes sure that you never retrieve a timed out session). I don't see the need to cleanup the same data ~5000 times because there is 10000 requests at the same moment; especially because we are comparing a TIMED value. The first request would do the garbage collection for all elapsed sessions, and the 4999 other requests would run for nothing because time hasn't changed. It's an extreme example, but it's to make my point on the fact that the check should be only timed, because we are comparing time values.

Providing an option to have your own custom garbage collection is a nice to have, but I can't see when it could be useful to do this differently because the "need" represented here doesn't vary. Garbage collection and that's it. The only option should be to set a delay on which you want garbage collection to occur, in case you have a lot of short sessions occurring in your program that leaves a lot of garbage data in a short period of time.

Does any of that makes sense ? Thanks for your input !

focusaurus · 2015-02-24T14:38:53Z

I think @paquettea's analysis is solid here. I see no reason to issue a DELETE statement more frequently than some minimum threshold (perhaps 1 minute). I also see no reason this particularly needs to be tied to an incoming request arriving. I think basically a setInterval (maybe plus some sanity checking) would suffice.

arsnl · 2015-02-24T15:10:53Z

+1 on @paquettea solution

paquettea · 2015-02-24T16:00:00Z

I will update my PR soon to use setInterval.

focusaurus · 2015-02-24T16:05:41Z

OK great. FYI a simple configuration optional setting of the interval value with a default of 1 minute will probably get it done.

hugowetterberg · 2015-02-24T16:18:47Z

+1 with caveat: if the aim isn't flexibility it's probably better to just replace the existing purge logic entirely instead of supporting both. Can't see any real benefits with multiple solutions.

hugowetterberg · 2015-02-24T16:25:20Z

And that was not a "Bohoo!" over my solution not getting traction ;) I just don't see the value proposition in keeping the "random" method.

paquettea · 2015-02-24T16:27:46Z

haha! don't worry, I understood the right intention with the first comment :)

here is the (final ?) solution I propose :

paquettea@311f5ce

paquettea · 2015-02-24T16:30:58Z

along with paquettea@044587e

fixed self this closure issue in delete query callback

paquettea · 2015-02-24T17:38:52Z

ran the last version in my own project and seems to do the trick

voxpelli · 2015-03-06T14:52:04Z

I agree that this solution is a better one (less arbitrary as #15 points out). I've created a new branch where I've pulled it in and where I've done some alterations to it to match the coding style of the rest a bit more: https://github.com/voxpelli/node-connect-pg-simple/tree/session-pruning

Changes I've done on top of the code from here (which I squashed and pulled in its entirety):

Moved from setInterval() to setTimeout() – the interesting part is the delay between the end of one call and the start of the next call, not the time between the start of two calls.
As a consequence of moving to setTimeout() the danger of two automatic pruning processes overlapping each other is gone and therefore I see no need for the this.isPruningSessions lock that this PR added. If projects using this module wants to add additional pruning mechanisms they can deal with ensuring they don't overlap themselves. I therefore removed this.isPruningSessions to simplify things.
Added a errorLog configuration parameter so that the error message on failed pruning can be logged using the standard logging library of the project's implementing this module – like Bunyan which I myself use in some projects.
Documented the pruneSessions() method in the README
Added a callback method to the pruneSessions() so that external code can know the the pruning is done and act on any error the pruning resulted in

I also added another branch on top of this branch which added a full-blown linting setup, using the Lintlovin Grunt boilerplate, and that will be connected to Travis CI once the branch for this PR has been merged.

Any feedback on the session-pruning branch before I merge it? Will try to merge and release it later during this weekend.

paquettea · 2015-03-06T15:03:21Z

That's an even better implementation. looking forward for the merge of session-pruning branch ! :)
Thanks!

focusaurus · 2015-03-06T15:25:45Z

We might want to at least consider and document the implications for this in a clustered/multihost deployment. I don't think there's any harm in running the DELETE sql too frequently but it's worth consideration I think.

voxpelli · 2015-03-06T15:29:05Z

Looking at it now, it seems like there's no index on the expire column by default (this module wasn't originally intended for larger scale deployments) – so if anyone wants to do a follow up PR which adds an index to it then there shouldn't be any harm in running the delete-statement often.

I'll make sure to mention something about the implications in the changelog entry.

voxpelli · 2015-03-08T13:10:00Z

I now realize that this is going to be a breaking change as it will require additional code to not block graceful shutdowns. By default it will now prune session tokens infinitely no matter if the rest of the app is closed or not while previously it wouldn't do anything unless the rest of the app told it to, which meant it would shutdown the moment the rest of the app shutdown.

I'm therefore going to add a close() method before merging in the session-pruningbranch and then wait and release this as a 3.0.0 only after I've released #16 as a 2.3.0.

voxpelli · 2015-03-15T15:35:41Z

Released now as 3.0.0 due to the breaking change of requiring close() for graceful shutdown.

Alexandre Paquette added 2 commits February 10, 2015 21:45

added options to trigger session expiry check based on a timed delay

8858303

fixed option variable name

12c0396

use setInterval for expired session cleanup

311f5ce

removed feedback in console on success

044587e

_this -> self for standard

2269b74

fixed self this closure issue in delete query callback

voxpelli mentioned this pull request Mar 4, 2015

Random deletion of expired sessions #15

Closed

This was referenced Mar 6, 2015

Unbreak schema name #16

Merged

Provide a default index for the expire column #17

Closed

voxpelli self-assigned this Mar 8, 2015

voxpelli mentioned this pull request Mar 8, 2015

Interval based session pruning #18

Merged

voxpelli closed this in #18 Mar 15, 2015

voxpelli mentioned this pull request Aug 20, 2017

Query every min. #49

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run DELETE on expired session based on a timed check instead of a random factor. #13

Run DELETE on expired session based on a timed check instead of a random factor. #13

paquettea commented Feb 11, 2015

paquettea commented Feb 23, 2015

voxpelli commented Feb 23, 2015

paquettea commented Feb 23, 2015

hugowetterberg commented Feb 24, 2015

paquettea commented Feb 24, 2015

focusaurus commented Feb 24, 2015

arsnl commented Feb 24, 2015

paquettea commented Feb 24, 2015

focusaurus commented Feb 24, 2015 via email

hugowetterberg commented Feb 24, 2015

hugowetterberg commented Feb 24, 2015

paquettea commented Feb 24, 2015

paquettea commented Feb 24, 2015

paquettea commented Feb 24, 2015

voxpelli commented Mar 6, 2015

paquettea commented Mar 6, 2015

focusaurus commented Mar 6, 2015 via email

voxpelli commented Mar 6, 2015

voxpelli commented Mar 8, 2015

voxpelli commented Mar 15, 2015

Run DELETE on expired session based on a timed check instead of a random factor. #13

Run DELETE on expired session based on a timed check instead of a random factor. #13

Conversation

paquettea commented Feb 11, 2015

paquettea commented Feb 23, 2015

voxpelli commented Feb 23, 2015

paquettea commented Feb 23, 2015

hugowetterberg commented Feb 24, 2015

paquettea commented Feb 24, 2015

focusaurus commented Feb 24, 2015

arsnl commented Feb 24, 2015

paquettea commented Feb 24, 2015

focusaurus commented Feb 24, 2015 via email

hugowetterberg commented Feb 24, 2015

hugowetterberg commented Feb 24, 2015

paquettea commented Feb 24, 2015

paquettea commented Feb 24, 2015

paquettea commented Feb 24, 2015

voxpelli commented Mar 6, 2015

paquettea commented Mar 6, 2015

focusaurus commented Mar 6, 2015 via email

voxpelli commented Mar 6, 2015

voxpelli commented Mar 8, 2015

voxpelli commented Mar 15, 2015