Fix "max number of clients reached" errors and perf improvements #797

timgl · 2020-05-18T19:02:58Z

Changes

Add broker_pool_limit=0 (see explanation)
Merge in minor improvements in number of queries
Cache Team calls for api token

Before, Celery would be spinning up too many connections to Redis if many events were inserted.
See load test:

Error message:

Load testing

With 1 Standard-1x web dyno I was able to achieve on average 20/sec

With 2 Standard-2x web dynos I was able to achieve 44/sec

After caching Teams, no real difference though lower response times and memory load

Each time, Redis would use about 21 connections so the advice would be to get a Redis server with slightly more connections.

Checklist

All querysets/queries filter by Team (if applicable)
Backend tests (if applicable)

…-error

EDsCODE

code lgtm. can't/not sure how to mimic the conditions for client failure locally

EDsCODE · 2020-05-19T00:03:39Z

posthog/api/capture.py

+        try:
+            team_id = Team.objects.only('pk').get(api_token=token).pk
+            if team_id:
+                TEAM_ID_CACHE[token] = team_id


does this cache ever get flushed?

It does when the server gets restarted. However the token is fairly static, I don't think there'd be many situations you'd change it

EDsCODE · 2020-05-19T00:04:21Z

posthog/celery.py

@@ -20,6 +20,10 @@
 # Load task modules from all registered Django app configs.
 app.autodiscover_tasks()

+# Make sure Redis doesn't add too many connections
+# https://stackoverflow.com/questions/47106592/redis-connections-not-being-released-after-celery-task-is-complete
+app.conf.broker_pool_limit = 0


makes sense. curious if needing to open and close connections will become a bottleneck anyway but the load testing seems to hold up

Hm yes I think it does slow it down a little bit but at least it works, especially with the cheaper Redis instances

timgl added 6 commits May 15, 2020 12:59

test where processing events

13d9cf5

Minor event insert performance improvement

ec45134

Try broker_pool_limit

1510bd0

Merge branch 'minor-event-insert-improvements' into redis-connections…

da5e979

…-error

Cache team id

bfbc1f1

Comment

b2aac27

timgl requested a review from EDsCODE May 18, 2020 19:03

timgl temporarily deployed to posthog-redis-connectio-lfxp5v May 18, 2020 19:04 Inactive

timgl changed the title ~~Try broker_pool_limit~~ Fix "max number of clients reached" errors and perf improvements May 18, 2020

Merge branch 'master' into redis-connections-error

72691c5

timgl temporarily deployed to posthog-redis-connectio-lfxp5v May 18, 2020 21:41 Inactive

EDsCODE approved these changes May 19, 2020

View reviewed changes

timgl merged commit b69411f into master May 19, 2020

timgl deleted the redis-connections-error branch May 19, 2020 10:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix "max number of clients reached" errors and perf improvements #797

Fix "max number of clients reached" errors and perf improvements #797

timgl commented May 18, 2020 •

edited

Loading

EDsCODE left a comment

EDsCODE May 19, 2020

timgl May 19, 2020

EDsCODE May 19, 2020

timgl May 19, 2020

Fix "max number of clients reached" errors and perf improvements #797

Fix "max number of clients reached" errors and perf improvements #797

Conversation

timgl commented May 18, 2020 • edited Loading

Changes

Load testing

Checklist

EDsCODE left a comment

Choose a reason for hiding this comment

EDsCODE May 19, 2020

Choose a reason for hiding this comment

timgl May 19, 2020

Choose a reason for hiding this comment

EDsCODE May 19, 2020

Choose a reason for hiding this comment

timgl May 19, 2020

Choose a reason for hiding this comment

timgl commented May 18, 2020 •

edited

Loading