[#1847] Add middleware that cleans up the pylons response string. #2262

joetsoi · 2015-02-04T12:08:46Z

(fixes #1847)
Once the response string has been served, this middleware replaces the response string with a dummy one, so the original response can be garbage collected. This is lifted from https://code.google.com/p/modwsgi/wiki/RegisteringCleanupCode This middleware is can be switched on by setting 'ckan.use_pylons_response_cleanup_middleware = true' in your development.ini

So I did a comparison of with and without this enabled. Everyone likes graphs right?

Notably, the memory usage still grows with this enabled, but is a bit more stable. I also did some examining of the memory and this memory has been returned to python's free lists but has not been released by the python process. see (http://effbot.org/pyfaq/why-doesnt-python-release-the-memory-when-i-delete-a-large-object.htm). So we'll still use up as much memory as there are threads

So the other way around it is to do something like use gunicorn + gevent, although this isn't a fair direct comparison as this is only using one worker process, but I mainly suggest this as you can set worker processes to restart after serving a maximum number of requests (see http://docs.gunicorn.org/en/develop/configure.html#max-requests), which would release any memory that the process has been allocated but is not using.

Anyway, this isn't critical or anything as it does just fix an edge case for a very particular workload that most people will not come across. (still needs tests)

Once the response string has been served, this middleware replaces the response string with a dummy one, so the original response can be garbage collected. This is lifted from https://code.google.com/p/modwsgi/wiki/RegisteringCleanupCode This middleware is can be switched on by setting 'ckan.use_pylons_response_cleanup_middleware = True' in your development.ini

Just test that the homepage runs fine with the middleware actiavted

and deployment.ini_tmpl

wardi · 2015-02-10T12:09:44Z

ckan/config/middleware.py

+    def __init__(self, iterable, callback, environ):
+        self.__iterable = iterable
+        self.__callback = callback
+        self.__environ = environ


ew. private members with name munging

joetsoi · 2015-02-10T12:34:13Z

@aliceh75, might be nice if you could test this, as it's basically your fix without the timer, you might also want to look into using gunicorn + gevent that I suggested to utilize the process restart after serving a number of requests.

aliceh75 · 2015-02-12T16:55:50Z

Hi,

I can confirm this stabilizes the memory. The test I was just running (continuous loop fetching 10,000 rows with 4 workers) saw the memory go from 800M to 1.8G and back quite rapidly. With the middleware enabled it is far more stable (It's been oscillating between 793M and 823M for a while now, most of the time staying around 810M).

This is a great improvement :-) Thank you for doing this.

aliceh75 · 2015-02-12T16:56:15Z

mod_wsgi can also restart after a number of requests btw.

amercader · 2015-02-12T17:18:53Z

🎉 woo, nice. Great work @joetsoi

wardi · 2015-03-30T18:12:43Z

@joetsoi I've made a couple small changes: https://github.com/wardi/ckan/tree/1847-pylons-clean-middleware would you take a look and cherry-pick or merge into your branch?

wardi · 2015-03-31T12:51:56Z

@joetsoi this link makes it easier to see the two commits https://github.com/wardi/ckan/commits/1847-pylons-clean-middleware Would you like a PR against your PR? :-)

joetsoi · 2015-04-01T19:52:01Z

@wardi, we should just all give each other edit permissions to each other's ckan forks, that'd be distributed version control in action :p

I've fixed up the tests/changelog and made it switch on by default, no point having it if it's not going to be used.

wardi · 2015-04-01T20:04:45Z

@joetsoi sounds good. is this good to go?

joetsoi · 2015-04-01T20:11:55Z

@wardi yeah it's all ready, it's just coveralls moaning.

joetsoi added WIP and removed WIP labels Feb 4, 2015

[ckan#1874] Add a test for pylons repsponse cleanup middleware

68d384c

Just test that the homepage runs fine with the middleware actiavted

joetsoi force-pushed the 1847-pylons-clean-middleware branch from b4c9676 to 68d384c Compare February 5, 2015 16:24

[ckan#1847] add ckan.use_pylons_response_cleanup_middleware to docs

a7d5599

and deployment.ini_tmpl

joetsoi force-pushed the 1847-pylons-clean-middleware branch from efb0d51 to a7d5599 Compare February 5, 2015 17:07

wardi reviewed Feb 10, 2015
View reviewed changes

amercader added this to the CKAN 2.4 milestone Feb 10, 2015

davidread assigned wardi Feb 10, 2015

This was referenced Mar 6, 2015

Test the datastore retrival of large datasets for a visualisation OCHA-DAP/hdx-ckan#2379

Closed

make sure response strings can be garbage collected OCHA-DAP/hdx-ckan#2409

Open

wardi added 2 commits March 30, 2015 13:52

[ckan#1847] clean up cleanup middleware a bit

1d3d5fc

[ckan#1847] call close only when our generator is closed

677056d

joetsoi added 5 commits April 1, 2015 17:49

[ckan#1847] enabled reponse string cleanup middleware by default

7b070a2

Merge branch 'master' into 1847-pylons-clean-middleware

7bdca1f

[ckan#1847] Fix branch for new_tests -> tests

598cc6c

[ckan#1847] remove new_test references

1f1d0fb

[ckan#1847] Add changelog entry

9de6f5f

wardi merged commit 9de6f5f into ckan:master Apr 1, 2015

joetsoi deleted the 1847-pylons-clean-middleware branch April 1, 2015 22:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[#1847] Add middleware that cleans up the pylons response string. #2262

[#1847] Add middleware that cleans up the pylons response string. #2262

joetsoi commented Feb 4, 2015

wardi Feb 10, 2015

joetsoi commented Feb 10, 2015

aliceh75 commented Feb 12, 2015

aliceh75 commented Feb 12, 2015

amercader commented Feb 12, 2015

wardi commented Mar 30, 2015

wardi commented Mar 31, 2015

joetsoi commented Apr 1, 2015

wardi commented Apr 1, 2015

joetsoi commented Apr 1, 2015

[#1847] Add middleware that cleans up the pylons response string. #2262

[#1847] Add middleware that cleans up the pylons response string. #2262

Conversation

joetsoi commented Feb 4, 2015

wardi Feb 10, 2015

Choose a reason for hiding this comment

joetsoi commented Feb 10, 2015

aliceh75 commented Feb 12, 2015

aliceh75 commented Feb 12, 2015

amercader commented Feb 12, 2015

wardi commented Mar 30, 2015

wardi commented Mar 31, 2015

joetsoi commented Apr 1, 2015

wardi commented Apr 1, 2015

joetsoi commented Apr 1, 2015