Compress logs to use less disk space #8028

gnprice · 2018-01-08T17:47:33Z

On the chat.zulip.org server, /var/log/zulip/ takes up a total of 4.8 GB of space (*). The bulk of that disk usage would end up looking the same on any other busy Zulip server with our default configuration. That's kind of a lot, and we can do better.

The space breaks down as (approximately)

1.5GB server.log and older versions
1.0GB django.log and older versions
1.0GB tornado.log and older versions
1.0GB other (some of which may not come from default Zulip behavior.)

The reason this is so much is

For server.log, we have rotation, but we let the file grow past 500MB before rotating it, and we compress only the .2 and up. So two 500MB files, plus 9 more files (up to .10) which are compresed to about 50MB. This is configured in /etc/logrotate.d/zulip, which comes from puppet/zulip/files/logrotate/zulip. We should reduce the size we let the file grow to before rotation, and perhaps keep more files in order to keep the same total amount of log information.
For django.log and tornado.log, we have rotation but no compression. We keep 10 old files for each, and they're 100MiB in size. This is configured through supervisord, from puppet/zulip/templates/supervisor/zulip.conf.template.erb. Unfortunately supervisor doesn't support log compression, which is the real solution for this. See Fixed and improved logrotate file #3385 and the supervisor issue Tim linked to from there. One solution would be to disable supervisor's log rotation, and use logrotate -- see a blog post -- though that post's solution involves using the copytruncate option, which isn't great. Following more breadcrumbs, there's this suggestion on a supervisor PR: use logrotate, and "send[] SIGUSR2 to Supervisor to signal it to reopen the logs." Maybe that'd work.

(*) Actually 5.3 GB, but at least 0.5GB are in some files it doesn't look like we currently generate, namely tornado.log*gz.

The text was updated successfully, but these errors were encountered:

zulipbot · 2018-01-08T17:47:34Z

Hello @zulip/server-production members, this issue was labeled with the area: production label, so you may want to check it out!

devZer0 · 2019-02-10T21:12:40Z

zulip apparently comes with logrotate.d configuration file - but it appears to be imcomplete as it only compresses some of the logs but not all relevant ones. at least django.log and tornado.log are quickly growing besides server.log

timabbott · 2019-02-11T22:56:26Z

that is by design, in that the log files managed by Supervisor (e.g. django.log and tornado.log) can't also use logrotate easily, see the discussion above. It'd be nice to improve this, but it's not trivial and could easily break things, so we'd need to carefully test one of the proposals mentioned above.

timabbott · 2021-05-25T23:13:22Z

We now have log rotation for all files that Zulip generates since 93d661d. So the only remaining issue is the supervisord thing. Sadly Supervisor/supervisor#322 remains unresolved.

gnprice added the area: production label Jan 8, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compress logs to use less disk space #8028

Compress logs to use less disk space #8028

gnprice commented Jan 8, 2018 •

edited

zulipbot commented Jan 8, 2018

devZer0 commented Feb 10, 2019

timabbott commented Feb 11, 2019

timabbott commented May 25, 2021

Compress logs to use less disk space #8028

Compress logs to use less disk space #8028

Comments

gnprice commented Jan 8, 2018 • edited

zulipbot commented Jan 8, 2018

devZer0 commented Feb 10, 2019

timabbott commented Feb 11, 2019

timabbott commented May 25, 2021

gnprice commented Jan 8, 2018 •

edited