Fix CPU load of dockerapi container #5376

mstilkerich · 2023-08-05T19:06:06Z

Previously the handle_pubsub_messages() loop was executing every 10ms when there was no message available. This creates a constant CPU load of around 6% on my system.

Now reading from the redis network socket will block (the coroutine) for up to 30s before it returns when no message is available. Using channel.listen() would be even better, but it lacks the ignore_subscribe_messages option and I could not figure out how to filter the returned messages.

Note I have a very limited to basically non-existent knowledge of Python, but I read up on asyncio and believe to understand the changes I made. Nonetheless of course this should be cross-checked. I verified that it does not actually block the entire event loop, since HTTP requests to dockerapi are immediately responded to.

Previously the handle_pubsub_messages() loop was executing every 10ms when there was no message available. Now reading from the redis network socket will block (the coroutine) for up to 30s before it returns when no message is available. Using channel.listen() would be even better, but it lacks the ignore_subscribe_messages option and I could not figure out how to filter the returned messages.

milkmaker · 2023-08-05T19:06:18Z

Thanks for contributing!

I noticed that you didn't select staging as your base branch. Please change the base branch to staging.
See the attached picture on how to change the base branch to staging:

FreddleSpl0it · 2023-08-07T08:43:06Z

In the telegram discussion we had, i pointed to the wrong timeout function.
this is the correct one:

mailcow-dockerized/data/Dockerfiles/dockerapi/main.py

Line 247 in d6c3c58

await asyncio.sleep(0.01)

this is also the loop delay of 10ms @mstilkerich described. I think it would be enough to set this timeout to 250ms and see if the cpu load decreases.

mstilkerich · 2023-08-07T10:09:02Z

It would sure fix the CPU load, but it would also add up to 250ms of extra delay for processing a redis message.

I think what you want is:

As long as there is no redis message to process, the task / coroutine should be blocked so the event loop does not schedule it.
- This is done by adding the timeout to get_message(), which will limit it to waking every 30s in case there is no message. If there is a new message, it will become available for being dispatched by the event loop right away.
- Better would be listen(), as it blocks until a message is available for processing, but could not use it for the reasons described above.
After processing a redis message, we should give a chance for other coroutines to run. This is achieved by the asyncio.sleep(0.01) at the end. In fact, for this purpose we should pass 0 instead as it appear to offer an optimized code path for selection of other coroutines if any is available for execution.
The surrounding async_timeout.timeout() serves as a watchdog to my understanding in case processing one message takes longer than expected and will kill it in that case. The timeout value here must be chosen large enough to accomodate for the execution time incl. blocking time of the contained code.

mstilkerich · 2023-08-23T06:28:41Z

AFAIK everything is in place here. Just in case you are waiting for anything from my side please let me know.

mstilkerich changed the base branch from master to staging August 5, 2023 19:15

DerLinkman assigned FreddleSpl0it Aug 6, 2023

DerLinkman requested a review from FreddleSpl0it August 6, 2023 10:26

Set asyncio timeout to 0 for yielding

930473a

FreddleSpl0it approved these changes Aug 28, 2023

View reviewed changes

FreddleSpl0it merged commit 9ba5c13 into mailcow:staging Aug 28, 2023

milkmaker mentioned this pull request Aug 28, 2023

Automatic PR to nightly from 2023-08-14T11:46:02Z #5387

Merged

mstilkerich deleted the fix_dockerapi_cpuload branch October 13, 2023 14:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix CPU load of dockerapi container #5376

Fix CPU load of dockerapi container #5376

mstilkerich commented Aug 5, 2023

milkmaker commented Aug 5, 2023

FreddleSpl0it commented Aug 7, 2023

mstilkerich commented Aug 7, 2023

mstilkerich commented Aug 23, 2023

Fix CPU load of dockerapi container #5376

Fix CPU load of dockerapi container #5376

Conversation

mstilkerich commented Aug 5, 2023

milkmaker commented Aug 5, 2023

FreddleSpl0it commented Aug 7, 2023

mstilkerich commented Aug 7, 2023

mstilkerich commented Aug 23, 2023