Add a concurrency model with ThreadPoolExecutor #5011

alfred-sa · 2018-08-29T10:45:51Z

Hello,

I don't know if it would be useful to someone else, but I needed to implement a concurrency model based on thread (and not processes) in celery because I wanted to pass future objects between tasks and some coroutines and it is not pickable.

Of course it is not scale out and it has limitations (because of the GIL), but it can be useful for people who wants the share memory between tasks and another thread (for example the asyncio event loop) without blocking as the 'solo' concurrency model.

I am open to comment. Maybe this model won't be needed anymore in celery 5 as it could be replaced by an asyncio loop, which is not possible in celery 4.
And I would be happy to help on a massive asyncio refactoring for Celery 5.

Regards

thedrow · 2018-08-29T11:15:48Z

I've been meaning to do this myself. I'll review this as soon as possible.

thedrow

This PR requires some adjustments before we can merge it. Some of them are outlined in the diff itself.
We also require:

Unit tests to ensure this works correctly.
A requirements file for Python 2.7 users in requirements/extra/ is necessary. Please ensure to provide the appropriate version markers.
Documentation adjustments.

thedrow · 2018-09-20T10:01:38Z

celery/concurrency/thread.py

+    signal_safe = False
+
+    def __init__(self, *args, **kwargs):
+        super().__init__(*args, **kwargs)


super() calls are only valid with Python 3.
Celery 4.x still supports Python 2.7 so we'll need to adjust that.

thedrow · 2018-09-20T10:01:44Z

celery/concurrency/thread.py

+
+    def on_stop(self):
+        self.executor.shutdown()
+        super().on_stop()


super() calls are only valid with Python 3.
Celery 4.x still supports Python 2.7 so we'll need to adjust that.

thedrow · 2018-09-20T10:02:42Z

celery/concurrency/thread.py

+    def _get_info(self):
+        return {
+            'max-concurrency': self.limit,
+            'threads': len(self.executor._threads)


I'm not a big fan of using private APIs. Can we change this to something more sensible?

The actual number of threads in the executor is not accessible from a public API. So I can do 2 things: either modify concurrent.futures/thread.py to create one (modification to a core python lib) or remove de actual number of threads info in the concurrency model (ie remove line 39).

The former is not sure to be accepted by the PSF. I already have a pull request pending for Python 3.8, so I can try anyway ?

Seems like you are correct. There's no public API for that.
I think a TODO comment to change that once it lands is sufficient.

alfred-sa · 2018-09-21T09:22:27Z

Thank you for the review, I will apply your recommandation in the next week.

thedrow · 2018-10-08T11:47:28Z

@whuji Did you get a chance to work on this?

alfred-sa · 2018-10-08T13:10:15Z

Not yet. I am doing it right now.

alfred-sa · 2018-10-08T15:14:05Z

I have submitted a new RP : #5099

Duplicate of #5099

Add a concurrency model with ThreadPoolExecutor

b9ac16d

thedrow added this to the v4.3 milestone Aug 29, 2018

thedrow added PR Type: Enhancement Status: Needs Test Coverage ✘ labels Aug 29, 2018

thedrow requested changes Sep 20, 2018

View reviewed changes

thedrow added the Status: Needs Documentation ✘ label Sep 20, 2018

alfred-sa mentioned this pull request Oct 8, 2018

Add a concurrency model with ThreadPoolExecutor #5098

Closed

alfred-sa closed this Oct 8, 2018

alfred-sa deleted the master branch October 8, 2018 15:25

alfred-sa mentioned this pull request Oct 8, 2018

Add a concurrency model with ThreadPoolExecutor #5099

Merged

thedrow removed this from the v4.3 milestone Oct 9, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a concurrency model with ThreadPoolExecutor #5011

Add a concurrency model with ThreadPoolExecutor #5011

alfred-sa commented Aug 29, 2018 •

edited

thedrow commented Aug 29, 2018

thedrow left a comment

thedrow Sep 20, 2018

thedrow Sep 20, 2018

thedrow Sep 20, 2018

alfred-sa Sep 21, 2018

thedrow Sep 22, 2018

alfred-sa commented Sep 21, 2018

thedrow commented Oct 8, 2018

alfred-sa commented Oct 8, 2018

alfred-sa commented Oct 8, 2018 •

edited

Add a concurrency model with ThreadPoolExecutor #5011

Add a concurrency model with ThreadPoolExecutor #5011

Conversation

alfred-sa commented Aug 29, 2018 • edited

thedrow commented Aug 29, 2018

thedrow left a comment

Choose a reason for hiding this comment

thedrow Sep 20, 2018

Choose a reason for hiding this comment

thedrow Sep 20, 2018

Choose a reason for hiding this comment

thedrow Sep 20, 2018

Choose a reason for hiding this comment

alfred-sa Sep 21, 2018

Choose a reason for hiding this comment

thedrow Sep 22, 2018

Choose a reason for hiding this comment

alfred-sa commented Sep 21, 2018

thedrow commented Oct 8, 2018

alfred-sa commented Oct 8, 2018

alfred-sa commented Oct 8, 2018 • edited

alfred-sa commented Aug 29, 2018 •

edited

alfred-sa commented Oct 8, 2018 •

edited