Add function parameters to the task attributes #5363

tlinhart · 2019-02-26T15:04:31Z

I'm building a web application that would let users schedule jobs that are defined as Celery tasks. I attempt to decouple the components as much as possible -- the application doesn't have access to tasks' code and uses send_task() to execute the tasks. The only common thing should be the broker (and possibly the results backend). When adding new job, users have to specify the task and provide argument values to the task. I would like to render a form that would contains list of possible parameters. I'm able to get the list of tasks registered to workers broadcast call:

celery.control.broadcast('registered', reply=True)

What I'm not able to get, though, is the list of parameters the task accepts. I'm aware that I can call the broadcast() method like this:

celery.control.broadcast(
    'registered', reply=True, arguments={
        'taskinfoitems': [...]
    })

to also return task attributes, but there is no task attribute that would hold the parameters of the original function (I'm referring to app/base.py). If there was such an attribute, I would be able to parse it out of the broadcast reply (as it returns list of strings as per the code).

Or maybe there's another way to achieve my goal?

The text was updated successfully, but these errors were encountered:

tlinhart · 2019-02-26T19:21:48Z

I found a solution so I'm sharing it in case someone deals with the same problem. I defined a custom base class for my tasks that adds __parameters__ attribute:

import inspect
from celery import Celery, Task

celery = Celery('tasks', broker='pyamqp://guest@localhost//')

class BaseTask(Task):
    def __init__(self, *args, **kwargs):
        super(BaseTask, self).__init__(*args, **kwargs)

        signature = inspect.signature(self.__wrapped__)
        self.__parameters__ = [p for p in signature.parameters]

@celery.task(base=BaseTask, name='add_numbers')
def add(x, y):
    return x + y

In my application, I query the workers for registered tasks using broadcast and parse the results:

import re
from celery import Celery

celery = Celery(broker='pyamqp://guest@localhost//')

def get_available_tasks():
    reply = celery.control.broadcast(
        'registered', reply=True,
        arguments={'taskinfoitems': ['__parameters__']})
    reg_tasks = set(t for n in reply for w in n.values() for t in w)

    tasks = []
    for task in reg_tasks:
        name = re.search(r'[\w\.]+', task).group(0)
        parameters = eval(re.search(r'\[__parameters__=(\[.*\])\]', task).group(1))
        tasks.append({'name': name, 'parameters': parameters})

    return tasks

It's probably not the cleanest solution, but works perfectly for my use case.

auvipy · 2021-02-21T14:00:55Z

I found a solution so I'm sharing it in case someone deals with the same problem. I defined a custom base class for my tasks that adds __parameters__ attribute:

import inspect
from celery import Celery, Task

celery = Celery('tasks', broker='pyamqp://guest@localhost//')

class BaseTask(Task):
    def __init__(self, *args, **kwargs):
        super(BaseTask, self).__init__(*args, **kwargs)

        signature = inspect.signature(self.__wrapped__)
        self.__parameters__ = [p for p in signature.parameters]

@celery.task(base=BaseTask, name='add_numbers')
def add(x, y):
    return x + y

In my application, I query the workers for registered tasks using broadcast and parse the results:

import re
from celery import Celery

celery = Celery(broker='pyamqp://guest@localhost//')

def get_available_tasks():
    reply = celery.control.broadcast(
        'registered', reply=True,
        arguments={'taskinfoitems': ['__parameters__']})
    reg_tasks = set(t for n in reply for w in n.values() for t in w)

    tasks = []
    for task in reg_tasks:
        name = re.search(r'[\w\.]+', task).group(0)
        parameters = eval(re.search(r'\[__parameters__=(\[.*\])\]', task).group(1))
        tasks.append({'name': name, 'parameters': parameters})

    return tasks

It's probably not the cleanest solution, but works perfectly for my use case.

If you are able to come with a possible proof of concept improvement request you are more than welcome

tlinhart · 2021-02-23T10:00:34Z

I'm afraid I won't have time to look into this anytime soon.

woutdenolf · 2022-01-14T17:25:12Z

I was about to make a new issue but afaiu it is related to this issue: "What is the Celery way to decouple client and worker code?"

@tlinhart I was trying to understand your solution but couldn't figure it out (I'm new to Celery).

Worker node

Suppose I have installed a python project on the worker node with this celery application:

# myproject.heavy_app
 
from celery import Celery
from heavy_project import heavy_function
 
app = Celery()
app.config_from_object("celeryconfig", force=True)
 
@app.task
def heavy_task(a, b):
    return heavy_function(a, b)

On the worker node I can the launch a worker that serves this application like this

celery -A myproject.heavy_app worker

Client node

To use the Celery app on the client node I have to import it just to get the task signature! This is not what I want because I don't want my client to depend on worker requirements (heavy_project in this case):

# myjob.py

from myproject.heavy_app import heavy_task
 
future = heavy_task.delay(1, 1)
assert future.get(timeout=10) == 2

python myjob.py

Solutions

Solutions to decouple client and worker code I'm not satisfied with:

import heavy_function inside heavy_task
define the Celery app twice:
1. once for the client (not including the implementation, only the task signature)
2. once for the worker (including the implementation)

woutdenolf · 2022-01-14T17:49:38Z

Ok sorry, I found the solution in the docs. On the client side I can do this:

# myjob.py

from celery.execute import send_task

future = send_task("myproject.heavy_app.heavy_task", args=(1, 1))
future.get(timeout=10)

python myjob.py

woutdenolf · 2022-01-14T18:14:07Z

Maybe this should be used in the "getting started" docs? It was not obvious to me.

auvipy added the Status: Needs Verification ✘ label Mar 3, 2019

thedrow added Issue Type: Feature Request Component: Gossip and removed Status: Needs Verification ✘ labels Apr 1, 2019

thedrow added this to the Future milestone Feb 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add function parameters to the task attributes #5363

Add function parameters to the task attributes #5363

tlinhart commented Feb 26, 2019 •

edited by sync-by-unito bot

tlinhart commented Feb 26, 2019

auvipy commented Feb 21, 2021

tlinhart commented Feb 23, 2021

woutdenolf commented Jan 14, 2022 •

edited

woutdenolf commented Jan 14, 2022

woutdenolf commented Jan 14, 2022

Add function parameters to the task attributes #5363

Add function parameters to the task attributes #5363

Comments

tlinhart commented Feb 26, 2019 • edited by sync-by-unito bot

tlinhart commented Feb 26, 2019

auvipy commented Feb 21, 2021

tlinhart commented Feb 23, 2021

woutdenolf commented Jan 14, 2022 • edited

Worker node

Client node

Solutions

woutdenolf commented Jan 14, 2022

woutdenolf commented Jan 14, 2022

tlinhart commented Feb 26, 2019 •

edited by sync-by-unito bot

woutdenolf commented Jan 14, 2022 •

edited