Celery Worker Service (a.k.a. data-processor)

This repository contains a Dockerized Celery worker that processes background tasks in a distributed system.

Features

Executes tasks are connected with data preprocessing, modeling, and predicting
Related tasks, are interconnected in workflow tasks (basically chain of tasks)
Through the connection to a same message broker (RabbitMQ, Redis, etc.), Backend can send tasks by the name and the worker will pick it up whenever ready
Due to separation from backend, easy to scale/repliate
Supports task execution and concurrency settings
Logs task execution status
Sends the data directly on opentelemetry monitoring stack, if needed

Requirements

Docker
A message broker (e.g., Redis or RabbitMQ)
A backend for task results (optional, e.g., Redis, PostgreSQL)
A s3-like file storage (e.g. Minio, AWS S3)

Environment Variables

The following environment variables must be set for the worker to function correctly:

Variable	Description	Default
`CELERY_BROKER_CONNECTION`	URL of the message broker (Redis, RabbitMQ)	None
`CELERY_BACKEND_CONNECTION`	URL of the backend for storing task results	None
`CELERY_DEFAULT_QUEUE`	Default queue name used by celery if no custom specified	`tasks`
`S3_ENDPOINT_URL`	URL of the s3-like storage system	None
`S3_BUCKET_NAME`	Name of s3-like bucket for data exchange between tasks	`celery-data-holder`
`S3_ACCESS_KEY_ID`	Access key id to access private s3-like bucket(s)	None
`S3_SECRET_ACCESS_KEY`	Secret access key to access private s3-like bucket(s)	None
`C_FORCE_ROOT`	Forces Celery to run workers as root	false
`MLFLOW_TRACKING_URI`	Number of concurrent worker processes	None
`MPLCONFIGDIR`	Custom path for Matplotlib cache directory.	`/usr/src/app/artifacts`

All needed environment variables can copied from the file.

Usage

Build and Run with Docker

# Build the Docker image
docker build -t data-processor .

# Run the worker container
docker run -d \
  --name mlops_data_processor \
  --env CELERY_BROKER_CONNECTION=amqp://admin:adminadmin@rabbitmq:5672/ \
  --env RESULT_BACKEND=db+postgresql://celery:adminadmin@postgres:5432/celery_storage \
  data-processor

Docker Compose

To deploy with Docker Compose, create a docker-compose.yml file:

services:
  data-processor:
    build: .
    environment:
      - BROKER_URL=amqp://admin:adminadmin@rabbitmq:5672/
      - RESULT_BACKEND=db+postgresql://celery:adminadmin@postgres:5432/celery_storage
    depends_on:
      - rabbitmq
      - postgres

  rabbitmq:
    image: rabbimq:latest
  postgres:
    image: postgres:latest

Run the service:

docker-compose up -d

Scaling Workers

You can scale the number of Celery workers dynamically:

docker-compose up --scale data-processor=3 -d

Logs and Monitoring

To check worker logs:

docker logs -f data-processor

To monitor tasks:

celery -A app.core.celery.app status

Contributing

Contributions are welcome! Feel free to submit a pull request or open an issue.

Name		Name	Last commit message	Last commit date
Latest commit History 95 Commits
.github/workflows		.github/workflows
app		app
artifacts		artifacts
notebooks		notebooks
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml
pyrightconfig.json		pyrightconfig.json
ruff.toml		ruff.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Celery Worker Service (a.k.a. data-processor)

Features

Requirements

Environment Variables

Usage

Build and Run with Docker

Docker Compose

Scaling Workers

Logs and Monitoring

Contributing

About

Uh oh!

Packages

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages

License

fhswf-study-projects/mlops-data-processor

Folders and files

Latest commit

History

Repository files navigation

Celery Worker Service (a.k.a. data-processor)

Features

Requirements

Environment Variables

Usage

Build and Run with Docker

Docker Compose

Scaling Workers

Logs and Monitoring

Contributing

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Packages 0

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages

Packages