📽️ Movie Recommendation System using Ploomber and DuckDB

This project is a modular and scalable Movie Recommendation System built using Ploomber for pipeline orchestration and DuckDB for efficient in-process analytics. It demonstrates how modern data tools can be combined to build a fast, lightweight, and reproducible machine learning pipeline for content-based movie recommendations.

🔧 Tech Stack

Ploomber – For defining, orchestrating, and running modular data pipelines.
DuckDB – An in-process SQL OLAP database optimized for analytical workloads.
Python – Core language for data processing and modeling.
scikit-learn – For building and training the recommendation model.
FastAPI – For serving the model via a lightweight REST API (if included).
Docker – For containerized deployment and reproducibility.

💡 Features

Cleans and transforms raw movie metadata.
Generates movie embeddings using TF-IDF on genres, overviews, and more.
Computes cosine similarity for movie recommendations.
Returns top-N similar movies for a given title.
Powered by SQL queries on DuckDB for fast, memory-efficient processing.
Modular pipeline with Ploomber for easy debugging, testing, and extension.

🚀 Use Case

This system is ideal for:

Small to medium-scale movie recommendation tasks
Educational purposes in data science and ML pipelines
A starting point for building more advanced recommender systems using collaborative filtering or deep learning

Set up - with Docker

docker build -t movierec:latest -f Dockerfile .

docker run -it -p 8000:8000 movierec:latest

Explore container

Open new terminal window & docker ps & copy container id

docker docker exec -ti YOURCONTAINERID /bin/bash

./movies_data.duckdb

Explore the database

Navigate to Browser

Navigate to http://localhost:8000 in browser

Navigate to http://localhost:8000/docs in browser

Set up - one step at a time

Create new environment

conda create --name poetry-env python=3.10

Activate environment

conda activate poetry-env

Install poetry

pip install poetry

Install dependencies

poetry lock
poetry install

Run the as a Ploomber pipeline

cd mini-projects/
poetry run ploomber build

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
movie_rec_system		movie_rec_system
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
fastapi_duckdb.drawio		fastapi_duckdb.drawio
fastapi_duckdb.pdf		fastapi_duckdb.pdf
pipeline.yaml		pipeline.yaml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📽️ Movie Recommendation System using Ploomber and DuckDB

🔧 Tech Stack

💡 Features

🚀 Use Case

Set up - with Docker

Explore container

Navigate to Browser

Set up - one step at a time

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Mishalmp/movie-recommendations-system

Folders and files

Latest commit

History

Repository files navigation

📽️ Movie Recommendation System using Ploomber and DuckDB

🔧 Tech Stack

💡 Features

🚀 Use Case

Set up - with Docker

Explore container

Navigate to Browser

Set up - one step at a time

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages