RL Course - Multi-Agent Box Pushing Environment

Welcome to the Reinforcement Learning (RL) course repository!

This repository serves as the central hub for our multi-agent box-pushing RL environment course. Here you will find the course environment, exercises, and assignment instructions.

🚀 Setting Up the Environment

Before you can run the simulations, you need to install the required Python dependencies. The environment relies on pygame for graphics, minigrid / pettingzoo for RL interfaces, and unified-planning with fast-downward for solving logical PDDL puzzles.

Since modern operating systems (like macOS) protect the system Python environment, you must create a Virtual Environment before installing the packages:

# 1. Create a virtual environment named 'venv'
python3 -m venv venv

# 2. Activate the virtual environment
source venv/bin/activate

# 3. Install the required dependencies safely
pip install -r requirements.txt

🎮 Running the Visual Simulation

We have included a full end-to-end visualizer that automatically builds a PDDL domain out of a 2D grid, sends it to the Fast Downward AI planner, and then plays the optimal solution visually on your screen.

To run the large multi-agent simulation where two agents cooperate to push a Big Box:

python3 visualize_plan.py

This script will:

Load a hardcoded 8x8 ASCII map.
Generate domain.pddl and problem.pddl in the pddl/ folder.
Call the classical planner to find the shortest list of actions.
Launch a pygame window and execute the actions step-by-step.

📁 Project Architecture & Files

Here is an overview of the core files in this repository and what they do:

Core Environment

environment/multi_agent_env.py The heart of the simulation! This defines the MultiAgentBoxPushEnv class, inheriting from PettingZoo's ParallelEnv. It handles the core physics: small box pushes, two-agent joint Big Box pushes, grid overlaps, and generating visual frames for the agents.
environment/box_push_env.py A simpler, single-agent Gym environment (used for basic training and earlier exercises before moving to multi-agent).
environment/objects.py Defines the visual rendering rules for our custom grid objects (AgentObj, SmallBox, BigBox) using standard PyGame polygon rendering metrics.
environment/wrappers.py Contains advanced RL wrappers to increase difficulty:
- StochasticActionWrapper: Adds a chance for agent actions to fail.
- NoisyObservationWrapper: Adds visual static/noise to the agent's observation matrix.

Automated Planning (PDDL)

environment/pddl_extractor.py Acts as a bridge between the Python grid and classical planning. It parses the live environment state and writes valid mathematically-constrained domain and problem files.
planner/pddl_solver.py Connects to the unified-planning library and pipes the generated PDDL files into the fast-downward engine, returning a parsed list of steps if a valid solution exists.

Main Entry Points

visualize_plan.py The main testing script. It strings together the Environment, the Extractor, and the Solver, and then renders the output visually.

Student Assignments

exercises/README.md Contains the homework assignments for the students taking this course (e.g. creating custom maps, integrating wrappers, adding constraints).

Branch Naming Convention

IMPORTANT: When working on your assignments, you must create a new branch for each exercise. Your branch name must follow this format:

student-{firstname}-{lastname}-{exercise}

For example:

student-yossi-cohen-ex1
student-sarah-levi-ex2

Please ensure you adhere to this naming convention, as it will be used for grading and tracking your progress.

📬 Submission Requirements (Exercise 1)

Submission consists of two parts — both are required for a complete grade.

Part A — Pull Request on GitHub

When you open a Pull Request, your branch must include all of the following files:

File	Description
`llm_pipeline.py` (or similar name)	Your pipeline script that queries the LLM, generates the PDDL files, and runs the planner
`pddl/domain.pddl`	The generated PDDL domain file
`pddl/problem.pddl`	The generated PDDL problem file
`pddl_to_map.py` (or similar name)	A script that parses your `domain.pddl` / `problem.pddl` and translates them back into an ASCII map recognized by the visualizer
`planner_output.txt`	The full terminal log from running the planner (Fast Downward output)

How to capture the terminal log:
python3 visualize_plan.py 2>&1 | tee planner_output.txt
This prints to the terminal and saves everything to planner_output.txt simultaneously.

Part B — Live Demo (In-Person Presentation)

In addition to the Pull Request, you will present your work live in front of the course instructor.

During the demo you are expected to:

Run your full pipeline end-to-end from the terminal.
Show the planner finding a valid plan.
Run the visual simulator and demonstrate the agents reaching the goal state on your map.
Explain your prompting strategy — how you described the world to the LLM and what design choices you made.

No submission is considered complete without the live demo.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL Course - Multi-Agent Box Pushing Environment

🚀 Setting Up the Environment

🎮 Running the Visual Simulation

📁 Project Architecture & Files

Core Environment

Automated Planning (PDDL)

Main Entry Points

Student Assignments

Branch Naming Convention

📬 Submission Requirements (Exercise 1)

Part A — Pull Request on GitHub

Part B — Live Demo (In-Person Presentation)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
environment		environment
exercises		exercises
pddl		pddl
planner		planner
tests		tests
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
requirements.txt		requirements.txt
visualize_plan.py		visualize_plan.py

Folders and files

Latest commit

History

Repository files navigation

RL Course - Multi-Agent Box Pushing Environment

🚀 Setting Up the Environment

🎮 Running the Visual Simulation

📁 Project Architecture & Files

Core Environment

Automated Planning (PDDL)

Main Entry Points

Student Assignments

Branch Naming Convention

📬 Submission Requirements (Exercise 1)

Part A — Pull Request on GitHub

Part B — Live Demo (In-Person Presentation)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages