Reinforcement Learning — From Scratch to Agentic AI

A collection of hands-on RL projects built alongside my Reinforcement Learning From Scratch series on Medium. Each project is a self-contained implementation that accompanies an article — from tabular Q-Learning all the way to an LLM agent trained with RL to use tools.

Projects

#	Project	Algorithm	Concepts
01	Grid World Navigator	Q-Learning	Custom Gym env, reward shaping, epsilon-greedy
02	Blackjack Strategy Learner	Monte Carlo	First-visit MC, model-free RL, value function
03	CliffWalking: TD vs MC	TD(0) vs MC	Temporal difference, bias-variance tradeoff
04	CliffWalking: SARSA vs Q-Learning	SARSA, Q-Learning	On-policy vs off-policy, path visualization
05	LunarLander with DQN	DQN	Deep RL, replay buffer, target network
06	MountainCar with A2C	A2C	Actor-Critic, continuous action space
07	BipedalWalker with PPO	PPO	Clip ratio, surrogate objective
08	LLM Tool-Use Agent	PPO + LLM	Agentic AI, tool selection, custom Gym env

Each folder is fully self-contained. Install that project's dependencies and run it independently.

Setup

git clone https://github.com/yourusername/reinforcement-learning-projects.git
cd reinforcement-learning-projects

# Go into any project
cd project-01-gridworld
pip install -r requirements.txt
python train.py

Python 3.10+ recommended.

Tech Stack

Gymnasium — RL environments
Stable-Baselines3 — PPO, A2C implementations
PyTorch — Neural networks for deep RL projects
NumPy & Matplotlib — Computation and visualization

Author

Ebad Sayed — Final year, IIT (ISM) Dhanbad, Co-founder of Voke AI

Connect: LinkedIn · GitHub · X

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Blackjack Strategy Learner		Blackjack Strategy Learner
Cliff Walking		Cliff Walking
Grid World Navigator		Grid World Navigator
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning — From Scratch to Agentic AI

Projects

Setup

Tech Stack

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning — From Scratch to Agentic AI

Projects

Setup

Tech Stack

Author

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages