# Building DQN Atari Agents -- Notebook Index

## Vizuara AI

Welcome to the **Building DQN Atari Agents** notebook series. This set of 4 interactive notebooks takes you from the fundamental limitation of tabular Q-learning all the way to training a Deep Q-Network that plays Atari games from raw pixels.

---

## Notebooks

### 01 -- From Tables to Neural Networks: The DQN Architecture
**What you will build:** The DQN convolutional neural network that maps raw pixel frames to Q-values.

Key topics:
- Why Q-tables fail for large state spaces
- Function approximation with neural networks
- The DQN CNN architecture (Conv layers, dimensions, forward pass)
- Atari preprocessing pipeline (grayscale, resize, frame stacking)

### 02 -- Experience Replay and Target Networks
**What you will build:** The two innovations that make training deep Q-networks stable.

Key topics:
- Why correlated data causes catastrophic forgetting
- Experience replay buffer implementation
- The moving target problem and target networks
- DQN loss function with TD error computation

### 03 -- Training a DQN Agent to Play Pong
**What you will build:** A complete end-to-end DQN training pipeline.

Key topics:
- Epsilon-greedy exploration with annealing
- Complete training loop with all components
- Training on Pong from raw pixels
- Evaluation and visualization of learned behavior

### 04 -- Double DQN and Game Generalization
**What you will build:** Double DQN to fix overestimation bias, plus analysis across games.

Key topics:
- The Q-value overestimation problem
- Double DQN: decoupling selection and evaluation
- DQN performance across 49 Atari games
- The DQN family: Dueling DQN, Prioritized Replay, Rainbow

---

## Prerequisites
- Basic understanding of reinforcement learning (states, actions, rewards)
- Familiarity with Q-learning (Q-values, Bellman equation)
- Basic PyTorch (tensors, nn.Module, gradient descent)

## Environment
- Google Colab with T4 GPU
- All notebooks are self-contained with pip installs included
- Each notebook runs in under 10 minutes

## References
1. Mnih, V., et al. (2013). "Playing Atari with Deep Reinforcement Learning." arXiv:1312.5602.
2. Mnih, V., et al. (2015). "Human-level control through deep reinforcement learning." Nature, 518(7540), 529-533.
3. van Hasselt, H., Guez, A., Silver, D. (2015). "Deep Reinforcement Learning with Double Q-Learning." arXiv:1509.06461.