Lunar-Lander-RL

This is an exercise using Actor-Critic and Deep Q Networks for OpenAI's gym environment, specifically the continuous and discrete versions of LunarLander-v2. The models for discrete action spaces are designed with straightforward nn.Linear layers and ReLU activation functions, culminating in 18,180 parameters and 279 kilobytes for weight storage.

For continuous RL models (Actor/Critic), the Actor model outputs a Gaussian Normal Distribution of the action space based on the given state. The Critic model is defined as a Q-Value function that outputs a scalar score based on the given state and action pair. Noise injection is added to the action outputs to enable additional exploration capabilities. Using Actor/Critic, with proper hyperparameters, convergences was accomplished in 1/4 the episodes for discrete Q-Networks in discrete action space.

Results

Continuous Action space (Top) and Discrete Action space (Bottom) results from testing. I'm not sure if there was training errors for discrete but continuous clearly lands smoother on terrain despite less episodes for training.

Continuous Action Space

Discrete Action Space

Installation

Git clone the repository

git clone https://github.com/CodeKnight314/Lunar-Lander-RL.git 
cd Lunar-Lander-RL

Create and activate a virtual environment (optional but recommended)

python -m venv lunar-env
source lunar-env/bin/activate

Install necessary pacakages

pip install -r src/requirements.txt

Training

You can start the training process via main.py:

python src/main.py --c path/to/config --o path/to/output --train --env [discrete/continuous]

This will start the training process based on the given configs then record the model in simulation before saving to specified directory. You can specify discrete or continuous action space via --env flag.

Testing

You can start and save evaluation results via main.py:

python src/main.py --c path/to/config \
                    --o path/to/output \
                    --w path/to/weights \
                    --test \
                    --env [discrete/continuous]

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lunar-Lander-RL

Results

Installation

Training

Testing

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Lunar-Lander-RL

Results

Installation

Training

Testing

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages