Mujoco Humanoid Reinforcement Learning Experiment

This project implements a reinforcement learning algorithm to teach a Mujoco Humanoid to stand up using Stable Baselines 3.

Converting to Jupyter Notebook for Colab

To convert the humanoid_rl.py file to a Jupyter notebook for Google Colab:

Upload the file to Google Colab:
- Go to Google Colab
- Click on File > Upload notebook > Choose file and select the humanoid_rl.py file
OR manually create a new notebook and copy each section between the # %% markers as separate cells:
- Code blocks should be added as code cells
- Comments/markdown sections (starting with # %% [markdown]) should be added as text cells with the # and [markdown] removed

Add this at the beginning of your notebook to install dependencies:

!pip install gymnasium[mujoco]==0.28.1
!pip install stable-baselines3==1.8.0

The experiment trains a PPO (Proximal Policy Optimization) agent on the Humanoid-Standup-v4 environment from Gymnasium.

Three parameters are varied to evaluate their impact on performance:

Results are visualized with training curves and performance metrics.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
baseline_model		baseline_model
dt_halfcheetah_variation		dt_halfcheetah_variation
models		models
variation_model		variation_model
videos		videos
.gitignore		.gitignore
Decision_Transformer_Model.pth		Decision_Transformer_Model.pth
README.md		README.md
baseline_model.pt		baseline_model.pt
cheetah.ipynb		cheetah.ipynb
cheetah_model.pth		cheetah_model.pth
cheetah_variation.ipynb		cheetah_variation.ipynb
cheetah_variation.py		cheetah_variation.py
episode_length_comparison.png		episode_length_comparison.png
humanoid_rl.ipynb		humanoid_rl.ipynb
humanoid_rl.py		humanoid_rl.py
learning curves for diff configs.png		learning curves for diff configs.png
requirements.txt		requirements.txt
returns_comparison.png		returns_comparison.png
variation_model.pt		variation_model.pt
variation_model.pth		variation_model.pth
vec_normalize.pkl		vec_normalize.pkl