VAE Facilitates Reinforcement Learning

Final project for Bayesian theory and computation @ PKU (2021 Spring).

About this Project

This project aims to train an RL agent able to drive in the CarRacing-v0 environment using features extracted by Q-Consistency regularized VAE (QC-VAE).

The pipeline of the whole process is illustrated in the figure below

You can find our thesis here.

Environment

This project depends on the following python packages:

pytorch
cudatoolkit=10.2
tensorflow=1.15.0
tqdm
Pillow
gym
pybox2d
pyvirtualdisplay

, and the following Linux libraries:

xvfb

For your convenience, you can run the setup script to configure the runtime environment:

sudo setup.sh

Code Structure

Sub-modules:

OpenAI-GYM-CarRacing-DQN: containing an pre-trained expert DQN agent whose action is used when training our Q-network
TD3: forked from this repo and modified by Zihan Mao, contains components of DDPG (and CNN-DDPG)

In this repo:

VAE.py: implement VAE with convolutional layers
train_critic.py: train Q-network using action performed by expert DQN agent
train_vae.py: train QC-VAE with usual VAE loss and our Q-consistency loss computed by the pre-trained Q-network
train_agent.py: train DDPG agent using features extracted by the QC-VAE
baseline.py: train baseline agent (DDPG with convolutional layers) using the image inputs themselves
util.py: helper functions that help processing image inputs
model/: model folder

Contact Info

Code Author: Zihan Mao -- Personal Email, Educational Email

Project Link: https://github.com/Mzhhh/VAEFRL

Acknowledgment

Thanks to the related repositories on Github and various questions on Stackoverflow, without which I couldn't have written a single line of bug-free code.

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
assets		assets
tools		tools
.gitignore		.gitignore
README.md		README.md
VAE.py		VAE.py
__init__.py		__init__.py
baseline.py		baseline.py
setup.sh		setup.sh
train_agent.py		train_agent.py
train_critic.py		train_critic.py
train_vae.py		train_vae.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VAE Facilitates Reinforcement Learning

About this Project

Environment

Code Structure

Contact Info

Acknowledgment

About

Releases

Packages

Languages

Mzhhh/VAEFRL

Folders and files

Latest commit

History

Repository files navigation

VAE Facilitates Reinforcement Learning

About this Project

Environment

Code Structure

Contact Info

Acknowledgment

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages