Soft actor critic for CaRL

Made with

Car game for Reinforcement Learning

Actual in-game footage, policy function is playing the game.

For more information about the game check out CaRL. Unlike most other SAC car games this implementation is relying on images instead if distance vectors as state vector to the network. For reducing the image dimension an Autoencoder is used. This is an example on how to use the simple car game with a reinforcement learning algorithm.

1. Getting Started

Maybe you want to play the game first manually to get a feeling for the game itself (1.2).
Inside the folder SavedWeights you can find pretrained parameters for the autoencoder and the policy network.

1.1. Requirements

The game itself needs

pip install numpy pygame opencv-python

The SAC implementation is using pytorch and the Autoencoder is using tensorflow

pip install torch torchvision

pip install tensorflow

(Sorry you need to install both.)

1.2. Play the game manually

The game can be played manually using W,A,S,D for steering. Hit E to engage the 'autopilot' (policy network) and R to disable it. With Q you can quit the game.

1.3. Train your own SAC

In order to train the SAC by yourself, just check out TrainSoftActorCritc.ipynb. One nice thing is, that you can check the current process by using playing the game manually using the last saved weights of the policy function.
What is really important is, that you design your loss function inside the game so it suits your needs. For example, I did quite some experiments to get the car going straight without shaking and wiggling around too much.

2. Technical details

The environment is outputting and image which is reduced in dimensions to 55 by a convolutional autoencoder. Appended to the actual state are the last two actions that were given by the agent. This is done to control the shakiness of the driving - the environment is penalizing shaky driving based on the past actions. Hence, the agent receives a vector with 59 entries.

2.1. Convolutional autoencoder

In order to train the SAC successfully, we need to reduce the dimensions of the image. Here I used an Autoencoder to reduce the dimension of the image. I don't want to go into the full details of the Autoencoder here, but you have an encoder and a decoder. The encoder is reducing the dimension of the image and the decoder is trying to reconstruct the image from the encoded vector. The encoded vector is what we are interested in for using SAC. The decoder is just used for training, to get a somehow meaningful encoded vector.

Here an example Image of the used Autoencoder:

Input image	Output image

Note that we have binary images here. You can of course increase the dimension of the encoded vector to get a better output image. This means however also that SAC has more dimensions to explore (training of SAC will take longer and might even get unstable).

2.2. State vector

Inside PygamePlayCar.py you can find self.action_space, which is representing the state of the environment. self.action_space consists of the encoded vector and the past two actions given by the agent. I added the two actions, because the scoring system of the game is penalizing shaky inputs. The length of the state vector is 59.

Authors

Matthias Schinzel

License

No license yet

Acknowledgments

Check out the SAC Paper
SAC implementation based on this implementation, with modifications
Hat tip to AtsushiSakai/PythonRobotics where I got inspiration for the car model. You should check out this repository.
Thanks to Pygame

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.ipynb_checkpoints		.ipynb_checkpoints
ImageVideos		ImageVideos
SavedWeights		SavedWeights
Tracks		Tracks
AutoencoderProperties.py		AutoencoderProperties.py
PlayingManually.py		PlayingManually.py
PygamePlayCar.py		PygamePlayCar.py
Readme.md		Readme.md
TrainSoftActorCritc.ipynb		TrainSoftActorCritc.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.ipynb_checkpoints

.ipynb_checkpoints

ImageVideos

ImageVideos

SavedWeights

SavedWeights

Tracks

Tracks

AutoencoderProperties.py

AutoencoderProperties.py

PlayingManually.py

PlayingManually.py

PygamePlayCar.py

PygamePlayCar.py

Readme.md

Readme.md

TrainSoftActorCritc.ipynb

TrainSoftActorCritc.ipynb

Repository files navigation

Soft actor critic for CaRL

1. Getting Started

1.1. Requirements

1.2. Play the game manually

1.3. Train your own SAC

2. Technical details

2.1. Convolutional autoencoder

2.2. State vector

Authors

License

Acknowledgments

About

Releases

Packages

Languages

MatthiasSchinzel/SAC-for-CaRL

Folders and files

Latest commit

History

Repository files navigation

Soft actor critic for CaRL

1. Getting Started

1.1. Requirements

1.2. Play the game manually

1.3. Train your own SAC

2. Technical details

2.1. Convolutional autoencoder

2.2. State vector

Authors

License

Acknowledgments

About

Resources

Stars

Watchers

Forks

Languages