tuxkart-ai

LSTM model arch: (for RL agent)

==========================================================================================
Layer (type:depth-idx)                   Output Shape              Param #
==========================================================================================
Net                                      --                        --
├─LSTM: 1-1                              [5, 8, 256]               --
│    └─LSTM: 2-1                         [5, 8, 256]               1,052,672
├─Actor: 1-2                             [8, 13]                   --
│    └─Sequential: 2-2                   [8, 13]                   --
│    │    └─Linear: 3-1                  [8, 128]                  32,896
│    │    └─Tanh: 3-2                    [8, 128]                  --
│    │    └─Linear: 3-3                  [8, 13]                   1,677
├─Critic: 1-3                            [8, 1]                    --
│    └─Sequential: 2-3                   [8, 1]                    --
│    │    └─Linear: 3-4                  [8, 128]                  32,896
│    │    └─Tanh: 3-5                    [8, 128]                  --
│    │    └─Linear: 3-6                  [8, 1]                    129
==========================================================================================
Total params: 1,120,270
Trainable params: 1,120,270
Non-trainable params: 0
Total mult-adds (M): 42.65
==========================================================================================
Input size (MB): 0.04
Forward/backward pass size (MB): 0.10
Params size (MB): 4.48
Estimated Total Size (MB): 4.62
==========================================================================================

VAE model arch: (for representation learning)

==========================================================================================
Layer (type:depth-idx)                   Output Shape              Param #
==========================================================================================
ConvVAE                                  --                        --
├─Encoder: 1-1                           [8, 128]                  --
│    └─Sequential: 2-1                   [8, 1, 75, 50]            --
│    │    └─Conv2d: 3-1                  [8, 128, 149, 99]         12,800
│    │    └─ReLU: 3-2                    [8, 128, 149, 99]         --
│    │    └─BatchNorm2d: 3-3             [8, 128, 149, 99]         256
│    │    └─Conv2d: 3-4                  [8, 256, 150, 100]        524,288
│    │    └─ReLU: 3-5                    [8, 256, 150, 100]        --
│    │    └─BatchNorm2d: 3-6             [8, 256, 150, 100]        512
│    │    └─Conv2d: 3-7                  [8, 128, 149, 99]         524,288
│    │    └─ReLU: 3-8                    [8, 128, 149, 99]         --
│    │    └─BatchNorm2d: 3-9             [8, 128, 149, 99]         256
│    │    └─Conv2d: 3-10                 [8, 1, 75, 50]            1,152
│    └─Linear: 2-2                       [8, 128]                  480,128
│    └─Linear: 2-3                       [8, 128]                  480,128
├─Decoder: 1-2                           [8, 1, 600, 400]          --
│    └─Linear: 2-4                       [8, 3750]                 483,750
│    └─Sequential: 2-5                   [8, 1, 600, 400]          --
│    │    └─ConvTranspose2d: 3-11        [8, 128, 149, 99]         1,280
│    │    └─ReLU: 3-12                   [8, 128, 149, 99]         --
│    │    └─ConvTranspose2d: 3-13        [8, 256, 149, 99]         295,168
│    │    └─ReLU: 3-14                   [8, 256, 149, 99]         --
│    │    └─ConvTranspose2d: 3-15        [8, 128, 296, 196]        524,416
│    │    └─ReLU: 3-16                   [8, 128, 296, 196]        --
│    │    └─ConvTranspose2d: 3-17        [8, 1, 600, 400]          12,801
==========================================================================================
Total params: 3,341,223
Trainable params: 3,341,223
Non-trainable params: 0
Total mult-adds (G): 429.30
==========================================================================================
Input size (MB): 7.68
Forward/backward pass size (MB): 1828.52
Params size (MB): 13.36
Estimated Total Size (MB): 1849.57
==========================================================================================

TODO:

keep a moving finish line?
clean up encoding infos
use stackedvec - stable_baselines_3
train VAE on RGB images instead of using grayscale image as model input
another regularization step (maybe) would be to take x number of actions randomly, like not sample from the dist but totally random steps and have the model recover from that - so as to see how the model recovers from that state?

Observations:

Half Precision doesn't work
Training on RGB image absolutely does nothing

Name		Name	Last commit message	Last commit date
Latest commit History 127 Commits
models/vae		models/vae
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eval.py		eval.py
inspect_vae.py		inspect_vae.py
tests.py		tests.py
train.py		train.py
train_vae.py		train_vae.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

models/vae

models/vae

src

src

.gitattributes

.gitattributes

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

eval.py

eval.py

inspect_vae.py

inspect_vae.py

tests.py

tests.py

train.py

train.py

train_vae.py

train_vae.py

Repository files navigation

tuxkart-ai

VAE model arch: (for representation learning)

TODO:

Observations:

References

About

Releases

Packages

Languages

License

notjedi/tuxkart-ai

Folders and files

Latest commit

History

Repository files navigation

tuxkart-ai

VAE model arch: (for representation learning)

TODO:

Observations:

References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages