Model needs more training for it to have better scores. Right now, it trains for only 10000 episodes.
The code runs Google Colab, still needs more debugging.
- Fix the overfitting issue in DDQN
- Add a save_model and load_model method
- Graph and analyse the loss and progress of the model