design a reinforcement learning algorithm that leverages prior experience to figure out how to solve new tasks quickly
- CartPole-v1 -
cart
- MountainCarContinuous-v0 -
car
- Acrobot-v1 -
acro
--mode=train --trained_model=cart --test_game=cart
--mode=fine --trained_model=acro --test_game=cart
--mode=transfer --trained_model=cart --trained_model2=acro --test_game=car