Sarsa(Lambda) algorithm
Train
python3 run_sarsa_lambda.py --verbose
Test
python3 run_sarsa_lambda.py --verbose --iter 10 --load_path weights.npy
TAMER algorithm
Train
python3 run_tamer.py --verbose
Test
python3 run_tamer.py --verbose --load_path tamer.npy