## Solving the taxi problem using Q learning

To demonstrate the problem let's say our agent is the driver. There are four locations and
the agent has to pick up a passenger at one location and drop them off at another. The
agent will receive +20 points as a reward for successful drop off and -1 point for every time
step it takes. The agent will also lose -10 points for illegal pickups and drops. So the goal of
our agent is to learn to pick up and drop off passengers at the correct location in a short
time without adding illegal passengers.

In [1]:
#Importing the required libraries

import gym
import random
import numpy as np

In [2]:
env = gym.make("Taxi-v1")

[2020-02-26 22:48:09,344] Making new env: Taxi-v1
  result = entry_point.load(False)


In [4]:
env.render()

+---------+
|R: | : :[35m[34;1m[43mG[0m[0m[0m|
| : : : : |
| : : : : |
| | : | : |
|Y| : |B: |
+---------+



In [5]:
alpha = 0.4
gamma = 0.999
epsilon = 0.017

# Initialize a Q table; it has a dictionary that stores the state-action pair as (state-action)

q = {}
for s in range(env.nS):
    for a in range(env.nA):
        q[(s,a)] = 0.0

In [6]:
print(q)

{(0, 0): 0.0, (0, 1): 0.0, (0, 2): 0.0, (0, 3): 0.0, (0, 4): 0.0, (0, 5): 0.0, (1, 0): 0.0, (1, 1): 0.0, (1, 2): 0.0, (1, 3): 0.0, (1, 4): 0.0, (1, 5): 0.0, (2, 0): 0.0, (2, 1): 0.0, (2, 2): 0.0, (2, 3): 0.0, (2, 4): 0.0, (2, 5): 0.0, (3, 0): 0.0, (3, 1): 0.0, (3, 2): 0.0, (3, 3): 0.0, (3, 4): 0.0, (3, 5): 0.0, (4, 0): 0.0, (4, 1): 0.0, (4, 2): 0.0, (4, 3): 0.0, (4, 4): 0.0, (4, 5): 0.0, (5, 0): 0.0, (5, 1): 0.0, (5, 2): 0.0, (5, 3): 0.0, (5, 4): 0.0, (5, 5): 0.0, (6, 0): 0.0, (6, 1): 0.0, (6, 2): 0.0, (6, 3): 0.0, (6, 4): 0.0, (6, 5): 0.0, (7, 0): 0.0, (7, 1): 0.0, (7, 2): 0.0, (7, 3): 0.0, (7, 4): 0.0, (7, 5): 0.0, (8, 0): 0.0, (8, 1): 0.0, (8, 2): 0.0, (8, 3): 0.0, (8, 4): 0.0, (8, 5): 0.0, (9, 0): 0.0, (9, 1): 0.0, (9, 2): 0.0, (9, 3): 0.0, (9, 4): 0.0, (9, 5): 0.0, (10, 0): 0.0, (10, 1): 0.0, (10, 2): 0.0, (10, 3): 0.0, (10, 4): 0.0, (10, 5): 0.0, (11, 0): 0.0, (11, 1): 0.0, (11, 2): 0.0, (11, 3): 0.0, (11, 4): 0.0, (11, 5): 0.0, (12, 0): 0.0, (12, 1): 0.0, (12, 2): 0.0, (12, 3): 

In [8]:
# Let's create a function for updating the q table
def update_q_table(prev_state, action, reward, next_state, alpha, gamma):
    qa = max([q[(next_state,a)] for a in range(env.nA)])
    q[(prev_state,action)] += alpha*(reward + gamma*qa - q[(prev_state,action)])
    

In [10]:
# Let's define the function for performing the epsilon-greedy
def epsilon_greedy_policy(state,epsilon):
    if random.uniform(0,1)< epsilon:
        return env.action_space.sample()
    else:
        return max(list(range(env.nA)), key = lambda x: q[(state,x)])

In [11]:
# Let's now perform the Q learning

nb_episode = 80000
for epis in range(nb_episode):
    r = 0
    # first we initialize the environment
    
    prev_state = env.reset()
    while True:
        # select action by epsilon-greedy policy
        action = epsilon_greedy_policy(prev_state,epsilon)
        #then we take the select action and move to the next state
        next_state, reward, done, _ = env.step(action)
        # Update the q value
        update_q_table(prev_state, action, reward, next_state, alpha, gamma)
        #then update the previous state as next state
        prev_state = next_state
        
        #and store the rewards in r
        r +=reward
        # if done i.e if we reach the terminal state of the episode
        # it break the loop and start the next episode
        if done:
            break
    print('total reward: ', r)

env.close()

total reward:  -2357
total reward:  -925
total reward:  -124
total reward:  -2212
total reward:  -628
total reward:  -536
total reward:  -420
total reward:  -2368
total reward:  -844
total reward:  -558
total reward:  -1049
total reward:  -220
total reward:  -189
total reward:  -55
total reward:  18
total reward:  -911
total reward:  -126
total reward:  15
total reward:  -428
total reward:  -198
total reward:  -43
total reward:  -19
total reward:  -284
total reward:  -180
total reward:  -205
total reward:  -116
total reward:  -180
total reward:  -344
total reward:  -282
total reward:  -337
total reward:  -72
total reward:  15
total reward:  -454
total reward:  -215
total reward:  -37
total reward:  -204
total reward:  -231
total reward:  -273
total reward:  -70
total reward:  -1882
total reward:  -70
total reward:  9
total reward:  -157
total reward:  -221
total reward:  -285
total reward:  -164
total reward:  -7
total reward:  -120
total reward:  -193
total reward:  -51
total reward: 

total reward:  7
total reward:  4
total reward:  15
total reward:  11
total reward:  -1
total reward:  12
total reward:  11
total reward:  11
total reward:  13
total reward:  9
total reward:  9
total reward:  14
total reward:  17
total reward:  10
total reward:  11
total reward:  10
total reward:  11
total reward:  6
total reward:  -3
total reward:  -15
total reward:  6
total reward:  9
total reward:  6
total reward:  -16
total reward:  8
total reward:  6
total reward:  18
total reward:  12
total reward:  15
total reward:  7
total reward:  6
total reward:  15
total reward:  13
total reward:  5
total reward:  10
total reward:  6
total reward:  10
total reward:  14
total reward:  8
total reward:  8
total reward:  8
total reward:  10
total reward:  17
total reward:  9
total reward:  16
total reward:  6
total reward:  18
total reward:  17
total reward:  18
total reward:  8
total reward:  9
total reward:  7
total reward:  12
total reward:  7
total reward:  13
total reward:  15
total reward:

total reward:  6
total reward:  6
total reward:  7
total reward:  10
total reward:  17
total reward:  8
total reward:  16
total reward:  9
total reward:  8
total reward:  8
total reward:  14
total reward:  8
total reward:  11
total reward:  10
total reward:  14
total reward:  10
total reward:  6
total reward:  0
total reward:  17
total reward:  6
total reward:  12
total reward:  7
total reward:  6
total reward:  7
total reward:  10
total reward:  3
total reward:  18
total reward:  8
total reward:  2
total reward:  10
total reward:  8
total reward:  8
total reward:  6
total reward:  13
total reward:  8
total reward:  18
total reward:  14
total reward:  12
total reward:  6
total reward:  10
total reward:  1
total reward:  14
total reward:  16
total reward:  10
total reward:  10
total reward:  12
total reward:  18
total reward:  12
total reward:  1
total reward:  12
total reward:  5
total reward:  14
total reward:  -5
total reward:  15
total reward:  9
total reward:  6
total reward:  3
to

total reward:  8
total reward:  14
total reward:  17
total reward:  6
total reward:  6
total reward:  9
total reward:  8
total reward:  19
total reward:  10
total reward:  5
total reward:  10
total reward:  18
total reward:  10
total reward:  1
total reward:  7
total reward:  9
total reward:  6
total reward:  15
total reward:  9
total reward:  13
total reward:  5
total reward:  17
total reward:  4
total reward:  4
total reward:  7
total reward:  16
total reward:  13
total reward:  11
total reward:  -6
total reward:  7
total reward:  5
total reward:  6
total reward:  17
total reward:  13
total reward:  19
total reward:  15
total reward:  9
total reward:  5
total reward:  8
total reward:  6
total reward:  14
total reward:  14
total reward:  12
total reward:  14
total reward:  13
total reward:  10
total reward:  17
total reward:  5
total reward:  16
total reward:  7
total reward:  8
total reward:  15
total reward:  4
total reward:  11
total reward:  6
total reward:  11
total reward:  7
to

total reward:  10
total reward:  8
total reward:  7
total reward:  7
total reward:  9
total reward:  8
total reward:  5
total reward:  6
total reward:  13
total reward:  13
total reward:  8
total reward:  10
total reward:  15
total reward:  13
total reward:  7
total reward:  9
total reward:  19
total reward:  6
total reward:  9
total reward:  8
total reward:  15
total reward:  14
total reward:  -5
total reward:  12
total reward:  9
total reward:  7
total reward:  6
total reward:  9
total reward:  11
total reward:  6
total reward:  9
total reward:  7
total reward:  17
total reward:  9
total reward:  6
total reward:  12
total reward:  11
total reward:  13
total reward:  14
total reward:  10
total reward:  8
total reward:  17
total reward:  8
total reward:  8
total reward:  16
total reward:  9
total reward:  14
total reward:  9
total reward:  14
total reward:  11
total reward:  10
total reward:  6
total reward:  9
total reward:  7
total reward:  8
total reward:  10
total reward:  7
total 

total reward:  14
total reward:  14
total reward:  7
total reward:  9
total reward:  10
total reward:  8
total reward:  13
total reward:  12
total reward:  7
total reward:  5
total reward:  8
total reward:  13
total reward:  11
total reward:  7
total reward:  7
total reward:  5
total reward:  7
total reward:  7
total reward:  4
total reward:  11
total reward:  7
total reward:  9
total reward:  10
total reward:  7
total reward:  6
total reward:  9
total reward:  13
total reward:  12
total reward:  17
total reward:  10
total reward:  9
total reward:  8
total reward:  12
total reward:  9
total reward:  7
total reward:  15
total reward:  6
total reward:  15
total reward:  4
total reward:  11
total reward:  9
total reward:  18
total reward:  9
total reward:  10
total reward:  10
total reward:  11
total reward:  15
total reward:  9
total reward:  4
total reward:  7
total reward:  8
total reward:  5
total reward:  7
total reward:  7
total reward:  11
total reward:  6
total reward:  13
total r

total reward:  7
total reward:  2
total reward:  7
total reward:  8
total reward:  -2
total reward:  7
total reward:  7
total reward:  7
total reward:  5
total reward:  15
total reward:  9
total reward:  11
total reward:  -3
total reward:  18
total reward:  13
total reward:  12
total reward:  15
total reward:  9
total reward:  9
total reward:  3
total reward:  6
total reward:  11
total reward:  11
total reward:  8
total reward:  19
total reward:  6
total reward:  8
total reward:  17
total reward:  13
total reward:  11
total reward:  19
total reward:  16
total reward:  5
total reward:  6
total reward:  18
total reward:  3
total reward:  7
total reward:  12
total reward:  6
total reward:  10
total reward:  5
total reward:  7
total reward:  9
total reward:  7
total reward:  4
total reward:  6
total reward:  18
total reward:  8
total reward:  9
total reward:  5
total reward:  15
total reward:  15
total reward:  11
total reward:  14
total reward:  15
total reward:  9
total reward:  8
total 

total reward:  -4
total reward:  19
total reward:  11
total reward:  13
total reward:  6
total reward:  3
total reward:  9
total reward:  13
total reward:  7
total reward:  10
total reward:  9
total reward:  7
total reward:  9
total reward:  13
total reward:  11
total reward:  9
total reward:  8
total reward:  11
total reward:  9
total reward:  -6
total reward:  8
total reward:  7
total reward:  9
total reward:  6
total reward:  6
total reward:  6
total reward:  15
total reward:  7
total reward:  11
total reward:  14
total reward:  -1
total reward:  18
total reward:  11
total reward:  8
total reward:  14
total reward:  17
total reward:  5
total reward:  9
total reward:  11
total reward:  16
total reward:  15
total reward:  12
total reward:  7
total reward:  13
total reward:  9
total reward:  3
total reward:  5
total reward:  9
total reward:  8
total reward:  10
total reward:  -3
total reward:  6
total reward:  10
total reward:  7
total reward:  10
total reward:  3
total reward:  9
tota

total reward:  7
total reward:  6
total reward:  15
total reward:  11
total reward:  7
total reward:  14
total reward:  9
total reward:  8
total reward:  3
total reward:  6
total reward:  6
total reward:  16
total reward:  5
total reward:  12
total reward:  12
total reward:  9
total reward:  7
total reward:  -2
total reward:  8
total reward:  12
total reward:  8
total reward:  5
total reward:  16
total reward:  7
total reward:  8
total reward:  9
total reward:  13
total reward:  8
total reward:  12
total reward:  15
total reward:  5
total reward:  7
total reward:  11
total reward:  9
total reward:  10
total reward:  8
total reward:  9
total reward:  10
total reward:  11
total reward:  10
total reward:  5
total reward:  11
total reward:  17
total reward:  9
total reward:  11
total reward:  10
total reward:  -3
total reward:  6
total reward:  18
total reward:  5
total reward:  5
total reward:  8
total reward:  15
total reward:  9
total reward:  13
total reward:  6
total reward:  0
total 

total reward:  13
total reward:  9
total reward:  10
total reward:  18
total reward:  9
total reward:  15
total reward:  13
total reward:  11
total reward:  14
total reward:  14
total reward:  12
total reward:  10
total reward:  4
total reward:  5
total reward:  13
total reward:  8
total reward:  7
total reward:  12
total reward:  5
total reward:  19
total reward:  9
total reward:  11
total reward:  4
total reward:  7
total reward:  7
total reward:  2
total reward:  9
total reward:  12
total reward:  8
total reward:  12
total reward:  16
total reward:  13
total reward:  7
total reward:  11
total reward:  11
total reward:  9
total reward:  7
total reward:  12
total reward:  4
total reward:  12
total reward:  9
total reward:  13
total reward:  10
total reward:  8
total reward:  14
total reward:  -3
total reward:  10
total reward:  15
total reward:  9
total reward:  6
total reward:  4
total reward:  6
total reward:  11
total reward:  6
total reward:  10
total reward:  9
total reward:  4
t

total reward:  9
total reward:  9
total reward:  9
total reward:  10
total reward:  5
total reward:  10
total reward:  7
total reward:  11
total reward:  8
total reward:  8
total reward:  5
total reward:  8
total reward:  13
total reward:  14
total reward:  10
total reward:  9
total reward:  10
total reward:  7
total reward:  11
total reward:  3
total reward:  8
total reward:  6
total reward:  12
total reward:  16
total reward:  11
total reward:  6
total reward:  12
total reward:  7
total reward:  18
total reward:  12
total reward:  9
total reward:  4
total reward:  8
total reward:  5
total reward:  1
total reward:  8
total reward:  12
total reward:  12
total reward:  8
total reward:  2
total reward:  13
total reward:  9
total reward:  7
total reward:  7
total reward:  10
total reward:  -1
total reward:  17
total reward:  -3
total reward:  10
total reward:  6
total reward:  6
total reward:  11
total reward:  13
total reward:  6
total reward:  8
total reward:  7
total reward:  16
total 

total reward:  1
total reward:  13
total reward:  12
total reward:  8
total reward:  9
total reward:  7
total reward:  9
total reward:  15
total reward:  4
total reward:  18
total reward:  9
total reward:  14
total reward:  7
total reward:  17
total reward:  6
total reward:  11
total reward:  13
total reward:  6
total reward:  9
total reward:  15
total reward:  12
total reward:  14
total reward:  13
total reward:  19
total reward:  11
total reward:  8
total reward:  6
total reward:  5
total reward:  4
total reward:  14
total reward:  7
total reward:  4
total reward:  5
total reward:  -3
total reward:  9
total reward:  16
total reward:  10
total reward:  9
total reward:  7
total reward:  7
total reward:  -4
total reward:  14
total reward:  8
total reward:  7
total reward:  11
total reward:  9
total reward:  5
total reward:  7
total reward:  16
total reward:  12
total reward:  5
total reward:  18
total reward:  3
total reward:  5
total reward:  11
total reward:  5
total reward:  12
total

total reward:  10
total reward:  11
total reward:  8
total reward:  10
total reward:  19
total reward:  7
total reward:  13
total reward:  7
total reward:  8
total reward:  9
total reward:  17
total reward:  10
total reward:  16
total reward:  8
total reward:  4
total reward:  18
total reward:  12
total reward:  16
total reward:  8
total reward:  12
total reward:  11
total reward:  12
total reward:  -3
total reward:  9
total reward:  6
total reward:  11
total reward:  8
total reward:  6
total reward:  15
total reward:  14
total reward:  18
total reward:  8
total reward:  10
total reward:  4
total reward:  9
total reward:  12
total reward:  11
total reward:  12
total reward:  11
total reward:  13
total reward:  16
total reward:  4
total reward:  13
total reward:  6
total reward:  13
total reward:  9
total reward:  14
total reward:  9
total reward:  17
total reward:  11
total reward:  17
total reward:  -4
total reward:  11
total reward:  8
total reward:  9
total reward:  10
total reward:

total reward:  1
total reward:  8
total reward:  12
total reward:  -1
total reward:  3
total reward:  14
total reward:  11
total reward:  11
total reward:  10
total reward:  8
total reward:  5
total reward:  8
total reward:  4
total reward:  15
total reward:  14
total reward:  7
total reward:  15
total reward:  10
total reward:  5
total reward:  12
total reward:  6
total reward:  7
total reward:  10
total reward:  10
total reward:  14
total reward:  16
total reward:  13
total reward:  6
total reward:  4
total reward:  8
total reward:  13
total reward:  13
total reward:  6
total reward:  10
total reward:  8
total reward:  7
total reward:  10
total reward:  12
total reward:  7
total reward:  10
total reward:  12
total reward:  14
total reward:  12
total reward:  7
total reward:  4
total reward:  11
total reward:  6
total reward:  14
total reward:  10
total reward:  6
total reward:  7
total reward:  12
total reward:  7
total reward:  10
total reward:  6
total reward:  9
total reward:  10


total reward:  10
total reward:  7
total reward:  15
total reward:  19
total reward:  9
total reward:  14
total reward:  11
total reward:  13
total reward:  9
total reward:  15
total reward:  8
total reward:  9
total reward:  9
total reward:  8
total reward:  6
total reward:  8
total reward:  8
total reward:  7
total reward:  6
total reward:  11
total reward:  6
total reward:  6
total reward:  10
total reward:  9
total reward:  4
total reward:  6
total reward:  14
total reward:  10
total reward:  12
total reward:  6
total reward:  13
total reward:  8
total reward:  8
total reward:  9
total reward:  9
total reward:  -4
total reward:  11
total reward:  10
total reward:  10
total reward:  16
total reward:  5
total reward:  8
total reward:  8
total reward:  15
total reward:  7
total reward:  16
total reward:  7
total reward:  9
total reward:  10
total reward:  15
total reward:  9
total reward:  11
total reward:  7
total reward:  9
total reward:  8
total reward:  18
total reward:  11
total 

total reward:  8
total reward:  10
total reward:  8
total reward:  10
total reward:  13
total reward:  13
total reward:  0
total reward:  7
total reward:  10
total reward:  10
total reward:  14
total reward:  11
total reward:  9
total reward:  12
total reward:  10
total reward:  11
total reward:  9
total reward:  11
total reward:  8
total reward:  11
total reward:  5
total reward:  4
total reward:  13
total reward:  6
total reward:  14
total reward:  15
total reward:  6
total reward:  7
total reward:  -2
total reward:  8
total reward:  14
total reward:  14
total reward:  12
total reward:  13
total reward:  6
total reward:  10
total reward:  12
total reward:  14
total reward:  0
total reward:  4
total reward:  8
total reward:  8
total reward:  6
total reward:  7
total reward:  6
total reward:  6
total reward:  13
total reward:  4
total reward:  8
total reward:  8
total reward:  11
total reward:  10
total reward:  -3
total reward:  17
total reward:  15
total reward:  10
total reward:  15

total reward:  10
total reward:  8
total reward:  7
total reward:  10
total reward:  15
total reward:  9
total reward:  12
total reward:  5
total reward:  8
total reward:  11
total reward:  17
total reward:  12
total reward:  8
total reward:  9
total reward:  17
total reward:  8
total reward:  6
total reward:  13
total reward:  4
total reward:  10
total reward:  7
total reward:  10
total reward:  8
total reward:  12
total reward:  8
total reward:  15
total reward:  12
total reward:  8
total reward:  14
total reward:  10
total reward:  10
total reward:  11
total reward:  7
total reward:  16
total reward:  5
total reward:  -2
total reward:  14
total reward:  10
total reward:  11
total reward:  18
total reward:  13
total reward:  9
total reward:  10
total reward:  12
total reward:  7
total reward:  10
total reward:  9
total reward:  6
total reward:  19
total reward:  -3
total reward:  12
total reward:  15
total reward:  18
total reward:  10
total reward:  14
total reward:  9
total reward:

total reward:  14
total reward:  13
total reward:  10
total reward:  9
total reward:  16
total reward:  15
total reward:  16
total reward:  -4
total reward:  19
total reward:  14
total reward:  16
total reward:  10
total reward:  14
total reward:  7
total reward:  14
total reward:  6
total reward:  -4
total reward:  9
total reward:  17
total reward:  14
total reward:  9
total reward:  12
total reward:  15
total reward:  9
total reward:  8
total reward:  9
total reward:  6
total reward:  10
total reward:  13
total reward:  6
total reward:  7
total reward:  2
total reward:  -3
total reward:  10
total reward:  -1
total reward:  12
total reward:  10
total reward:  14
total reward:  15
total reward:  13
total reward:  13
total reward:  10
total reward:  9
total reward:  8
total reward:  4
total reward:  8
total reward:  15
total reward:  10
total reward:  9
total reward:  10
total reward:  19
total reward:  12
total reward:  10
total reward:  11
total reward:  15
total reward:  13
total rew

total reward:  12
total reward:  3
total reward:  5
total reward:  8
total reward:  7
total reward:  11
total reward:  15
total reward:  6
total reward:  3
total reward:  7
total reward:  8
total reward:  8
total reward:  9
total reward:  12
total reward:  11
total reward:  16
total reward:  9
total reward:  11
total reward:  18
total reward:  14
total reward:  8
total reward:  11
total reward:  6
total reward:  10
total reward:  7
total reward:  6
total reward:  10
total reward:  10
total reward:  -4
total reward:  12
total reward:  14
total reward:  9
total reward:  11
total reward:  11
total reward:  12
total reward:  8
total reward:  9
total reward:  13
total reward:  9
total reward:  8
total reward:  15
total reward:  9
total reward:  13
total reward:  8
total reward:  5
total reward:  2
total reward:  12
total reward:  9
total reward:  6
total reward:  13
total reward:  6
total reward:  7
total reward:  -1
total reward:  8
total reward:  6
total reward:  11
total reward:  14
tota

total reward:  7
total reward:  15
total reward:  12
total reward:  3
total reward:  9
total reward:  -4
total reward:  8
total reward:  3
total reward:  -3
total reward:  5
total reward:  10
total reward:  5
total reward:  16
total reward:  11
total reward:  11
total reward:  17
total reward:  5
total reward:  12
total reward:  13
total reward:  8
total reward:  15
total reward:  8
total reward:  9
total reward:  15
total reward:  10
total reward:  16
total reward:  14
total reward:  0
total reward:  11
total reward:  13
total reward:  17
total reward:  9
total reward:  14
total reward:  17
total reward:  10
total reward:  6
total reward:  7
total reward:  7
total reward:  8
total reward:  4
total reward:  8
total reward:  6
total reward:  12
total reward:  13
total reward:  18
total reward:  4
total reward:  9
total reward:  9
total reward:  11
total reward:  14
total reward:  11
total reward:  8
total reward:  10
total reward:  12
total reward:  14
total reward:  13
total reward:  1

total reward:  7
total reward:  12
total reward:  12
total reward:  13
total reward:  13
total reward:  10
total reward:  6
total reward:  7
total reward:  6
total reward:  8
total reward:  13
total reward:  14
total reward:  9
total reward:  9
total reward:  15
total reward:  9
total reward:  13
total reward:  10
total reward:  13
total reward:  9
total reward:  12
total reward:  0
total reward:  6
total reward:  8
total reward:  10
total reward:  -6
total reward:  15
total reward:  11
total reward:  11
total reward:  7
total reward:  7
total reward:  17
total reward:  2
total reward:  16
total reward:  6
total reward:  -2
total reward:  14
total reward:  11
total reward:  -4
total reward:  8
total reward:  14
total reward:  13
total reward:  11
total reward:  4
total reward:  7
total reward:  12
total reward:  12
total reward:  9
total reward:  10
total reward:  7
total reward:  19
total reward:  15
total reward:  9
total reward:  15
total reward:  4
total reward:  15
total reward:  

total reward:  10
total reward:  10
total reward:  10
total reward:  11
total reward:  11
total reward:  12
total reward:  -6
total reward:  10
total reward:  8
total reward:  13
total reward:  13
total reward:  6
total reward:  8
total reward:  14
total reward:  8
total reward:  14
total reward:  7
total reward:  6
total reward:  10
total reward:  7
total reward:  15
total reward:  14
total reward:  9
total reward:  12
total reward:  13
total reward:  7
total reward:  12
total reward:  7
total reward:  8
total reward:  18
total reward:  10
total reward:  8
total reward:  12
total reward:  13
total reward:  12
total reward:  8
total reward:  8
total reward:  7
total reward:  11
total reward:  13
total reward:  18
total reward:  8
total reward:  5
total reward:  19
total reward:  6
total reward:  11
total reward:  17
total reward:  14
total reward:  6
total reward:  8
total reward:  12
total reward:  14
total reward:  13
total reward:  13
total reward:  8
total reward:  -3
total reward:

total reward:  16
total reward:  9
total reward:  15
total reward:  9
total reward:  6
total reward:  15
total reward:  11
total reward:  7
total reward:  6
total reward:  13
total reward:  14
total reward:  13
total reward:  5
total reward:  15
total reward:  13
total reward:  6
total reward:  9
total reward:  12
total reward:  12
total reward:  11
total reward:  8
total reward:  14
total reward:  12
total reward:  11
total reward:  8
total reward:  4
total reward:  5
total reward:  8
total reward:  11
total reward:  11
total reward:  15
total reward:  8
total reward:  10
total reward:  7
total reward:  14
total reward:  8
total reward:  13
total reward:  14
total reward:  10
total reward:  12
total reward:  11
total reward:  7
total reward:  11
total reward:  9
total reward:  -1
total reward:  14
total reward:  9
total reward:  10
total reward:  14
total reward:  9
total reward:  -1
total reward:  5
total reward:  10
total reward:  6
total reward:  8
total reward:  10
total reward:  

total reward:  6
total reward:  9
total reward:  7
total reward:  15
total reward:  9
total reward:  5
total reward:  12
total reward:  9
total reward:  10
total reward:  14
total reward:  4
total reward:  6
total reward:  -4
total reward:  11
total reward:  9
total reward:  9
total reward:  12
total reward:  9
total reward:  11
total reward:  11
total reward:  6
total reward:  8
total reward:  7
total reward:  7
total reward:  6
total reward:  11
total reward:  15
total reward:  11
total reward:  5
total reward:  9
total reward:  8
total reward:  12
total reward:  8
total reward:  8
total reward:  0
total reward:  11
total reward:  9
total reward:  5
total reward:  8
total reward:  10
total reward:  10
total reward:  12
total reward:  10
total reward:  6
total reward:  7
total reward:  15
total reward:  9
total reward:  7
total reward:  10
total reward:  9
total reward:  8
total reward:  6
total reward:  16
total reward:  8
total reward:  12
total reward:  12
total reward:  18
total r

total reward:  10
total reward:  5
total reward:  11
total reward:  16
total reward:  13
total reward:  8
total reward:  14
total reward:  17
total reward:  13
total reward:  11
total reward:  7
total reward:  8
total reward:  8
total reward:  8
total reward:  11
total reward:  16
total reward:  7
total reward:  5
total reward:  6
total reward:  12
total reward:  7
total reward:  12
total reward:  7
total reward:  6
total reward:  10
total reward:  8
total reward:  6
total reward:  4
total reward:  10
total reward:  -1
total reward:  18
total reward:  8
total reward:  18
total reward:  8
total reward:  13
total reward:  9
total reward:  -2
total reward:  3
total reward:  9
total reward:  12
total reward:  14
total reward:  14
total reward:  12
total reward:  5
total reward:  14
total reward:  12
total reward:  13
total reward:  8
total reward:  5
total reward:  5
total reward:  14
total reward:  9
total reward:  11
total reward:  6
total reward:  6
total reward:  9
total reward:  7
tot

total reward:  4
total reward:  9
total reward:  11
total reward:  10
total reward:  16
total reward:  8
total reward:  9
total reward:  7
total reward:  7
total reward:  10
total reward:  7
total reward:  9
total reward:  6
total reward:  18
total reward:  18
total reward:  17
total reward:  11
total reward:  11
total reward:  11
total reward:  10
total reward:  9
total reward:  18
total reward:  9
total reward:  16
total reward:  7
total reward:  13
total reward:  4
total reward:  9
total reward:  7
total reward:  -1
total reward:  8
total reward:  12
total reward:  13
total reward:  2
total reward:  14
total reward:  5
total reward:  11
total reward:  5
total reward:  5
total reward:  14
total reward:  17
total reward:  9
total reward:  8
total reward:  7
total reward:  10
total reward:  6
total reward:  6
total reward:  8
total reward:  11
total reward:  7
total reward:  14
total reward:  14
total reward:  9
total reward:  9
total reward:  17
total reward:  10
total reward:  10
tot

total reward:  8
total reward:  15
total reward:  10
total reward:  7
total reward:  10
total reward:  12
total reward:  6
total reward:  3
total reward:  10
total reward:  12
total reward:  8
total reward:  0
total reward:  9
total reward:  11
total reward:  17
total reward:  11
total reward:  11
total reward:  9
total reward:  12
total reward:  19
total reward:  12
total reward:  16
total reward:  6
total reward:  15
total reward:  6
total reward:  9
total reward:  14
total reward:  5
total reward:  8
total reward:  14
total reward:  8
total reward:  7
total reward:  -3
total reward:  9
total reward:  13
total reward:  8
total reward:  8
total reward:  16
total reward:  17
total reward:  9
total reward:  6
total reward:  16
total reward:  5
total reward:  9
total reward:  8
total reward:  12
total reward:  7
total reward:  8
total reward:  6
total reward:  11
total reward:  16
total reward:  17
total reward:  13
total reward:  10
total reward:  8
total reward:  6
total reward:  -4
to

total reward:  19
total reward:  7
total reward:  9
total reward:  13
total reward:  9
total reward:  6
total reward:  7
total reward:  10
total reward:  0
total reward:  9
total reward:  6
total reward:  9
total reward:  11
total reward:  12
total reward:  13
total reward:  13
total reward:  9
total reward:  15
total reward:  5
total reward:  8
total reward:  7
total reward:  12
total reward:  5
total reward:  11
total reward:  10
total reward:  16
total reward:  10
total reward:  9
total reward:  11
total reward:  9
total reward:  8
total reward:  8
total reward:  5
total reward:  8
total reward:  14
total reward:  10
total reward:  10
total reward:  11
total reward:  13
total reward:  8
total reward:  11
total reward:  10
total reward:  9
total reward:  5
total reward:  9
total reward:  8
total reward:  11
total reward:  5
total reward:  4
total reward:  4
total reward:  13
total reward:  8
total reward:  8
total reward:  17
total reward:  9
total reward:  8
total reward:  10
total 

total reward:  -2
total reward:  9
total reward:  9
total reward:  -2
total reward:  12
total reward:  11
total reward:  5
total reward:  13
total reward:  9
total reward:  13
total reward:  12
total reward:  14
total reward:  10
total reward:  10
total reward:  19
total reward:  13
total reward:  12
total reward:  15
total reward:  9
total reward:  15
total reward:  6
total reward:  15
total reward:  7
total reward:  15
total reward:  11
total reward:  5
total reward:  8
total reward:  11
total reward:  10
total reward:  12
total reward:  12
total reward:  10
total reward:  10
total reward:  10
total reward:  11
total reward:  6
total reward:  7
total reward:  11
total reward:  12
total reward:  13
total reward:  13
total reward:  9
total reward:  5
total reward:  7
total reward:  14
total reward:  8
total reward:  6
total reward:  6
total reward:  8
total reward:  8
total reward:  17
total reward:  8
total reward:  -2
total reward:  7
total reward:  5
total reward:  13
total reward: 

total reward:  8
total reward:  13
total reward:  8
total reward:  9
total reward:  9
total reward:  8
total reward:  7
total reward:  9
total reward:  10
total reward:  10
total reward:  9
total reward:  16
total reward:  16
total reward:  11
total reward:  5
total reward:  13
total reward:  6
total reward:  8
total reward:  7
total reward:  13
total reward:  10
total reward:  8
total reward:  18
total reward:  14
total reward:  9
total reward:  14
total reward:  8
total reward:  5
total reward:  8
total reward:  7
total reward:  10
total reward:  13
total reward:  7
total reward:  13
total reward:  17
total reward:  12
total reward:  8
total reward:  13
total reward:  13
total reward:  11
total reward:  10
total reward:  7
total reward:  7
total reward:  11
total reward:  10
total reward:  9
total reward:  5
total reward:  11
total reward:  13
total reward:  0
total reward:  10
total reward:  9
total reward:  13
total reward:  7
total reward:  7
total reward:  -15
total reward:  9
to

total reward:  9
total reward:  10
total reward:  4
total reward:  6
total reward:  6
total reward:  17
total reward:  3
total reward:  15
total reward:  9
total reward:  16
total reward:  6
total reward:  9
total reward:  7
total reward:  11
total reward:  8
total reward:  9
total reward:  11
total reward:  5
total reward:  16
total reward:  9
total reward:  6
total reward:  15
total reward:  14
total reward:  14
total reward:  11
total reward:  10
total reward:  18
total reward:  8
total reward:  9
total reward:  5
total reward:  18
total reward:  7
total reward:  14
total reward:  9
total reward:  8
total reward:  8
total reward:  6
total reward:  12
total reward:  14
total reward:  4
total reward:  8
total reward:  8
total reward:  10
total reward:  11
total reward:  5
total reward:  12
total reward:  5
total reward:  -10
total reward:  3
total reward:  14
total reward:  10
total reward:  19
total reward:  8
total reward:  12
total reward:  13
total reward:  7
total reward:  9
tota

total reward:  7
total reward:  6
total reward:  14
total reward:  5
total reward:  11
total reward:  7
total reward:  10
total reward:  10
total reward:  18
total reward:  17
total reward:  14
total reward:  4
total reward:  -2
total reward:  11
total reward:  5
total reward:  7
total reward:  9
total reward:  11
total reward:  7
total reward:  9
total reward:  16
total reward:  10
total reward:  9
total reward:  17
total reward:  18
total reward:  10
total reward:  6
total reward:  4
total reward:  9
total reward:  18
total reward:  8
total reward:  6
total reward:  7
total reward:  11
total reward:  13
total reward:  11
total reward:  11
total reward:  10
total reward:  -2
total reward:  -3
total reward:  8
total reward:  15
total reward:  10
total reward:  5
total reward:  10
total reward:  14
total reward:  12
total reward:  8
total reward:  5
total reward:  10
total reward:  10
total reward:  6
total reward:  12
total reward:  7
total reward:  -3
total reward:  11
total reward:  

total reward:  18
total reward:  14
total reward:  18
total reward:  8
total reward:  13
total reward:  8
total reward:  18
total reward:  10
total reward:  10
total reward:  13
total reward:  5
total reward:  15
total reward:  8
total reward:  8
total reward:  9
total reward:  8
total reward:  11
total reward:  13
total reward:  9
total reward:  9
total reward:  13
total reward:  9
total reward:  8
total reward:  10
total reward:  14
total reward:  8
total reward:  6
total reward:  6
total reward:  12
total reward:  8
total reward:  9
total reward:  11
total reward:  7
total reward:  9
total reward:  11
total reward:  7
total reward:  11
total reward:  15
total reward:  5
total reward:  13
total reward:  5
total reward:  7
total reward:  3
total reward:  11
total reward:  10
total reward:  8
total reward:  14
total reward:  12
total reward:  4
total reward:  6
total reward:  9
total reward:  12
total reward:  11
total reward:  -2
total reward:  16
total reward:  15
total reward:  18
t

total reward:  10
total reward:  10
total reward:  10
total reward:  13
total reward:  6
total reward:  17
total reward:  14
total reward:  9
total reward:  7
total reward:  10
total reward:  14
total reward:  8
total reward:  8
total reward:  8
total reward:  8
total reward:  14
total reward:  12
total reward:  9
total reward:  13
total reward:  18
total reward:  13
total reward:  11
total reward:  8
total reward:  11
total reward:  7
total reward:  15
total reward:  9
total reward:  -1
total reward:  8
total reward:  9
total reward:  12
total reward:  12
total reward:  7
total reward:  7
total reward:  10
total reward:  10
total reward:  11
total reward:  11
total reward:  10
total reward:  6
total reward:  5
total reward:  7
total reward:  7
total reward:  5
total reward:  15
total reward:  12
total reward:  8
total reward:  7
total reward:  6
total reward:  10
total reward:  10
total reward:  12
total reward:  6
total reward:  10
total reward:  9
total reward:  9
total reward:  8
t

total reward:  11
total reward:  11
total reward:  15
total reward:  13
total reward:  6
total reward:  -5
total reward:  14
total reward:  7
total reward:  8
total reward:  11
total reward:  7
total reward:  10
total reward:  -1
total reward:  5
total reward:  9
total reward:  11
total reward:  4
total reward:  5
total reward:  4
total reward:  11
total reward:  10
total reward:  10
total reward:  10
total reward:  9
total reward:  7
total reward:  15
total reward:  8
total reward:  11
total reward:  8
total reward:  9
total reward:  6
total reward:  5
total reward:  14
total reward:  12
total reward:  9
total reward:  13
total reward:  13
total reward:  11
total reward:  7
total reward:  10
total reward:  8
total reward:  15
total reward:  7
total reward:  12
total reward:  10
total reward:  9
total reward:  12
total reward:  9
total reward:  15
total reward:  12
total reward:  10
total reward:  8
total reward:  7
total reward:  15
total reward:  7
total reward:  8
total reward:  7
t

total reward:  7
total reward:  7
total reward:  18
total reward:  11
total reward:  8
total reward:  8
total reward:  9
total reward:  12
total reward:  15
total reward:  10
total reward:  12
total reward:  11
total reward:  5
total reward:  17
total reward:  5
total reward:  14
total reward:  12
total reward:  6
total reward:  5
total reward:  7
total reward:  5
total reward:  13
total reward:  7
total reward:  7
total reward:  9
total reward:  13
total reward:  18
total reward:  6
total reward:  8
total reward:  14
total reward:  12
total reward:  5
total reward:  10
total reward:  9
total reward:  16
total reward:  6
total reward:  -1
total reward:  5
total reward:  12
total reward:  7
total reward:  8
total reward:  9
total reward:  14
total reward:  12
total reward:  6
total reward:  11
total reward:  -6
total reward:  8
total reward:  5
total reward:  17
total reward:  15
total reward:  -1
total reward:  6
total reward:  16
total reward:  8
total reward:  5
total reward:  13
tot

total reward:  8
total reward:  8
total reward:  7
total reward:  8
total reward:  10
total reward:  8
total reward:  15
total reward:  10
total reward:  8
total reward:  12
total reward:  8
total reward:  12
total reward:  15
total reward:  6
total reward:  11
total reward:  10
total reward:  7
total reward:  11
total reward:  14
total reward:  5
total reward:  14
total reward:  9
total reward:  9
total reward:  -1
total reward:  16
total reward:  12
total reward:  6
total reward:  7
total reward:  3
total reward:  6
total reward:  4
total reward:  9
total reward:  13
total reward:  8
total reward:  9
total reward:  10
total reward:  15
total reward:  6
total reward:  8
total reward:  15
total reward:  17
total reward:  15
total reward:  6
total reward:  9
total reward:  3
total reward:  11
total reward:  13
total reward:  7
total reward:  14
total reward:  7
total reward:  14
total reward:  18
total reward:  9
total reward:  11
total reward:  7
total reward:  14
total reward:  6
tota

total reward:  7
total reward:  7
total reward:  10
total reward:  5
total reward:  7
total reward:  11
total reward:  9
total reward:  13
total reward:  7
total reward:  8
total reward:  6
total reward:  13
total reward:  6
total reward:  14
total reward:  10
total reward:  16
total reward:  13
total reward:  11
total reward:  10
total reward:  10
total reward:  12
total reward:  15
total reward:  14
total reward:  13
total reward:  15
total reward:  8
total reward:  8
total reward:  13
total reward:  14
total reward:  14
total reward:  8
total reward:  14
total reward:  9
total reward:  9
total reward:  13
total reward:  2
total reward:  15
total reward:  -6
total reward:  8
total reward:  14
total reward:  15
total reward:  9
total reward:  7
total reward:  8
total reward:  10
total reward:  9
total reward:  7
total reward:  6
total reward:  4
total reward:  8
total reward:  11
total reward:  11
total reward:  15
total reward:  9
total reward:  5
total reward:  -3
total reward:  7
t

total reward:  6
total reward:  5
total reward:  10
total reward:  9
total reward:  13
total reward:  14
total reward:  13
total reward:  9
total reward:  13
total reward:  13
total reward:  5
total reward:  14
total reward:  7
total reward:  9
total reward:  14
total reward:  3
total reward:  12
total reward:  9
total reward:  12
total reward:  8
total reward:  -6
total reward:  3
total reward:  -3
total reward:  15
total reward:  11
total reward:  13
total reward:  -5
total reward:  7
total reward:  -4
total reward:  13
total reward:  10
total reward:  9
total reward:  11
total reward:  0
total reward:  12
total reward:  19
total reward:  12
total reward:  13
total reward:  8
total reward:  11
total reward:  8
total reward:  5
total reward:  7
total reward:  7
total reward:  6
total reward:  9
total reward:  11
total reward:  9
total reward:  12
total reward:  11
total reward:  4
total reward:  11
total reward:  11
total reward:  10
total reward:  7
total reward:  8
total reward:  6


total reward:  13
total reward:  7
total reward:  -4
total reward:  8
total reward:  12
total reward:  11
total reward:  4
total reward:  15
total reward:  8
total reward:  18
total reward:  12
total reward:  9
total reward:  12
total reward:  7
total reward:  6
total reward:  13
total reward:  7
total reward:  9
total reward:  14
total reward:  5
total reward:  15
total reward:  7
total reward:  17
total reward:  11
total reward:  8
total reward:  8
total reward:  8
total reward:  11
total reward:  8
total reward:  10
total reward:  8
total reward:  -3
total reward:  7
total reward:  15
total reward:  7
total reward:  13
total reward:  5
total reward:  6
total reward:  9
total reward:  10
total reward:  12
total reward:  5
total reward:  4
total reward:  13
total reward:  14
total reward:  13
total reward:  12
total reward:  13
total reward:  6
total reward:  8
total reward:  13
total reward:  7
total reward:  9
total reward:  7
total reward:  12
total reward:  13
total reward:  7
tot

total reward:  7
total reward:  9
total reward:  8
total reward:  8
total reward:  -3
total reward:  12
total reward:  8
total reward:  8
total reward:  6
total reward:  15
total reward:  6
total reward:  7
total reward:  14
total reward:  7
total reward:  10
total reward:  5
total reward:  8
total reward:  12
total reward:  10
total reward:  9
total reward:  7
total reward:  8
total reward:  13
total reward:  14
total reward:  8
total reward:  -2
total reward:  10
total reward:  10
total reward:  12
total reward:  11
total reward:  -3
total reward:  11
total reward:  14
total reward:  12
total reward:  9
total reward:  16
total reward:  8
total reward:  11
total reward:  15
total reward:  13
total reward:  6
total reward:  5
total reward:  15
total reward:  1
total reward:  7
total reward:  14
total reward:  15
total reward:  15
total reward:  7
total reward:  6
total reward:  13
total reward:  14
total reward:  7
total reward:  8
total reward:  -3
total reward:  5
total reward:  10
t

total reward:  6
total reward:  18
total reward:  13
total reward:  -6
total reward:  8
total reward:  7
total reward:  11
total reward:  16
total reward:  13
total reward:  15
total reward:  16
total reward:  10
total reward:  -3
total reward:  11
total reward:  7
total reward:  5
total reward:  14
total reward:  9
total reward:  -2
total reward:  17
total reward:  5
total reward:  15
total reward:  15
total reward:  2
total reward:  7
total reward:  7
total reward:  7
total reward:  9
total reward:  11
total reward:  9
total reward:  7
total reward:  14
total reward:  12
total reward:  9
total reward:  10
total reward:  11
total reward:  14
total reward:  7
total reward:  4
total reward:  8
total reward:  10
total reward:  8
total reward:  6
total reward:  13
total reward:  6
total reward:  7
total reward:  15
total reward:  10
total reward:  15
total reward:  9
total reward:  13
total reward:  11
total reward:  9
total reward:  7
total reward:  10
total reward:  8
total reward:  12


total reward:  6
total reward:  10
total reward:  14
total reward:  9
total reward:  18
total reward:  15
total reward:  17
total reward:  9
total reward:  13
total reward:  9
total reward:  15
total reward:  17
total reward:  3
total reward:  9
total reward:  17
total reward:  10
total reward:  12
total reward:  10
total reward:  6
total reward:  13
total reward:  7
total reward:  10
total reward:  5
total reward:  11
total reward:  7
total reward:  5
total reward:  8
total reward:  6
total reward:  13
total reward:  14
total reward:  16
total reward:  7
total reward:  7
total reward:  9
total reward:  6
total reward:  7
total reward:  12
total reward:  15
total reward:  6
total reward:  6
total reward:  5
total reward:  7
total reward:  5
total reward:  17
total reward:  13
total reward:  10
total reward:  -2
total reward:  12
total reward:  11
total reward:  10
total reward:  16
total reward:  7
total reward:  7
total reward:  7
total reward:  7
total reward:  9
total reward:  6
tot

total reward:  15
total reward:  11
total reward:  13
total reward:  11
total reward:  13
total reward:  18
total reward:  16
total reward:  9
total reward:  10
total reward:  15
total reward:  9
total reward:  4
total reward:  11
total reward:  7
total reward:  4
total reward:  18
total reward:  10
total reward:  16
total reward:  8
total reward:  10
total reward:  14
total reward:  9
total reward:  8
total reward:  18
total reward:  9
total reward:  14
total reward:  8
total reward:  9
total reward:  14
total reward:  9
total reward:  7
total reward:  8
total reward:  12
total reward:  12
total reward:  9
total reward:  4
total reward:  9
total reward:  12
total reward:  9
total reward:  11
total reward:  8
total reward:  11
total reward:  10
total reward:  19
total reward:  3
total reward:  8
total reward:  6
total reward:  11
total reward:  11
total reward:  11
total reward:  -3
total reward:  13
total reward:  9
total reward:  -3
total reward:  6
total reward:  16
total reward:  1

total reward:  11
total reward:  14
total reward:  10
total reward:  6
total reward:  9
total reward:  15
total reward:  9
total reward:  8
total reward:  7
total reward:  2
total reward:  13
total reward:  7
total reward:  11
total reward:  10
total reward:  10
total reward:  10
total reward:  18
total reward:  11
total reward:  6
total reward:  15
total reward:  16
total reward:  16
total reward:  11
total reward:  6
total reward:  8
total reward:  9
total reward:  7
total reward:  14
total reward:  15
total reward:  10
total reward:  6
total reward:  7
total reward:  14
total reward:  17
total reward:  6
total reward:  11
total reward:  12
total reward:  11
total reward:  7
total reward:  9
total reward:  7
total reward:  14
total reward:  6
total reward:  9
total reward:  7
total reward:  7
total reward:  9
total reward:  14
total reward:  4
total reward:  9
total reward:  10
total reward:  7
total reward:  10
total reward:  16
total reward:  8
total reward:  9
total reward:  6
tot

total reward:  11
total reward:  14
total reward:  10
total reward:  12
total reward:  4
total reward:  10
total reward:  5
total reward:  8
total reward:  5
total reward:  8
total reward:  9
total reward:  6
total reward:  9
total reward:  7
total reward:  7
total reward:  -3
total reward:  8
total reward:  12
total reward:  7
total reward:  10
total reward:  10
total reward:  5
total reward:  6
total reward:  12
total reward:  12
total reward:  8
total reward:  12
total reward:  13
total reward:  8
total reward:  14
total reward:  -1
total reward:  3
total reward:  9
total reward:  11
total reward:  6
total reward:  3
total reward:  7
total reward:  11
total reward:  7
total reward:  3
total reward:  13
total reward:  10
total reward:  13
total reward:  16
total reward:  16
total reward:  8
total reward:  4
total reward:  15
total reward:  -2
total reward:  7
total reward:  6
total reward:  5
total reward:  12
total reward:  14
total reward:  8
total reward:  7
total reward:  5
total

total reward:  8
total reward:  8
total reward:  5
total reward:  13
total reward:  2
total reward:  9
total reward:  9
total reward:  10
total reward:  15
total reward:  9
total reward:  13
total reward:  7
total reward:  10
total reward:  8
total reward:  7
total reward:  16
total reward:  12
total reward:  12
total reward:  18
total reward:  9
total reward:  6
total reward:  10
total reward:  14
total reward:  13
total reward:  12
total reward:  7
total reward:  13
total reward:  8
total reward:  11
total reward:  5
total reward:  13
total reward:  13
total reward:  10
total reward:  11
total reward:  7
total reward:  16
total reward:  7
total reward:  8
total reward:  17
total reward:  6
total reward:  6
total reward:  9
total reward:  7
total reward:  7
total reward:  12
total reward:  5
total reward:  11
total reward:  9
total reward:  11
total reward:  10
total reward:  12
total reward:  7
total reward:  7
total reward:  8
total reward:  12
total reward:  8
total reward:  11
tot

total reward:  8
total reward:  9
total reward:  8
total reward:  12
total reward:  9
total reward:  7
total reward:  11
total reward:  7
total reward:  13
total reward:  9
total reward:  7
total reward:  8
total reward:  8
total reward:  8
total reward:  7
total reward:  10
total reward:  9
total reward:  11
total reward:  5
total reward:  11
total reward:  12
total reward:  -1
total reward:  3
total reward:  4
total reward:  7
total reward:  17
total reward:  8
total reward:  8
total reward:  8
total reward:  5
total reward:  14
total reward:  7
total reward:  9
total reward:  5
total reward:  10
total reward:  6
total reward:  7
total reward:  16
total reward:  7
total reward:  8
total reward:  15
total reward:  8
total reward:  15
total reward:  13
total reward:  15
total reward:  6
total reward:  6
total reward:  10
total reward:  7
total reward:  19
total reward:  11
total reward:  14
total reward:  7
total reward:  9
total reward:  9
total reward:  11
total reward:  10
total rew

total reward:  10
total reward:  5
total reward:  14
total reward:  9
total reward:  6
total reward:  6
total reward:  8
total reward:  6
total reward:  14
total reward:  15
total reward:  8
total reward:  13
total reward:  6
total reward:  16
total reward:  14
total reward:  2
total reward:  13
total reward:  11
total reward:  8
total reward:  11
total reward:  13
total reward:  11
total reward:  -2
total reward:  6
total reward:  18
total reward:  13
total reward:  5
total reward:  6
total reward:  12
total reward:  17
total reward:  8
total reward:  9
total reward:  13
total reward:  17
total reward:  15
total reward:  7
total reward:  12
total reward:  13
total reward:  9
total reward:  14
total reward:  10
total reward:  9
total reward:  -1
total reward:  12
total reward:  13
total reward:  8
total reward:  9
total reward:  12
total reward:  5
total reward:  9
total reward:  11
total reward:  13
total reward:  6
total reward:  8
total reward:  5
total reward:  6
total reward:  9
t

total reward:  14
total reward:  11
total reward:  16
total reward:  10
total reward:  5
total reward:  15
total reward:  -2
total reward:  13
total reward:  9
total reward:  9
total reward:  -3
total reward:  10
total reward:  10
total reward:  7
total reward:  7
total reward:  12
total reward:  -3
total reward:  7
total reward:  12
total reward:  5
total reward:  13
total reward:  13
total reward:  6
total reward:  9
total reward:  7
total reward:  8
total reward:  7
total reward:  5
total reward:  17
total reward:  5
total reward:  11
total reward:  10
total reward:  9
total reward:  5
total reward:  7
total reward:  -1
total reward:  11
total reward:  6
total reward:  7
total reward:  9
total reward:  7
total reward:  5
total reward:  10
total reward:  6
total reward:  15
total reward:  4
total reward:  13
total reward:  9
total reward:  7
total reward:  7
total reward:  16
total reward:  18
total reward:  7
total reward:  7
total reward:  15
total reward:  9
total reward:  18
tota

total reward:  11
total reward:  12
total reward:  9
total reward:  19
total reward:  10
total reward:  8
total reward:  10
total reward:  6
total reward:  11
total reward:  9
total reward:  16
total reward:  6
total reward:  6
total reward:  11
total reward:  6
total reward:  10
total reward:  8
total reward:  6
total reward:  10
total reward:  3
total reward:  0
total reward:  5
total reward:  16
total reward:  15
total reward:  8
total reward:  13
total reward:  14
total reward:  5
total reward:  12
total reward:  13
total reward:  5
total reward:  5
total reward:  7
total reward:  10
total reward:  10
total reward:  8
total reward:  5
total reward:  7
total reward:  9
total reward:  10
total reward:  8
total reward:  10
total reward:  7
total reward:  8
total reward:  12
total reward:  14
total reward:  9
total reward:  12
total reward:  15
total reward:  8
total reward:  11
total reward:  10
total reward:  9
total reward:  12
total reward:  8
total reward:  9
total reward:  7
tota

total reward:  14
total reward:  13
total reward:  -3
total reward:  12
total reward:  9
total reward:  15
total reward:  14
total reward:  13
total reward:  -3
total reward:  14
total reward:  13
total reward:  4
total reward:  8
total reward:  2
total reward:  8
total reward:  -2
total reward:  7
total reward:  10
total reward:  7
total reward:  8
total reward:  13
total reward:  10
total reward:  13
total reward:  -2
total reward:  7
total reward:  6
total reward:  8
total reward:  12
total reward:  14
total reward:  8
total reward:  16
total reward:  9
total reward:  7
total reward:  13
total reward:  11
total reward:  18
total reward:  10
total reward:  8
total reward:  7
total reward:  15
total reward:  9
total reward:  14
total reward:  13
total reward:  12
total reward:  7
total reward:  9
total reward:  15
total reward:  8
total reward:  7
total reward:  9
total reward:  7
total reward:  12
total reward:  7
total reward:  1
total reward:  10
total reward:  13
total reward:  11

total reward:  12
total reward:  17
total reward:  14
total reward:  9
total reward:  1
total reward:  6
total reward:  6
total reward:  8
total reward:  3
total reward:  9
total reward:  13
total reward:  7
total reward:  5
total reward:  8
total reward:  14
total reward:  7
total reward:  16
total reward:  6
total reward:  7
total reward:  14
total reward:  7
total reward:  13
total reward:  18
total reward:  16
total reward:  19
total reward:  4
total reward:  12
total reward:  6
total reward:  17
total reward:  13
total reward:  16
total reward:  14
total reward:  11
total reward:  12
total reward:  13
total reward:  11
total reward:  6
total reward:  8
total reward:  14
total reward:  8
total reward:  15
total reward:  10
total reward:  10
total reward:  7
total reward:  9
total reward:  -5
total reward:  13
total reward:  6
total reward:  12
total reward:  8
total reward:  13
total reward:  8
total reward:  12
total reward:  -3
total reward:  7
total reward:  12
total reward:  9


total reward:  9
total reward:  -5
total reward:  5
total reward:  9
total reward:  7
total reward:  11
total reward:  9
total reward:  9
total reward:  7
total reward:  9
total reward:  8
total reward:  13
total reward:  11
total reward:  12
total reward:  7
total reward:  14
total reward:  11
total reward:  5
total reward:  6
total reward:  7
total reward:  13
total reward:  11
total reward:  16
total reward:  9
total reward:  15
total reward:  11
total reward:  12
total reward:  6
total reward:  11
total reward:  5
total reward:  8
total reward:  9
total reward:  11
total reward:  9
total reward:  16
total reward:  12
total reward:  -5
total reward:  11
total reward:  10
total reward:  6
total reward:  11
total reward:  11
total reward:  1
total reward:  10
total reward:  14
total reward:  7
total reward:  -4
total reward:  16
total reward:  11
total reward:  0
total reward:  7
total reward:  7
total reward:  9
total reward:  13
total reward:  4
total reward:  11
total reward:  10
t

total reward:  14
total reward:  7
total reward:  -3
total reward:  10
total reward:  6
total reward:  11
total reward:  10
total reward:  8
total reward:  5
total reward:  12
total reward:  10
total reward:  7
total reward:  9
total reward:  9
total reward:  15
total reward:  -4
total reward:  10
total reward:  13
total reward:  6
total reward:  8
total reward:  7
total reward:  9
total reward:  17
total reward:  8
total reward:  9
total reward:  17
total reward:  12
total reward:  8
total reward:  8
total reward:  12
total reward:  13
total reward:  10
total reward:  7
total reward:  11
total reward:  12
total reward:  14
total reward:  13
total reward:  7
total reward:  6
total reward:  7
total reward:  1
total reward:  9
total reward:  9
total reward:  8
total reward:  8
total reward:  12
total reward:  7
total reward:  12
total reward:  13
total reward:  8
total reward:  6
total reward:  8
total reward:  7
total reward:  12
total reward:  5
total reward:  -2
total reward:  0
total

total reward:  6
total reward:  7
total reward:  11
total reward:  16
total reward:  13
total reward:  9
total reward:  13
total reward:  -4
total reward:  7
total reward:  10
total reward:  11
total reward:  14
total reward:  6
total reward:  9
total reward:  13
total reward:  15
total reward:  13
total reward:  8
total reward:  13
total reward:  14
total reward:  11
total reward:  9
total reward:  7
total reward:  14
total reward:  17
total reward:  10
total reward:  10
total reward:  8
total reward:  12
total reward:  14
total reward:  1
total reward:  3
total reward:  10
total reward:  13
total reward:  18
total reward:  13
total reward:  2
total reward:  10
total reward:  5
total reward:  9
total reward:  9
total reward:  7
total reward:  7
total reward:  10
total reward:  10
total reward:  9
total reward:  12
total reward:  8
total reward:  4
total reward:  13
total reward:  14
total reward:  14
total reward:  10
total reward:  9
total reward:  11
total reward:  8
total reward:  

total reward:  4
total reward:  17
total reward:  12
total reward:  10
total reward:  4
total reward:  15
total reward:  5
total reward:  13
total reward:  6
total reward:  8
total reward:  7
total reward:  8
total reward:  7
total reward:  7
total reward:  7
total reward:  4
total reward:  14
total reward:  13
total reward:  14
total reward:  9
total reward:  8
total reward:  11
total reward:  5
total reward:  12
total reward:  9
total reward:  -3
total reward:  9
total reward:  8
total reward:  7
total reward:  12
total reward:  10
total reward:  9
total reward:  8
total reward:  8
total reward:  17
total reward:  14
total reward:  18
total reward:  12
total reward:  14
total reward:  14
total reward:  8
total reward:  6
total reward:  8
total reward:  14
total reward:  8
total reward:  15
total reward:  11
total reward:  10
total reward:  8
total reward:  10
total reward:  16
total reward:  12
total reward:  9
total reward:  11
total reward:  11
total reward:  -1
total reward:  11
t

total reward:  6
total reward:  5
total reward:  13
total reward:  8
total reward:  -2
total reward:  7
total reward:  8
total reward:  8
total reward:  4
total reward:  9
total reward:  12
total reward:  5
total reward:  9
total reward:  15
total reward:  17
total reward:  12
total reward:  12
total reward:  14
total reward:  11
total reward:  15
total reward:  7
total reward:  10
total reward:  9
total reward:  5
total reward:  7
total reward:  7
total reward:  11
total reward:  7
total reward:  15
total reward:  11
total reward:  -5
total reward:  11
total reward:  12
total reward:  1
total reward:  10
total reward:  13
total reward:  12
total reward:  12
total reward:  11
total reward:  14
total reward:  9
total reward:  13
total reward:  8
total reward:  14
total reward:  9
total reward:  9
total reward:  7
total reward:  16
total reward:  12
total reward:  18
total reward:  7
total reward:  10
total reward:  11
total reward:  6
total reward:  10
total reward:  18
total reward:  1

total reward:  17
total reward:  17
total reward:  4
total reward:  8
total reward:  7
total reward:  16
total reward:  11
total reward:  12
total reward:  6
total reward:  12
total reward:  7
total reward:  5
total reward:  8
total reward:  10
total reward:  14
total reward:  12
total reward:  10
total reward:  13
total reward:  8
total reward:  6
total reward:  11
total reward:  11
total reward:  7
total reward:  15
total reward:  7
total reward:  7
total reward:  11
total reward:  12
total reward:  14
total reward:  5
total reward:  10
total reward:  16
total reward:  11
total reward:  7
total reward:  11
total reward:  14
total reward:  14
total reward:  11
total reward:  13
total reward:  17
total reward:  6
total reward:  8
total reward:  7
total reward:  8
total reward:  4
total reward:  8
total reward:  10
total reward:  7
total reward:  10
total reward:  10
total reward:  8
total reward:  13
total reward:  10
total reward:  15
total reward:  16
total reward:  6
total reward:  

total reward:  10
total reward:  18
total reward:  3
total reward:  14
total reward:  5
total reward:  9
total reward:  6
total reward:  5
total reward:  9
total reward:  0
total reward:  12
total reward:  9
total reward:  14
total reward:  5
total reward:  14
total reward:  5
total reward:  19
total reward:  9
total reward:  11
total reward:  7
total reward:  9
total reward:  13
total reward:  4
total reward:  15
total reward:  9
total reward:  14
total reward:  6
total reward:  13
total reward:  9
total reward:  14
total reward:  15
total reward:  8
total reward:  9
total reward:  8
total reward:  7
total reward:  14
total reward:  16
total reward:  4
total reward:  9
total reward:  10
total reward:  10
total reward:  6
total reward:  14
total reward:  11
total reward:  9
total reward:  13
total reward:  5
total reward:  9
total reward:  5
total reward:  10
total reward:  12
total reward:  15
total reward:  11
total reward:  10
total reward:  8
total reward:  9
total reward:  9
total

total reward:  9
total reward:  17
total reward:  5
total reward:  13
total reward:  4
total reward:  10
total reward:  9
total reward:  8
total reward:  8
total reward:  7
total reward:  8
total reward:  17
total reward:  12
total reward:  15
total reward:  7
total reward:  9
total reward:  9
total reward:  10
total reward:  9
total reward:  7
total reward:  5
total reward:  -7
total reward:  5
total reward:  5
total reward:  15
total reward:  -2
total reward:  16
total reward:  14
total reward:  9
total reward:  7
total reward:  8
total reward:  14
total reward:  5
total reward:  12
total reward:  10
total reward:  18
total reward:  7
total reward:  8
total reward:  17
total reward:  12
total reward:  11
total reward:  9
total reward:  13
total reward:  16
total reward:  8
total reward:  9
total reward:  -3
total reward:  14
total reward:  18
total reward:  17
total reward:  9
total reward:  6
total reward:  7
total reward:  10
total reward:  12
total reward:  19
total reward:  9
tot

total reward:  15
total reward:  9
total reward:  11
total reward:  15
total reward:  -6
total reward:  10
total reward:  8
total reward:  7
total reward:  14
total reward:  0
total reward:  8
total reward:  9
total reward:  15
total reward:  7
total reward:  13
total reward:  6
total reward:  3
total reward:  7
total reward:  9
total reward:  11
total reward:  17
total reward:  8
total reward:  10
total reward:  14
total reward:  9
total reward:  9
total reward:  6
total reward:  14
total reward:  14
total reward:  15
total reward:  19
total reward:  0
total reward:  10
total reward:  13
total reward:  7
total reward:  10
total reward:  7
total reward:  8
total reward:  10
total reward:  11
total reward:  9
total reward:  14
total reward:  5
total reward:  12
total reward:  10
total reward:  -13
total reward:  12
total reward:  5
total reward:  5
total reward:  12
total reward:  13
total reward:  13
total reward:  6
total reward:  13
total reward:  11
total reward:  8
total reward:  1

total reward:  6
total reward:  -2
total reward:  8
total reward:  16
total reward:  10
total reward:  18
total reward:  8
total reward:  10
total reward:  -1
total reward:  14
total reward:  7
total reward:  7
total reward:  15
total reward:  15
total reward:  14
total reward:  10
total reward:  9
total reward:  9
total reward:  8
total reward:  7
total reward:  10
total reward:  14
total reward:  0
total reward:  12
total reward:  6
total reward:  13
total reward:  7
total reward:  5
total reward:  12
total reward:  4
total reward:  0
total reward:  13
total reward:  12
total reward:  4
total reward:  11
total reward:  11
total reward:  14
total reward:  9
total reward:  8
total reward:  16
total reward:  13
total reward:  16
total reward:  9
total reward:  18
total reward:  8
total reward:  8
total reward:  5
total reward:  7
total reward:  8
total reward:  16
total reward:  13
total reward:  7
total reward:  8
total reward:  8
total reward:  4
total reward:  19
total reward:  10
to

total reward:  16
total reward:  7
total reward:  6
total reward:  16
total reward:  8
total reward:  5
total reward:  5
total reward:  10
total reward:  15
total reward:  11
total reward:  12
total reward:  13
total reward:  7
total reward:  13
total reward:  11
total reward:  10
total reward:  14
total reward:  10
total reward:  -3
total reward:  -2
total reward:  5
total reward:  5
total reward:  9
total reward:  11
total reward:  13
total reward:  8
total reward:  -9
total reward:  7
total reward:  7
total reward:  12
total reward:  14
total reward:  9
total reward:  14
total reward:  14
total reward:  15
total reward:  12
total reward:  8
total reward:  12
total reward:  13
total reward:  10
total reward:  11
total reward:  5
total reward:  18
total reward:  11
total reward:  14
total reward:  10
total reward:  5
total reward:  7
total reward:  11
total reward:  9
total reward:  -2
total reward:  7
total reward:  12
total reward:  11
total reward:  11
total reward:  7
total reward

total reward:  17
total reward:  16
total reward:  9
total reward:  7
total reward:  6
total reward:  -5
total reward:  7
total reward:  11
total reward:  16
total reward:  -1
total reward:  8
total reward:  9
total reward:  8
total reward:  11
total reward:  11
total reward:  7
total reward:  7
total reward:  14
total reward:  12
total reward:  13
total reward:  5
total reward:  14
total reward:  13
total reward:  13
total reward:  4
total reward:  8
total reward:  5
total reward:  7
total reward:  15
total reward:  7
total reward:  12
total reward:  10
total reward:  11
total reward:  4
total reward:  11
total reward:  5
total reward:  14
total reward:  11
total reward:  8
total reward:  13
total reward:  7
total reward:  9
total reward:  6
total reward:  13
total reward:  8
total reward:  13
total reward:  3
total reward:  12
total reward:  15
total reward:  10
total reward:  14
total reward:  9
total reward:  6
total reward:  9
total reward:  9
total reward:  5
total reward:  13
to

total reward:  12
total reward:  13
total reward:  16
total reward:  7
total reward:  18
total reward:  10
total reward:  11
total reward:  8
total reward:  13
total reward:  13
total reward:  5
total reward:  10
total reward:  4
total reward:  14
total reward:  8
total reward:  7
total reward:  14
total reward:  7
total reward:  8
total reward:  12
total reward:  11
total reward:  5
total reward:  9
total reward:  11
total reward:  17
total reward:  10
total reward:  13
total reward:  9
total reward:  13
total reward:  10
total reward:  13
total reward:  8
total reward:  10
total reward:  13
total reward:  11
total reward:  4
total reward:  10
total reward:  -5
total reward:  9
total reward:  6
total reward:  11
total reward:  7
total reward:  10
total reward:  9
total reward:  6
total reward:  11
total reward:  14
total reward:  8
total reward:  5
total reward:  5
total reward:  12
total reward:  4
total reward:  -2
total reward:  5
total reward:  6
total reward:  12
total reward:  7

total reward:  11
total reward:  -4
total reward:  10
total reward:  8
total reward:  8
total reward:  9
total reward:  17
total reward:  7
total reward:  17
total reward:  17
total reward:  6
total reward:  10
total reward:  11
total reward:  12
total reward:  5
total reward:  4
total reward:  5
total reward:  8
total reward:  9
total reward:  6
total reward:  4
total reward:  12
total reward:  15
total reward:  9
total reward:  7
total reward:  14
total reward:  8
total reward:  8
total reward:  10
total reward:  15
total reward:  14
total reward:  -2
total reward:  0
total reward:  7
total reward:  11
total reward:  5
total reward:  12
total reward:  -1
total reward:  11
total reward:  10
total reward:  4
total reward:  19
total reward:  11
total reward:  13
total reward:  9
total reward:  4
total reward:  18
total reward:  7
total reward:  15
total reward:  11
total reward:  7
total reward:  19
total reward:  8
total reward:  12
total reward:  7
total reward:  8
total reward:  9
to

total reward:  10
total reward:  8
total reward:  0
total reward:  11
total reward:  15
total reward:  7
total reward:  9
total reward:  1
total reward:  16
total reward:  -3
total reward:  6
total reward:  16
total reward:  8
total reward:  11
total reward:  15
total reward:  -4
total reward:  6
total reward:  13
total reward:  11
total reward:  8
total reward:  13
total reward:  12
total reward:  11
total reward:  12
total reward:  8
total reward:  13
total reward:  10
total reward:  13
total reward:  12
total reward:  17
total reward:  8
total reward:  8
total reward:  -1
total reward:  2
total reward:  15
total reward:  9
total reward:  -3
total reward:  8
total reward:  11
total reward:  4
total reward:  4
total reward:  9
total reward:  10
total reward:  9
total reward:  9
total reward:  10
total reward:  17
total reward:  17
total reward:  6
total reward:  6
total reward:  7
total reward:  6
total reward:  -2
total reward:  8
total reward:  9
total reward:  12
total reward:  8
t

total reward:  18
total reward:  13
total reward:  19
total reward:  9
total reward:  12
total reward:  8
total reward:  10
total reward:  12
total reward:  8
total reward:  12
total reward:  5
total reward:  7
total reward:  12
total reward:  15
total reward:  15
total reward:  10
total reward:  6
total reward:  14
total reward:  7
total reward:  6
total reward:  12
total reward:  5
total reward:  8
total reward:  10
total reward:  18
total reward:  11
total reward:  18
total reward:  11
total reward:  12
total reward:  13
total reward:  11
total reward:  12
total reward:  7
total reward:  9
total reward:  19
total reward:  12
total reward:  3
total reward:  7
total reward:  10
total reward:  6
total reward:  13
total reward:  10
total reward:  7
total reward:  9
total reward:  12
total reward:  13
total reward:  5
total reward:  5
total reward:  8
total reward:  12
total reward:  13
total reward:  12
total reward:  14
total reward:  13
total reward:  6
total reward:  5
total reward: 

total reward:  12
total reward:  13
total reward:  13
total reward:  18
total reward:  8
total reward:  8
total reward:  13
total reward:  14
total reward:  11
total reward:  8
total reward:  6
total reward:  5
total reward:  9
total reward:  9
total reward:  11
total reward:  16
total reward:  12
total reward:  9
total reward:  4
total reward:  8
total reward:  -3
total reward:  15
total reward:  10
total reward:  8
total reward:  12
total reward:  5
total reward:  7
total reward:  8
total reward:  7
total reward:  13
total reward:  12
total reward:  12
total reward:  8
total reward:  9
total reward:  5
total reward:  15
total reward:  17
total reward:  7
total reward:  13
total reward:  10
total reward:  12
total reward:  11
total reward:  10
total reward:  7
total reward:  12
total reward:  15
total reward:  10
total reward:  6
total reward:  14
total reward:  15
total reward:  12
total reward:  9
total reward:  15
total reward:  10
total reward:  16
total reward:  -2
total reward: 

total reward:  13
total reward:  11
total reward:  8
total reward:  6
total reward:  14
total reward:  -3
total reward:  5
total reward:  18
total reward:  4
total reward:  6
total reward:  4
total reward:  17
total reward:  10
total reward:  12
total reward:  16
total reward:  7
total reward:  -3
total reward:  9
total reward:  12
total reward:  5
total reward:  7
total reward:  9
total reward:  9
total reward:  10
total reward:  5
total reward:  8
total reward:  16
total reward:  -4
total reward:  13
total reward:  13
total reward:  15
total reward:  9
total reward:  14
total reward:  8
total reward:  12
total reward:  7
total reward:  11
total reward:  9
total reward:  16
total reward:  4
total reward:  9
total reward:  7
total reward:  11
total reward:  7
total reward:  -16
total reward:  10
total reward:  5
total reward:  14
total reward:  8
total reward:  5
total reward:  8
total reward:  8
total reward:  8
total reward:  9
total reward:  6
total reward:  10
total reward:  -1
tot

total reward:  9
total reward:  4
total reward:  13
total reward:  13
total reward:  3
total reward:  8
total reward:  11
total reward:  7
total reward:  13
total reward:  11
total reward:  9
total reward:  6
total reward:  15
total reward:  14
total reward:  7
total reward:  12
total reward:  8
total reward:  5
total reward:  10
total reward:  8
total reward:  12
total reward:  9
total reward:  12
total reward:  12
total reward:  13
total reward:  9
total reward:  9
total reward:  8
total reward:  9
total reward:  5
total reward:  14
total reward:  7
total reward:  14
total reward:  18
total reward:  14
total reward:  14
total reward:  16
total reward:  7
total reward:  9
total reward:  -3
total reward:  12
total reward:  6
total reward:  9
total reward:  10
total reward:  9
total reward:  17
total reward:  12
total reward:  8
total reward:  13
total reward:  12
total reward:  -2
total reward:  11
total reward:  8
total reward:  15
total reward:  8
total reward:  5
total reward:  10
t

total reward:  19
total reward:  7
total reward:  7
total reward:  11
total reward:  7
total reward:  -6
total reward:  8
total reward:  14
total reward:  13
total reward:  4
total reward:  3
total reward:  15
total reward:  7
total reward:  16
total reward:  10
total reward:  7
total reward:  7
total reward:  4
total reward:  12
total reward:  15
total reward:  12
total reward:  9
total reward:  7
total reward:  8
total reward:  14
total reward:  12
total reward:  7
total reward:  8
total reward:  14
total reward:  8
total reward:  8
total reward:  15
total reward:  9
total reward:  8
total reward:  8
total reward:  6
total reward:  4
total reward:  9
total reward:  9
total reward:  17
total reward:  10
total reward:  12
total reward:  8
total reward:  14
total reward:  9
total reward:  10
total reward:  9
total reward:  10
total reward:  8
total reward:  10
total reward:  15
total reward:  15
total reward:  8
total reward:  11
total reward:  4
total reward:  6
total reward:  8
total 

total reward:  14
total reward:  13
total reward:  11
total reward:  14
total reward:  6
total reward:  6
total reward:  14
total reward:  8
total reward:  12
total reward:  4
total reward:  8
total reward:  7
total reward:  14
total reward:  5
total reward:  4
total reward:  9
total reward:  6
total reward:  16
total reward:  6
total reward:  9
total reward:  8
total reward:  6
total reward:  12
total reward:  11
total reward:  15
total reward:  16
total reward:  12
total reward:  15
total reward:  15
total reward:  8
total reward:  7
total reward:  7
total reward:  12
total reward:  11
total reward:  9
total reward:  -3
total reward:  7
total reward:  6
total reward:  7
total reward:  15
total reward:  4
total reward:  11
total reward:  12
total reward:  8
total reward:  10
total reward:  7
total reward:  15
total reward:  12
total reward:  14
total reward:  13
total reward:  4
total reward:  9
total reward:  6
total reward:  10
total reward:  14
total reward:  -2
total reward:  15
t

total reward:  12
total reward:  7
total reward:  3
total reward:  16
total reward:  16
total reward:  8
total reward:  12
total reward:  18
total reward:  11
total reward:  12
total reward:  14
total reward:  10
total reward:  12
total reward:  -6
total reward:  15
total reward:  7
total reward:  9
total reward:  15
total reward:  14
total reward:  15
total reward:  9
total reward:  7
total reward:  14
total reward:  12
total reward:  13
total reward:  11
total reward:  7
total reward:  3
total reward:  8
total reward:  10
total reward:  14
total reward:  15
total reward:  11
total reward:  15
total reward:  11
total reward:  11
total reward:  10
total reward:  9
total reward:  11
total reward:  8
total reward:  11
total reward:  17
total reward:  10
total reward:  8
total reward:  11
total reward:  6
total reward:  8
total reward:  8
total reward:  5
total reward:  14
total reward:  17
total reward:  10
total reward:  9
total reward:  2
total reward:  4
total reward:  11
total reward

total reward:  11
total reward:  6
total reward:  6
total reward:  5
total reward:  10
total reward:  12
total reward:  10
total reward:  5
total reward:  9
total reward:  11
total reward:  10
total reward:  5
total reward:  6
total reward:  8
total reward:  9
total reward:  14
total reward:  10
total reward:  5
total reward:  7
total reward:  6
total reward:  12
total reward:  13
total reward:  7
total reward:  8
total reward:  6
total reward:  12
total reward:  9
total reward:  13
total reward:  9
total reward:  15
total reward:  9
total reward:  16
total reward:  14
total reward:  5
total reward:  9
total reward:  5
total reward:  2
total reward:  10
total reward:  7
total reward:  9
total reward:  12
total reward:  6
total reward:  10
total reward:  13
total reward:  10
total reward:  1
total reward:  15
total reward:  8
total reward:  9
total reward:  7
total reward:  13
total reward:  6
total reward:  10
total reward:  11
total reward:  14
total reward:  8
total reward:  12
total

total reward:  10
total reward:  8
total reward:  9
total reward:  -1
total reward:  8
total reward:  14
total reward:  14
total reward:  15
total reward:  12
total reward:  8
total reward:  16
total reward:  17
total reward:  10
total reward:  15
total reward:  12
total reward:  9
total reward:  8
total reward:  5
total reward:  6
total reward:  10
total reward:  11
total reward:  13
total reward:  10
total reward:  12
total reward:  11
total reward:  9
total reward:  9
total reward:  15
total reward:  6
total reward:  12
total reward:  12
total reward:  10
total reward:  -2
total reward:  5
total reward:  9
total reward:  10
total reward:  7
total reward:  6
total reward:  14
total reward:  14
total reward:  5
total reward:  9
total reward:  13
total reward:  11
total reward:  8
total reward:  9
total reward:  11
total reward:  19
total reward:  13
total reward:  9
total reward:  11
total reward:  9
total reward:  8
total reward:  7
total reward:  -4
total reward:  9
total reward:  1

total reward:  12
total reward:  14
total reward:  10
total reward:  7
total reward:  16
total reward:  12
total reward:  9
total reward:  13
total reward:  9
total reward:  12
total reward:  6
total reward:  12
total reward:  7
total reward:  13
total reward:  6
total reward:  18
total reward:  12
total reward:  7
total reward:  10
total reward:  5
total reward:  -5
total reward:  11
total reward:  15
total reward:  13
total reward:  6
total reward:  15
total reward:  12
total reward:  16
total reward:  -2
total reward:  8
total reward:  12
total reward:  14
total reward:  6
total reward:  5
total reward:  11
total reward:  7
total reward:  8
total reward:  7
total reward:  8
total reward:  5
total reward:  3
total reward:  8
total reward:  8
total reward:  9
total reward:  17
total reward:  8
total reward:  10
total reward:  7
total reward:  8
total reward:  -1
total reward:  12
total reward:  13
total reward:  13
total reward:  14
total reward:  9
total reward:  14
total reward:  11

total reward:  7
total reward:  18
total reward:  -1
total reward:  13
total reward:  12
total reward:  7
total reward:  10
total reward:  8
total reward:  -2
total reward:  14
total reward:  10
total reward:  11
total reward:  12
total reward:  10
total reward:  12
total reward:  12
total reward:  8
total reward:  9
total reward:  6
total reward:  13
total reward:  13
total reward:  7
total reward:  3
total reward:  9
total reward:  8
total reward:  9
total reward:  16
total reward:  4
total reward:  10
total reward:  17
total reward:  9
total reward:  7
total reward:  11
total reward:  9
total reward:  12
total reward:  8
total reward:  6
total reward:  7
total reward:  11
total reward:  -3
total reward:  11
total reward:  7
total reward:  8
total reward:  14
total reward:  12
total reward:  11
total reward:  8
total reward:  4
total reward:  8
total reward:  7
total reward:  19
total reward:  15
total reward:  11
total reward:  15
total reward:  16
total reward:  5
total reward:  15

total reward:  17
total reward:  9
total reward:  10
total reward:  12
total reward:  6
total reward:  6
total reward:  9
total reward:  11
total reward:  14
total reward:  5
total reward:  -2
total reward:  13
total reward:  8
total reward:  8
total reward:  15
total reward:  12
total reward:  8
total reward:  12
total reward:  11
total reward:  10
total reward:  16
total reward:  9
total reward:  10
total reward:  9
total reward:  14
total reward:  10
total reward:  11
total reward:  8
total reward:  -2
total reward:  7
total reward:  6
total reward:  12
total reward:  4
total reward:  -2
total reward:  9
total reward:  9
total reward:  7
total reward:  14
total reward:  16
total reward:  12
total reward:  8
total reward:  8
total reward:  9
total reward:  9
total reward:  12
total reward:  16
total reward:  6
total reward:  0
total reward:  12
total reward:  5
total reward:  8
total reward:  9
total reward:  18
total reward:  13
total reward:  10
total reward:  -1
total reward:  12


total reward:  8
total reward:  6
total reward:  13
total reward:  15
total reward:  17
total reward:  9
total reward:  13
total reward:  11
total reward:  11
total reward:  8
total reward:  19
total reward:  10
total reward:  7
total reward:  13
total reward:  10
total reward:  7
total reward:  11
total reward:  9
total reward:  7
total reward:  12
total reward:  12
total reward:  6
total reward:  14
total reward:  17
total reward:  5
total reward:  14
total reward:  7
total reward:  11
total reward:  -3
total reward:  6
total reward:  -1
total reward:  15
total reward:  7
total reward:  7
total reward:  12
total reward:  12
total reward:  6
total reward:  8
total reward:  9
total reward:  13
total reward:  9
total reward:  12
total reward:  10
total reward:  11
total reward:  10
total reward:  8
total reward:  -2
total reward:  10
total reward:  9
total reward:  9
total reward:  3
total reward:  10
total reward:  13
total reward:  6
total reward:  13
total reward:  19
total reward:  

total reward:  10
total reward:  14
total reward:  9
total reward:  14
total reward:  14
total reward:  19
total reward:  7
total reward:  11
total reward:  15
total reward:  14
total reward:  10
total reward:  9
total reward:  12
total reward:  -2
total reward:  7
total reward:  9
total reward:  10
total reward:  10
total reward:  5
total reward:  18
total reward:  8
total reward:  6
total reward:  7
total reward:  12
total reward:  15
total reward:  15
total reward:  9
total reward:  11
total reward:  11
total reward:  13
total reward:  7
total reward:  -3
total reward:  8
total reward:  11
total reward:  14
total reward:  8
total reward:  10
total reward:  11
total reward:  17
total reward:  14
total reward:  13
total reward:  -2
total reward:  13
total reward:  10
total reward:  10
total reward:  8
total reward:  14
total reward:  7
total reward:  6
total reward:  9
total reward:  2
total reward:  11
total reward:  1
total reward:  12
total reward:  8
total reward:  -2
total reward

total reward:  10
total reward:  7
total reward:  12
total reward:  8
total reward:  8
total reward:  15
total reward:  9
total reward:  10
total reward:  14
total reward:  15
total reward:  10
total reward:  11
total reward:  9
total reward:  5
total reward:  7
total reward:  11
total reward:  14
total reward:  5
total reward:  7
total reward:  14
total reward:  9
total reward:  8
total reward:  11
total reward:  8
total reward:  14
total reward:  7
total reward:  8
total reward:  9
total reward:  10
total reward:  9
total reward:  7
total reward:  14
total reward:  17
total reward:  17
total reward:  8
total reward:  11
total reward:  16
total reward:  12
total reward:  8
total reward:  14
total reward:  19
total reward:  6
total reward:  11
total reward:  18
total reward:  10
total reward:  -1
total reward:  10
total reward:  13
total reward:  10
total reward:  11
total reward:  11
total reward:  8
total reward:  -1
total reward:  6
total reward:  7
total reward:  14
total reward:  

total reward:  10
total reward:  7
total reward:  8
total reward:  11
total reward:  7
total reward:  14
total reward:  -3
total reward:  4
total reward:  10
total reward:  17
total reward:  13
total reward:  14
total reward:  6
total reward:  12
total reward:  3
total reward:  11
total reward:  13
total reward:  13
total reward:  18
total reward:  10
total reward:  7
total reward:  19
total reward:  19
total reward:  10
total reward:  12
total reward:  7
total reward:  15
total reward:  6
total reward:  8
total reward:  10
total reward:  -3
total reward:  12
total reward:  15
total reward:  14
total reward:  8
total reward:  13
total reward:  11
total reward:  12
total reward:  12
total reward:  7
total reward:  12
total reward:  11
total reward:  10
total reward:  11
total reward:  14
total reward:  9
total reward:  12
total reward:  3
total reward:  13
total reward:  11
total reward:  10
total reward:  12
total reward:  5
total reward:  6
total reward:  14
total reward:  11
total re

total reward:  10
total reward:  12
total reward:  10
total reward:  15
total reward:  11
total reward:  7
total reward:  9
total reward:  9
total reward:  8
total reward:  16
total reward:  5
total reward:  7
total reward:  16
total reward:  11
total reward:  8
total reward:  8
total reward:  7
total reward:  15
total reward:  6
total reward:  5
total reward:  8
total reward:  11
total reward:  16
total reward:  16
total reward:  7
total reward:  18
total reward:  12
total reward:  15
total reward:  1
total reward:  12
total reward:  11
total reward:  9
total reward:  15
total reward:  12
total reward:  16
total reward:  7
total reward:  9
total reward:  9
total reward:  11
total reward:  14
total reward:  10
total reward:  -1
total reward:  8
total reward:  7
total reward:  6
total reward:  6
total reward:  8
total reward:  13
total reward:  8
total reward:  11
total reward:  12
total reward:  12
total reward:  14
total reward:  8
total reward:  10
total reward:  -2
total reward:  14

total reward:  15
total reward:  12
total reward:  4
total reward:  9
total reward:  6
total reward:  6
total reward:  -1
total reward:  11
total reward:  -4
total reward:  11
total reward:  11
total reward:  13
total reward:  6
total reward:  11
total reward:  2
total reward:  7
total reward:  12
total reward:  9
total reward:  15
total reward:  -2
total reward:  17
total reward:  10
total reward:  8
total reward:  9
total reward:  5
total reward:  7
total reward:  8
total reward:  15
total reward:  11
total reward:  12
total reward:  14
total reward:  6
total reward:  9
total reward:  3
total reward:  13
total reward:  12
total reward:  10
total reward:  9
total reward:  14
total reward:  7
total reward:  11
total reward:  11
total reward:  13
total reward:  11
total reward:  17
total reward:  8
total reward:  8
total reward:  6
total reward:  14
total reward:  8
total reward:  4
total reward:  9
total reward:  9
total reward:  11
total reward:  11
total reward:  9
total reward:  8
t

total reward:  12
total reward:  9
total reward:  15
total reward:  11
total reward:  9
total reward:  7
total reward:  7
total reward:  12
total reward:  12
total reward:  6
total reward:  7
total reward:  6
total reward:  14
total reward:  9
total reward:  8
total reward:  8
total reward:  7
total reward:  10
total reward:  8
total reward:  11
total reward:  11
total reward:  18
total reward:  7
total reward:  9
total reward:  12
total reward:  9
total reward:  14
total reward:  9
total reward:  13
total reward:  9
total reward:  11
total reward:  3
total reward:  5
total reward:  0
total reward:  14
total reward:  9
total reward:  7
total reward:  6
total reward:  14
total reward:  5
total reward:  12
total reward:  17
total reward:  8
total reward:  9
total reward:  10
total reward:  11
total reward:  9
total reward:  12
total reward:  12
total reward:  12
total reward:  5
total reward:  19
total reward:  9
total reward:  9
total reward:  5
total reward:  15
total reward:  6
total 

total reward:  6
total reward:  4
total reward:  12
total reward:  6
total reward:  12
total reward:  9
total reward:  5
total reward:  11
total reward:  10
total reward:  13
total reward:  13
total reward:  11
total reward:  12
total reward:  13
total reward:  6
total reward:  18
total reward:  13
total reward:  11
total reward:  7
total reward:  15
total reward:  9
total reward:  8
total reward:  9
total reward:  8
total reward:  11
total reward:  11
total reward:  11
total reward:  10
total reward:  8
total reward:  17
total reward:  8
total reward:  7
total reward:  10
total reward:  6
total reward:  6
total reward:  7
total reward:  6
total reward:  12
total reward:  17
total reward:  7
total reward:  9
total reward:  16
total reward:  9
total reward:  6
total reward:  6
total reward:  6
total reward:  11
total reward:  8
total reward:  12
total reward:  -4
total reward:  9
total reward:  16
total reward:  14
total reward:  8
total reward:  12
total reward:  14
total reward:  13
t

total reward:  17
total reward:  14
total reward:  6
total reward:  8
total reward:  8
total reward:  19
total reward:  11
total reward:  -7
total reward:  13
total reward:  17
total reward:  7
total reward:  15
total reward:  9
total reward:  10
total reward:  15
total reward:  9
total reward:  17
total reward:  1
total reward:  -3
total reward:  6
total reward:  16
total reward:  13
total reward:  15
total reward:  6
total reward:  13
total reward:  6
total reward:  13
total reward:  13
total reward:  0
total reward:  15
total reward:  14
total reward:  1
total reward:  12
total reward:  10
total reward:  7
total reward:  4
total reward:  11
total reward:  13
total reward:  5
total reward:  6
total reward:  6
total reward:  9
total reward:  5
total reward:  12
total reward:  -4
total reward:  -4
total reward:  5
total reward:  9
total reward:  17
total reward:  11
total reward:  13
total reward:  10
total reward:  3
total reward:  1
total reward:  -16
total reward:  10
total reward: 

total reward:  12
total reward:  2
total reward:  9
total reward:  8
total reward:  9
total reward:  9
total reward:  7
total reward:  13
total reward:  5
total reward:  11
total reward:  10
total reward:  12
total reward:  16
total reward:  16
total reward:  15
total reward:  5
total reward:  10
total reward:  4
total reward:  18
total reward:  13
total reward:  9
total reward:  8
total reward:  8
total reward:  9
total reward:  14
total reward:  14
total reward:  8
total reward:  12
total reward:  12
total reward:  9
total reward:  6
total reward:  14
total reward:  17
total reward:  7
total reward:  12
total reward:  16
total reward:  12
total reward:  8
total reward:  9
total reward:  11
total reward:  11
total reward:  14
total reward:  13
total reward:  5
total reward:  10
total reward:  8
total reward:  -4
total reward:  9
total reward:  10
total reward:  13
total reward:  4
total reward:  8
total reward:  14
total reward:  14
total reward:  18
total reward:  6
total reward:  3


total reward:  17
total reward:  8
total reward:  11
total reward:  11
total reward:  10
total reward:  10
total reward:  14
total reward:  3
total reward:  6
total reward:  15
total reward:  9
total reward:  14
total reward:  14
total reward:  14
total reward:  14
total reward:  12
total reward:  14
total reward:  12
total reward:  8
total reward:  7
total reward:  0
total reward:  5
total reward:  7
total reward:  16
total reward:  10
total reward:  8
total reward:  8
total reward:  14
total reward:  7
total reward:  7
total reward:  14
total reward:  9
total reward:  8
total reward:  8
total reward:  8
total reward:  14
total reward:  5
total reward:  -1
total reward:  8
total reward:  15
total reward:  15
total reward:  8
total reward:  8
total reward:  10
total reward:  10
total reward:  6
total reward:  11
total reward:  9
total reward:  10
total reward:  15
total reward:  10
total reward:  2
total reward:  10
total reward:  7
total reward:  12
total reward:  4
total reward:  14


total reward:  10
total reward:  7
total reward:  10
total reward:  16
total reward:  13
total reward:  7
total reward:  -1
total reward:  10
total reward:  8
total reward:  8
total reward:  6
total reward:  7
total reward:  10
total reward:  6
total reward:  12
total reward:  11
total reward:  10
total reward:  15
total reward:  12
total reward:  7
total reward:  9
total reward:  5
total reward:  10
total reward:  10
total reward:  9
total reward:  7
total reward:  12
total reward:  9
total reward:  4
total reward:  9
total reward:  12
total reward:  11
total reward:  5
total reward:  9
total reward:  7
total reward:  7
total reward:  5
total reward:  12
total reward:  11
total reward:  15
total reward:  9
total reward:  5
total reward:  18
total reward:  9
total reward:  10
total reward:  8
total reward:  6
total reward:  13
total reward:  13
total reward:  12
total reward:  11
total reward:  10
total reward:  5
total reward:  5
total reward:  10
total reward:  12
total reward:  5
to

total reward:  14
total reward:  18
total reward:  14
total reward:  5
total reward:  11
total reward:  6
total reward:  -2
total reward:  12
total reward:  15
total reward:  13
total reward:  13
total reward:  14
total reward:  -7
total reward:  11
total reward:  5
total reward:  10
total reward:  8
total reward:  8
total reward:  6
total reward:  6
total reward:  8
total reward:  9
total reward:  7
total reward:  7
total reward:  10
total reward:  6
total reward:  5
total reward:  10
total reward:  16
total reward:  6
total reward:  9
total reward:  10
total reward:  5
total reward:  7
total reward:  -3
total reward:  -4
total reward:  13
total reward:  14
total reward:  19
total reward:  15
total reward:  9
total reward:  10
total reward:  18
total reward:  11
total reward:  5
total reward:  11
total reward:  8
total reward:  11
total reward:  11
total reward:  9
total reward:  8
total reward:  16
total reward:  13
total reward:  8
total reward:  18
total reward:  8
total reward:  9

total reward:  9
total reward:  16
total reward:  7
total reward:  13
total reward:  5
total reward:  9
total reward:  15
total reward:  14
total reward:  13
total reward:  7
total reward:  7
total reward:  15
total reward:  17
total reward:  12
total reward:  13
total reward:  16
total reward:  8
total reward:  7
total reward:  8
total reward:  2
total reward:  3
total reward:  7
total reward:  11
total reward:  8
total reward:  18
total reward:  8
total reward:  5
total reward:  11
total reward:  5
total reward:  8
total reward:  9
total reward:  11
total reward:  6
total reward:  7
total reward:  14
total reward:  8
total reward:  13
total reward:  9
total reward:  12
total reward:  12
total reward:  6
total reward:  8
total reward:  13
total reward:  6
total reward:  13
total reward:  7
total reward:  6
total reward:  12
total reward:  8
total reward:  11
total reward:  7
total reward:  9
total reward:  10
total reward:  11
total reward:  13
total reward:  17
total reward:  10
tota

total reward:  11
total reward:  11
total reward:  -2
total reward:  11
total reward:  9
total reward:  17
total reward:  -6
total reward:  14
total reward:  13
total reward:  10
total reward:  5
total reward:  7
total reward:  9
total reward:  11
total reward:  8
total reward:  6
total reward:  13
total reward:  10
total reward:  14
total reward:  6
total reward:  8
total reward:  8
total reward:  6
total reward:  13
total reward:  16
total reward:  15
total reward:  6
total reward:  11
total reward:  10
total reward:  6
total reward:  0
total reward:  13
total reward:  12
total reward:  11
total reward:  12
total reward:  17
total reward:  14
total reward:  10
total reward:  7
total reward:  5
total reward:  5
total reward:  19
total reward:  9
total reward:  6
total reward:  9
total reward:  6
total reward:  9
total reward:  10
total reward:  12
total reward:  7
total reward:  17
total reward:  11
total reward:  10
total reward:  15
total reward:  13
total reward:  8
total reward:  

total reward:  6
total reward:  9
total reward:  8
total reward:  8
total reward:  12
total reward:  13
total reward:  14
total reward:  14
total reward:  13
total reward:  13
total reward:  7
total reward:  7
total reward:  8
total reward:  13
total reward:  7
total reward:  13
total reward:  7
total reward:  7
total reward:  11
total reward:  8
total reward:  9
total reward:  12
total reward:  10
total reward:  6
total reward:  15
total reward:  16
total reward:  6
total reward:  7
total reward:  9
total reward:  4
total reward:  9
total reward:  7
total reward:  14
total reward:  14
total reward:  10
total reward:  -2
total reward:  10
total reward:  14
total reward:  11
total reward:  7
total reward:  6
total reward:  10
total reward:  8
total reward:  13
total reward:  14
total reward:  14
total reward:  5
total reward:  5
total reward:  16
total reward:  8
total reward:  13
total reward:  9
total reward:  5
total reward:  1
total reward:  8
total reward:  8
total reward:  18
tota

total reward:  14
total reward:  11
total reward:  5
total reward:  13
total reward:  9
total reward:  8
total reward:  5
total reward:  15
total reward:  10
total reward:  7
total reward:  14
total reward:  10
total reward:  8
total reward:  13
total reward:  7
total reward:  9
total reward:  -13
total reward:  8
total reward:  8
total reward:  7
total reward:  12
total reward:  8
total reward:  11
total reward:  17
total reward:  12
total reward:  9
total reward:  10
total reward:  10
total reward:  8
total reward:  1
total reward:  9
total reward:  16
total reward:  10
total reward:  -1
total reward:  7
total reward:  6
total reward:  11
total reward:  12
total reward:  10
total reward:  12
total reward:  6
total reward:  14
total reward:  12
total reward:  14
total reward:  5
total reward:  7
total reward:  10
total reward:  12
total reward:  8
total reward:  15
total reward:  9
total reward:  14
total reward:  9
total reward:  7
total reward:  6
total reward:  16
total reward:  5


## Solving the taxi problem using SARSA

In [13]:
alpha = 0.85
gamma = 0.90
epsilon = 0.8


Q = {}
for s in range(env.nS):
    for a in range(env.nA):
        Q[(s,a)] = 0.0
        
# Let's define the function for performing the epsilon-greedy
def epsilon_greedy(state,epsilon):
    if random.uniform(0,1)< epsilon:
        return env.action_space.sample()
    else:
        return max(list(range(env.nA)), key = lambda x: q[(state,x)])
    
nb_ep = 4000
for i in range(nb_ep):
    r = 0
    state = env.reset()
    action = epsilon_greedy(state,epsilon)
    while True:
        next_state, reward, done, _ = env.step(action)
        # We pick up the next action using epsilon greedy policy
        next_action = epsilon_greedy(next_state,epsilon)
        #update Q
        Q[(state,action)] += alpha*(reward + gamma*Q[(next_state,next_action)] - Q[(state,action)])
        # update
        action = next_action
        state = next_state
        r += reward

        if done:
            break
    print('total reward: ', r)
env.close
    
    
    

total reward:  -12
total reward:  -154
total reward:  -30
total reward:  -34
total reward:  16
total reward:  -46
total reward:  -224
total reward:  -216
total reward:  -56
total reward:  -13
total reward:  -481
total reward:  -645
total reward:  -382
total reward:  -948
total reward:  -373
total reward:  -134
total reward:  -49
total reward:  -306
total reward:  -212
total reward:  -717
total reward:  6
total reward:  -22
total reward:  -76
total reward:  -203
total reward:  -55
total reward:  -260
total reward:  -241
total reward:  -445
total reward:  -187
total reward:  -426
total reward:  -386
total reward:  -427
total reward:  -50
total reward:  -60
total reward:  -22
total reward:  -94
total reward:  -277
total reward:  -63
total reward:  -164
total reward:  -100
total reward:  15
total reward:  -189
total reward:  -173
total reward:  -415
total reward:  -582
total reward:  -175
total reward:  -191
total reward:  -375
total reward:  -529
total reward:  -285
total reward:  -228
to

total reward:  -257
total reward:  -332
total reward:  -141
total reward:  8
total reward:  -92
total reward:  -384
total reward:  -149
total reward:  -240
total reward:  -146
total reward:  -127
total reward:  -101
total reward:  -72
total reward:  -548
total reward:  -100
total reward:  -157
total reward:  -221
total reward:  -70
total reward:  -435
total reward:  -706
total reward:  -425
total reward:  -110
total reward:  -288
total reward:  -247
total reward:  -265
total reward:  -173
total reward:  -95
total reward:  -382
total reward:  -38
total reward:  -227
total reward:  -107
total reward:  -288
total reward:  -101
total reward:  -755
total reward:  -150
total reward:  -77
total reward:  -164
total reward:  -33374
total reward:  -70
total reward:  -43
total reward:  -146
total reward:  -79
total reward:  18
total reward:  -59
total reward:  -550
total reward:  4
total reward:  -163
total reward:  -137
total reward:  -65
total reward:  -154
total reward:  -105
total reward:  -4

total reward:  -131
total reward:  -13
total reward:  -212
total reward:  -210
total reward:  -42
total reward:  -4
total reward:  -278
total reward:  -408
total reward:  -164
total reward:  -152
total reward:  -282
total reward:  -232
total reward:  -242
total reward:  -466
total reward:  -69
total reward:  -154
total reward:  2
total reward:  -400
total reward:  -328
total reward:  -315
total reward:  -204
total reward:  -373
total reward:  -99
total reward:  -307
total reward:  -128
total reward:  15
total reward:  -124
total reward:  -485
total reward:  -244
total reward:  -258
total reward:  -61
total reward:  -168
total reward:  -128
total reward:  -86
total reward:  -16
total reward:  -28
total reward:  -592
total reward:  -196
total reward:  -201
total reward:  8
total reward:  -217
total reward:  -193
total reward:  -15
total reward:  -377
total reward:  -98
total reward:  -109
total reward:  -102
total reward:  -244
total reward:  -223
total reward:  -51
total reward:  -422
t

total reward:  -254
total reward:  -68
total reward:  -150
total reward:  -379
total reward:  -84
total reward:  -112
total reward:  -196
total reward:  -559
total reward:  -261
total reward:  -2510
total reward:  -288
total reward:  -168
total reward:  -349
total reward:  -462
total reward:  -51
total reward:  -59
total reward:  -133
total reward:  -152
total reward:  -41
total reward:  -138
total reward:  -254
total reward:  6
total reward:  -787
total reward:  -3866
total reward:  -43
total reward:  -59
total reward:  -251
total reward:  -150
total reward:  -61
total reward:  -318
total reward:  -356
total reward:  -21
total reward:  -390
total reward:  -2567
total reward:  -166
total reward:  -42
total reward:  -88
total reward:  -792
total reward:  -199
total reward:  -206
total reward:  -260
total reward:  -240
total reward:  -39
total reward:  -184
total reward:  -182
total reward:  -139
total reward:  -301
total reward:  -178
total reward:  -23
total reward:  -179
total reward:

total reward:  -360
total reward:  -309
total reward:  -148
total reward:  -53
total reward:  -57
total reward:  -811
total reward:  -465
total reward:  -145
total reward:  -76
total reward:  -201
total reward:  -76
total reward:  -204
total reward:  -185
total reward:  -305
total reward:  -21
total reward:  -350
total reward:  -206
total reward:  -197
total reward:  -54
total reward:  -254
total reward:  -154
total reward:  -125
total reward:  -93
total reward:  -105
total reward:  -331
total reward:  -17
total reward:  -245
total reward:  -484
total reward:  -295
total reward:  -177
total reward:  -64
total reward:  -110
total reward:  -280
total reward:  -195
total reward:  -181
total reward:  -106
total reward:  -4
total reward:  -97
total reward:  -269
total reward:  -279
total reward:  -294
total reward:  -236
total reward:  -234
total reward:  -233
total reward:  11
total reward:  -242
total reward:  -22
total reward:  -82
total reward:  -185
total reward:  -3
total reward:  -37

total reward:  -482
total reward:  -97
total reward:  -137
total reward:  -85
total reward:  -66
total reward:  -5748
total reward:  -141
total reward:  -185
total reward:  -26718
total reward:  -32
total reward:  -209
total reward:  -235
total reward:  -11
total reward:  -236
total reward:  -298
total reward:  -146
total reward:  -306
total reward:  -64
total reward:  -217
total reward:  -451
total reward:  -355
total reward:  -302
total reward:  -51
total reward:  -108
total reward:  -157
total reward:  -394
total reward:  -159
total reward:  -314
total reward:  -137
total reward:  -183
total reward:  -206
total reward:  -190
total reward:  -57
total reward:  -75
total reward:  -118
total reward:  -319
total reward:  -257
total reward:  -600
total reward:  -50
total reward:  -57
total reward:  -222
total reward:  -292
total reward:  -93
total reward:  -21
total reward:  -134
total reward:  -307
total reward:  -121
total reward:  -4
total reward:  -333
total reward:  -57
total reward:

total reward:  -327
total reward:  -117
total reward:  -284
total reward:  -193
total reward:  -200
total reward:  -281
total reward:  -26
total reward:  -84
total reward:  -173
total reward:  -230
total reward:  -125
total reward:  -159
total reward:  -254
total reward:  -393
total reward:  -264
total reward:  -151
total reward:  -86
total reward:  -454
total reward:  -29
total reward:  -283
total reward:  -369
total reward:  -220
total reward:  -40209
total reward:  -33963
total reward:  -224
total reward:  -87
total reward:  5
total reward:  -462
total reward:  -214
total reward:  -155
total reward:  -257
total reward:  -390
total reward:  -128
total reward:  -30
total reward:  -70
total reward:  -114
total reward:  -67
total reward:  -399
total reward:  -305
total reward:  -65
total reward:  -232
total reward:  -247
total reward:  -222
total reward:  -401
total reward:  -412
total reward:  -145
total reward:  -28
total reward:  -80
total reward:  -147
total reward:  -14
total rewar

total reward:  -18055
total reward:  -220
total reward:  -10086
total reward:  -43
total reward:  -188
total reward:  -157
total reward:  -448
total reward:  -257
total reward:  -14
total reward:  -292
total reward:  -195
total reward:  -222
total reward:  -376
total reward:  -243
total reward:  -433
total reward:  -256
total reward:  -450
total reward:  -157
total reward:  -30
total reward:  -245
total reward:  -48
total reward:  -447
total reward:  -424
total reward:  -442
total reward:  -95
total reward:  -32155
total reward:  -510
total reward:  -72
total reward:  -339
total reward:  -272
total reward:  -205
total reward:  -98
total reward:  -54
total reward:  -162
total reward:  -76
total reward:  -49
total reward:  -441
total reward:  -113
total reward:  -296
total reward:  -89
total reward:  -78
total reward:  -305
total reward:  -252
total reward:  -129
total reward:  -84
total reward:  -49
total reward:  -130
total reward:  -10
total reward:  -56
total reward:  -384
total rewa

total reward:  -75
total reward:  -82
total reward:  -368
total reward:  -349
total reward:  -220
total reward:  -205
total reward:  -72
total reward:  -39
total reward:  -80
total reward:  -44
total reward:  -25
total reward:  -249
total reward:  -289
total reward:  -205
total reward:  -4665
total reward:  -413
total reward:  -154
total reward:  -106
total reward:  -294
total reward:  -101
total reward:  -152
total reward:  -189
total reward:  -459
total reward:  -167
total reward:  -461
total reward:  -603
total reward:  -130
total reward:  -618
total reward:  -290
total reward:  -61
total reward:  -37
total reward:  -181
total reward:  -145
total reward:  -300
total reward:  -149
total reward:  -380
total reward:  -273
total reward:  -42
total reward:  -87
total reward:  -201
total reward:  -827
total reward:  -190
total reward:  -51
total reward:  -11
total reward:  -513
total reward:  -318
total reward:  -277
total reward:  -399
total reward:  -66
total reward:  -290
total reward:

<bound method Env.close of <gym.envs.toy_text.taxi.TaxiEnv object at 0x7f71abcc0240>>