# Navigation

---

In this notebook, you will learn how to use the Unity ML-Agents environment for the first project of the [Deep Reinforcement Learning Nanodegree](https://www.udacity.com/course/deep-reinforcement-learning-nanodegree--nd893).

### 1. Start the Environment

We begin by importing some necessary packages.  If the code cell below returns an error, please revisit the project instructions to double-check that you have installed [Unity ML-Agents](https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Installation.md) and [NumPy](http://www.numpy.org/).

In [1]:
from unityagents import UnityEnvironment
import numpy as np

Next, we will start the environment!  **_Before running the code cell below_**, change the `file_name` parameter to match the location of the Unity environment that you downloaded.

- **Mac**: `"path/to/Banana.app"`
- **Windows** (x86): `"path/to/Banana_Windows_x86/Banana.exe"`
- **Windows** (x86_64): `"path/to/Banana_Windows_x86_64/Banana.exe"`
- **Linux** (x86): `"path/to/Banana_Linux/Banana.x86"`
- **Linux** (x86_64): `"path/to/Banana_Linux/Banana.x86_64"`
- **Linux** (x86, headless): `"path/to/Banana_Linux_NoVis/Banana.x86"`
- **Linux** (x86_64, headless): `"path/to/Banana_Linux_NoVis/Banana.x86_64"`

For instance, if you are using a Mac, then you downloaded `Banana.app`.  If this file is in the same folder as the notebook, then the line below should appear as follows:
```
env = UnityEnvironment(file_name="Banana.app")
```

In [2]:
# env = UnityEnvironment(file_name="/home/arasdar/VisualBanana_Linux/Banana.x86")
env = UnityEnvironment(file_name="/home/arasdar/Banana_Linux/Banana.x86_64")

INFO:unityagents:
'Academy' started successfully!
Unity Academy name: Academy
        Number of Brains: 1
        Number of External Brains : 1
        Lesson number : 0
        Reset Parameters :
		
Unity brain name: BananaBrain
        Number of Visual Observations (per agent): 0
        Vector Observation space type: continuous
        Vector Observation space size (per agent): 37
        Number of stacked Vector Observation: 1
        Vector Action space type: discrete
        Vector Action space size (per agent): 4
        Vector Action descriptions: , , , 


Environments contain **_brains_** which are responsible for deciding the actions of their associated agents. Here we check for the first brain available, and set it as the default brain we will be controlling from Python.

In [3]:
# get the default brain
brain_name = env.brain_names[0]
brain = env.brains[brain_name]

### 2. Examine the State and Action Spaces

The simulation contains a single agent that navigates a large environment.  At each time step, it has four actions at its disposal:
- `0` - walk forward 
- `1` - walk backward
- `2` - turn left
- `3` - turn right

The state space has `37` dimensions and contains the agent's velocity, along with ray-based perception of objects around agent's forward direction.  A reward of `+1` is provided for collecting a yellow banana, and a reward of `-1` is provided for collecting a blue banana. 

Run the code cell below to print some information about the environment.

In [4]:
# reset the environment
env_info = env.reset(train_mode=True)[brain_name]

# number of agents in the environment
print('Number of agents:', len(env_info.agents))

# number of actions
action_size = brain.vector_action_space_size
print('Number of actions:', action_size)

# examine the state space 
state = env_info.vector_observations[0]
# print('States look like:', state)
state_size = len(state)
print('States have length:', state_size)
# print(state.shape, len(env_info.vector_observations), env_info.vector_observations.shape)

Number of agents: 1
Number of actions: 4
States have length: 37


### 3. Take Random Actions in the Environment

In the next code cell, you will learn how to use the Python API to control the agent and receive feedback from the environment.

Once this cell is executed, you will watch the agent's performance, if it selects an action (uniformly) at random with each time step.  A window should pop up that allows you to observe the agent, as it moves through the environment.  

Of course, as part of the project, you'll have to change the code so that the agent is able to use its experience to gradually choose better actions when interacting with the environment!

In [5]:
env_info = env.reset(train_mode=False)[brain_name] # reset the environment
state = env_info.vector_observations[0]            # get the current state
score = 0                                          # initialize the score
while True:
    action = np.random.randint(action_size)        # select an action
    env_info = env.step(action)[brain_name]        # send the action to the environment
    next_state = env_info.vector_observations[0]   # get the next state
    reward = env_info.rewards[0]                   # get the reward
    done = env_info.local_done[0]                  # see if episode has finished
    score += reward                                # update the score
    state = next_state                             # roll over the state to next time step
    if done:                                       # exit loop if episode finished
        print(state.shape)
        break
    
print("Score: {}".format(score))

(37,)
Score: 0.0


When finished, you can close the environment.

In [6]:
# env.close()

### 4. It's Your Turn!

Now it's your turn to train your own agent to solve the environment!  When training the environment, set `train_mode=True`, so that the line for resetting the environment looks like the following:
```python
env_info = env.reset(train_mode=True)[brain_name]
```

In [7]:
env_info = env.reset(train_mode=True)[brain_name] # reset the environment
state = env_info.vector_observations[0]            # get the current state
score = 0                                          # initialize the score
while True:
    action = np.random.randint(action_size)        # select an action
    env_info = env.step(action)[brain_name]        # send the action to the environment
    next_state = env_info.vector_observations[0]   # get the next state
    reward = env_info.rewards[0]                   # get the reward
    done = env_info.local_done[0]                  # see if episode has finished
    score += reward                                # update the score
    state = next_state                             # roll over the state to next time step
    #print(state)
    if done:                                       # exit loop if episode finished
        break
    
print("Score: {}".format(score))

Score: 1.0


In [8]:
# In this one we should define and detect GPUs for tensorflow
# GPUs or CPU
import tensorflow as tf

# Check TensorFlow Version
print('TensorFlow Version: {}'.format(tf.__version__))

# Check for a GPU
print('Default GPU Device: {}'.format(tf.test.gpu_device_name()))

TensorFlow Version: 1.7.1
Default GPU Device: 


In [9]:
env_info = env.reset(train_mode=True)[brain_name] # reset the environment
state = env_info.vector_observations[0]            # get the current state
score = 0                                          # initialize the score
batch = []
while True: # infinite number of steps
    action = np.random.randint(action_size)        # select an action
    env_info = env.step(action)[brain_name]        # send the action to the environment
    next_state = env_info.vector_observations[0]   # get the next state
    reward = env_info.rewards[0]                   # get the reward
    done = env_info.local_done[0]                  # see if episode has finished
    score += reward                                # update the score
    #print(state, action, reward, done)
    batch.append([action, state, reward, done])
    state = next_state                             # roll over the state to next time step
    if done:                                       # exit loop if episode finished
        break
    
# print("Score: {}".format(score))

In [10]:
batch[0], batch[0][1].shape

([0, array([1.        , 0.        , 0.        , 0.        , 0.35186431,
         1.        , 0.        , 0.        , 0.        , 0.37953866,
         1.        , 0.        , 0.        , 0.        , 0.11957462,
         1.        , 0.        , 0.        , 0.        , 0.43679786,
         0.        , 1.        , 0.        , 0.        , 0.7516005 ,
         0.        , 0.        , 1.        , 0.        , 0.6708644 ,
         0.        , 0.        , 1.        , 0.        , 0.36187497,
         0.        , 0.        ]), 0.0, False], (37,))

In [11]:
batch[0][1].shape

(37,)

In [12]:
batch[0]

[0, array([1.        , 0.        , 0.        , 0.        , 0.35186431,
        1.        , 0.        , 0.        , 0.        , 0.37953866,
        1.        , 0.        , 0.        , 0.        , 0.11957462,
        1.        , 0.        , 0.        , 0.        , 0.43679786,
        0.        , 1.        , 0.        , 0.        , 0.7516005 ,
        0.        , 0.        , 1.        , 0.        , 0.6708644 ,
        0.        , 0.        , 1.        , 0.        , 0.36187497,
        0.        , 0.        ]), 0.0, False]

In [13]:
actions = np.array([each[0] for each in batch])
states = np.array([each[1] for each in batch])
rewards = np.array([each[2] for each in batch])
dones = np.array([each[3] for each in batch])
# infos = np.array([each[4] for each in batch])

In [14]:
# print(rewards[:])
print(np.array(rewards).shape, np.array(states).shape, np.array(actions).shape, np.array(dones).shape)
print(np.array(rewards).dtype, np.array(states).dtype, np.array(actions).dtype, np.array(dones).dtype)
print(np.max(np.array(actions)), np.min(np.array(actions)), 
      (np.max(np.array(actions)) - np.min(np.array(actions)))+1)
print(np.max(np.array(rewards)), np.min(np.array(rewards)))
print(np.max(np.array(states)), np.min(np.array(states)))

(300,) (300, 37) (300,) (300,)
float64 float64 int64 bool
3 0 4
1.0 0.0
10.869853973388672 -10.982420921325684


In [15]:
# Data of the model
def model_input(state_size):
    states = tf.placeholder(tf.float32, [None, state_size], name='states')
    actions = tf.placeholder(tf.int32, [None], name='actions')
    targetQs = tf.placeholder(tf.float32, [None], name='targetQs')
    reward = tf.placeholder(tf.float32, [], name='reward')
    return states, actions, targetQs, reward

In [16]:
# Generator/Controller: Generating/prediting the actions
def generator(states, action_size, hidden_size, reuse=False, alpha=0.1, training=False):
    with tf.variable_scope('generator', reuse=reuse):
        # First fully connected layer
        h1 = tf.layers.dense(inputs=states, units=hidden_size)
        bn1 = tf.layers.batch_normalization(h1, training=training)        
        nl1 = tf.maximum(alpha * bn1, bn1)
        
        # Second fully connected layer
        h2 = tf.layers.dense(inputs=nl1, units=hidden_size)
        bn2 = tf.layers.batch_normalization(h2, training=training)        
        nl2 = tf.maximum(alpha * bn2, bn2)
        
        # Output layer
        logits = tf.layers.dense(inputs=nl2, units=action_size)        
        #predictions = tf.nn.softmax(logits)

        # return actions logits
        return logits

In [17]:
# Discriminator/Dopamine: Reward function/planner/naviator/advisor/supervisor/cortical columns
def discriminator(states, actions, hidden_size, reuse=False, alpha=0.1, training=False):
    with tf.variable_scope('discriminator', reuse=reuse):
        # Fusion/merge states and actions/ SA/ SM
        x_fused = tf.concat(axis=1, values=[states, actions])
        
        # First fully connected layer
        h1 = tf.layers.dense(inputs=x_fused, units=hidden_size)
        bn1 = tf.layers.batch_normalization(h1, training=training)        
        nl1 = tf.maximum(alpha * bn1, bn1)
        
        # Second fully connected layer
        h2 = tf.layers.dense(inputs=nl1, units=hidden_size)
        bn2 = tf.layers.batch_normalization(h2, training=training)        
        nl2 = tf.maximum(alpha * bn2, bn2)
        
        # Output layer
        logits = tf.layers.dense(inputs=nl2, units=1)        
        #predictions = tf.nn.softmax(logits)

        # return rewards logits
        return logits

In [25]:
def model_loss(action_size, hidden_size, states, actions, targetQs, reward):
    # G
    actions_logits = generator(states=states, hidden_size=hidden_size, action_size=action_size)
    actions_labels = tf.one_hot(indices=actions, depth=action_size, dtype=actions_logits.dtype)
    neg_log_prob_actions = tf.nn.softmax_cross_entropy_with_logits_v2(logits=actions_logits, 
                                                                      labels=actions_labels)
    rewards = reward * tf.ones_like(targetQs)
    #Qs_labels = targetQs[1:]
    Qs_labels = rewards[:-1] + (0.99*targetQs[1:])
    Qs_labels = tf.concat(axis=0, values=[Qs_labels, tf.zeros([1])])
    g_loss = tf.reduce_mean(neg_log_prob_actions * Qs_labels)
    #g_loss = tf.reduce_mean(neg_log_prob_actions[:-1] * Qs_labels)
    
    # D
    Qs_logits = discriminator(actions=actions_logits, hidden_size=hidden_size, states=states)
    d_lossR = tf.reduce_mean(tf.nn.sigmoid_cross_entropy_with_logits(logits=tf.reshape(Qs_logits, [-1]),
                                                                     labels=rewards))
    d_lossQ = tf.reduce_mean(tf.nn.sigmoid_cross_entropy_with_logits(logits=tf.reshape(Qs_logits, [-1]),
                                                                     labels=tf.nn.sigmoid(Qs_labels)))
    # d_lossQ = tf.reduce_mean(tf.nn.sigmoid_cross_entropy_with_logits(logits=tf.reshape(Qs_logits[:-1], [-1]),
    #                                                                  labels=tf.nn.sigmoid(Qs_labels)))
    d_loss = d_lossR + d_lossQ

    return actions_logits, Qs_logits, g_loss, d_loss, d_lossR, d_lossQ

In [26]:
# Optimizating/training/learning G & D
def model_opt(g_loss, d_loss, learning_rate):
    """
    Get optimization operations in order
    :param g_loss: Generator loss Tensor for action prediction
    :param d_loss: Discriminator loss Tensor for reward prediction for generated/prob/logits action
    :param learning_rate: Learning Rate Placeholder
    :return: A tuple of (qfunction training, generator training, discriminator training)
    """
    # Get weights and bias to update
    t_vars = tf.trainable_variables()
    g_vars = [var for var in t_vars if var.name.startswith('generator')]
    d_vars = [var for var in t_vars if var.name.startswith('discriminator')]

    # Optimize
    with tf.control_dependencies(tf.get_collection(tf.GraphKeys.UPDATE_OPS)): # Required for batchnorm (BN)
        g_opt = tf.train.AdamOptimizer(learning_rate).minimize(g_loss, var_list=g_vars)
        d_opt = tf.train.AdamOptimizer(learning_rate).minimize(d_loss, var_list=d_vars)

    return g_opt, d_opt

In [27]:
class Model:
    def __init__(self, state_size, action_size, hidden_size, learning_rate):

        # Data of the Model: make the data available inside the framework
        self.states, self.actions, self.targetQs, self.reward = model_input(state_size=state_size)

        # Create the Model: calculating the loss and forwad pass
        self.actions_logits, self.Qs_logits, self.g_loss, self.d_loss, self.d_lossR, self.d_lossQ = model_loss(
            action_size=action_size, hidden_size=hidden_size, # model init parameters
            states=self.states, actions=self.actions, # model input
            targetQs=self.targetQs, reward=self.reward) # model input
        
        # Update the model: backward pass and backprop
        self.g_opt, self.d_opt = model_opt(g_loss=self.g_loss, d_loss=self.d_loss, learning_rate=learning_rate)

In [28]:
print('state size:{}'.format(states.shape), 
      'actions:{}'.format(actions.shape)) 
print('action size:{}'.format(np.max(actions) - np.min(actions)+1))

state size:(300, 37) actions:(300,)
action size:4


In [29]:
# Training parameters
# Network parameters
state_size = 37              # number of units for the input state/observation -- simulation
action_size = 4              # number of units for the output actions -- simulation
hidden_size = 37*16          # number of units in each Q-network hidden layer -- simulation
learning_rate = 0.001          # learning rate for adam

In [30]:
# Reset/init the graph/session
graph = tf.reset_default_graph()

# Init the model
model = Model(action_size=action_size, hidden_size=hidden_size, state_size=state_size, learning_rate=learning_rate)

In [31]:
env_info = env.reset(train_mode=True)[brain_name] # reset the environment

while True: # infinite number of steps
#for _ in range(batch_size):
    state = env_info.vector_observations[0]   # get the next state
    action = np.random.randint(action_size)        # select an action
    env_info = env.step(action)[brain_name]        # send the action to the environment
    reward = env_info.rewards[0]                   # get the reward
    done = env_info.local_done[0]                  # see if episode has finished
    #memory.buffer.append([action, state, done])
    if done:                                       # exit loop if episode finished
        break

In [None]:
from collections import deque
episodes_total_reward = deque(maxlen=100) # 100 episodes average/running average/running mean/window
saver = tf.train.Saver()
rewards_list, g_loss_list, d_loss_list = [], [], []
d_lossR_list, d_lossQ_list = [], []

# TF session for training
with tf.Session(graph=graph) as sess:
    sess.run(tf.global_variables_initializer())
    #saver.restore(sess, 'checkpoints/model.ckpt')    
    #saver.restore(sess, tf.train.latest_checkpoint('checkpoints'))
    
    # Training episodes/epochs
    for ep in range(111111):
        batch = [] # every data batch
        total_reward = 0
        #state = env.reset() # env first state
        env_info = env.reset(train_mode=True)[brain_name] # reset the environment

        # Training steps/batches
        while True:
            state = env_info.vector_observations[0]   # get the next state
            action_logits, Q_logits = sess.run(fetches=[model.actions_logits, model.Qs_logits], 
                                               feed_dict={model.states: np.reshape(state, [1, -1])})
            action = np.argmax(action_logits)
            batch.append([state, action, Q_logits])
            #state, reward, done, _ = env.step(action)
            env_info = env.step(action)[brain_name]        # send the action to the environment
            reward = env_info.rewards[0]                   # get the reward
            done = env_info.local_done[0]                  # see if episode has finished
            total_reward += reward
            if done is True: # episode ended success/failure
                episodes_total_reward.append(total_reward) # stopping criteria
                #rate = total_reward/ 500 # success is 500 points, rate is between 0 and +1 ~ sigmoid
                rate = total_reward/ +13 # success is +13; rate is between -1 and +1 ~ tanh
                if rate >= +1: rate = +1
                if rate <= -1: rate = -1
                break

        # Training using batches
        #batch = memory.buffer
        states = np.array([each[0] for each in batch])
        actions = np.array([each[1] for each in batch])
        targetQs = np.array([each[2] for each in batch])
        g_loss, d_loss, d_lossR, d_lossQ, _, _ = sess.run([model.g_loss, model.d_loss,
                                                           model.d_lossR, model.d_lossQ, 
                                                           model.g_opt, model.d_opt],
                                                          feed_dict = {model.states: states, 
                                                                       model.actions: actions,
                                                                       model.reward: rate,
                                                                       model.targetQs: targetQs.reshape([-1])})
        # Average 100 episode total reward
        # Print out
        print('Episode:{}'.format(ep),
              'meanR:{:.4f}'.format(np.mean(episodes_total_reward)),
              'rate:{:.4f}'.format(rate),
              'gloss:{:.4f}'.format(g_loss),
              'dloss:{:.4f}'.format(d_loss),
              'dlossR:{:.4f}'.format(d_lossR),
              'dlossQ:{:.4f}'.format(d_lossQ))
        # Ploting out
        rewards_list.append([ep, np.mean(episodes_total_reward)])
        g_loss_list.append([ep, g_loss])
        d_loss_list.append([ep, d_loss])
        d_lossR_list.append([ep, d_lossR])
        d_lossQ_list.append([ep, d_lossQ])
        # Break episode/epoch loop
        if np.mean(episodes_total_reward) >= +13:
            break
            
    # At the end of all training episodes/epochs
    saver.save(sess, 'checkpoints-nav/model.ckpt')

Episode:0 meanR:-1.0000 rate:-0.0769 gloss:-0.1316 dloss:1.3706 dlossR:0.6773 dlossQ:0.6933
Episode:1 meanR:-1.0000 rate:-0.0769 gloss:-0.7123 dloss:1.0514 dlossR:0.4117 dlossQ:0.6397
Episode:2 meanR:-0.6667 rate:0.0000 gloss:-2.2430 dloss:0.4708 dlossR:0.1167 dlossQ:0.3541
Episode:3 meanR:-0.5000 rate:0.0000 gloss:-1.9378 dloss:0.5582 dlossR:0.1505 dlossQ:0.4077
Episode:4 meanR:-0.4000 rate:0.0000 gloss:-2.5756 dloss:0.3596 dlossR:0.0808 dlossQ:0.2788
Episode:5 meanR:-0.3333 rate:0.0000 gloss:-3.3259 dloss:0.2474 dlossR:0.0481 dlossQ:0.1993
Episode:6 meanR:-0.1429 rate:0.0769 gloss:-3.8222 dloss:0.4762 dlossR:0.2936 dlossQ:0.1826
Episode:7 meanR:-0.1250 rate:0.0000 gloss:-4.5931 dloss:0.1158 dlossR:0.0177 dlossQ:0.0981
Episode:8 meanR:-0.1111 rate:0.0000 gloss:-5.3130 dloss:0.0846 dlossR:0.0118 dlossQ:0.0728
Episode:9 meanR:-0.1000 rate:0.0000 gloss:-6.1249 dloss:0.0456 dlossR:0.0050 dlossQ:0.0406
Episode:10 meanR:-0.0909 rate:0.0000 gloss:-6.6945 dloss:0.0358 dlossR:0.0033 dlossQ:0.0

Episode:89 meanR:0.1222 rate:0.0769 gloss:-4.9906 dloss:0.5519 dlossR:0.3447 dlossQ:0.2072
Episode:90 meanR:0.1099 rate:-0.0769 gloss:-6.2658 dloss:-0.0555 dlossR:-0.3289 dlossQ:0.2734
Episode:91 meanR:0.1196 rate:0.0769 gloss:-6.3559 dloss:0.5692 dlossR:0.4113 dlossQ:0.1578
Episode:92 meanR:0.1290 rate:0.0769 gloss:-7.2748 dloss:0.6754 dlossR:0.4671 dlossQ:0.2083
Episode:93 meanR:0.1170 rate:-0.0769 gloss:-5.8568 dloss:-0.1527 dlossR:-0.3167 dlossQ:0.1640
Episode:94 meanR:0.1158 rate:0.0000 gloss:-6.4807 dloss:0.2158 dlossR:0.0274 dlossQ:0.1884
Episode:95 meanR:0.1354 rate:0.1538 gloss:-7.1775 dloss:1.0732 dlossR:0.9109 dlossQ:0.1622
Episode:96 meanR:0.1546 rate:0.1538 gloss:-5.5053 dloss:0.8447 dlossR:0.7018 dlossQ:0.1429
Episode:97 meanR:0.1633 rate:0.0769 gloss:-5.0454 dloss:0.4961 dlossR:0.3340 dlossQ:0.1621
Episode:98 meanR:0.1616 rate:0.0000 gloss:-5.5479 dloss:0.1849 dlossR:0.0287 dlossQ:0.1562
Episode:99 meanR:0.1700 rate:0.0769 gloss:-5.6715 dloss:0.5481 dlossR:0.3738 dlossQ:

Episode:178 meanR:0.0700 rate:-0.1538 gloss:-6.4764 dloss:-0.6435 dlossR:-0.7165 dlossQ:0.0730
Episode:179 meanR:0.0600 rate:-0.0769 gloss:-7.1288 dloss:-0.3462 dlossR:-0.4007 dlossQ:0.0544
Episode:180 meanR:0.0800 rate:0.0769 gloss:-7.1074 dloss:0.4845 dlossR:0.4239 dlossQ:0.0607
Episode:181 meanR:0.0900 rate:-0.0769 gloss:-6.7778 dloss:-0.3072 dlossR:-0.3726 dlossQ:0.0655
Episode:182 meanR:0.1500 rate:0.4615 gloss:-8.5889 dloss:3.2731 dlossR:3.2229 dlossQ:0.0501
Episode:183 meanR:0.1500 rate:-0.0769 gloss:-8.1131 dloss:-0.4213 dlossR:-0.4557 dlossQ:0.0344
Episode:184 meanR:0.1500 rate:0.0000 gloss:-7.7196 dloss:0.0400 dlossR:0.0038 dlossQ:0.0362
Episode:185 meanR:0.1600 rate:0.0769 gloss:-7.7394 dloss:0.4918 dlossR:0.4555 dlossQ:0.0363
Episode:186 meanR:0.1600 rate:0.0000 gloss:-7.3096 dloss:0.0436 dlossR:0.0044 dlossQ:0.0392
Episode:187 meanR:0.1500 rate:0.0000 gloss:-5.8111 dloss:0.0962 dlossR:0.0138 dlossQ:0.0824
Episode:188 meanR:0.1500 rate:0.0000 gloss:-5.0896 dloss:0.1504 dlos

Episode:266 meanR:-0.0100 rate:0.0000 gloss:-8.6233 dloss:0.0720 dlossR:0.0079 dlossQ:0.0642
Episode:267 meanR:-0.0300 rate:-0.1538 gloss:-8.4223 dloss:-0.9064 dlossR:-0.9478 dlossQ:0.0414
Episode:268 meanR:-0.0300 rate:-0.0769 gloss:-8.3815 dloss:-0.4108 dlossR:-0.4715 dlossQ:0.0607
Episode:269 meanR:-0.0500 rate:-0.0769 gloss:-8.9644 dloss:-0.4479 dlossR:-0.5065 dlossQ:0.0585
Episode:270 meanR:-0.0700 rate:-0.1538 gloss:-7.6879 dloss:-0.7971 dlossR:-0.8621 dlossQ:0.0651
Episode:271 meanR:-0.0900 rate:-0.1538 gloss:-7.5284 dloss:-0.8080 dlossR:-0.8496 dlossQ:0.0415
Episode:272 meanR:-0.0800 rate:0.0769 gloss:-7.6809 dloss:0.5081 dlossR:0.4564 dlossQ:0.0517
Episode:273 meanR:-0.0300 rate:0.3846 gloss:-8.8949 dloss:2.7482 dlossR:2.7219 dlossQ:0.0263
Episode:274 meanR:-0.0100 rate:0.0000 gloss:-10.6412 dloss:0.0405 dlossR:0.0027 dlossQ:0.0378
Episode:275 meanR:0.0200 rate:0.0000 gloss:-11.2298 dloss:0.0571 dlossR:0.0027 dlossQ:0.0544
Episode:276 meanR:0.0100 rate:-0.0769 gloss:-14.1845 d

Episode:354 meanR:0.9200 rate:-0.0769 gloss:-2.2822 dloss:0.5966 dlossR:0.0803 dlossQ:0.5163
Episode:355 meanR:0.9300 rate:0.0769 gloss:-1.9830 dloss:0.8418 dlossR:0.3271 dlossQ:0.5148
Episode:356 meanR:0.9500 rate:0.0769 gloss:-1.5729 dloss:0.9319 dlossR:0.3660 dlossQ:0.5659
Episode:357 meanR:0.9500 rate:0.0000 gloss:-2.0637 dloss:0.6932 dlossR:0.2028 dlossQ:0.4905
Episode:358 meanR:0.9500 rate:0.0000 gloss:-1.6846 dloss:0.8003 dlossR:0.2617 dlossQ:0.5387
Episode:359 meanR:0.9500 rate:0.0000 gloss:-2.2903 dloss:0.6122 dlossR:0.1715 dlossQ:0.4407
Episode:360 meanR:0.9700 rate:0.0769 gloss:-1.6090 dloss:0.8975 dlossR:0.3466 dlossQ:0.5509
Episode:361 meanR:0.9600 rate:0.0000 gloss:-2.3196 dloss:0.6071 dlossR:0.1685 dlossQ:0.4385
Episode:362 meanR:0.9500 rate:0.0000 gloss:-2.4289 dloss:0.5863 dlossR:0.1616 dlossQ:0.4247
Episode:363 meanR:0.9300 rate:-0.0769 gloss:-3.1793 dloss:0.2504 dlossR:-0.0693 dlossQ:0.3197
Episode:364 meanR:0.9300 rate:0.0000 gloss:-3.9018 dloss:0.2915 dlossR:0.0592

Episode:443 meanR:0.2800 rate:0.0000 gloss:-11.0727 dloss:0.0181 dlossR:0.0005 dlossQ:0.0176
Episode:444 meanR:0.2700 rate:0.0000 gloss:-10.1782 dloss:0.0203 dlossR:0.0006 dlossQ:0.0196
Episode:445 meanR:0.2300 rate:0.0000 gloss:-12.1951 dloss:0.0157 dlossR:0.0002 dlossQ:0.0155
Episode:446 meanR:0.2000 rate:0.0000 gloss:-10.8834 dloss:0.0205 dlossR:0.0004 dlossQ:0.0201
Episode:447 meanR:0.2000 rate:-0.0769 gloss:-8.5757 dloss:-0.4512 dlossR:-0.4768 dlossQ:0.0255
Episode:448 meanR:0.2100 rate:0.0769 gloss:-9.4893 dloss:0.5648 dlossR:0.5448 dlossQ:0.0200
Episode:449 meanR:0.2100 rate:0.0000 gloss:-8.6463 dloss:0.0278 dlossR:0.0019 dlossQ:0.0259
Episode:450 meanR:0.1900 rate:0.0000 gloss:-10.5569 dloss:0.0176 dlossR:0.0005 dlossQ:0.0171
Episode:451 meanR:0.2000 rate:0.1538 gloss:-11.6203 dloss:1.3519 dlossR:1.3357 dlossQ:0.0162
Episode:452 meanR:0.1800 rate:0.0000 gloss:-12.2977 dloss:0.0182 dlossR:0.0001 dlossQ:0.0180
Episode:453 meanR:0.1500 rate:0.0000 gloss:-11.4191 dloss:0.0192 dloss

Episode:532 meanR:0.2500 rate:0.0000 gloss:-4.8003 dloss:0.1695 dlossR:0.0295 dlossQ:0.1400
Episode:533 meanR:0.2400 rate:0.0000 gloss:-3.8728 dloss:0.3034 dlossR:0.0650 dlossQ:0.2383
Episode:534 meanR:0.2300 rate:-0.1538 gloss:-4.6997 dloss:-0.3057 dlossR:-0.4653 dlossQ:0.1597
Episode:535 meanR:0.2500 rate:0.0769 gloss:-4.4173 dloss:0.4798 dlossR:0.2960 dlossQ:0.1838
Episode:536 meanR:0.2600 rate:0.0769 gloss:-4.2197 dloss:0.4964 dlossR:0.2913 dlossQ:0.2051
Episode:537 meanR:0.2700 rate:0.0000 gloss:-4.9824 dloss:0.1730 dlossR:0.0302 dlossQ:0.1428
Episode:538 meanR:0.2700 rate:0.0000 gloss:-4.5999 dloss:0.2048 dlossR:0.0373 dlossQ:0.1675
Episode:539 meanR:0.2800 rate:0.0000 gloss:-4.6257 dloss:0.2015 dlossR:0.0359 dlossQ:0.1656
Episode:540 meanR:0.2700 rate:-0.0769 gloss:-5.0526 dloss:-0.1178 dlossR:-0.2506 dlossQ:0.1328
Episode:541 meanR:0.2700 rate:0.0000 gloss:-4.0955 dloss:0.2615 dlossR:0.0524 dlossQ:0.2091
Episode:542 meanR:0.2500 rate:-0.1538 gloss:-5.3096 dloss:-0.4351 dlossR:-

Episode:620 meanR:-0.0800 rate:0.0000 gloss:-10.4572 dloss:0.0281 dlossR:0.0019 dlossQ:0.0263
Episode:621 meanR:-0.0400 rate:0.2308 gloss:-11.2947 dloss:2.0053 dlossR:1.9790 dlossQ:0.0262
Episode:622 meanR:0.0200 rate:0.5385 gloss:-8.4951 dloss:3.7202 dlossR:3.6631 dlossQ:0.0571
Episode:623 meanR:0.0200 rate:0.2308 gloss:-9.1687 dloss:1.6933 dlossR:1.6222 dlossQ:0.0711
Episode:624 meanR:0.0300 rate:0.0769 gloss:-11.5210 dloss:0.6999 dlossR:0.6638 dlossQ:0.0362
Episode:625 meanR:0.0100 rate:0.0000 gloss:-11.6532 dloss:0.0633 dlossR:0.0055 dlossQ:0.0579
Episode:626 meanR:0.0000 rate:0.0769 gloss:-14.3224 dloss:0.8758 dlossR:0.8251 dlossQ:0.0507
Episode:627 meanR:0.0300 rate:0.0769 gloss:-10.0044 dloss:0.6441 dlossR:0.5789 dlossQ:0.0653
Episode:628 meanR:0.0300 rate:0.0000 gloss:-8.1046 dloss:0.0786 dlossR:0.0093 dlossQ:0.0694
Episode:629 meanR:0.0200 rate:-0.0769 gloss:-10.1971 dloss:-0.5374 dlossR:-0.5659 dlossQ:0.0285
Episode:630 meanR:0.0300 rate:0.0769 gloss:-4.8995 dloss:0.6547 dlos

Episode:709 meanR:0.3200 rate:0.0000 gloss:-5.0954 dloss:0.1721 dlossR:0.0283 dlossQ:0.1438
Episode:710 meanR:0.3000 rate:-0.0769 gloss:-7.0114 dloss:-0.3178 dlossR:-0.3800 dlossQ:0.0622
Episode:711 meanR:0.2800 rate:-0.2308 gloss:-9.3161 dloss:-1.4886 dlossR:-1.5194 dlossQ:0.0307
Episode:712 meanR:0.2600 rate:0.0000 gloss:-6.7449 dloss:0.1032 dlossR:0.0131 dlossQ:0.0901
Episode:713 meanR:0.2200 rate:0.0000 gloss:-7.7545 dloss:0.0531 dlossR:0.0052 dlossQ:0.0479
Episode:714 meanR:0.2700 rate:0.3846 gloss:-6.7360 dloss:2.1282 dlossR:2.0560 dlossQ:0.0722
Episode:715 meanR:0.2500 rate:0.0000 gloss:-6.9531 dloss:0.0934 dlossR:0.0114 dlossQ:0.0820
Episode:716 meanR:0.2500 rate:0.0000 gloss:-7.0990 dloss:0.0788 dlossR:0.0094 dlossQ:0.0694
Episode:717 meanR:0.2500 rate:0.0000 gloss:-6.7777 dloss:0.0986 dlossR:0.0126 dlossQ:0.0861
Episode:718 meanR:0.2700 rate:0.0000 gloss:-7.8960 dloss:0.0805 dlossR:0.0078 dlossQ:0.0727
Episode:719 meanR:0.2700 rate:-0.0769 gloss:-6.4088 dloss:-0.2626 dlossR:-

Episode:797 meanR:-0.0400 rate:0.0000 gloss:-11.4510 dloss:0.0167 dlossR:0.0002 dlossQ:0.0165
Episode:798 meanR:-0.0300 rate:0.0000 gloss:-10.0290 dloss:0.0203 dlossR:0.0011 dlossQ:0.0192
Episode:799 meanR:-0.0100 rate:0.0000 gloss:-9.3520 dloss:0.0217 dlossR:0.0011 dlossQ:0.0206
Episode:800 meanR:0.0000 rate:0.0000 gloss:-10.3010 dloss:0.0178 dlossR:0.0006 dlossQ:0.0172
Episode:801 meanR:0.0100 rate:0.0000 gloss:-8.9199 dloss:0.0242 dlossR:0.0015 dlossQ:0.0227
Episode:802 meanR:0.0100 rate:0.0000 gloss:-9.5269 dloss:0.0245 dlossR:0.0014 dlossQ:0.0231
Episode:803 meanR:0.0100 rate:0.0000 gloss:-11.9282 dloss:0.0174 dlossR:0.0003 dlossQ:0.0171
Episode:804 meanR:0.0000 rate:0.0769 gloss:-11.6874 dloss:0.6859 dlossR:0.6662 dlossQ:0.0197
Episode:805 meanR:0.0000 rate:0.0000 gloss:-12.8093 dloss:0.0157 dlossR:0.0002 dlossQ:0.0155
Episode:806 meanR:0.0100 rate:0.0769 gloss:-12.4593 dloss:0.7295 dlossR:0.7094 dlossQ:0.0201
Episode:807 meanR:0.0100 rate:0.0000 gloss:-10.7480 dloss:0.0193 dloss

Episode:886 meanR:0.1300 rate:0.0000 gloss:-5.0896 dloss:0.1856 dlossR:0.0322 dlossQ:0.1534
Episode:887 meanR:0.1300 rate:-0.0769 gloss:-5.8377 dloss:-0.2002 dlossR:-0.3036 dlossQ:0.1034
Episode:888 meanR:0.1200 rate:0.0000 gloss:-5.6870 dloss:0.2069 dlossR:0.0287 dlossQ:0.1782
Episode:889 meanR:0.1100 rate:-0.0769 gloss:-5.1964 dloss:-0.1429 dlossR:-0.2625 dlossQ:0.1196
Episode:890 meanR:0.1000 rate:0.0000 gloss:-5.5803 dloss:0.1150 dlossR:0.0172 dlossQ:0.0978
Episode:891 meanR:0.1000 rate:0.0000 gloss:-5.3746 dloss:0.1322 dlossR:0.0211 dlossQ:0.1112
Episode:892 meanR:0.1000 rate:0.0000 gloss:-5.2187 dloss:0.1431 dlossR:0.0232 dlossQ:0.1199
Episode:893 meanR:0.1200 rate:0.1538 gloss:-5.6666 dloss:0.7837 dlossR:0.6806 dlossQ:0.1031
Episode:894 meanR:0.1200 rate:0.0000 gloss:-7.0863 dloss:0.0870 dlossR:0.0107 dlossQ:0.0763
Episode:895 meanR:0.1500 rate:0.1538 gloss:-6.3115 dloss:0.8805 dlossR:0.7551 dlossQ:0.1254
Episode:896 meanR:0.1400 rate:0.0000 gloss:-6.6712 dloss:0.1088 dlossR:0.0

Episode:975 meanR:0.0000 rate:-0.0769 gloss:-8.6987 dloss:-0.4545 dlossR:-0.4836 dlossQ:0.0291
Episode:976 meanR:0.0100 rate:0.0769 gloss:-8.9039 dloss:0.5392 dlossR:0.5122 dlossQ:0.0270
Episode:977 meanR:0.0400 rate:0.0769 gloss:-8.5632 dloss:0.5238 dlossR:0.4929 dlossQ:0.0309
Episode:978 meanR:0.0200 rate:-0.1538 gloss:-7.7741 dloss:-0.8089 dlossR:-0.8492 dlossQ:0.0402
Episode:979 meanR:0.0500 rate:0.2308 gloss:-7.0667 dloss:1.3217 dlossR:1.2583 dlossQ:0.0634
Episode:980 meanR:0.0400 rate:-0.0769 gloss:-7.7604 dloss:-0.3891 dlossR:-0.4278 dlossQ:0.0387
Episode:981 meanR:0.0300 rate:-0.0769 gloss:-7.2625 dloss:-0.3500 dlossR:-0.3983 dlossQ:0.0482
Episode:982 meanR:0.0400 rate:-0.0769 gloss:-6.9501 dloss:-0.3260 dlossR:-0.3788 dlossQ:0.0528
Episode:983 meanR:0.0300 rate:-0.0769 gloss:-6.8458 dloss:-0.3104 dlossR:-0.3715 dlossQ:0.0611
Episode:984 meanR:0.0300 rate:0.0000 gloss:-6.3803 dloss:0.0842 dlossR:0.0112 dlossQ:0.0729
Episode:985 meanR:0.0100 rate:-0.0769 gloss:-7.0499 dloss:-0.3

Episode:1063 meanR:0.1400 rate:0.0769 gloss:-4.0047 dloss:0.5045 dlossR:0.2828 dlossQ:0.2217
Episode:1064 meanR:0.1400 rate:0.0000 gloss:-4.8071 dloss:0.1774 dlossR:0.0310 dlossQ:0.1464
Episode:1065 meanR:0.1300 rate:-0.0769 gloss:-5.4311 dloss:-0.1609 dlossR:-0.2763 dlossQ:0.1153
Episode:1066 meanR:0.1000 rate:-0.0769 gloss:-4.5521 dloss:-0.0440 dlossR:-0.2108 dlossQ:0.1668
Episode:1067 meanR:0.0900 rate:0.0000 gloss:-4.5764 dloss:0.2023 dlossR:0.0369 dlossQ:0.1654
Episode:1068 meanR:0.0900 rate:0.0769 gloss:-4.9379 dloss:0.4610 dlossR:0.3145 dlossQ:0.1465
Episode:1069 meanR:0.1200 rate:0.0000 gloss:-5.5973 dloss:0.1178 dlossR:0.0179 dlossQ:0.1000
Episode:1070 meanR:0.1000 rate:-0.0769 gloss:-6.0453 dloss:-0.2410 dlossR:-0.3213 dlossQ:0.0803
Episode:1071 meanR:0.0700 rate:0.0000 gloss:-5.4217 dloss:0.1254 dlossR:0.0194 dlossQ:0.1060
Episode:1072 meanR:0.0600 rate:-0.0769 gloss:-6.0537 dloss:-0.2358 dlossR:-0.3200 dlossQ:0.0841
Episode:1073 meanR:0.0400 rate:-0.0769 gloss:-6.4832 dloss

Episode:1150 meanR:-0.0900 rate:-0.1538 gloss:-10.2168 dloss:-1.1096 dlossR:-1.1292 dlossQ:0.0195
Episode:1151 meanR:-0.0700 rate:0.1538 gloss:-9.4872 dloss:1.1173 dlossR:1.0957 dlossQ:0.0216
Episode:1152 meanR:-0.0700 rate:0.0000 gloss:-10.0695 dloss:0.0192 dlossR:0.0008 dlossQ:0.0185
Episode:1153 meanR:-0.0600 rate:0.0769 gloss:-8.2950 dloss:0.5106 dlossR:0.4773 dlossQ:0.0333
Episode:1154 meanR:-0.0600 rate:0.0000 gloss:-8.7688 dloss:0.0296 dlossR:0.0023 dlossQ:0.0273
Episode:1155 meanR:-0.0800 rate:-0.0769 gloss:-8.2906 dloss:-0.4288 dlossR:-0.4590 dlossQ:0.0302
Episode:1156 meanR:-0.0800 rate:0.0000 gloss:-7.7278 dloss:0.0530 dlossR:0.0058 dlossQ:0.0472
Episode:1157 meanR:-0.0900 rate:0.0000 gloss:-8.9097 dloss:0.0246 dlossR:0.0016 dlossQ:0.0230
Episode:1158 meanR:-0.0800 rate:0.0000 gloss:-7.4440 dloss:0.0538 dlossR:0.0062 dlossQ:0.0476
Episode:1159 meanR:-0.0700 rate:0.0769 gloss:-5.9123 dloss:0.4433 dlossR:0.3546 dlossQ:0.0887
Episode:1160 meanR:-0.0400 rate:0.0769 gloss:-6.0782

Episode:1238 meanR:0.2100 rate:0.0769 gloss:-3.9874 dloss:0.5271 dlossR:0.2897 dlossQ:0.2374
Episode:1239 meanR:0.2100 rate:-0.0769 gloss:-4.4233 dloss:0.0008 dlossR:-0.1929 dlossQ:0.1937
Episode:1240 meanR:0.2200 rate:0.0769 gloss:-4.1202 dloss:0.5212 dlossR:0.2932 dlossQ:0.2280
Episode:1241 meanR:0.2200 rate:0.0000 gloss:-4.0711 dloss:0.2610 dlossR:0.0521 dlossQ:0.2089
Episode:1242 meanR:0.2000 rate:-0.0769 gloss:-4.9875 dloss:-0.0922 dlossR:-0.2407 dlossQ:0.1485
Episode:1243 meanR:0.2100 rate:0.1538 gloss:-4.6352 dloss:0.7621 dlossR:0.5818 dlossQ:0.1803
Episode:1244 meanR:0.2700 rate:0.1538 gloss:-4.5668 dloss:0.7620 dlossR:0.5762 dlossQ:0.1858
Episode:1245 meanR:0.2900 rate:0.0769 gloss:-4.5590 dloss:0.4819 dlossR:0.3011 dlossQ:0.1808
Episode:1246 meanR:0.2600 rate:-0.1538 gloss:-4.9987 dloss:-0.3537 dlossR:-0.5023 dlossQ:0.1486
Episode:1247 meanR:0.2100 rate:0.0000 gloss:-5.3204 dloss:0.1513 dlossR:0.0245 dlossQ:0.1268
Episode:1248 meanR:0.2200 rate:0.0000 gloss:-4.4072 dloss:0.22

Episode:1325 meanR:-0.0300 rate:0.0769 gloss:-10.6586 dloss:0.6279 dlossR:0.6075 dlossQ:0.0204
Episode:1326 meanR:-0.0300 rate:0.0000 gloss:-10.3195 dloss:0.0191 dlossR:0.0008 dlossQ:0.0183
Episode:1327 meanR:-0.0100 rate:0.1538 gloss:-10.2992 dloss:1.2106 dlossR:1.1877 dlossQ:0.0229
Episode:1328 meanR:0.0400 rate:0.3846 gloss:-9.4573 dloss:2.8432 dlossR:2.8161 dlossQ:0.0271
Episode:1329 meanR:0.0300 rate:-0.0769 gloss:-10.1255 dloss:-0.5464 dlossR:-0.5643 dlossQ:0.0180
Episode:1330 meanR:0.0200 rate:0.0000 gloss:-7.8966 dloss:0.0349 dlossR:0.0031 dlossQ:0.0317
Episode:1331 meanR:0.0100 rate:0.0000 gloss:-8.2702 dloss:0.0315 dlossR:0.0026 dlossQ:0.0289
Episode:1332 meanR:0.0100 rate:0.0000 gloss:-7.3109 dloss:0.0452 dlossR:0.0048 dlossQ:0.0404
Episode:1333 meanR:0.0000 rate:-0.0769 gloss:-8.4702 dloss:-0.4373 dlossR:-0.4688 dlossQ:0.0315
Episode:1334 meanR:0.0100 rate:0.1538 gloss:-8.7415 dloss:1.0550 dlossR:1.0157 dlossQ:0.0393
Episode:1335 meanR:0.0200 rate:0.0000 gloss:-8.2824 dloss

Episode:1412 meanR:0.0400 rate:-0.1538 gloss:-6.8473 dloss:-0.6871 dlossR:-0.7386 dlossQ:0.0515
Episode:1413 meanR:0.0600 rate:0.0769 gloss:-6.6236 dloss:0.4442 dlossR:0.3863 dlossQ:0.0579
Episode:1414 meanR:0.0800 rate:0.1538 gloss:-8.1632 dloss:0.9851 dlossR:0.9480 dlossQ:0.0371
Episode:1415 meanR:0.0700 rate:0.0000 gloss:-7.7282 dloss:0.0404 dlossR:0.0039 dlossQ:0.0365
Episode:1416 meanR:0.0700 rate:0.0000 gloss:-7.2951 dloss:0.0483 dlossR:0.0051 dlossQ:0.0432
Episode:1417 meanR:0.0500 rate:-0.0769 gloss:-8.3168 dloss:-0.4289 dlossR:-0.4593 dlossQ:0.0304
Episode:1418 meanR:0.0500 rate:0.0000 gloss:-8.6572 dloss:0.0336 dlossR:0.0029 dlossQ:0.0307
Episode:1419 meanR:0.0400 rate:0.0000 gloss:-8.8907 dloss:0.0333 dlossR:0.0026 dlossQ:0.0307
Episode:1420 meanR:0.0300 rate:-0.0769 gloss:-9.1546 dloss:-0.4826 dlossR:-0.5071 dlossQ:0.0244
Episode:1421 meanR:0.0200 rate:-0.0769 gloss:-8.7578 dloss:-0.4540 dlossR:-0.4844 dlossQ:0.0304
Episode:1422 meanR:-0.0100 rate:-0.2308 gloss:-9.8989 dlos

Episode:1498 meanR:-0.2300 rate:-0.2308 gloss:-15.5154 dloss:-2.5638 dlossR:-2.5838 dlossQ:0.0201
Episode:1499 meanR:-0.2200 rate:0.0000 gloss:-17.9889 dloss:0.0228 dlossR:0.0000 dlossQ:0.0228
Episode:1500 meanR:-0.1800 rate:0.0000 gloss:-18.7726 dloss:0.0181 dlossR:0.0000 dlossQ:0.0181
Episode:1501 meanR:-0.1800 rate:0.0000 gloss:-13.0803 dloss:0.0176 dlossR:0.0001 dlossQ:0.0175
Episode:1502 meanR:-0.1700 rate:0.0000 gloss:-17.3395 dloss:0.0224 dlossR:0.0000 dlossQ:0.0224
Episode:1503 meanR:-0.1800 rate:-0.0769 gloss:-18.5011 dloss:-1.0141 dlossR:-1.0429 dlossQ:0.0288
Episode:1504 meanR:-0.1700 rate:0.0000 gloss:-21.7442 dloss:0.0244 dlossR:0.0000 dlossQ:0.0244
Episode:1505 meanR:-0.1700 rate:0.0000 gloss:-19.3694 dloss:0.0238 dlossR:0.0000 dlossQ:0.0238
Episode:1506 meanR:-0.1800 rate:-0.0769 gloss:-16.2237 dloss:-0.8952 dlossR:-0.9152 dlossQ:0.0199
Episode:1507 meanR:-0.1800 rate:0.0000 gloss:-25.8581 dloss:0.0311 dlossR:0.0000 dlossQ:0.0311
Episode:1508 meanR:-0.2100 rate:-0.0769 g

Episode:1584 meanR:0.0500 rate:0.0000 gloss:-11.1006 dloss:0.0440 dlossR:0.0036 dlossQ:0.0405
Episode:1585 meanR:0.0400 rate:-0.0769 gloss:-8.8875 dloss:-0.4447 dlossR:-0.4900 dlossQ:0.0453
Episode:1586 meanR:0.0400 rate:0.0000 gloss:-8.1793 dloss:0.1006 dlossR:0.0114 dlossQ:0.0892
Episode:1587 meanR:0.0200 rate:-0.1538 gloss:-9.3000 dloss:-0.9828 dlossR:-1.0240 dlossQ:0.0413
Episode:1588 meanR:0.0100 rate:0.0000 gloss:-22.4660 dloss:0.0330 dlossR:0.0001 dlossQ:0.0330
Episode:1589 meanR:-0.0100 rate:-0.0769 gloss:-23.4136 dloss:-1.2878 dlossR:-1.3118 dlossQ:0.0240
Episode:1590 meanR:-0.0600 rate:-0.3846 gloss:-10.1533 dloss:-2.6863 dlossR:-2.7163 dlossQ:0.0300
Episode:1591 meanR:-0.1200 rate:-0.4615 gloss:-12.5719 dloss:-4.0059 dlossR:-4.0456 dlossQ:0.0397
Episode:1592 meanR:-0.0900 rate:0.2308 gloss:-9.9106 dloss:1.7950 dlossR:1.7370 dlossQ:0.0579
Episode:1593 meanR:-0.0800 rate:0.0769 gloss:-17.9014 dloss:1.0461 dlossR:1.0164 dlossQ:0.0297
Episode:1594 meanR:-0.0700 rate:0.0769 gloss

Episode:1672 meanR:0.4100 rate:0.1538 gloss:-4.3308 dloss:1.0472 dlossR:0.6213 dlossQ:0.4259
Episode:1673 meanR:0.4400 rate:0.2308 gloss:-3.8680 dloss:1.2418 dlossR:0.8194 dlossQ:0.4224
Episode:1674 meanR:0.4300 rate:0.2308 gloss:-3.4952 dloss:1.2378 dlossR:0.7754 dlossQ:0.4623
Episode:1675 meanR:0.4300 rate:0.0769 gloss:-3.6622 dloss:0.7799 dlossR:0.3491 dlossQ:0.4308
Episode:1676 meanR:0.4400 rate:0.1538 gloss:-3.7319 dloss:0.9762 dlossR:0.5710 dlossQ:0.4052
Episode:1677 meanR:0.4400 rate:0.0000 gloss:-4.3111 dloss:0.3350 dlossR:0.0716 dlossQ:0.2634
Episode:1678 meanR:0.4400 rate:0.0000 gloss:-3.9635 dloss:0.3631 dlossR:0.0856 dlossQ:0.2775
Episode:1679 meanR:0.4300 rate:-0.0769 gloss:-4.6659 dloss:0.1053 dlossR:-0.1769 dlossQ:0.2823
Episode:1680 meanR:0.4200 rate:0.0000 gloss:-5.1425 dloss:0.3340 dlossR:0.0685 dlossQ:0.2655
Episode:1681 meanR:0.4200 rate:0.0000 gloss:-4.7388 dloss:0.3821 dlossR:0.0828 dlossQ:0.2993
Episode:1682 meanR:0.4200 rate:0.0000 gloss:-3.6490 dloss:0.4905 dlo

Episode:1760 meanR:0.4100 rate:0.0769 gloss:-6.1956 dloss:0.4927 dlossR:0.3733 dlossQ:0.1194
Episode:1761 meanR:0.4000 rate:0.0000 gloss:-6.4805 dloss:0.0847 dlossR:0.0116 dlossQ:0.0731
Episode:1762 meanR:0.3700 rate:0.0769 gloss:-5.7754 dloss:0.4419 dlossR:0.3461 dlossQ:0.0959
Episode:1763 meanR:0.3800 rate:0.0000 gloss:-6.0796 dloss:0.1034 dlossR:0.0150 dlossQ:0.0884
Episode:1764 meanR:0.3500 rate:0.0769 gloss:-6.9573 dloss:0.4930 dlossR:0.4086 dlossQ:0.0844
Episode:1765 meanR:0.3300 rate:0.0000 gloss:-5.0849 dloss:0.1883 dlossR:0.0309 dlossQ:0.1574
Episode:1766 meanR:0.2800 rate:-0.0769 gloss:-6.7212 dloss:-0.2868 dlossR:-0.3596 dlossQ:0.0728
Episode:1767 meanR:0.3000 rate:0.0769 gloss:-6.5405 dloss:0.4827 dlossR:0.3880 dlossQ:0.0947
Episode:1768 meanR:0.3100 rate:0.0769 gloss:-6.5558 dloss:0.4613 dlossR:0.3860 dlossQ:0.0754
Episode:1769 meanR:0.3300 rate:0.0769 gloss:-6.1174 dloss:0.4602 dlossR:0.3650 dlossQ:0.0952
Episode:1770 meanR:0.3400 rate:0.1538 gloss:-5.8996 dloss:0.8143 dl

Episode:1848 meanR:-0.1400 rate:0.0000 gloss:-24.0979 dloss:0.0295 dlossR:0.0000 dlossQ:0.0295
Episode:1849 meanR:-0.1400 rate:0.0000 gloss:-19.7683 dloss:0.0265 dlossR:0.0000 dlossQ:0.0265
Episode:1850 meanR:-0.1400 rate:0.0000 gloss:-21.4960 dloss:0.0274 dlossR:0.0000 dlossQ:0.0274
Episode:1851 meanR:-0.1400 rate:0.0769 gloss:-24.8428 dloss:1.4393 dlossR:1.4064 dlossQ:0.0329
Episode:1852 meanR:-0.1700 rate:-0.1538 gloss:-30.4940 dloss:-3.3817 dlossR:-3.4122 dlossQ:0.0304
Episode:1853 meanR:-0.1600 rate:0.0769 gloss:-30.4425 dloss:1.7521 dlossR:1.7195 dlossQ:0.0326
Episode:1854 meanR:-0.1400 rate:0.0769 gloss:-23.5431 dloss:1.3644 dlossR:1.3338 dlossQ:0.0306
Episode:1855 meanR:-0.1600 rate:0.0000 gloss:-26.1316 dloss:0.0338 dlossR:0.0000 dlossQ:0.0338
Episode:1856 meanR:-0.1700 rate:-0.0769 gloss:-26.3556 dloss:-1.4429 dlossR:-1.4793 dlossQ:0.0364
Episode:1857 meanR:-0.1400 rate:0.1538 gloss:-20.9209 dloss:2.4151 dlossR:2.3875 dlossQ:0.0276
Episode:1858 meanR:-0.1400 rate:0.0000 gloss

Episode:1934 meanR:-0.1200 rate:0.0000 gloss:-24.7729 dloss:0.0294 dlossR:0.0000 dlossQ:0.0294
Episode:1935 meanR:-0.1600 rate:-0.1538 gloss:-27.9422 dloss:-3.1213 dlossR:-3.1595 dlossQ:0.0382
Episode:1936 meanR:-0.1300 rate:0.1538 gloss:-33.2793 dloss:3.8436 dlossR:3.8076 dlossQ:0.0360
Episode:1937 meanR:-0.1200 rate:0.0000 gloss:-34.1199 dloss:0.0363 dlossR:0.0000 dlossQ:0.0363
Episode:1938 meanR:-0.1200 rate:0.0000 gloss:-39.3857 dloss:0.0541 dlossR:0.0000 dlossQ:0.0541
Episode:1939 meanR:-0.1400 rate:-0.1538 gloss:-34.1050 dloss:-3.7928 dlossR:-3.8388 dlossQ:0.0460
Episode:1940 meanR:-0.1700 rate:-0.1538 gloss:-38.0815 dloss:-4.2498 dlossR:-4.2858 dlossQ:0.0360
Episode:1941 meanR:-0.1600 rate:0.0000 gloss:-29.5557 dloss:0.0353 dlossR:0.0000 dlossQ:0.0353
Episode:1942 meanR:-0.0900 rate:0.0000 gloss:-35.7375 dloss:0.0449 dlossR:0.0000 dlossQ:0.0449
Episode:1943 meanR:-0.0800 rate:0.0000 gloss:-35.4049 dloss:0.0432 dlossR:0.0000 dlossQ:0.0432
Episode:1944 meanR:-0.0700 rate:0.0769 gl

Episode:2020 meanR:-0.1600 rate:-0.1538 gloss:-85.7985 dloss:-9.5301 dlossR:-9.6471 dlossQ:0.1170
Episode:2021 meanR:-0.1500 rate:0.0769 gloss:-80.9200 dloss:4.6735 dlossR:4.5700 dlossQ:0.1035
Episode:2022 meanR:-0.1600 rate:-0.0769 gloss:-86.2160 dloss:-4.7441 dlossR:-4.8516 dlossQ:0.1075
Episode:2023 meanR:-0.1700 rate:-0.0769 gloss:-78.6548 dloss:-4.3359 dlossR:-4.4304 dlossQ:0.0945
Episode:2024 meanR:-0.1600 rate:0.0769 gloss:-88.4179 dloss:5.1118 dlossR:4.9901 dlossQ:0.1217
Episode:2025 meanR:-0.1400 rate:0.1538 gloss:-98.2113 dloss:11.2449 dlossR:11.0971 dlossQ:0.1477
Episode:2026 meanR:-0.1600 rate:-0.1538 gloss:-95.1262 dloss:-10.5767 dlossR:-10.6953 dlossQ:0.1187
Episode:2027 meanR:-0.1800 rate:-0.1538 gloss:-95.1594 dloss:-10.5523 dlossR:-10.7005 dlossQ:0.1482
Episode:2028 meanR:-0.1900 rate:0.0000 gloss:-91.2840 dloss:0.1028 dlossR:0.0000 dlossQ:0.1028
Episode:2029 meanR:-0.1900 rate:0.0000 gloss:-94.5579 dloss:0.0973 dlossR:0.0000 dlossQ:0.0973
Episode:2030 meanR:-0.1800 ra

Episode:2105 meanR:-0.1000 rate:0.0769 gloss:-162.6706 dloss:9.3199 dlossR:9.1819 dlossQ:0.1380
Episode:2106 meanR:-0.1100 rate:0.0000 gloss:-111.8051 dloss:0.1051 dlossR:0.0000 dlossQ:0.1051
Episode:2107 meanR:-0.1100 rate:0.0000 gloss:-106.1728 dloss:0.1266 dlossR:0.0000 dlossQ:0.1266
Episode:2108 meanR:-0.1300 rate:-0.0769 gloss:-192.6159 dloss:-10.5842 dlossR:-10.8829 dlossQ:0.2987
Episode:2109 meanR:-0.1000 rate:0.0000 gloss:-160.1417 dloss:0.1909 dlossR:0.0000 dlossQ:0.1909
Episode:2110 meanR:-0.1100 rate:-0.0769 gloss:-150.0671 dloss:-8.2641 dlossR:-8.4819 dlossQ:0.2179
Episode:2111 meanR:-0.1100 rate:0.0769 gloss:-126.2272 dloss:7.2920 dlossR:7.1325 dlossQ:0.1596
Episode:2112 meanR:-0.0700 rate:0.0769 gloss:-126.1130 dloss:7.3085 dlossR:7.1220 dlossQ:0.1865
Episode:2113 meanR:-0.0900 rate:-0.0769 gloss:-124.1248 dloss:-6.8504 dlossR:-7.0024 dlossQ:0.1520
Episode:2114 meanR:-0.0900 rate:0.0000 gloss:-125.2106 dloss:0.2049 dlossR:0.0000 dlossQ:0.2049
Episode:2115 meanR:-0.0800 ra

Episode:2191 meanR:0.0700 rate:0.0000 gloss:-39.1597 dloss:0.0872 dlossR:0.0006 dlossQ:0.0866
Episode:2192 meanR:0.0600 rate:-0.0769 gloss:-38.7881 dloss:-2.1314 dlossR:-2.1936 dlossQ:0.0622
Episode:2193 meanR:0.0800 rate:0.1538 gloss:-59.8905 dloss:6.8405 dlossR:6.7897 dlossQ:0.0508
Episode:2194 meanR:0.0900 rate:0.0769 gloss:-95.2687 dloss:5.5119 dlossR:5.3752 dlossQ:0.1367
Episode:2195 meanR:0.0600 rate:-0.1538 gloss:-50.2118 dloss:-5.6036 dlossR:-5.6488 dlossQ:0.0451
Episode:2196 meanR:0.0600 rate:0.0000 gloss:-64.3478 dloss:0.1239 dlossR:0.0007 dlossQ:0.1231
Episode:2197 meanR:0.0500 rate:-0.0769 gloss:-175.6075 dloss:-9.4402 dlossR:-9.8628 dlossQ:0.4226
Episode:2198 meanR:0.0500 rate:0.0000 gloss:-137.8691 dloss:0.2096 dlossR:0.0000 dlossQ:0.2096
Episode:2199 meanR:0.0300 rate:0.0000 gloss:-61.8335 dloss:0.0633 dlossR:0.0000 dlossQ:0.0632
Episode:2200 meanR:0.0200 rate:-0.0769 gloss:-30.2191 dloss:-1.6841 dlossR:-1.7160 dlossQ:0.0319
Episode:2201 meanR:0.0000 rate:-0.2308 gloss:-

Episode:2278 meanR:0.7100 rate:0.2308 gloss:0.2792 dloss:1.4572 dlossR:0.7417 dlossQ:0.7155
Episode:2279 meanR:0.7300 rate:0.0769 gloss:-7.4489 dloss:1.6351 dlossR:0.8973 dlossQ:0.7378
Episode:2280 meanR:0.7200 rate:0.0000 gloss:-11.8971 dloss:1.0343 dlossR:0.3511 dlossQ:0.6832
Episode:2281 meanR:0.7300 rate:0.0769 gloss:-2.9697 dloss:1.2127 dlossR:0.5523 dlossQ:0.6604
Episode:2282 meanR:0.7100 rate:-0.1538 gloss:-0.4841 dloss:1.6214 dlossR:0.9753 dlossQ:0.6461
Episode:2283 meanR:0.7100 rate:0.0000 gloss:-1.2427 dloss:1.0530 dlossR:0.4262 dlossQ:0.6268
Episode:2284 meanR:0.7000 rate:-0.0769 gloss:-2.7427 dloss:0.5060 dlossR:0.0530 dlossQ:0.4530
Episode:2285 meanR:0.6900 rate:-0.1538 gloss:-3.6694 dloss:-0.0526 dlossR:-0.3083 dlossQ:0.2558
Episode:2286 meanR:0.6800 rate:0.0000 gloss:-4.8169 dloss:0.1721 dlossR:0.0296 dlossQ:0.1425
Episode:2287 meanR:0.6800 rate:0.0000 gloss:-5.9864 dloss:0.1206 dlossR:0.0186 dlossQ:0.1020
Episode:2288 meanR:0.7300 rate:0.3846 gloss:-4.9678 dloss:1.7392 

Episode:2366 meanR:0.8600 rate:0.0769 gloss:-5.5008 dloss:0.4875 dlossR:0.3418 dlossQ:0.1457
Episode:2367 meanR:0.8400 rate:0.1538 gloss:-5.2382 dloss:0.8036 dlossR:0.6440 dlossQ:0.1596
Episode:2368 meanR:0.8500 rate:0.1538 gloss:-4.8765 dloss:0.7918 dlossR:0.6073 dlossQ:0.1845
Episode:2369 meanR:0.8600 rate:0.1538 gloss:-4.5074 dloss:0.7473 dlossR:0.5719 dlossQ:0.1754
Episode:2370 meanR:0.8600 rate:0.0000 gloss:-5.1448 dloss:0.1592 dlossR:0.0262 dlossQ:0.1330
Episode:2371 meanR:0.8200 rate:0.0000 gloss:-4.4635 dloss:0.2404 dlossR:0.0443 dlossQ:0.1961
Episode:2372 meanR:0.8100 rate:0.1538 gloss:-4.8084 dloss:0.7770 dlossR:0.6033 dlossQ:0.1736
Episode:2373 meanR:0.7400 rate:0.0000 gloss:-10.5119 dloss:0.1542 dlossR:0.0252 dlossQ:0.1291
Episode:2374 meanR:0.7400 rate:0.2308 gloss:-10.4576 dloss:2.0212 dlossR:1.8433 dlossQ:0.1779
Episode:2375 meanR:0.6600 rate:-0.2308 gloss:-7.1356 dloss:-0.9645 dlossR:-1.1140 dlossQ:0.1494
Episode:2376 meanR:0.6500 rate:-0.0769 gloss:-9.2901 dloss:-0.341

Episode:2454 meanR:1.2600 rate:0.2308 gloss:-3.8248 dloss:1.1380 dlossR:0.7832 dlossQ:0.3548
Episode:2455 meanR:1.3100 rate:0.3077 gloss:-3.1870 dloss:1.2961 dlossR:0.9075 dlossQ:0.3886
Episode:2456 meanR:1.3700 rate:0.4615 gloss:-2.8564 dloss:1.7688 dlossR:1.2889 dlossQ:0.4799
Episode:2457 meanR:1.3800 rate:-0.0769 gloss:-4.6356 dloss:0.1944 dlossR:-0.1577 dlossQ:0.3521
Episode:2458 meanR:1.3500 rate:0.0769 gloss:-7.8297 dloss:1.3424 dlossR:0.5317 dlossQ:0.8106
Episode:2459 meanR:1.3500 rate:0.2308 gloss:-2.7659 dloss:1.1005 dlossR:0.6498 dlossQ:0.4506
Episode:2460 meanR:1.3700 rate:0.3846 gloss:-2.0027 dloss:1.4408 dlossR:0.8898 dlossQ:0.5510
Episode:2461 meanR:1.4100 rate:-0.0769 gloss:-2.8821 dloss:0.4167 dlossR:0.0024 dlossQ:0.4143
Episode:2462 meanR:1.4400 rate:0.2308 gloss:-2.1509 dloss:1.1057 dlossR:0.5909 dlossQ:0.5147
Episode:2463 meanR:1.4400 rate:0.2308 gloss:-1.7734 dloss:1.0862 dlossR:0.5536 dlossQ:0.5327
Episode:2464 meanR:1.4500 rate:0.0769 gloss:-1.9456 dloss:0.8507 dl

Episode:2542 meanR:0.5700 rate:0.0000 gloss:-4.5731 dloss:0.2359 dlossR:0.0410 dlossQ:0.1949
Episode:2543 meanR:0.5600 rate:0.0000 gloss:-5.5106 dloss:0.1645 dlossR:0.0262 dlossQ:0.1383
Episode:2544 meanR:0.5300 rate:-0.0769 gloss:-5.6588 dloss:-0.1799 dlossR:-0.2911 dlossQ:0.1112
Episode:2545 meanR:0.5400 rate:0.0000 gloss:-6.6591 dloss:0.1229 dlossR:0.0156 dlossQ:0.1073
Episode:2546 meanR:0.5300 rate:0.0000 gloss:-6.1909 dloss:0.1233 dlossR:0.0183 dlossQ:0.1050
Episode:2547 meanR:0.5400 rate:0.0000 gloss:-6.2607 dloss:0.1125 dlossR:0.0159 dlossQ:0.0966
Episode:2548 meanR:0.5200 rate:0.0000 gloss:-6.2805 dloss:0.1025 dlossR:0.0148 dlossQ:0.0877
Episode:2549 meanR:0.5400 rate:0.0000 gloss:-6.1256 dloss:0.1297 dlossR:0.0189 dlossQ:0.1109
Episode:2550 meanR:0.5300 rate:0.0000 gloss:-6.7455 dloss:0.0775 dlossR:0.0100 dlossQ:0.0676
Episode:2551 meanR:0.5300 rate:0.0769 gloss:-5.7498 dloss:0.4491 dlossR:0.3465 dlossQ:0.1025
Episode:2552 meanR:0.5400 rate:0.0769 gloss:-6.0833 dloss:0.4639 dl

Episode:2629 meanR:-0.0600 rate:0.0000 gloss:-14.1723 dloss:0.0163 dlossR:0.0001 dlossQ:0.0162
Episode:2630 meanR:-0.0500 rate:-0.0769 gloss:-14.7697 dloss:-0.8098 dlossR:-0.8246 dlossQ:0.0148
Episode:2631 meanR:-0.0500 rate:0.0769 gloss:-14.1368 dloss:0.8231 dlossR:0.8008 dlossQ:0.0223
Episode:2632 meanR:-0.0700 rate:0.1538 gloss:-13.9195 dloss:1.6098 dlossR:1.5897 dlossQ:0.0201
Episode:2633 meanR:-0.0700 rate:0.0000 gloss:-14.4188 dloss:0.0144 dlossR:0.0001 dlossQ:0.0143
Episode:2634 meanR:-0.0700 rate:0.0000 gloss:-12.9686 dloss:0.0202 dlossR:0.0002 dlossQ:0.0200
Episode:2635 meanR:-0.0500 rate:0.1538 gloss:-11.9755 dloss:1.3889 dlossR:1.3711 dlossQ:0.0178
Episode:2636 meanR:-0.0400 rate:0.0000 gloss:-13.6081 dloss:0.0168 dlossR:0.0002 dlossQ:0.0166
Episode:2637 meanR:-0.0300 rate:0.1538 gloss:-13.0778 dloss:1.5112 dlossR:1.4949 dlossQ:0.0163
Episode:2638 meanR:-0.0800 rate:-0.1538 gloss:-15.0074 dloss:-1.6474 dlossR:-1.6620 dlossQ:0.0147
Episode:2639 meanR:-0.0900 rate:-0.1538 glos

Episode:2716 meanR:0.3300 rate:0.0769 gloss:-4.8344 dloss:0.5128 dlossR:0.3195 dlossQ:0.1933
Episode:2717 meanR:0.3100 rate:-0.0769 gloss:-5.2752 dloss:-0.1260 dlossR:-0.2606 dlossQ:0.1346
Episode:2718 meanR:0.2600 rate:-0.3077 gloss:-5.7671 dloss:-1.0861 dlossR:-1.1795 dlossQ:0.0933
Episode:2719 meanR:0.2700 rate:0.0769 gloss:-6.5789 dloss:0.5128 dlossR:0.3963 dlossQ:0.1166
Episode:2720 meanR:0.2700 rate:0.0000 gloss:-7.8049 dloss:0.1126 dlossR:0.0160 dlossQ:0.0966
Episode:2721 meanR:0.2900 rate:0.0769 gloss:-5.4095 dloss:0.4983 dlossR:0.3416 dlossQ:0.1567
Episode:2722 meanR:0.3200 rate:0.0769 gloss:-5.3566 dloss:0.4370 dlossR:0.3261 dlossQ:0.1109
Episode:2723 meanR:0.3100 rate:0.0000 gloss:-6.5874 dloss:0.1411 dlossR:0.0204 dlossQ:0.1207
Episode:2724 meanR:0.3300 rate:0.0000 gloss:-7.3235 dloss:0.0934 dlossR:0.0105 dlossQ:0.0829
Episode:2725 meanR:0.3100 rate:0.0000 gloss:-7.3337 dloss:0.1224 dlossR:0.0150 dlossQ:0.1075
Episode:2726 meanR:0.3700 rate:0.1538 gloss:-7.8034 dloss:1.0285

Episode:2803 meanR:-0.0300 rate:0.0769 gloss:-8.0245 dloss:0.4974 dlossR:0.4608 dlossQ:0.0367
Episode:2804 meanR:-0.0400 rate:0.0000 gloss:-24.5108 dloss:0.0578 dlossR:0.0003 dlossQ:0.0575
Episode:2805 meanR:-0.0300 rate:-0.0769 gloss:-11.1760 dloss:-0.5971 dlossR:-0.6217 dlossQ:0.0246
Episode:2806 meanR:-0.0200 rate:0.0769 gloss:-9.1008 dloss:0.5533 dlossR:0.5212 dlossQ:0.0322
Episode:2807 meanR:-0.0100 rate:-0.0769 gloss:-7.8112 dloss:-0.3825 dlossR:-0.4270 dlossQ:0.0445
Episode:2808 meanR:0.0000 rate:0.0769 gloss:-6.8155 dloss:0.4510 dlossR:0.3969 dlossQ:0.0541
Episode:2809 meanR:0.0200 rate:0.2308 gloss:-8.7731 dloss:1.5904 dlossR:1.5431 dlossQ:0.0473
Episode:2810 meanR:0.0200 rate:0.0769 gloss:-14.3501 dloss:0.8396 dlossR:0.8141 dlossQ:0.0255
Episode:2811 meanR:0.0400 rate:0.2308 gloss:-13.1829 dloss:2.3205 dlossR:2.2762 dlossQ:0.0443
Episode:2812 meanR:0.0300 rate:0.0000 gloss:-7.2753 dloss:0.0568 dlossR:0.0064 dlossQ:0.0504
Episode:2813 meanR:0.0300 rate:-0.0769 gloss:-7.6667 dl

Episode:2891 meanR:0.2300 rate:0.0000 gloss:-7.4621 dloss:0.0735 dlossR:0.0073 dlossQ:0.0663
Episode:2892 meanR:0.2300 rate:0.0000 gloss:-5.6727 dloss:0.1292 dlossR:0.0207 dlossQ:0.1085
Episode:2893 meanR:0.2400 rate:0.0769 gloss:-6.2220 dloss:0.4519 dlossR:0.3691 dlossQ:0.0828
Episode:2894 meanR:0.2700 rate:0.2308 gloss:-5.5051 dloss:1.1421 dlossR:1.0068 dlossQ:0.1353
Episode:2895 meanR:0.2700 rate:0.0000 gloss:-5.7737 dloss:0.1519 dlossR:0.0237 dlossQ:0.1282
Episode:2896 meanR:0.2700 rate:0.0769 gloss:-7.5214 dloss:0.5357 dlossR:0.4422 dlossQ:0.0935
Episode:2897 meanR:0.2900 rate:0.0000 gloss:-6.4185 dloss:0.1029 dlossR:0.0138 dlossQ:0.0891
Episode:2898 meanR:0.2700 rate:-0.0769 gloss:-7.2520 dloss:-0.3145 dlossR:-0.3898 dlossQ:0.0753
Episode:2899 meanR:0.2800 rate:0.0000 gloss:-6.4181 dloss:0.0789 dlossR:0.0105 dlossQ:0.0684
Episode:2900 meanR:0.3000 rate:0.0000 gloss:-7.9249 dloss:0.0685 dlossR:0.0078 dlossQ:0.0607
Episode:2901 meanR:0.2700 rate:-0.0769 gloss:-6.4482 dloss:-0.2206 

Episode:2979 meanR:0.0300 rate:0.0769 gloss:-6.2951 dloss:0.4479 dlossR:0.3721 dlossQ:0.0758
Episode:2980 meanR:0.0200 rate:-0.0769 gloss:-4.9561 dloss:-0.0937 dlossR:-0.2394 dlossQ:0.1457
Episode:2981 meanR:0.0200 rate:0.0000 gloss:-7.9753 dloss:0.0573 dlossR:0.0071 dlossQ:0.0503
Episode:2982 meanR:0.0300 rate:0.0769 gloss:-6.5892 dloss:0.4565 dlossR:0.3865 dlossQ:0.0700
Episode:2983 meanR:0.0200 rate:-0.0769 gloss:-11.2666 dloss:-0.5436 dlossR:-0.6222 dlossQ:0.0786
Episode:2984 meanR:0.0300 rate:0.0769 gloss:-8.0762 dloss:0.5224 dlossR:0.4676 dlossQ:0.0548
Episode:2985 meanR:0.0200 rate:-0.2308 gloss:-8.5077 dloss:-1.3416 dlossR:-1.3775 dlossQ:0.0359
Episode:2986 meanR:0.0400 rate:0.0769 gloss:-6.5580 dloss:0.4560 dlossR:0.3856 dlossQ:0.0704
Episode:2987 meanR:0.0500 rate:0.0000 gloss:-12.9806 dloss:0.0187 dlossR:0.0003 dlossQ:0.0185
Episode:2988 meanR:0.0500 rate:-0.0769 gloss:-8.1920 dloss:-0.4062 dlossR:-0.4493 dlossQ:0.0431
Episode:2989 meanR:0.0100 rate:-0.2308 gloss:-12.5789 dl

Episode:3065 meanR:-0.2100 rate:-0.0769 gloss:-24.5523 dloss:-1.3490 dlossR:-1.3747 dlossQ:0.0257
Episode:3066 meanR:-0.2000 rate:0.0000 gloss:-13.3341 dloss:0.0177 dlossR:0.0001 dlossQ:0.0176
Episode:3067 meanR:-0.2400 rate:0.0000 gloss:-18.2520 dloss:0.0208 dlossR:0.0000 dlossQ:0.0208
Episode:3068 meanR:-0.2300 rate:0.0000 gloss:-19.7149 dloss:0.0248 dlossR:0.0000 dlossQ:0.0248
Episode:3069 meanR:-0.2200 rate:0.0769 gloss:-20.7349 dloss:1.1965 dlossR:1.1721 dlossQ:0.0244
Episode:3070 meanR:-0.2300 rate:-0.0769 gloss:-24.9003 dloss:-1.3615 dlossR:-1.3924 dlossQ:0.0309
Episode:3071 meanR:-0.2300 rate:0.0000 gloss:-23.5143 dloss:0.0307 dlossR:0.0000 dlossQ:0.0307
Episode:3072 meanR:-0.2400 rate:0.0000 gloss:-23.1809 dloss:0.0311 dlossR:0.0000 dlossQ:0.0311
Episode:3073 meanR:-0.2400 rate:0.0000 gloss:-17.6295 dloss:0.0212 dlossR:0.0000 dlossQ:0.0212
Episode:3074 meanR:-0.2700 rate:-0.1538 gloss:-23.1083 dloss:-2.5594 dlossR:-2.5805 dlossQ:0.0211
Episode:3075 meanR:-0.2400 rate:-0.1538 g

Episode:3151 meanR:-0.1500 rate:0.0769 gloss:-28.4983 dloss:1.6407 dlossR:1.6098 dlossQ:0.0309
Episode:3152 meanR:-0.1400 rate:0.0769 gloss:-31.5798 dloss:1.8214 dlossR:1.7846 dlossQ:0.0368
Episode:3153 meanR:-0.1400 rate:0.0000 gloss:-46.3246 dloss:0.0699 dlossR:0.0000 dlossQ:0.0699
Episode:3154 meanR:-0.1500 rate:0.0000 gloss:-40.4608 dloss:0.0458 dlossR:0.0000 dlossQ:0.0458
Episode:3155 meanR:-0.1600 rate:0.0000 gloss:-35.0691 dloss:0.0462 dlossR:0.0000 dlossQ:0.0462
Episode:3156 meanR:-0.1500 rate:0.0000 gloss:-34.3954 dloss:0.0404 dlossR:0.0000 dlossQ:0.0404
Episode:3157 meanR:-0.1300 rate:0.1538 gloss:-41.0499 dloss:4.6882 dlossR:4.6403 dlossQ:0.0479
Episode:3158 meanR:-0.1200 rate:0.0769 gloss:-67.0830 dloss:3.8800 dlossR:3.7777 dlossQ:0.1024
Episode:3159 meanR:-0.1300 rate:-0.0769 gloss:-37.9584 dloss:-2.0845 dlossR:-2.1296 dlossQ:0.0451
Episode:3160 meanR:-0.1300 rate:0.0000 gloss:-23.2524 dloss:0.0263 dlossR:0.0000 dlossQ:0.0263
Episode:3161 meanR:-0.1500 rate:-0.1538 gloss:-

Episode:3237 meanR:-0.0500 rate:0.0769 gloss:-41.0760 dloss:2.3644 dlossR:2.3163 dlossQ:0.0480
Episode:3238 meanR:-0.0500 rate:0.0000 gloss:-35.3024 dloss:0.0370 dlossR:0.0000 dlossQ:0.0370
Episode:3239 meanR:-0.0400 rate:0.0000 gloss:-34.3507 dloss:0.0510 dlossR:0.0000 dlossQ:0.0510
Episode:3240 meanR:-0.0300 rate:0.0000 gloss:-51.9182 dloss:0.0668 dlossR:0.0000 dlossQ:0.0668
Episode:3241 meanR:-0.0300 rate:0.0000 gloss:-35.0704 dloss:0.0338 dlossR:0.0000 dlossQ:0.0338
Episode:3242 meanR:-0.0200 rate:0.0769 gloss:-67.0501 dloss:3.8542 dlossR:3.7735 dlossQ:0.0807
Episode:3243 meanR:-0.0200 rate:0.0000 gloss:-41.4635 dloss:0.0595 dlossR:0.0000 dlossQ:0.0595
Episode:3244 meanR:-0.0200 rate:0.0000 gloss:-35.4745 dloss:0.0420 dlossR:0.0000 dlossQ:0.0420
Episode:3245 meanR:-0.0100 rate:0.0000 gloss:-41.2111 dloss:0.0545 dlossR:0.0000 dlossQ:0.0545
Episode:3246 meanR:-0.0100 rate:0.0000 gloss:-41.5015 dloss:0.0577 dlossR:0.0000 dlossQ:0.0577
Episode:3247 meanR:0.0000 rate:0.0769 gloss:-29.49

Episode:3323 meanR:-0.1500 rate:-0.0769 gloss:-96.8614 dloss:-5.2710 dlossR:-5.4396 dlossQ:0.1686
Episode:3324 meanR:-0.1400 rate:0.0000 gloss:-94.7557 dloss:0.0984 dlossR:0.0000 dlossQ:0.0984
Episode:3325 meanR:-0.1500 rate:-0.0769 gloss:-100.5717 dloss:-5.5326 dlossR:-5.6583 dlossQ:0.1258
Episode:3326 meanR:-0.1600 rate:0.0000 gloss:-92.2153 dloss:0.1150 dlossR:0.0000 dlossQ:0.1150
Episode:3327 meanR:-0.1700 rate:0.0000 gloss:-68.3830 dloss:0.0802 dlossR:0.0000 dlossQ:0.0802
Episode:3328 meanR:-0.1300 rate:0.2308 gloss:-111.5602 dloss:18.9765 dlossR:18.8923 dlossQ:0.0843
Episode:3329 meanR:-0.1300 rate:0.0000 gloss:-127.5262 dloss:0.0999 dlossR:0.0000 dlossQ:0.0999
Episode:3330 meanR:-0.1300 rate:0.0000 gloss:-98.8510 dloss:0.1184 dlossR:0.0000 dlossQ:0.1184
Episode:3331 meanR:-0.1300 rate:0.0000 gloss:-98.3648 dloss:0.1622 dlossR:0.0000 dlossQ:0.1622
Episode:3332 meanR:-0.1300 rate:0.0000 gloss:-105.9930 dloss:0.1745 dlossR:0.0000 dlossQ:0.1745
Episode:3333 meanR:-0.1500 rate:-0.076

Episode:3408 meanR:-0.1600 rate:0.1538 gloss:-227.3242 dloss:26.0370 dlossR:25.7424 dlossQ:0.2946
Episode:3409 meanR:-0.1600 rate:0.0000 gloss:-288.5386 dloss:0.2938 dlossR:0.0000 dlossQ:0.2938
Episode:3410 meanR:-0.1600 rate:0.0000 gloss:-482.4001 dloss:0.7586 dlossR:0.0000 dlossQ:0.7586
Episode:3411 meanR:-0.1700 rate:-0.0769 gloss:-217.1091 dloss:-11.9891 dlossR:-12.2779 dlossQ:0.2889
Episode:3412 meanR:-0.1700 rate:0.0000 gloss:-237.2742 dloss:0.3759 dlossR:0.0000 dlossQ:0.3759
Episode:3413 meanR:-0.1500 rate:0.0769 gloss:-256.4496 dloss:14.7757 dlossR:14.4779 dlossQ:0.2978
Episode:3414 meanR:-0.1400 rate:0.0769 gloss:-383.0065 dloss:21.9545 dlossR:21.6162 dlossQ:0.3383
Episode:3415 meanR:-0.1100 rate:0.1538 gloss:-305.8474 dloss:34.8890 dlossR:34.5385 dlossQ:0.3505
Episode:3416 meanR:-0.1000 rate:0.0769 gloss:-229.0205 dloss:13.2023 dlossR:12.9182 dlossQ:0.2840
Episode:3417 meanR:-0.1000 rate:0.0000 gloss:-163.9168 dloss:0.2038 dlossR:0.0000 dlossQ:0.2038
Episode:3418 meanR:-0.100

Episode:3493 meanR:-0.0700 rate:-0.1538 gloss:-226.6364 dloss:-25.3614 dlossR:-25.7016 dlossQ:0.3403
Episode:3494 meanR:-0.0600 rate:-0.0769 gloss:-251.6595 dloss:-13.9394 dlossR:-14.2557 dlossQ:0.3163
Episode:3495 meanR:-0.0500 rate:0.0769 gloss:-242.2744 dloss:14.0392 dlossR:13.7299 dlossQ:0.3094
Episode:3496 meanR:-0.0900 rate:-0.1538 gloss:-206.6779 dloss:-23.1210 dlossR:-23.3908 dlossQ:0.2698
Episode:3497 meanR:-0.0900 rate:0.0000 gloss:-404.2993 dloss:0.2858 dlossR:0.0000 dlossQ:0.2858
Episode:3498 meanR:-0.1000 rate:0.0000 gloss:-432.2366 dloss:0.3387 dlossR:0.0000 dlossQ:0.3387
Episode:3499 meanR:-0.1100 rate:0.0000 gloss:-302.3213 dloss:0.3384 dlossR:0.0000 dlossQ:0.3384
Episode:3500 meanR:-0.1100 rate:0.0000 gloss:-359.0581 dloss:0.4491 dlossR:0.0000 dlossQ:0.4491
Episode:3501 meanR:-0.1100 rate:0.0000 gloss:-313.8298 dloss:0.4070 dlossR:0.0000 dlossQ:0.4070
Episode:3502 meanR:-0.0900 rate:0.0000 gloss:-514.1290 dloss:0.6211 dlossR:0.0000 dlossQ:0.6211
Episode:3503 meanR:-0.0

Episode:3578 meanR:-0.0900 rate:0.0000 gloss:-311.2528 dloss:0.3643 dlossR:0.0000 dlossQ:0.3643
Episode:3579 meanR:-0.1300 rate:-0.3077 gloss:-334.9077 dloss:-74.8510 dlossR:-75.2846 dlossQ:0.4336
Episode:3580 meanR:-0.1300 rate:0.0000 gloss:-435.8096 dloss:0.3790 dlossR:0.0000 dlossQ:0.3790
Episode:3581 meanR:-0.1200 rate:0.0769 gloss:-408.1109 dloss:23.4425 dlossR:22.9647 dlossQ:0.4778
Episode:3582 meanR:-0.1200 rate:0.0000 gloss:-345.5548 dloss:0.3917 dlossR:0.0000 dlossQ:0.3917
Episode:3583 meanR:-0.1200 rate:0.0000 gloss:-427.6831 dloss:0.5051 dlossR:0.0000 dlossQ:0.5051
Episode:3584 meanR:-0.1400 rate:-0.3077 gloss:-437.7680 dloss:-97.9963 dlossR:-98.5470 dlossQ:0.5506
Episode:3585 meanR:-0.1600 rate:0.0000 gloss:-519.8116 dloss:0.7929 dlossR:0.0000 dlossQ:0.7929
Episode:3586 meanR:-0.1400 rate:0.1538 gloss:-698.2330 dloss:79.2823 dlossR:78.4979 dlossQ:0.7844
Episode:3587 meanR:-0.1400 rate:0.0000 gloss:-533.4645 dloss:0.7177 dlossR:0.0000 dlossQ:0.7177
Episode:3588 meanR:-0.1400

Episode:3662 meanR:-0.2400 rate:-0.0769 gloss:-1101.1816 dloss:-60.3029 dlossR:-61.9274 dlossQ:1.6246
Episode:3663 meanR:-0.2500 rate:-0.0769 gloss:-1113.6775 dloss:-61.1888 dlossR:-62.6417 dlossQ:1.4529
Episode:3664 meanR:-0.2600 rate:-0.0769 gloss:-1317.4617 dloss:-72.0951 dlossR:-74.1931 dlossQ:2.0980
Episode:3665 meanR:-0.2600 rate:0.0000 gloss:-1166.6864 dloss:1.3991 dlossR:0.0000 dlossQ:1.3991
Episode:3666 meanR:-0.2800 rate:0.0000 gloss:-1762.6536 dloss:2.2950 dlossR:0.0000 dlossQ:2.2950
Episode:3667 meanR:-0.2600 rate:0.1538 gloss:-1360.6241 dloss:154.8009 dlossR:153.0399 dlossQ:1.7610
Episode:3668 meanR:-0.2600 rate:0.0000 gloss:-1364.4061 dloss:2.4336 dlossR:0.0000 dlossQ:2.4336
Episode:3669 meanR:-0.2600 rate:0.0000 gloss:-1551.0339 dloss:2.0874 dlossR:0.0000 dlossQ:2.0874
Episode:3670 meanR:-0.2500 rate:0.0000 gloss:-1418.2474 dloss:1.4651 dlossR:0.0000 dlossQ:1.4651
Episode:3671 meanR:-0.2500 rate:0.0000 gloss:-1411.1541 dloss:1.3601 dlossR:0.0000 dlossQ:1.3601
Episode:367

Episode:3746 meanR:-0.1700 rate:0.0769 gloss:-1187.5032 dloss:68.2221 dlossR:67.0529 dlossQ:1.1691
Episode:3747 meanR:-0.1700 rate:0.0000 gloss:-1721.8030 dloss:1.9027 dlossR:0.0000 dlossQ:1.9027
Episode:3748 meanR:-0.1600 rate:0.0769 gloss:-1362.3154 dloss:78.4054 dlossR:76.9022 dlossQ:1.5032
Episode:3749 meanR:-0.1600 rate:0.0000 gloss:-1609.3145 dloss:1.7982 dlossR:0.0000 dlossQ:1.7982
Episode:3750 meanR:-0.1100 rate:0.3077 gloss:-1818.6899 dloss:412.6177 dlossR:410.6018 dlossQ:2.0159
Episode:3751 meanR:-0.1000 rate:0.0769 gloss:-1817.6893 dloss:104.9624 dlossR:102.4872 dlossQ:2.4752
Episode:3752 meanR:-0.1100 rate:-0.0769 gloss:-1961.5177 dloss:-107.8604 dlossR:-110.4826 dlossQ:2.6221
Episode:3753 meanR:-0.1100 rate:0.0000 gloss:-1699.0496 dloss:1.9484 dlossR:0.0000 dlossQ:1.9484
Episode:3754 meanR:-0.1000 rate:0.0000 gloss:-1622.9491 dloss:2.4935 dlossR:0.0000 dlossQ:2.4935
Episode:3755 meanR:-0.1000 rate:0.0000 gloss:-1685.9053 dloss:1.5140 dlossR:0.0000 dlossQ:1.5140
Episode:375

Episode:3830 meanR:0.0900 rate:-0.0769 gloss:-606.6840 dloss:-33.2451 dlossR:-34.2042 dlossQ:0.9592
Episode:3831 meanR:0.1000 rate:0.0769 gloss:-872.9731 dloss:50.2402 dlossR:49.1426 dlossQ:1.0976
Episode:3832 meanR:0.1000 rate:0.0000 gloss:-1036.6576 dloss:1.0571 dlossR:0.0000 dlossQ:1.0571
Episode:3833 meanR:0.0800 rate:-0.1538 gloss:-942.1390 dloss:-105.0293 dlossR:-105.8589 dlossQ:0.8296
Episode:3834 meanR:0.0900 rate:0.0000 gloss:-540.8279 dloss:0.7419 dlossR:0.0000 dlossQ:0.7419
Episode:3835 meanR:0.0900 rate:0.0000 gloss:-652.6827 dloss:0.6617 dlossR:0.0000 dlossQ:0.6617
Episode:3836 meanR:0.0900 rate:0.0000 gloss:-726.3717 dloss:0.7763 dlossR:0.0000 dlossQ:0.7763
Episode:3837 meanR:0.0800 rate:-0.0769 gloss:-724.9720 dloss:-39.8603 dlossR:-40.8100 dlossQ:0.9496
Episode:3838 meanR:0.0700 rate:0.0000 gloss:-755.3427 dloss:0.9626 dlossR:0.0000 dlossQ:0.9626
Episode:3839 meanR:0.0800 rate:0.0000 gloss:-666.1703 dloss:0.6362 dlossR:0.0000 dlossQ:0.6362
Episode:3840 meanR:0.1000 rate

Episode:3915 meanR:-0.0100 rate:0.0000 gloss:-756.2999 dloss:0.6775 dlossR:0.0000 dlossQ:0.6775
Episode:3916 meanR:-0.0100 rate:0.0000 gloss:-720.1682 dloss:0.6758 dlossR:0.0000 dlossQ:0.6758
Episode:3917 meanR:-0.0200 rate:0.0000 gloss:-800.5722 dloss:1.1804 dlossR:0.0000 dlossQ:1.1804
Episode:3918 meanR:-0.0200 rate:0.0000 gloss:-632.2345 dloss:0.7629 dlossR:0.0000 dlossQ:0.7629
Episode:3919 meanR:-0.0200 rate:0.0000 gloss:-616.0506 dloss:0.6072 dlossR:0.0000 dlossQ:0.6072
Episode:3920 meanR:-0.0200 rate:0.0000 gloss:-759.4114 dloss:0.7437 dlossR:0.0000 dlossQ:0.7437
Episode:3921 meanR:0.0000 rate:0.0769 gloss:-732.5185 dloss:42.3242 dlossR:41.2434 dlossQ:1.0807
Episode:3922 meanR:0.0000 rate:0.0000 gloss:-565.9202 dloss:0.4776 dlossR:0.0000 dlossQ:0.4776
Episode:3923 meanR:-0.0100 rate:-0.0769 gloss:-656.3813 dloss:-36.4292 dlossR:-36.9556 dlossQ:0.5264
Episode:3924 meanR:-0.0100 rate:0.0000 gloss:-340.5886 dloss:0.3832 dlossR:0.0000 dlossQ:0.3832
Episode:3925 meanR:0.0000 rate:-0.0

Episode:4000 meanR:-0.0200 rate:0.0000 gloss:-482.3443 dloss:0.5018 dlossR:0.0000 dlossQ:0.5018
Episode:4001 meanR:-0.0100 rate:0.0000 gloss:-742.0836 dloss:0.4480 dlossR:0.0000 dlossQ:0.4480
Episode:4002 meanR:-0.0100 rate:0.0000 gloss:-795.8262 dloss:0.7382 dlossR:0.0000 dlossQ:0.7382
Episode:4003 meanR:-0.0200 rate:-0.0769 gloss:-378.7446 dloss:-20.9797 dlossR:-21.3638 dlossQ:0.3841
Episode:4004 meanR:-0.0300 rate:-0.0769 gloss:-398.6634 dloss:-21.9275 dlossR:-22.4237 dlossQ:0.4962
Episode:4005 meanR:-0.0300 rate:-0.0769 gloss:-480.5507 dloss:-26.7163 dlossR:-27.0902 dlossQ:0.3739
Episode:4006 meanR:-0.0300 rate:0.0000 gloss:-701.0218 dloss:0.7381 dlossR:0.0000 dlossQ:0.7381
Episode:4007 meanR:-0.0300 rate:0.0000 gloss:-793.5082 dloss:0.5091 dlossR:0.0000 dlossQ:0.5091
Episode:4008 meanR:-0.0300 rate:0.0000 gloss:-1024.3048 dloss:1.2605 dlossR:0.0000 dlossQ:1.2605
Episode:4009 meanR:-0.0400 rate:-0.0769 gloss:-645.9487 dloss:-35.1681 dlossR:-36.3960 dlossQ:1.2278
Episode:4010 meanR:

Episode:4085 meanR:0.1300 rate:0.0769 gloss:-352.6862 dloss:20.3118 dlossR:19.8125 dlossQ:0.4992
Episode:4086 meanR:0.1400 rate:0.0000 gloss:-435.0818 dloss:0.5338 dlossR:0.0000 dlossQ:0.5338
Episode:4087 meanR:0.1200 rate:0.0769 gloss:-419.0141 dloss:24.1012 dlossR:23.5918 dlossQ:0.5094
Episode:4088 meanR:0.1300 rate:0.0769 gloss:-308.7171 dloss:19.2900 dlossR:17.6412 dlossQ:1.6488
Episode:4089 meanR:0.1400 rate:0.0769 gloss:-465.6166 dloss:26.7371 dlossR:26.2278 dlossQ:0.5093
Episode:4090 meanR:0.1700 rate:0.1538 gloss:-615.7029 dloss:69.8061 dlossR:69.1189 dlossQ:0.6872
Episode:4091 meanR:0.1800 rate:0.2308 gloss:-393.2377 dloss:67.4453 dlossR:66.4259 dlossQ:1.0194
Episode:4092 meanR:0.1900 rate:0.0769 gloss:-480.6640 dloss:27.5917 dlossR:27.0706 dlossQ:0.5211
Episode:4093 meanR:0.2200 rate:0.1538 gloss:-262.6134 dloss:30.5304 dlossR:29.6607 dlossQ:0.8697
Episode:4094 meanR:0.2700 rate:0.2308 gloss:-498.4100 dloss:84.9747 dlossR:84.0412 dlossQ:0.9336
Episode:4095 meanR:0.2800 rate:0

Episode:4172 meanR:1.3100 rate:0.2308 gloss:-63.1169 dloss:14.2161 dlossR:12.2873 dlossQ:1.9288
Episode:4173 meanR:1.3200 rate:0.0769 gloss:5.3326 dloss:4.1194 dlossR:3.6629 dlossQ:0.4564
Episode:4174 meanR:1.3700 rate:0.3077 gloss:4.1986 dloss:2.4786 dlossR:2.1203 dlossQ:0.3583
Episode:4175 meanR:1.3600 rate:0.0769 gloss:-6.4321 dloss:6.3242 dlossR:1.2333 dlossQ:5.0909
Episode:4176 meanR:1.3500 rate:-0.0769 gloss:0.0709 dloss:1.4984 dlossR:0.8154 dlossQ:0.6830
Episode:4177 meanR:1.3600 rate:0.0769 gloss:-7.1807 dloss:1.3331 dlossR:0.5462 dlossQ:0.7869
Episode:4178 meanR:1.3500 rate:0.0000 gloss:-6.0176 dloss:0.3457 dlossR:0.0490 dlossQ:0.2967
Episode:4179 meanR:1.3300 rate:0.0000 gloss:-7.1454 dloss:0.1075 dlossR:0.0164 dlossQ:0.0911
Episode:4180 meanR:1.3500 rate:0.0000 gloss:-7.9361 dloss:0.0489 dlossR:0.0057 dlossQ:0.0432
Episode:4181 meanR:1.3500 rate:0.0000 gloss:-11.5086 dloss:0.0222 dlossR:0.0015 dlossQ:0.0207
Episode:4182 meanR:1.3600 rate:0.0769 gloss:-14.2691 dloss:0.8255 dl

Episode:4260 meanR:1.3400 rate:0.0000 gloss:-6.5865 dloss:1.1587 dlossR:0.1639 dlossQ:0.9948
Episode:4261 meanR:1.3700 rate:0.2308 gloss:-1.7602 dloss:1.4008 dlossR:0.6388 dlossQ:0.7620
Episode:4262 meanR:1.3700 rate:0.0000 gloss:-3.2528 dloss:0.4627 dlossR:0.1122 dlossQ:0.3505
Episode:4263 meanR:1.3700 rate:0.1538 gloss:-3.1529 dloss:0.8056 dlossR:0.4693 dlossQ:0.3363
Episode:4264 meanR:1.3800 rate:0.2308 gloss:-3.6264 dloss:0.9984 dlossR:0.7247 dlossQ:0.2737
Episode:4265 meanR:1.3900 rate:0.0769 gloss:-2.7441 dloss:0.7887 dlossR:0.3449 dlossQ:0.4438
Episode:4266 meanR:1.3300 rate:0.0769 gloss:-4.5083 dloss:0.4687 dlossR:0.2965 dlossQ:0.1722
Episode:4267 meanR:1.3400 rate:0.0000 gloss:-4.0651 dloss:0.2579 dlossR:0.0510 dlossQ:0.2069
Episode:4268 meanR:1.2600 rate:0.0000 gloss:-4.5041 dloss:0.2437 dlossR:0.0450 dlossQ:0.1987
Episode:4269 meanR:1.2300 rate:0.0000 gloss:-5.1987 dloss:0.1560 dlossR:0.0263 dlossQ:0.1298
Episode:4270 meanR:1.2300 rate:0.0000 gloss:-4.6842 dloss:0.2144 dloss

Episode:4348 meanR:1.0600 rate:0.3077 gloss:-3.5757 dloss:1.4410 dlossR:1.0058 dlossQ:0.4352
Episode:4349 meanR:1.0400 rate:0.0000 gloss:-2.8861 dloss:0.7275 dlossR:0.1835 dlossQ:0.5440
Episode:4350 meanR:1.0200 rate:0.0000 gloss:-2.8501 dloss:0.5093 dlossR:0.1363 dlossQ:0.3730
Episode:4351 meanR:1.0100 rate:0.0769 gloss:-1.8490 dloss:0.9048 dlossR:0.3730 dlossQ:0.5318
Episode:4352 meanR:0.9700 rate:-0.0769 gloss:-2.4752 dloss:0.4940 dlossR:0.0544 dlossQ:0.4396
Episode:4353 meanR:0.9400 rate:0.0000 gloss:-4.1708 dloss:0.2647 dlossR:0.0562 dlossQ:0.2084
Episode:4354 meanR:0.9500 rate:0.1538 gloss:-3.5171 dloss:0.7974 dlossR:0.4987 dlossQ:0.2986
Episode:4355 meanR:0.9400 rate:0.0000 gloss:-4.1377 dloss:0.2927 dlossR:0.0579 dlossQ:0.2348
Episode:4356 meanR:0.9100 rate:0.1538 gloss:-4.5165 dloss:0.7668 dlossR:0.5710 dlossQ:0.1957
Episode:4357 meanR:0.8700 rate:0.0769 gloss:-4.4383 dloss:0.5159 dlossR:0.3019 dlossQ:0.2141
Episode:4358 meanR:0.8800 rate:0.1538 gloss:-4.5992 dloss:0.7688 dlos

Episode:4436 meanR:1.9500 rate:0.2308 gloss:-1.9305 dloss:1.3465 dlossR:0.6260 dlossQ:0.7205
Episode:4437 meanR:1.9700 rate:0.1538 gloss:-2.6037 dloss:1.3515 dlossR:0.5301 dlossQ:0.8214
Episode:4438 meanR:1.9600 rate:0.0769 gloss:-2.3624 dloss:0.8154 dlossR:0.3172 dlossQ:0.4983
Episode:4439 meanR:1.9900 rate:0.3077 gloss:-2.3361 dloss:1.2827 dlossR:0.7776 dlossQ:0.5051
Episode:4440 meanR:1.9900 rate:0.0000 gloss:-2.8125 dloss:0.5043 dlossR:0.1271 dlossQ:0.3772
Episode:4441 meanR:2.0000 rate:0.2308 gloss:-2.3383 dloss:1.2819 dlossR:0.6586 dlossQ:0.6233
Episode:4442 meanR:2.0400 rate:0.3846 gloss:-1.8961 dloss:1.4726 dlossR:0.8739 dlossQ:0.5987
Episode:4443 meanR:2.0500 rate:0.1538 gloss:-1.4032 dloss:1.1591 dlossR:0.4983 dlossQ:0.6608
Episode:4444 meanR:2.0800 rate:0.1538 gloss:-0.7893 dloss:1.7176 dlossR:0.6892 dlossQ:1.0284
Episode:4445 meanR:2.1000 rate:0.1538 gloss:-0.4293 dloss:1.2847 dlossR:0.5829 dlossQ:0.7018
Episode:4446 meanR:2.1400 rate:0.2308 gloss:-2.9683 dloss:2.1920 dloss

Episode:4525 meanR:2.2400 rate:0.0000 gloss:-4.1193 dloss:0.2702 dlossR:0.0538 dlossQ:0.2164
Episode:4526 meanR:2.2600 rate:0.2308 gloss:-3.8143 dloss:1.0374 dlossR:0.7596 dlossQ:0.2777
Episode:4527 meanR:2.2700 rate:0.3846 gloss:-3.1332 dloss:1.4963 dlossR:1.1144 dlossQ:0.3819
Episode:4528 meanR:2.2900 rate:0.2308 gloss:-2.6104 dloss:1.0716 dlossR:0.6264 dlossQ:0.4451
Episode:4529 meanR:2.2600 rate:0.0000 gloss:-2.4583 dloss:0.6039 dlossR:0.1665 dlossQ:0.4373
Episode:4530 meanR:2.2400 rate:0.2308 gloss:-0.9997 dloss:1.2277 dlossR:0.5728 dlossQ:0.6549
Episode:4531 meanR:2.2400 rate:0.2308 gloss:-0.0767 dloss:1.4848 dlossR:0.7232 dlossQ:0.7615
Episode:4532 meanR:2.2700 rate:0.1538 gloss:-0.1091 dloss:1.4196 dlossR:0.6916 dlossQ:0.7279
Episode:4533 meanR:2.2200 rate:0.0000 gloss:-0.9128 dloss:1.1154 dlossR:0.4463 dlossQ:0.6690
Episode:4534 meanR:2.2000 rate:0.0769 gloss:-1.8013 dloss:0.8941 dlossR:0.3554 dlossQ:0.5387
Episode:4535 meanR:2.1900 rate:0.0769 gloss:-2.5844 dloss:0.6961 dloss

Episode:4613 meanR:0.9300 rate:0.2308 gloss:-4.3006 dloss:1.0855 dlossR:0.8298 dlossQ:0.2557
Episode:4614 meanR:0.9500 rate:0.1538 gloss:-3.4344 dloss:0.8810 dlossR:0.5113 dlossQ:0.3697
Episode:4615 meanR:0.9300 rate:-0.0769 gloss:-3.2569 dloss:0.5798 dlossR:0.0147 dlossQ:0.5650
Episode:4616 meanR:0.9200 rate:0.0769 gloss:-4.1717 dloss:0.5813 dlossR:0.3060 dlossQ:0.2753
Episode:4617 meanR:0.9200 rate:0.0769 gloss:-2.9947 dloss:0.7890 dlossR:0.3170 dlossQ:0.4720
Episode:4618 meanR:0.8800 rate:-0.0769 gloss:-4.1209 dloss:0.1058 dlossR:-0.1532 dlossQ:0.2590
Episode:4619 meanR:0.8800 rate:0.0000 gloss:-3.7696 dloss:0.4173 dlossR:0.0931 dlossQ:0.3243
Episode:4620 meanR:0.8600 rate:0.0769 gloss:-3.8767 dloss:0.6540 dlossR:0.3193 dlossQ:0.3347
Episode:4621 meanR:0.8500 rate:0.0000 gloss:-5.4414 dloss:0.3672 dlossR:0.0571 dlossQ:0.3100
Episode:4622 meanR:0.8600 rate:0.0000 gloss:-5.4592 dloss:0.2567 dlossR:0.0468 dlossQ:0.2099
Episode:4623 meanR:0.8700 rate:0.0000 gloss:-5.2589 dloss:0.3215 dl

Episode:4701 meanR:0.8100 rate:0.1538 gloss:-3.5961 dloss:0.7671 dlossR:0.4934 dlossQ:0.2737
Episode:4702 meanR:0.8200 rate:0.0000 gloss:-3.0073 dloss:0.4659 dlossR:0.1147 dlossQ:0.3512
Episode:4703 meanR:0.8100 rate:0.0000 gloss:-2.5303 dloss:0.5844 dlossR:0.1595 dlossQ:0.4249
Episode:4704 meanR:0.8100 rate:0.0769 gloss:-1.8597 dloss:0.9270 dlossR:0.3540 dlossQ:0.5730
Episode:4705 meanR:0.8100 rate:0.1538 gloss:-1.4855 dloss:1.0751 dlossR:0.4706 dlossQ:0.6045
Episode:4706 meanR:0.8000 rate:0.0000 gloss:-2.0211 dloss:0.7355 dlossR:0.2212 dlossQ:0.5143
Episode:4707 meanR:0.7800 rate:-0.0769 gloss:-2.6302 dloss:0.4467 dlossR:0.0189 dlossQ:0.4278
Episode:4708 meanR:0.7700 rate:0.0000 gloss:-3.2236 dloss:0.4444 dlossR:0.1050 dlossQ:0.3394
Episode:4709 meanR:0.7700 rate:0.0000 gloss:-4.5567 dloss:0.3062 dlossR:0.0625 dlossQ:0.2438
Episode:4710 meanR:0.7900 rate:0.2308 gloss:-4.5842 dloss:1.0558 dlossR:0.8620 dlossQ:0.1939
Episode:4711 meanR:0.8100 rate:0.0769 gloss:-5.7027 dloss:0.4899 dlos

Episode:4789 meanR:0.8600 rate:0.1538 gloss:-3.4278 dloss:0.7841 dlossR:0.4847 dlossQ:0.2995
Episode:4790 meanR:0.8700 rate:0.1538 gloss:-4.8499 dloss:0.7515 dlossR:0.5972 dlossQ:0.1542
Episode:4791 meanR:0.8900 rate:0.2308 gloss:-4.5598 dloss:1.0510 dlossR:0.8583 dlossQ:0.1927
Episode:4792 meanR:0.8700 rate:0.0000 gloss:-5.5218 dloss:0.1386 dlossR:0.0216 dlossQ:0.1170
Episode:4793 meanR:0.8700 rate:0.0769 gloss:-5.9522 dloss:0.4525 dlossR:0.3561 dlossQ:0.0965
Episode:4794 meanR:0.8600 rate:0.0000 gloss:-5.6399 dloss:0.1325 dlossR:0.0202 dlossQ:0.1123
Episode:4795 meanR:0.8500 rate:0.0000 gloss:-6.1965 dloss:0.0872 dlossR:0.0119 dlossQ:0.0752
Episode:4796 meanR:0.8400 rate:-0.0769 gloss:-5.7400 dloss:-0.1809 dlossR:-0.2944 dlossQ:0.1135
Episode:4797 meanR:0.8400 rate:-0.0769 gloss:-6.5763 dloss:-0.2812 dlossR:-0.3520 dlossQ:0.0708
Episode:4798 meanR:0.8300 rate:0.0769 gloss:-7.4763 dloss:0.4840 dlossR:0.4321 dlossQ:0.0519
Episode:4799 meanR:0.8400 rate:0.0769 gloss:-7.8812 dloss:0.5067

Episode:4877 meanR:0.6600 rate:0.0000 gloss:-8.0410 dloss:0.0491 dlossR:0.0058 dlossQ:0.0433
Episode:4878 meanR:0.7000 rate:0.2308 gloss:-7.3415 dloss:1.3595 dlossR:1.2992 dlossQ:0.0603
Episode:4879 meanR:0.7300 rate:0.2308 gloss:-6.3138 dloss:1.2278 dlossR:1.1320 dlossQ:0.0958
Episode:4880 meanR:0.6600 rate:0.0769 gloss:-7.2528 dloss:0.5049 dlossR:0.4253 dlossQ:0.0796
Episode:4881 meanR:0.6500 rate:0.1538 gloss:-5.4974 dloss:0.8421 dlossR:0.6749 dlossQ:0.1672
Episode:4882 meanR:0.6400 rate:0.0000 gloss:-5.4732 dloss:0.2159 dlossR:0.0382 dlossQ:0.1777
Episode:4883 meanR:0.6000 rate:0.0769 gloss:-4.6748 dloss:0.6189 dlossR:0.3357 dlossQ:0.2832
Episode:4884 meanR:0.6000 rate:0.0000 gloss:-3.4924 dloss:0.5394 dlossR:0.1391 dlossQ:0.4004
Episode:4885 meanR:0.6000 rate:0.0000 gloss:-4.0852 dloss:0.4023 dlossR:0.0857 dlossQ:0.3167
Episode:4886 meanR:0.6000 rate:0.0769 gloss:-3.2244 dloss:0.7607 dlossR:0.3462 dlossQ:0.4145
Episode:4887 meanR:0.6300 rate:0.1538 gloss:-3.2322 dloss:0.8989 dloss

Episode:4965 meanR:0.5200 rate:0.1538 gloss:-5.8717 dloss:0.7904 dlossR:0.6980 dlossQ:0.0924
Episode:4966 meanR:0.5200 rate:0.0769 gloss:-6.3743 dloss:0.4464 dlossR:0.3748 dlossQ:0.0716
Episode:4967 meanR:0.5500 rate:0.0769 gloss:-6.1451 dloss:0.4550 dlossR:0.3647 dlossQ:0.0902
Episode:4968 meanR:0.5600 rate:0.1538 gloss:-5.5318 dloss:0.7674 dlossR:0.6623 dlossQ:0.1051
Episode:4969 meanR:0.5700 rate:0.0000 gloss:-4.9478 dloss:0.1679 dlossR:0.0288 dlossQ:0.1391
Episode:4970 meanR:0.5800 rate:0.0769 gloss:-4.1565 dloss:0.4904 dlossR:0.2861 dlossQ:0.2043
Episode:4971 meanR:0.5600 rate:0.0000 gloss:-4.2001 dloss:0.2480 dlossR:0.0482 dlossQ:0.1998
Episode:4972 meanR:0.5500 rate:0.0000 gloss:-3.2724 dloss:0.4644 dlossR:0.1122 dlossQ:0.3522
Episode:4973 meanR:0.5600 rate:0.0000 gloss:-3.8051 dloss:0.3209 dlossR:0.0681 dlossQ:0.2528
Episode:4974 meanR:0.5400 rate:-0.0769 gloss:-3.9903 dloss:0.1176 dlossR:-0.1435 dlossQ:0.2612
Episode:4975 meanR:0.5400 rate:0.0769 gloss:-3.1850 dloss:0.5985 dlo

Episode:5053 meanR:0.4500 rate:0.1538 gloss:-1.4501 dloss:1.0541 dlossR:0.4667 dlossQ:0.5873
Episode:5054 meanR:0.4700 rate:0.0769 gloss:-1.5717 dloss:0.9505 dlossR:0.3737 dlossQ:0.5768
Episode:5055 meanR:0.4900 rate:0.0769 gloss:-1.9476 dloss:0.8546 dlossR:0.3369 dlossQ:0.5177
Episode:5056 meanR:0.5000 rate:0.0769 gloss:-2.8204 dloss:0.6751 dlossR:0.2877 dlossQ:0.3874
Episode:5057 meanR:0.4800 rate:-0.0769 gloss:-3.3414 dloss:0.2466 dlossR:-0.0751 dlossQ:0.3217
Episode:5058 meanR:0.4900 rate:0.0769 gloss:-3.3528 dloss:0.6055 dlossR:0.2854 dlossQ:0.3202
Episode:5059 meanR:0.4700 rate:0.0000 gloss:-4.4370 dloss:0.2604 dlossR:0.0497 dlossQ:0.2108
Episode:5060 meanR:0.4300 rate:-0.2308 gloss:-4.9677 dloss:-0.5560 dlossR:-0.7363 dlossQ:0.1803
Episode:5061 meanR:0.4300 rate:0.0000 gloss:-5.9734 dloss:0.1360 dlossR:0.0202 dlossQ:0.1157
Episode:5062 meanR:0.4500 rate:0.0000 gloss:-6.1882 dloss:0.1133 dlossR:0.0155 dlossQ:0.0978
Episode:5063 meanR:0.4600 rate:0.0000 gloss:-7.2781 dloss:0.0607 

Episode:5141 meanR:0.6000 rate:0.0000 gloss:-5.3644 dloss:0.1754 dlossR:0.0285 dlossQ:0.1469
Episode:5142 meanR:0.6000 rate:0.0769 gloss:-4.3909 dloss:0.5114 dlossR:0.2992 dlossQ:0.2122
Episode:5143 meanR:0.5900 rate:0.0000 gloss:-5.5243 dloss:0.1377 dlossR:0.0219 dlossQ:0.1157
Episode:5144 meanR:0.5900 rate:0.0769 gloss:-5.2991 dloss:0.4609 dlossR:0.3282 dlossQ:0.1326
Episode:5145 meanR:0.5900 rate:0.0000 gloss:-6.4787 dloss:0.0977 dlossR:0.0136 dlossQ:0.0841
Episode:5146 meanR:0.5800 rate:-0.1538 gloss:-6.8159 dloss:-0.6695 dlossR:-0.7322 dlossQ:0.0627
Episode:5147 meanR:0.5300 rate:0.0000 gloss:-7.7295 dloss:0.0582 dlossR:0.0061 dlossQ:0.0521
Episode:5148 meanR:0.5000 rate:-0.0769 gloss:-7.9706 dloss:-0.3999 dlossR:-0.4375 dlossQ:0.0376
Episode:5149 meanR:0.5300 rate:0.2308 gloss:-7.2849 dloss:1.3447 dlossR:1.2881 dlossQ:0.0566
Episode:5150 meanR:0.5600 rate:0.3077 gloss:-7.5405 dloss:1.8490 dlossR:1.7956 dlossQ:0.0534
Episode:5151 meanR:0.5600 rate:0.3077 gloss:-7.4518 dloss:1.8298

Episode:5229 meanR:0.4600 rate:0.0000 gloss:-4.0584 dloss:0.2930 dlossR:0.0562 dlossQ:0.2367
Episode:5230 meanR:0.4600 rate:0.0769 gloss:-3.4768 dloss:0.5536 dlossR:0.2750 dlossQ:0.2786
Episode:5231 meanR:0.4300 rate:-0.0769 gloss:-3.7651 dloss:0.1261 dlossR:-0.1320 dlossQ:0.2581
Episode:5232 meanR:0.4300 rate:0.1538 gloss:-3.6475 dloss:0.7503 dlossR:0.4939 dlossQ:0.2564
Episode:5233 meanR:0.4300 rate:0.0769 gloss:-3.8284 dloss:0.5127 dlossR:0.2782 dlossQ:0.2345
Episode:5234 meanR:0.4100 rate:0.0000 gloss:-4.2696 dloss:0.2502 dlossR:0.0481 dlossQ:0.2021
Episode:5235 meanR:0.4300 rate:0.0000 gloss:-4.2364 dloss:0.2385 dlossR:0.0460 dlossQ:0.1924
Episode:5236 meanR:0.4400 rate:0.1538 gloss:-3.9601 dloss:0.7445 dlossR:0.5186 dlossQ:0.2260
Episode:5237 meanR:0.4300 rate:0.0000 gloss:-4.1103 dloss:0.2558 dlossR:0.0503 dlossQ:0.2055
Episode:5238 meanR:0.4300 rate:0.0000 gloss:-4.4551 dloss:0.2233 dlossR:0.0419 dlossQ:0.1814
Episode:5239 meanR:0.4100 rate:-0.0769 gloss:-4.6945 dloss:-0.0653 d

Episode:5317 meanR:0.3700 rate:0.0000 gloss:-3.5106 dloss:0.3523 dlossR:0.0778 dlossQ:0.2744
Episode:5318 meanR:0.3700 rate:0.0769 gloss:-4.8048 dloss:0.4933 dlossR:0.3127 dlossQ:0.1806
Episode:5319 meanR:0.3700 rate:0.0000 gloss:-5.5088 dloss:0.1521 dlossR:0.0236 dlossQ:0.1285
Episode:5320 meanR:0.3500 rate:-0.0769 gloss:-5.8038 dloss:-0.2093 dlossR:-0.3026 dlossQ:0.0933
Episode:5321 meanR:0.3400 rate:0.0000 gloss:-6.5982 dloss:0.0901 dlossR:0.0121 dlossQ:0.0780
Episode:5322 meanR:0.3500 rate:0.0769 gloss:-6.0274 dloss:0.4535 dlossR:0.3596 dlossQ:0.0939
Episode:5323 meanR:0.3400 rate:0.1538 gloss:-7.0238 dloss:0.8803 dlossR:0.8204 dlossQ:0.0599
Episode:5324 meanR:0.3300 rate:0.0000 gloss:-6.5619 dloss:0.0865 dlossR:0.0118 dlossQ:0.0747
Episode:5325 meanR:0.3300 rate:0.0000 gloss:-7.4748 dloss:0.0558 dlossR:0.0057 dlossQ:0.0501
Episode:5326 meanR:0.3400 rate:0.0000 gloss:-6.1237 dloss:0.1036 dlossR:0.0138 dlossQ:0.0898
Episode:5327 meanR:0.3400 rate:0.0000 gloss:-8.2637 dloss:0.0430 dl

Episode:5405 meanR:0.2300 rate:-0.0769 gloss:-14.3025 dloss:-0.7826 dlossR:-0.8000 dlossQ:0.0174
Episode:5406 meanR:0.2200 rate:-0.0769 gloss:-19.4385 dloss:-1.0515 dlossR:-1.0862 dlossQ:0.0346
Episode:5407 meanR:0.2200 rate:0.0000 gloss:-17.7300 dloss:0.0158 dlossR:0.0000 dlossQ:0.0158
Episode:5408 meanR:0.2200 rate:0.1538 gloss:-16.0899 dloss:1.8584 dlossR:1.8341 dlossQ:0.0243
Episode:5409 meanR:0.2000 rate:0.0769 gloss:-14.0265 dloss:0.8132 dlossR:0.7953 dlossQ:0.0179
Episode:5410 meanR:0.2200 rate:0.2308 gloss:-14.4745 dloss:2.5198 dlossR:2.5023 dlossQ:0.0175
Episode:5411 meanR:0.1900 rate:0.0000 gloss:-13.9894 dloss:0.0166 dlossR:0.0001 dlossQ:0.0165
Episode:5412 meanR:0.1900 rate:0.0000 gloss:-15.1593 dloss:0.0226 dlossR:0.0001 dlossQ:0.0226
Episode:5413 meanR:0.1800 rate:0.0000 gloss:-15.3262 dloss:0.0209 dlossR:0.0001 dlossQ:0.0208
Episode:5414 meanR:0.1700 rate:0.0000 gloss:-15.5328 dloss:0.0151 dlossR:0.0001 dlossQ:0.0149
Episode:5415 meanR:0.1800 rate:0.0000 gloss:-14.0574 d

Episode:5492 meanR:0.1400 rate:0.0000 gloss:-7.9849 dloss:0.0572 dlossR:0.0065 dlossQ:0.0506
Episode:5493 meanR:0.1400 rate:0.0000 gloss:-11.7415 dloss:0.0172 dlossR:0.0004 dlossQ:0.0168
Episode:5494 meanR:0.1500 rate:0.0769 gloss:-8.7281 dloss:0.5558 dlossR:0.5029 dlossQ:0.0529
Episode:5495 meanR:0.1600 rate:0.0769 gloss:-11.7641 dloss:0.6950 dlossR:0.6684 dlossQ:0.0267
Episode:5496 meanR:0.1700 rate:0.0000 gloss:-13.1937 dloss:0.0258 dlossR:0.0007 dlossQ:0.0251
Episode:5497 meanR:0.1700 rate:0.0000 gloss:-11.6684 dloss:0.0228 dlossR:0.0014 dlossQ:0.0214
Episode:5498 meanR:0.1800 rate:0.0000 gloss:-10.8901 dloss:0.0221 dlossR:0.0008 dlossQ:0.0213
Episode:5499 meanR:0.1900 rate:0.0000 gloss:-13.9429 dloss:0.0250 dlossR:0.0008 dlossQ:0.0242
Episode:5500 meanR:0.1900 rate:0.0769 gloss:-12.1886 dloss:0.7230 dlossR:0.6924 dlossQ:0.0306
Episode:5501 meanR:0.1600 rate:0.0000 gloss:-11.5894 dloss:0.0178 dlossR:0.0004 dlossQ:0.0174
Episode:5502 meanR:0.1600 rate:0.0000 gloss:-14.0054 dloss:0.0

Episode:5580 meanR:0.3400 rate:0.0769 gloss:-3.9244 dloss:0.5386 dlossR:0.2899 dlossQ:0.2488
Episode:5581 meanR:0.3100 rate:-0.0769 gloss:-4.3420 dloss:0.0085 dlossR:-0.1871 dlossQ:0.1957
Episode:5582 meanR:0.3000 rate:0.0769 gloss:-4.7568 dloss:0.4782 dlossR:0.3089 dlossQ:0.1693
Episode:5583 meanR:0.2800 rate:-0.0769 gloss:-4.5325 dloss:-0.0200 dlossR:-0.2031 dlossQ:0.1830
Episode:5584 meanR:0.2800 rate:0.0000 gloss:-5.0133 dloss:0.1718 dlossR:0.0314 dlossQ:0.1404
Episode:5585 meanR:0.2800 rate:0.0769 gloss:-5.2417 dloss:0.4637 dlossR:0.3264 dlossQ:0.1372
Episode:5586 meanR:0.2400 rate:-0.3077 gloss:-6.4040 dloss:-1.2378 dlossR:-1.3246 dlossQ:0.0868
Episode:5587 meanR:0.2300 rate:-0.0769 gloss:-9.0235 dloss:-0.4295 dlossR:-0.4910 dlossQ:0.0616
Episode:5588 meanR:0.2200 rate:-0.2308 gloss:-7.7957 dloss:-1.2018 dlossR:-1.2529 dlossQ:0.0511
Episode:5589 meanR:0.2100 rate:-0.0769 gloss:-8.3496 dloss:-0.4303 dlossR:-0.4612 dlossQ:0.0308
Episode:5590 meanR:0.2200 rate:0.1538 gloss:-9.6327 d

Episode:5668 meanR:0.1000 rate:0.0769 gloss:-12.5597 dloss:0.7294 dlossR:0.7123 dlossQ:0.0171
Episode:5669 meanR:0.0700 rate:0.0000 gloss:-10.4022 dloss:0.0202 dlossR:0.0006 dlossQ:0.0196
Episode:5670 meanR:0.0600 rate:0.0000 gloss:-15.1916 dloss:0.0227 dlossR:0.0002 dlossQ:0.0226
Episode:5671 meanR:0.0600 rate:0.0000 gloss:-14.8103 dloss:0.0181 dlossR:0.0001 dlossQ:0.0180
Episode:5672 meanR:0.0600 rate:0.0000 gloss:-11.5787 dloss:0.0177 dlossR:0.0003 dlossQ:0.0174
Episode:5673 meanR:0.0600 rate:0.0769 gloss:-11.7315 dloss:0.6846 dlossR:0.6654 dlossQ:0.0192
Episode:5674 meanR:0.0700 rate:0.0769 gloss:-12.7489 dloss:0.7408 dlossR:0.7232 dlossQ:0.0176
Episode:5675 meanR:0.0700 rate:-0.0769 gloss:-11.8536 dloss:-0.6437 dlossR:-0.6592 dlossQ:0.0155
Episode:5676 meanR:0.0500 rate:0.0000 gloss:-12.9357 dloss:0.0171 dlossR:0.0001 dlossQ:0.0170
Episode:5677 meanR:0.0500 rate:0.0000 gloss:-15.3660 dloss:0.0155 dlossR:0.0001 dlossQ:0.0154
Episode:5678 meanR:0.0400 rate:-0.0769 gloss:-13.7072 dlo

Episode:5755 meanR:-0.1500 rate:0.0000 gloss:-14.0145 dloss:0.0164 dlossR:0.0001 dlossQ:0.0163
Episode:5756 meanR:-0.1500 rate:-0.0769 gloss:-18.1272 dloss:-0.9955 dlossR:-1.0160 dlossQ:0.0205
Episode:5757 meanR:-0.1500 rate:0.0000 gloss:-13.4008 dloss:0.0162 dlossR:0.0001 dlossQ:0.0162
Episode:5758 meanR:-0.1300 rate:0.0769 gloss:-14.6850 dloss:0.8502 dlossR:0.8311 dlossQ:0.0191
Episode:5759 meanR:-0.1100 rate:0.0769 gloss:-14.9514 dloss:0.8689 dlossR:0.8478 dlossQ:0.0211
Episode:5760 meanR:-0.1200 rate:-0.1538 gloss:-15.3225 dloss:-1.6871 dlossR:-1.7024 dlossQ:0.0153
Episode:5761 meanR:-0.1300 rate:-0.0769 gloss:-18.3760 dloss:-1.0110 dlossR:-1.0262 dlossQ:0.0152
Episode:5762 meanR:-0.1200 rate:0.0000 gloss:-19.0169 dloss:0.0265 dlossR:0.0000 dlossQ:0.0265
Episode:5763 meanR:-0.1100 rate:0.0000 gloss:-18.2785 dloss:0.0231 dlossR:0.0000 dlossQ:0.0231
Episode:5764 meanR:-0.0900 rate:0.0000 gloss:-17.5352 dloss:0.0173 dlossR:0.0000 dlossQ:0.0173
Episode:5765 meanR:-0.0800 rate:0.0769 gl

Episode:5841 meanR:0.0600 rate:0.0000 gloss:-8.1460 dloss:0.2814 dlossR:0.0356 dlossQ:0.2458
Episode:5842 meanR:0.0600 rate:0.0000 gloss:-6.5705 dloss:0.3128 dlossR:0.0501 dlossQ:0.2627
Episode:5843 meanR:0.0500 rate:-0.0769 gloss:-6.9691 dloss:-0.2620 dlossR:-0.3633 dlossQ:0.1013
Episode:5844 meanR:0.0500 rate:-0.0769 gloss:-6.2289 dloss:-0.2485 dlossR:-0.3316 dlossQ:0.0831
Episode:5845 meanR:0.0300 rate:-0.1538 gloss:-9.5789 dloss:-0.9575 dlossR:-1.0329 dlossQ:0.0754
Episode:5846 meanR:0.0100 rate:-0.1538 gloss:-10.7568 dloss:-1.1000 dlossR:-1.1735 dlossQ:0.0735
Episode:5847 meanR:0.0100 rate:0.0000 gloss:-7.7557 dloss:0.0577 dlossR:0.0066 dlossQ:0.0511
Episode:5848 meanR:0.0100 rate:-0.0769 gloss:-10.8737 dloss:-0.5887 dlossR:-0.6082 dlossQ:0.0195
Episode:5849 meanR:-0.0100 rate:-0.1538 gloss:-11.7884 dloss:-1.2861 dlossR:-1.3059 dlossQ:0.0199
Episode:5850 meanR:0.0300 rate:0.1538 gloss:-9.9975 dloss:1.1937 dlossR:1.1545 dlossQ:0.0392
Episode:5851 meanR:0.0200 rate:-0.0769 gloss:-19

Episode:5928 meanR:0.1500 rate:0.0000 gloss:-18.8141 dloss:0.0182 dlossR:0.0001 dlossQ:0.0181
Episode:5929 meanR:0.1500 rate:0.0000 gloss:-10.8066 dloss:0.1319 dlossR:0.0192 dlossQ:0.1128
Episode:5930 meanR:0.1600 rate:0.1538 gloss:-6.6193 dloss:0.9921 dlossR:0.8062 dlossQ:0.1859
Episode:5931 meanR:0.1600 rate:0.0000 gloss:-6.8900 dloss:0.0635 dlossR:0.0074 dlossQ:0.0562
Episode:5932 meanR:0.1600 rate:0.0000 gloss:-4.4623 dloss:0.2378 dlossR:0.0434 dlossQ:0.1944
Episode:5933 meanR:0.1800 rate:0.1538 gloss:-3.0441 dloss:1.0147 dlossR:0.5059 dlossQ:0.5088
Episode:5934 meanR:0.1800 rate:0.0000 gloss:-4.3628 dloss:0.2276 dlossR:0.0437 dlossQ:0.1839
Episode:5935 meanR:0.1900 rate:0.0769 gloss:-4.2787 dloss:0.8807 dlossR:0.4662 dlossQ:0.4145
Episode:5936 meanR:0.2200 rate:0.2308 gloss:-4.1292 dloss:1.3068 dlossR:0.9065 dlossQ:0.4003
Episode:5937 meanR:0.2500 rate:0.1538 gloss:-2.5090 dloss:0.9273 dlossR:0.4640 dlossQ:0.4633
Episode:5938 meanR:0.2200 rate:-0.0769 gloss:-3.0883 dloss:0.5071 dl

Episode:6016 meanR:0.6200 rate:0.0769 gloss:-6.2644 dloss:0.4630 dlossR:0.3731 dlossQ:0.0899
Episode:6017 meanR:0.6200 rate:-0.0769 gloss:-7.1704 dloss:-0.3353 dlossR:-0.3896 dlossQ:0.0543
Episode:6018 meanR:0.6500 rate:0.0000 gloss:-5.1753 dloss:0.1679 dlossR:0.0296 dlossQ:0.1382
Episode:6019 meanR:0.6700 rate:0.1538 gloss:-7.1156 dloss:0.8837 dlossR:0.8302 dlossQ:0.0536
Episode:6020 meanR:0.7500 rate:0.4615 gloss:-6.2524 dloss:2.4238 dlossR:2.3330 dlossQ:0.0908
Episode:6021 meanR:0.7300 rate:0.0000 gloss:-6.8767 dloss:0.0708 dlossR:0.0090 dlossQ:0.0617
Episode:6022 meanR:0.7000 rate:0.1538 gloss:-5.3003 dloss:0.7750 dlossR:0.6420 dlossQ:0.1330
Episode:6023 meanR:0.7000 rate:0.1538 gloss:-4.9846 dloss:0.7758 dlossR:0.6155 dlossQ:0.1603
Episode:6024 meanR:0.7100 rate:0.0769 gloss:-4.3884 dloss:0.4689 dlossR:0.2906 dlossQ:0.1783
Episode:6025 meanR:0.7000 rate:0.0769 gloss:-3.3110 dloss:0.6093 dlossR:0.2873 dlossQ:0.3220
Episode:6026 meanR:0.6800 rate:-0.0769 gloss:-2.5135 dloss:0.4809 d

Episode:6104 meanR:0.3300 rate:0.0000 gloss:-4.8791 dloss:0.1819 dlossR:0.0321 dlossQ:0.1498
Episode:6105 meanR:0.3100 rate:-0.3077 gloss:-5.1726 dloss:-0.9078 dlossR:-1.0340 dlossQ:0.1262
Episode:6106 meanR:0.3100 rate:0.0000 gloss:-5.6932 dloss:0.1151 dlossR:0.0176 dlossQ:0.0975
Episode:6107 meanR:0.3200 rate:0.0769 gloss:-5.9804 dloss:0.4317 dlossR:0.3540 dlossQ:0.0777
Episode:6108 meanR:0.3000 rate:-0.0769 gloss:-7.2803 dloss:-0.3467 dlossR:-0.3961 dlossQ:0.0494
Episode:6109 meanR:0.2700 rate:-0.0769 gloss:-7.7552 dloss:-0.3838 dlossR:-0.4248 dlossQ:0.0410
Episode:6110 meanR:0.2700 rate:0.0000 gloss:-8.4470 dloss:0.0286 dlossR:0.0022 dlossQ:0.0264
Episode:6111 meanR:0.2600 rate:-0.0769 gloss:-8.5642 dloss:-0.4367 dlossR:-0.4714 dlossQ:0.0347
Episode:6112 meanR:0.2600 rate:0.0000 gloss:-8.1840 dloss:0.0397 dlossR:0.0042 dlossQ:0.0355
Episode:6113 meanR:0.2800 rate:0.0769 gloss:-9.4020 dloss:0.5750 dlossR:0.5371 dlossQ:0.0380
Episode:6114 meanR:0.2700 rate:0.0000 gloss:-8.7613 dloss:

Episode:6192 meanR:0.1600 rate:0.0000 gloss:-12.0009 dloss:0.0166 dlossR:0.0004 dlossQ:0.0162
Episode:6193 meanR:0.1300 rate:-0.0769 gloss:-13.6539 dloss:-0.7365 dlossR:-0.7610 dlossQ:0.0245
Episode:6194 meanR:0.1400 rate:0.0769 gloss:-12.8444 dloss:0.7462 dlossR:0.7281 dlossQ:0.0181
Episode:6195 meanR:0.1200 rate:-0.0769 gloss:-12.7401 dloss:-0.6849 dlossR:-0.7098 dlossQ:0.0250
Episode:6196 meanR:0.1200 rate:0.0000 gloss:-10.4987 dloss:0.0214 dlossR:0.0007 dlossQ:0.0208
Episode:6197 meanR:0.1200 rate:0.0000 gloss:-15.9206 dloss:0.0247 dlossR:0.0001 dlossQ:0.0247
Episode:6198 meanR:0.0800 rate:-0.3077 gloss:-14.9485 dloss:-3.2489 dlossR:-3.2645 dlossQ:0.0155
Episode:6199 meanR:0.1200 rate:0.1538 gloss:-14.5674 dloss:1.6820 dlossR:1.6618 dlossQ:0.0202
Episode:6200 meanR:0.1100 rate:0.0000 gloss:-17.4202 dloss:0.0226 dlossR:0.0000 dlossQ:0.0226
Episode:6201 meanR:0.0900 rate:-0.0769 gloss:-16.1787 dloss:-0.8764 dlossR:-0.9037 dlossQ:0.0273
Episode:6202 meanR:0.0700 rate:0.0000 gloss:-18.

Episode:6280 meanR:0.1600 rate:0.0000 gloss:-5.7303 dloss:0.1337 dlossR:0.0213 dlossQ:0.1124
Episode:6281 meanR:0.1800 rate:0.1538 gloss:-4.7761 dloss:0.7577 dlossR:0.5935 dlossQ:0.1643
Episode:6282 meanR:0.1900 rate:0.0000 gloss:-3.8255 dloss:0.3125 dlossR:0.0645 dlossQ:0.2479
Episode:6283 meanR:0.2000 rate:0.0769 gloss:-4.5246 dloss:0.5071 dlossR:0.3053 dlossQ:0.2018
Episode:6284 meanR:0.1700 rate:-0.3077 gloss:-5.3854 dloss:-0.9332 dlossR:-1.0727 dlossQ:0.1395
Episode:6285 meanR:0.1900 rate:0.0769 gloss:-4.6914 dloss:0.4995 dlossR:0.3100 dlossQ:0.1895
Episode:6286 meanR:0.2200 rate:0.2308 gloss:-4.5423 dloss:1.0400 dlossR:0.8537 dlossQ:0.1863
Episode:6287 meanR:0.2300 rate:0.0769 gloss:-5.0393 dloss:0.4791 dlossR:0.3208 dlossQ:0.1584
Episode:6288 meanR:0.2300 rate:0.0000 gloss:-4.8299 dloss:0.2035 dlossR:0.0371 dlossQ:0.1664
Episode:6289 meanR:0.2500 rate:0.0769 gloss:-4.6790 dloss:0.4931 dlossR:0.3077 dlossQ:0.1854
Episode:6290 meanR:0.2300 rate:-0.0769 gloss:-6.5007 dloss:-0.2869 

Episode:6368 meanR:0.2500 rate:0.0769 gloss:-5.4491 dloss:0.4643 dlossR:0.3363 dlossQ:0.1280
Episode:6369 meanR:0.2500 rate:0.0000 gloss:-5.7061 dloss:0.1430 dlossR:0.0226 dlossQ:0.1204
Episode:6370 meanR:0.2400 rate:0.0000 gloss:-4.4602 dloss:0.2151 dlossR:0.0397 dlossQ:0.1754
Episode:6371 meanR:0.2400 rate:0.0000 gloss:-5.9298 dloss:0.1324 dlossR:0.0188 dlossQ:0.1136
Episode:6372 meanR:0.2200 rate:0.0000 gloss:-4.8965 dloss:0.2588 dlossR:0.0506 dlossQ:0.2083
Episode:6373 meanR:0.1900 rate:-0.1538 gloss:-5.9632 dloss:-0.4023 dlossR:-0.5973 dlossQ:0.1950
Episode:6374 meanR:0.1900 rate:0.0000 gloss:-5.5709 dloss:0.2592 dlossR:0.0386 dlossQ:0.2207
Episode:6375 meanR:0.2000 rate:0.0769 gloss:-5.0817 dloss:0.5490 dlossR:0.3393 dlossQ:0.2097
Episode:6376 meanR:0.1600 rate:-0.3077 gloss:-5.8300 dloss:-1.0016 dlossR:-1.1678 dlossQ:0.1662
Episode:6377 meanR:0.1600 rate:-0.0769 gloss:-5.1315 dloss:-0.0834 dlossR:-0.2457 dlossQ:0.1624
Episode:6378 meanR:0.1500 rate:0.0000 gloss:-6.8341 dloss:0.0

Episode:6455 meanR:0.2300 rate:0.1538 gloss:-6.8050 dloss:0.9761 dlossR:0.8125 dlossQ:0.1636
Episode:6456 meanR:0.2500 rate:0.0000 gloss:-4.9052 dloss:0.2100 dlossR:0.0367 dlossQ:0.1734
Episode:6457 meanR:0.2300 rate:-0.0769 gloss:-6.9042 dloss:-0.2807 dlossR:-0.3691 dlossQ:0.0884
Episode:6458 meanR:0.2200 rate:-0.0769 gloss:-7.3671 dloss:-0.3267 dlossR:-0.3973 dlossQ:0.0706
Episode:6459 meanR:0.2000 rate:-0.0769 gloss:-6.1812 dloss:-0.2241 dlossR:-0.3225 dlossQ:0.0985
Episode:6460 meanR:0.1900 rate:0.0000 gloss:-6.0435 dloss:0.1310 dlossR:0.0201 dlossQ:0.1109
Episode:6461 meanR:0.2000 rate:0.0000 gloss:-5.3718 dloss:0.1546 dlossR:0.0241 dlossQ:0.1305
Episode:6462 meanR:0.1900 rate:0.0000 gloss:-5.7238 dloss:0.1464 dlossR:0.0221 dlossQ:0.1243
Episode:6463 meanR:0.1900 rate:0.0769 gloss:-7.7096 dloss:0.5037 dlossR:0.4455 dlossQ:0.0582
Episode:6464 meanR:0.1800 rate:-0.0769 gloss:-8.9101 dloss:-0.4657 dlossR:-0.4931 dlossQ:0.0274
Episode:6465 meanR:0.1800 rate:0.0000 gloss:-11.7670 dloss

Episode:6542 meanR:-0.0800 rate:0.0000 gloss:-17.4617 dloss:0.0186 dlossR:0.0000 dlossQ:0.0186
Episode:6543 meanR:-0.0800 rate:0.0000 gloss:-19.6299 dloss:0.0179 dlossR:0.0000 dlossQ:0.0179
Episode:6544 meanR:-0.0800 rate:0.0769 gloss:-21.3100 dloss:1.2259 dlossR:1.2037 dlossQ:0.0221
Episode:6545 meanR:-0.0700 rate:0.2308 gloss:-22.4963 dloss:3.8824 dlossR:3.8482 dlossQ:0.0342
Episode:6546 meanR:-0.1000 rate:0.0000 gloss:-23.7381 dloss:0.0272 dlossR:0.0000 dlossQ:0.0272
Episode:6547 meanR:-0.1000 rate:0.1538 gloss:-20.3259 dloss:2.3352 dlossR:2.3082 dlossQ:0.0269
Episode:6548 meanR:-0.1100 rate:-0.0769 gloss:-23.1865 dloss:-1.2667 dlossR:-1.2983 dlossQ:0.0317
Episode:6549 meanR:-0.1000 rate:0.0769 gloss:-21.0812 dloss:1.2156 dlossR:1.1905 dlossQ:0.0250
Episode:6550 meanR:-0.1000 rate:0.0000 gloss:-21.7589 dloss:0.0223 dlossR:0.0000 dlossQ:0.0223
Episode:6551 meanR:-0.1200 rate:0.0000 gloss:-17.9191 dloss:0.0178 dlossR:0.0000 dlossQ:0.0178
Episode:6552 meanR:-0.0700 rate:0.0000 gloss:-2

Episode:6628 meanR:-0.0700 rate:-0.0769 gloss:-22.6777 dloss:-1.2388 dlossR:-1.2703 dlossQ:0.0314
Episode:6629 meanR:-0.0600 rate:0.0000 gloss:-30.8198 dloss:0.0346 dlossR:0.0000 dlossQ:0.0346
Episode:6630 meanR:-0.0600 rate:0.0769 gloss:-20.7745 dloss:1.1959 dlossR:1.1744 dlossQ:0.0216
Episode:6631 meanR:-0.0300 rate:0.0769 gloss:-26.5606 dloss:1.5309 dlossR:1.4981 dlossQ:0.0329
Episode:6632 meanR:-0.0500 rate:-0.1538 gloss:-23.4442 dloss:-2.5870 dlossR:-2.6130 dlossQ:0.0260
Episode:6633 meanR:-0.0300 rate:0.0769 gloss:-23.7675 dloss:1.3718 dlossR:1.3413 dlossQ:0.0305
Episode:6634 meanR:-0.0300 rate:0.0000 gloss:-21.2264 dloss:0.0310 dlossR:0.0000 dlossQ:0.0310
Episode:6635 meanR:0.0000 rate:0.0000 gloss:-16.2000 dloss:0.0258 dlossR:0.0000 dlossQ:0.0257
Episode:6636 meanR:0.0200 rate:-0.0769 gloss:-22.8496 dloss:-1.2483 dlossR:-1.2784 dlossQ:0.0302
Episode:6637 meanR:0.0000 rate:-0.1538 gloss:-26.8319 dloss:-2.9648 dlossR:-2.9932 dlossQ:0.0284
Episode:6638 meanR:0.0100 rate:0.0769 glo

Episode:6714 meanR:0.0500 rate:0.0000 gloss:-16.9980 dloss:0.0193 dlossR:0.0000 dlossQ:0.0193
Episode:6715 meanR:0.0400 rate:0.0000 gloss:-19.0614 dloss:0.0192 dlossR:0.0000 dlossQ:0.0192
Episode:6716 meanR:0.0200 rate:0.0000 gloss:-21.8088 dloss:0.0343 dlossR:0.0000 dlossQ:0.0343
Episode:6717 meanR:0.0200 rate:0.0000 gloss:-16.5490 dloss:0.0176 dlossR:0.0000 dlossQ:0.0176
Episode:6718 meanR:0.0200 rate:0.0000 gloss:-12.5585 dloss:0.0183 dlossR:0.0002 dlossQ:0.0181
Episode:6719 meanR:0.0400 rate:0.0769 gloss:-13.9816 dloss:0.8078 dlossR:0.7921 dlossQ:0.0157
Episode:6720 meanR:0.0600 rate:0.0000 gloss:-13.4973 dloss:0.0185 dlossR:0.0001 dlossQ:0.0184
Episode:6721 meanR:0.0700 rate:0.0000 gloss:-11.9777 dloss:0.0216 dlossR:0.0004 dlossQ:0.0212
Episode:6722 meanR:0.0900 rate:0.0769 gloss:-13.6791 dloss:0.7978 dlossR:0.7748 dlossQ:0.0229
Episode:6723 meanR:0.1100 rate:0.1538 gloss:-11.2786 dloss:1.3159 dlossR:1.2922 dlossQ:0.0236
Episode:6724 meanR:0.1000 rate:-0.0769 gloss:-13.4631 dloss:

Episode:6802 meanR:0.2300 rate:0.0000 gloss:-6.1976 dloss:0.1260 dlossR:0.0174 dlossQ:0.1086
Episode:6803 meanR:0.2300 rate:0.0000 gloss:-3.4755 dloss:0.3477 dlossR:0.0762 dlossQ:0.2716
Episode:6804 meanR:0.2100 rate:-0.0769 gloss:-6.2271 dloss:-0.2305 dlossR:-0.3264 dlossQ:0.0959
Episode:6805 meanR:0.2100 rate:0.0000 gloss:-5.1874 dloss:0.1868 dlossR:0.0330 dlossQ:0.1537
Episode:6806 meanR:0.2100 rate:0.0769 gloss:-4.1638 dloss:0.4966 dlossR:0.2876 dlossQ:0.2090
Episode:6807 meanR:0.2100 rate:0.0000 gloss:-5.5722 dloss:0.2368 dlossR:0.0346 dlossQ:0.2022
Episode:6808 meanR:0.2100 rate:0.0000 gloss:-5.2073 dloss:0.1846 dlossR:0.0312 dlossQ:0.1534
Episode:6809 meanR:0.2200 rate:0.0769 gloss:-6.1472 dloss:0.4681 dlossR:0.3686 dlossQ:0.0995
Episode:6810 meanR:0.2200 rate:0.0000 gloss:-5.7644 dloss:0.1414 dlossR:0.0226 dlossQ:0.1188
Episode:6811 meanR:0.2000 rate:-0.1538 gloss:-6.1655 dloss:-0.5558 dlossR:-0.6513 dlossQ:0.0955
Episode:6812 meanR:0.1800 rate:0.0000 gloss:-6.0372 dloss:0.1042

Episode:6890 meanR:0.2200 rate:0.0000 gloss:-2.6868 dloss:0.6136 dlossR:0.1781 dlossQ:0.4355
Episode:6891 meanR:0.2100 rate:0.0000 gloss:-3.7627 dloss:0.3841 dlossR:0.0947 dlossQ:0.2894
Episode:6892 meanR:0.2000 rate:-0.0769 gloss:-3.7340 dloss:0.1716 dlossR:-0.1119 dlossQ:0.2835
Episode:6893 meanR:0.1700 rate:-0.1538 gloss:-3.6051 dloss:-0.0125 dlossR:-0.2914 dlossQ:0.2790
Episode:6894 meanR:0.1700 rate:0.0000 gloss:-4.5193 dloss:0.2548 dlossR:0.0503 dlossQ:0.2045
Episode:6895 meanR:0.1700 rate:0.0000 gloss:-5.5995 dloss:0.1270 dlossR:0.0200 dlossQ:0.1071
Episode:6896 meanR:0.1500 rate:0.0000 gloss:-7.0603 dloss:0.0599 dlossR:0.0073 dlossQ:0.0526
Episode:6897 meanR:0.1600 rate:0.0769 gloss:-7.2701 dloss:0.4758 dlossR:0.4213 dlossQ:0.0546
Episode:6898 meanR:0.1700 rate:0.0769 gloss:-7.8441 dloss:0.4909 dlossR:0.4510 dlossQ:0.0399
Episode:6899 meanR:0.1700 rate:0.0000 gloss:-8.8459 dloss:0.0272 dlossR:0.0018 dlossQ:0.0253
Episode:6900 meanR:0.1500 rate:0.0000 gloss:-9.2940 dloss:0.0256 

Episode:6977 meanR:0.0100 rate:0.0000 gloss:-18.0213 dloss:0.0167 dlossR:0.0000 dlossQ:0.0167
Episode:6978 meanR:0.0200 rate:0.0769 gloss:-16.4427 dloss:0.9520 dlossR:0.9309 dlossQ:0.0211
Episode:6979 meanR:0.0300 rate:0.1538 gloss:-15.9514 dloss:1.8364 dlossR:1.8197 dlossQ:0.0167
Episode:6980 meanR:0.0200 rate:0.0000 gloss:-15.7290 dloss:0.0246 dlossR:0.0001 dlossQ:0.0245
Episode:6981 meanR:0.0200 rate:-0.0769 gloss:-18.8810 dloss:-1.0345 dlossR:-1.0548 dlossQ:0.0203
Episode:6982 meanR:0.0200 rate:0.0000 gloss:-15.6932 dloss:0.0167 dlossR:0.0000 dlossQ:0.0166
Episode:6983 meanR:-0.0400 rate:0.0000 gloss:-17.3195 dloss:0.0212 dlossR:0.0000 dlossQ:0.0212
Episode:6984 meanR:-0.0300 rate:0.1538 gloss:-13.5407 dloss:1.5619 dlossR:1.5461 dlossQ:0.0159
Episode:6985 meanR:-0.0400 rate:0.0769 gloss:-15.3884 dloss:0.8877 dlossR:0.8711 dlossQ:0.0165
Episode:6986 meanR:-0.0300 rate:0.0000 gloss:-16.8261 dloss:0.0147 dlossR:0.0000 dlossQ:0.0147
Episode:6987 meanR:-0.0200 rate:0.0769 gloss:-15.3210

Episode:7064 meanR:0.2400 rate:0.0000 gloss:-12.0889 dloss:0.0259 dlossR:0.0010 dlossQ:0.0249
Episode:7065 meanR:0.2400 rate:-0.0769 gloss:-13.4845 dloss:-0.7337 dlossR:-0.7504 dlossQ:0.0167
Episode:7066 meanR:0.2400 rate:0.0000 gloss:-11.7915 dloss:0.0406 dlossR:0.0024 dlossQ:0.0382
Episode:7067 meanR:0.2300 rate:0.0000 gloss:-10.0180 dloss:0.0294 dlossR:0.0015 dlossQ:0.0279
Episode:7068 meanR:0.2300 rate:0.0000 gloss:-6.8191 dloss:0.0718 dlossR:0.0096 dlossQ:0.0623
Episode:7069 meanR:0.2400 rate:0.0000 gloss:-8.4439 dloss:0.0322 dlossR:0.0027 dlossQ:0.0296
Episode:7070 meanR:0.2500 rate:0.0769 gloss:-7.7362 dloss:0.4996 dlossR:0.4466 dlossQ:0.0530
Episode:7071 meanR:0.2400 rate:-0.0769 gloss:-6.7004 dloss:-0.2822 dlossR:-0.3569 dlossQ:0.0746
Episode:7072 meanR:0.2600 rate:0.0000 gloss:-6.9679 dloss:0.1443 dlossR:0.0285 dlossQ:0.1158
Episode:7073 meanR:0.2700 rate:0.0769 gloss:-5.3850 dloss:0.5455 dlossR:0.3442 dlossQ:0.2013
Episode:7074 meanR:0.2800 rate:0.0000 gloss:-7.9719 dloss:0.

Episode:7152 meanR:0.2700 rate:0.0000 gloss:-11.6212 dloss:0.0240 dlossR:0.0008 dlossQ:0.0232
Episode:7153 meanR:0.2400 rate:-0.0769 gloss:-10.9464 dloss:-0.5861 dlossR:-0.6085 dlossQ:0.0224
Episode:7154 meanR:0.2700 rate:0.0000 gloss:-10.4042 dloss:0.0234 dlossR:0.0008 dlossQ:0.0226
Episode:7155 meanR:0.2700 rate:0.0000 gloss:-8.2863 dloss:0.0307 dlossR:0.0024 dlossQ:0.0282
Episode:7156 meanR:0.2700 rate:0.0000 gloss:-7.3341 dloss:0.0454 dlossR:0.0048 dlossQ:0.0407
Episode:7157 meanR:0.2400 rate:0.0000 gloss:-6.8273 dloss:0.0597 dlossR:0.0071 dlossQ:0.0526
Episode:7158 meanR:0.2400 rate:0.0000 gloss:-8.5232 dloss:0.0315 dlossR:0.0023 dlossQ:0.0292
Episode:7159 meanR:0.2400 rate:0.0000 gloss:-8.6897 dloss:0.0335 dlossR:0.0024 dlossQ:0.0311
Episode:7160 meanR:0.2800 rate:0.3077 gloss:-11.2525 dloss:2.6511 dlossR:2.6248 dlossQ:0.0264
Episode:7161 meanR:0.2200 rate:0.0000 gloss:-9.7652 dloss:0.0436 dlossR:0.0027 dlossQ:0.0409
Episode:7162 meanR:0.2100 rate:-0.0769 gloss:-10.7866 dloss:-0.

Episode:7240 meanR:0.0000 rate:0.0000 gloss:-10.5517 dloss:0.0222 dlossR:0.0010 dlossQ:0.0212
Episode:7241 meanR:-0.0100 rate:-0.0769 gloss:-12.2973 dloss:-0.6699 dlossR:-0.6844 dlossQ:0.0145
Episode:7242 meanR:-0.0100 rate:0.0000 gloss:-10.4252 dloss:0.0265 dlossR:0.0011 dlossQ:0.0254
Episode:7243 meanR:0.0000 rate:0.0000 gloss:-12.1840 dloss:0.0165 dlossR:0.0005 dlossQ:0.0160
Episode:7244 meanR:0.0000 rate:0.0000 gloss:-11.1780 dloss:0.0248 dlossR:0.0007 dlossQ:0.0241
Episode:7245 meanR:0.0100 rate:0.0000 gloss:-10.8050 dloss:0.0180 dlossR:0.0006 dlossQ:0.0174
Episode:7246 meanR:0.0000 rate:-0.0769 gloss:-12.9782 dloss:-0.7031 dlossR:-0.7228 dlossQ:0.0197
Episode:7247 meanR:0.0100 rate:0.0769 gloss:-12.2016 dloss:0.7120 dlossR:0.6920 dlossQ:0.0199
Episode:7248 meanR:-0.0200 rate:0.0000 gloss:-14.0308 dloss:0.0216 dlossR:0.0001 dlossQ:0.0215
Episode:7249 meanR:-0.0500 rate:-0.1538 gloss:-15.6289 dloss:-1.7131 dlossR:-1.7318 dlossQ:0.0188
Episode:7250 meanR:-0.0700 rate:-0.1538 gloss:-

Episode:7327 meanR:0.0300 rate:0.0000 gloss:-11.8845 dloss:0.0390 dlossR:0.0020 dlossQ:0.0371
Episode:7328 meanR:0.0500 rate:0.0000 gloss:-7.9008 dloss:0.1063 dlossR:0.0138 dlossQ:0.0924
Episode:7329 meanR:0.0500 rate:0.0769 gloss:-8.5103 dloss:0.5721 dlossR:0.4953 dlossQ:0.0768
Episode:7330 meanR:0.0700 rate:0.0000 gloss:-7.3207 dloss:0.0737 dlossR:0.0091 dlossQ:0.0646
Episode:7331 meanR:0.0800 rate:0.0769 gloss:-6.1343 dloss:0.4802 dlossR:0.3681 dlossQ:0.1121
Episode:7332 meanR:0.0800 rate:0.0000 gloss:-7.0242 dloss:0.0956 dlossR:0.0120 dlossQ:0.0837
Episode:7333 meanR:0.0500 rate:-0.2308 gloss:-6.7612 dloss:-0.9681 dlossR:-1.0701 dlossQ:0.1020
Episode:7334 meanR:0.0200 rate:-0.0769 gloss:-6.0490 dloss:-0.1801 dlossR:-0.3093 dlossQ:0.1292
Episode:7335 meanR:0.0300 rate:0.0769 gloss:-7.3422 dloss:0.5324 dlossR:0.4327 dlossQ:0.0997
Episode:7336 meanR:0.0200 rate:-0.1538 gloss:-9.5900 dloss:-1.0157 dlossR:-1.0492 dlossQ:0.0336
Episode:7337 meanR:0.0400 rate:0.0769 gloss:-9.3778 dloss:0.

Episode:7414 meanR:-0.0600 rate:0.0000 gloss:-13.1491 dloss:0.0162 dlossR:0.0002 dlossQ:0.0161
Episode:7415 meanR:-0.0500 rate:0.0000 gloss:-10.7501 dloss:0.0184 dlossR:0.0006 dlossQ:0.0178
Episode:7416 meanR:-0.0500 rate:0.0000 gloss:-21.2778 dloss:0.0308 dlossR:0.0000 dlossQ:0.0308
Episode:7417 meanR:-0.0600 rate:0.0000 gloss:-13.0753 dloss:0.0171 dlossR:0.0001 dlossQ:0.0170
Episode:7418 meanR:-0.0500 rate:0.0000 gloss:-12.6981 dloss:0.0173 dlossR:0.0001 dlossQ:0.0172
Episode:7419 meanR:-0.0500 rate:0.0000 gloss:-12.8547 dloss:0.0285 dlossR:0.0005 dlossQ:0.0279
Episode:7420 meanR:-0.0400 rate:0.0769 gloss:-20.7555 dloss:1.1958 dlossR:1.1724 dlossQ:0.0234
Episode:7421 meanR:-0.0700 rate:-0.0769 gloss:-20.7164 dloss:-1.1290 dlossR:-1.1573 dlossQ:0.0283
Episode:7422 meanR:-0.0700 rate:0.0000 gloss:-13.6864 dloss:0.0174 dlossR:0.0003 dlossQ:0.0171
Episode:7423 meanR:-0.0700 rate:0.0000 gloss:-17.2105 dloss:0.0183 dlossR:0.0000 dlossQ:0.0183
Episode:7424 meanR:-0.0500 rate:0.2308 gloss:-1

Episode:7501 meanR:0.0400 rate:-0.0769 gloss:-12.3748 dloss:-0.6753 dlossR:-0.6921 dlossQ:0.0168
Episode:7502 meanR:0.0500 rate:0.0769 gloss:-15.0838 dloss:0.8705 dlossR:0.8531 dlossQ:0.0174
Episode:7503 meanR:0.0700 rate:0.0000 gloss:-13.1774 dloss:0.0206 dlossR:0.0003 dlossQ:0.0203
Episode:7504 meanR:0.0600 rate:0.0000 gloss:-16.7874 dloss:0.0231 dlossR:0.0000 dlossQ:0.0231
Episode:7505 meanR:0.0700 rate:0.0000 gloss:-14.8393 dloss:0.0184 dlossR:0.0001 dlossQ:0.0182
Episode:7506 meanR:0.0700 rate:0.0000 gloss:-16.0781 dloss:0.0160 dlossR:0.0001 dlossQ:0.0160
Episode:7507 meanR:0.0400 rate:-0.2308 gloss:-17.8393 dloss:-2.9289 dlossR:-2.9537 dlossQ:0.0248
Episode:7508 meanR:0.0300 rate:-0.0769 gloss:-19.2771 dloss:-1.0573 dlossR:-1.0801 dlossQ:0.0228
Episode:7509 meanR:0.0300 rate:0.0769 gloss:-17.4518 dloss:1.0099 dlossR:0.9872 dlossQ:0.0227
Episode:7510 meanR:0.0300 rate:0.0000 gloss:-13.8673 dloss:0.0185 dlossR:0.0001 dlossQ:0.0183
Episode:7511 meanR:0.0400 rate:0.0769 gloss:-19.561

Episode:7588 meanR:0.0100 rate:-0.0769 gloss:-11.2126 dloss:-0.5814 dlossR:-0.6194 dlossQ:0.0381
Episode:7589 meanR:0.0100 rate:0.0000 gloss:-10.5916 dloss:0.0320 dlossR:0.0019 dlossQ:0.0300
Episode:7590 meanR:0.0200 rate:0.0000 gloss:-8.7988 dloss:0.0310 dlossR:0.0021 dlossQ:0.0289
Episode:7591 meanR:0.0000 rate:0.0000 gloss:-10.9585 dloss:0.0203 dlossR:0.0005 dlossQ:0.0198
Episode:7592 meanR:-0.0100 rate:0.0000 gloss:-6.0821 dloss:0.1728 dlossR:0.0288 dlossQ:0.1440
Episode:7593 meanR:0.0400 rate:0.0000 gloss:-7.5334 dloss:0.0678 dlossR:0.0073 dlossQ:0.0605
Episode:7594 meanR:0.0600 rate:0.0000 gloss:-6.3797 dloss:0.0799 dlossR:0.0104 dlossQ:0.0695
Episode:7595 meanR:0.0700 rate:0.0000 gloss:-6.2360 dloss:0.0855 dlossR:0.0121 dlossQ:0.0734
Episode:7596 meanR:0.0700 rate:0.0000 gloss:-8.5189 dloss:0.0655 dlossR:0.0101 dlossQ:0.0554
Episode:7597 meanR:0.0700 rate:0.0000 gloss:-6.2628 dloss:0.0830 dlossR:0.0110 dlossQ:0.0720
Episode:7598 meanR:0.0800 rate:0.0000 gloss:-9.8248 dloss:0.030

Episode:7676 meanR:0.2500 rate:0.0000 gloss:-13.2782 dloss:0.0168 dlossR:0.0002 dlossQ:0.0166
Episode:7677 meanR:0.2600 rate:0.0769 gloss:-14.7478 dloss:0.8538 dlossR:0.8354 dlossQ:0.0184
Episode:7678 meanR:0.2600 rate:0.0000 gloss:-12.8856 dloss:0.0176 dlossR:0.0001 dlossQ:0.0175
Episode:7679 meanR:0.2500 rate:-0.0769 gloss:-15.2681 dloss:-0.8343 dlossR:-0.8526 dlossQ:0.0183
Episode:7680 meanR:0.2500 rate:0.0000 gloss:-15.0442 dloss:0.0176 dlossR:0.0000 dlossQ:0.0176
Episode:7681 meanR:0.2600 rate:0.0769 gloss:-14.6659 dloss:0.8500 dlossR:0.8296 dlossQ:0.0204
Episode:7682 meanR:0.2600 rate:0.0000 gloss:-17.2400 dloss:0.0211 dlossR:0.0000 dlossQ:0.0211
Episode:7683 meanR:0.2700 rate:0.0769 gloss:-17.3183 dloss:0.9980 dlossR:0.9798 dlossQ:0.0183
Episode:7684 meanR:0.2800 rate:0.0769 gloss:-18.0225 dloss:1.0427 dlossR:1.0185 dlossQ:0.0242
Episode:7685 meanR:0.2700 rate:0.0000 gloss:-13.9590 dloss:0.0208 dlossR:0.0001 dlossQ:0.0207
Episode:7686 meanR:0.2500 rate:-0.0769 gloss:-13.3779 dlo

Episode:7764 meanR:0.2000 rate:-0.0769 gloss:-7.8720 dloss:-0.3921 dlossR:-0.4325 dlossQ:0.0404
Episode:7765 meanR:0.1900 rate:-0.0769 gloss:-9.0333 dloss:-0.4756 dlossR:-0.4998 dlossQ:0.0242
Episode:7766 meanR:0.1900 rate:-0.0769 gloss:-8.9863 dloss:-0.4727 dlossR:-0.4980 dlossQ:0.0253
Episode:7767 meanR:0.2000 rate:0.0769 gloss:-10.4756 dloss:0.6133 dlossR:0.5958 dlossQ:0.0175
Episode:7768 meanR:0.1900 rate:-0.0769 gloss:-9.7559 dloss:-0.5227 dlossR:-0.5415 dlossQ:0.0188
Episode:7769 meanR:0.2100 rate:0.0769 gloss:-10.2808 dloss:0.6062 dlossR:0.5848 dlossQ:0.0214
Episode:7770 meanR:0.2100 rate:0.0000 gloss:-11.6299 dloss:0.0166 dlossR:0.0003 dlossQ:0.0163
Episode:7771 meanR:0.2300 rate:0.0769 gloss:-12.4839 dloss:0.7254 dlossR:0.7077 dlossQ:0.0176
Episode:7772 meanR:0.2300 rate:0.0000 gloss:-9.4909 dloss:0.0214 dlossR:0.0010 dlossQ:0.0204
Episode:7773 meanR:0.2300 rate:0.0000 gloss:-9.8661 dloss:0.0201 dlossR:0.0009 dlossQ:0.0192
Episode:7774 meanR:0.2300 rate:0.0000 gloss:-11.5635 d

Episode:7852 meanR:0.0800 rate:-0.0769 gloss:-13.3880 dloss:-0.7320 dlossR:-0.7467 dlossQ:0.0147
Episode:7853 meanR:0.1000 rate:0.0769 gloss:-11.0959 dloss:0.6525 dlossR:0.6306 dlossQ:0.0219
Episode:7854 meanR:0.0800 rate:-0.1538 gloss:-11.7444 dloss:-1.2744 dlossR:-1.2955 dlossQ:0.0211
Episode:7855 meanR:0.0900 rate:0.0769 gloss:-13.0289 dloss:0.7554 dlossR:0.7385 dlossQ:0.0170
Episode:7856 meanR:0.1100 rate:0.0769 gloss:-10.3445 dloss:0.6070 dlossR:0.5885 dlossQ:0.0185
Episode:7857 meanR:0.1100 rate:-0.0769 gloss:-10.8254 dloss:-0.5840 dlossR:-0.6020 dlossQ:0.0180
Episode:7858 meanR:0.1200 rate:0.0000 gloss:-8.9168 dloss:0.0247 dlossR:0.0015 dlossQ:0.0232
Episode:7859 meanR:0.1300 rate:0.1538 gloss:-13.2545 dloss:1.5293 dlossR:1.5156 dlossQ:0.0138
Episode:7860 meanR:0.1400 rate:-0.0769 gloss:-13.5604 dloss:-0.7285 dlossR:-0.7561 dlossQ:0.0276
Episode:7861 meanR:0.1200 rate:-0.0769 gloss:-12.6870 dloss:-0.6923 dlossR:-0.7068 dlossQ:0.0145
Episode:7862 meanR:0.1100 rate:-0.0769 gloss:-

Episode:7939 meanR:0.1200 rate:0.3846 gloss:-7.2817 dloss:2.2617 dlossR:2.2011 dlossQ:0.0606
Episode:7940 meanR:0.1300 rate:0.0000 gloss:-6.5741 dloss:0.0847 dlossR:0.0118 dlossQ:0.0730
Episode:7941 meanR:0.1500 rate:0.1538 gloss:-4.8437 dloss:0.7801 dlossR:0.6035 dlossQ:0.1766
Episode:7942 meanR:0.1400 rate:-0.1538 gloss:-5.3026 dloss:-0.4159 dlossR:-0.5436 dlossQ:0.1277
Episode:7943 meanR:0.1700 rate:0.2308 gloss:-3.8894 dloss:1.0574 dlossR:0.7709 dlossQ:0.2864
Episode:7944 meanR:0.1700 rate:0.0000 gloss:-4.3802 dloss:0.2164 dlossR:0.0407 dlossQ:0.1757
Episode:7945 meanR:0.2100 rate:0.0769 gloss:-3.0166 dloss:0.7356 dlossR:0.3117 dlossQ:0.4238
Episode:7946 meanR:0.2100 rate:0.0769 gloss:-2.9435 dloss:0.7287 dlossR:0.3014 dlossQ:0.4273
Episode:7947 meanR:0.2100 rate:-0.0769 gloss:-2.6789 dloss:0.5069 dlossR:0.0370 dlossQ:0.4699
Episode:7948 meanR:0.2200 rate:-0.0769 gloss:-2.8349 dloss:0.3981 dlossR:-0.0001 dlossQ:0.3982
Episode:7949 meanR:0.1900 rate:-0.1538 gloss:-3.8782 dloss:-0.07

Episode:8026 meanR:0.1500 rate:0.0000 gloss:-18.9913 dloss:0.0211 dlossR:0.0000 dlossQ:0.0211
Episode:8027 meanR:0.1500 rate:0.0000 gloss:-17.7033 dloss:0.0244 dlossR:0.0000 dlossQ:0.0244
Episode:8028 meanR:0.1300 rate:-0.1538 gloss:-18.3201 dloss:-2.0139 dlossR:-2.0349 dlossQ:0.0210
Episode:8029 meanR:0.1300 rate:0.0000 gloss:-23.4044 dloss:0.0229 dlossR:0.0000 dlossQ:0.0229
Episode:8030 meanR:0.1200 rate:-0.0769 gloss:-15.9921 dloss:-0.8724 dlossR:-0.8930 dlossQ:0.0205
Episode:8031 meanR:0.1200 rate:0.0000 gloss:-14.3186 dloss:0.0163 dlossR:0.0001 dlossQ:0.0162
Episode:8032 meanR:0.1200 rate:0.0000 gloss:-16.2585 dloss:0.0311 dlossR:0.0000 dlossQ:0.0310
Episode:8033 meanR:0.1100 rate:-0.0769 gloss:-15.6185 dloss:-0.8495 dlossR:-0.8714 dlossQ:0.0219
Episode:8034 meanR:0.0800 rate:-0.2308 gloss:-17.4592 dloss:-2.8699 dlossR:-2.8893 dlossQ:0.0194
Episode:8035 meanR:0.0800 rate:0.0000 gloss:-19.4802 dloss:0.0219 dlossR:0.0000 dlossQ:0.0219
Episode:8036 meanR:0.0800 rate:0.1538 gloss:-20.

Episode:8112 meanR:-0.0300 rate:-0.0769 gloss:-16.4343 dloss:-0.9009 dlossR:-0.9190 dlossQ:0.0181
Episode:8113 meanR:-0.0600 rate:-0.1538 gloss:-18.4824 dloss:-2.0362 dlossR:-2.0522 dlossQ:0.0160
Episode:8114 meanR:-0.0700 rate:0.0769 gloss:-16.5891 dloss:0.9617 dlossR:0.9396 dlossQ:0.0221
Episode:8115 meanR:-0.0500 rate:0.0769 gloss:-30.6852 dloss:1.7775 dlossR:1.7294 dlossQ:0.0481
Episode:8116 meanR:-0.0400 rate:0.0769 gloss:-16.0238 dloss:0.9336 dlossR:0.9086 dlossQ:0.0250
Episode:8117 meanR:-0.0300 rate:0.0000 gloss:-13.0233 dloss:0.0186 dlossR:0.0002 dlossQ:0.0183
Episode:8118 meanR:-0.0400 rate:-0.0769 gloss:-15.9601 dloss:-0.8741 dlossR:-0.8922 dlossQ:0.0181
Episode:8119 meanR:-0.0400 rate:0.0000 gloss:-12.9171 dloss:0.0170 dlossR:0.0001 dlossQ:0.0169
Episode:8120 meanR:-0.0400 rate:0.0000 gloss:-19.2542 dloss:0.0273 dlossR:0.0000 dlossQ:0.0273
Episode:8121 meanR:-0.0400 rate:0.0000 gloss:-18.1460 dloss:0.0188 dlossR:0.0000 dlossQ:0.0188
Episode:8122 meanR:-0.0300 rate:0.0769 gl

Episode:8199 meanR:0.4800 rate:-0.2308 gloss:-3.8782 dloss:-0.2443 dlossR:-0.5120 dlossQ:0.2677
Episode:8200 meanR:0.4900 rate:0.0769 gloss:-3.4826 dloss:0.6169 dlossR:0.2891 dlossQ:0.3278
Episode:8201 meanR:0.5000 rate:0.0769 gloss:-4.2324 dloss:0.5241 dlossR:0.2956 dlossQ:0.2286
Episode:8202 meanR:0.4600 rate:-0.2308 gloss:-4.3937 dloss:-0.3951 dlossR:-0.6194 dlossQ:0.2243
Episode:8203 meanR:0.4500 rate:-0.0769 gloss:-4.8232 dloss:-0.0295 dlossR:-0.2174 dlossQ:0.1879
Episode:8204 meanR:0.4300 rate:-0.0769 gloss:-5.3927 dloss:-0.1363 dlossR:-0.2696 dlossQ:0.1333
Episode:8205 meanR:0.4500 rate:0.1538 gloss:-6.6724 dloss:0.8624 dlossR:0.7839 dlossQ:0.0786
Episode:8206 meanR:0.4600 rate:0.0769 gloss:-6.0008 dloss:0.4534 dlossR:0.3584 dlossQ:0.0950
Episode:8207 meanR:0.4600 rate:0.0769 gloss:-6.1691 dloss:0.4574 dlossR:0.3665 dlossQ:0.0909
Episode:8208 meanR:0.4700 rate:0.0769 gloss:-6.7958 dloss:0.4572 dlossR:0.3961 dlossQ:0.0611
Episode:8209 meanR:0.4600 rate:0.1538 gloss:-6.9197 dloss:

Episode:8287 meanR:0.2100 rate:0.0000 gloss:-4.9828 dloss:0.1750 dlossR:0.0301 dlossQ:0.1449
Episode:8288 meanR:0.2000 rate:0.0000 gloss:-5.3162 dloss:0.1439 dlossR:0.0234 dlossQ:0.1206
Episode:8289 meanR:0.2000 rate:0.0000 gloss:-5.6225 dloss:0.1290 dlossR:0.0207 dlossQ:0.1083
Episode:8290 meanR:0.1900 rate:-0.0769 gloss:-5.5059 dloss:-0.1614 dlossR:-0.2793 dlossQ:0.1180
Episode:8291 meanR:0.1900 rate:-0.0769 gloss:-6.3815 dloss:-0.2576 dlossR:-0.3381 dlossQ:0.0805
Episode:8292 meanR:0.1600 rate:0.0000 gloss:-6.2572 dloss:0.0953 dlossR:0.0136 dlossQ:0.0817
Episode:8293 meanR:0.1300 rate:0.0000 gloss:-6.5537 dloss:0.0802 dlossR:0.0108 dlossQ:0.0694
Episode:8294 meanR:0.1200 rate:-0.0769 gloss:-6.9371 dloss:-0.3113 dlossR:-0.3742 dlossQ:0.0629
Episode:8295 meanR:0.1400 rate:0.0769 gloss:-7.5511 dloss:0.4869 dlossR:0.4373 dlossQ:0.0496
Episode:8296 meanR:0.1400 rate:0.0769 gloss:-6.5491 dloss:0.4528 dlossR:0.3838 dlossQ:0.0691
Episode:8297 meanR:0.1100 rate:-0.0769 gloss:-7.4785 dloss:-0

Episode:8373 meanR:-0.2700 rate:0.0000 gloss:-24.0816 dloss:0.0298 dlossR:0.0000 dlossQ:0.0298
Episode:8374 meanR:-0.2700 rate:0.0000 gloss:-24.5261 dloss:0.0290 dlossR:0.0000 dlossQ:0.0290
Episode:8375 meanR:-0.2800 rate:-0.0769 gloss:-35.4663 dloss:-1.9578 dlossR:-1.9884 dlossQ:0.0306
Episode:8376 meanR:-0.3100 rate:0.0000 gloss:-27.9529 dloss:0.0444 dlossR:0.0000 dlossQ:0.0444
Episode:8377 meanR:-0.3400 rate:0.0000 gloss:-32.1892 dloss:0.0439 dlossR:0.0000 dlossQ:0.0439
Episode:8378 meanR:-0.3500 rate:0.0000 gloss:-25.0131 dloss:0.0245 dlossR:0.0000 dlossQ:0.0245
Episode:8379 meanR:-0.3600 rate:-0.0769 gloss:-28.9063 dloss:-1.5887 dlossR:-1.6189 dlossQ:0.0302
Episode:8380 meanR:-0.3600 rate:0.0000 gloss:-35.6940 dloss:0.0379 dlossR:0.0000 dlossQ:0.0379
Episode:8381 meanR:-0.3600 rate:0.0769 gloss:-30.2708 dloss:1.7432 dlossR:1.7082 dlossQ:0.0350
Episode:8382 meanR:-0.3500 rate:0.0000 gloss:-35.4699 dloss:0.0505 dlossR:0.0000 dlossQ:0.0505
Episode:8383 meanR:-0.3300 rate:0.0000 gloss

Episode:8459 meanR:-0.1300 rate:0.0000 gloss:-43.0938 dloss:0.0600 dlossR:0.0000 dlossQ:0.0600
Episode:8460 meanR:-0.1400 rate:0.0000 gloss:-47.7889 dloss:0.0542 dlossR:0.0000 dlossQ:0.0542
Episode:8461 meanR:-0.1300 rate:0.0000 gloss:-48.0019 dloss:0.0718 dlossR:0.0000 dlossQ:0.0718
Episode:8462 meanR:-0.1400 rate:-0.0769 gloss:-51.0977 dloss:-2.8221 dlossR:-2.8678 dlossQ:0.0457
Episode:8463 meanR:-0.1400 rate:0.0000 gloss:-51.1116 dloss:0.0510 dlossR:0.0000 dlossQ:0.0510
Episode:8464 meanR:-0.1400 rate:0.0000 gloss:-43.4898 dloss:0.0510 dlossR:0.0000 dlossQ:0.0510
Episode:8465 meanR:-0.1300 rate:-0.0769 gloss:-42.2276 dloss:-2.3164 dlossR:-2.3712 dlossQ:0.0548
Episode:8466 meanR:-0.1500 rate:0.0000 gloss:-40.2999 dloss:0.0580 dlossR:0.0000 dlossQ:0.0580
Episode:8467 meanR:-0.1200 rate:0.0000 gloss:-38.5017 dloss:0.0487 dlossR:0.0000 dlossQ:0.0487
Episode:8468 meanR:-0.1200 rate:0.0000 gloss:-45.8383 dloss:0.0584 dlossR:0.0000 dlossQ:0.0584
Episode:8469 meanR:-0.1100 rate:0.0769 gloss

Episode:8546 meanR:0.0500 rate:-0.0769 gloss:-44.2139 dloss:-2.4502 dlossR:-2.4783 dlossQ:0.0281
Episode:8547 meanR:0.0500 rate:0.0000 gloss:-32.3057 dloss:0.0402 dlossR:0.0000 dlossQ:0.0402
Episode:8548 meanR:0.0500 rate:0.0000 gloss:-35.8345 dloss:0.0437 dlossR:0.0000 dlossQ:0.0437
Episode:8549 meanR:0.0700 rate:0.1538 gloss:-43.1585 dloss:4.9470 dlossR:4.8757 dlossQ:0.0713
Episode:8550 meanR:0.0600 rate:0.0000 gloss:-38.1064 dloss:0.0754 dlossR:0.0000 dlossQ:0.0754
Episode:8551 meanR:0.0500 rate:-0.1538 gloss:-34.5578 dloss:-3.8287 dlossR:-3.8620 dlossQ:0.0333
Episode:8552 meanR:-0.0100 rate:-0.0769 gloss:-31.7071 dloss:-1.7413 dlossR:-1.7793 dlossQ:0.0380
Episode:8553 meanR:-0.0100 rate:0.0000 gloss:-45.5093 dloss:0.0480 dlossR:0.0000 dlossQ:0.0480
Episode:8554 meanR:-0.0100 rate:0.0000 gloss:-40.3308 dloss:0.0448 dlossR:0.0000 dlossQ:0.0448
Episode:8555 meanR:-0.0300 rate:-0.0769 gloss:-51.2349 dloss:-2.8157 dlossR:-2.8767 dlossQ:0.0610
Episode:8556 meanR:-0.0300 rate:0.0000 gloss

Episode:8632 meanR:-0.0500 rate:-0.0769 gloss:-30.9442 dloss:-1.6937 dlossR:-1.7344 dlossQ:0.0407
Episode:8633 meanR:-0.0400 rate:0.0769 gloss:-30.5334 dloss:1.7593 dlossR:1.7234 dlossQ:0.0358
Episode:8634 meanR:-0.0400 rate:-0.1538 gloss:-33.2757 dloss:-3.6802 dlossR:-3.7168 dlossQ:0.0366
Episode:8635 meanR:-0.0400 rate:0.0000 gloss:-33.7408 dloss:0.0428 dlossR:0.0000 dlossQ:0.0428
Episode:8636 meanR:-0.0100 rate:0.0769 gloss:-35.0348 dloss:2.0225 dlossR:1.9756 dlossQ:0.0469
Episode:8637 meanR:0.0000 rate:0.0769 gloss:-34.1366 dloss:1.9812 dlossR:1.9250 dlossQ:0.0562
Episode:8638 meanR:0.0000 rate:0.0000 gloss:-33.9274 dloss:0.0453 dlossR:0.0000 dlossQ:0.0453
Episode:8639 meanR:-0.0100 rate:-0.0769 gloss:-39.0108 dloss:-2.1424 dlossR:-2.1859 dlossQ:0.0435
Episode:8640 meanR:-0.0500 rate:-0.3077 gloss:-31.3509 dloss:-6.9131 dlossR:-6.9523 dlossQ:0.0392
Episode:8641 meanR:-0.0400 rate:0.0000 gloss:-34.7450 dloss:0.0383 dlossR:0.0000 dlossQ:0.0383
Episode:8642 meanR:-0.0400 rate:0.0000 g

Episode:8718 meanR:-0.1100 rate:-0.0769 gloss:-29.9541 dloss:-1.6591 dlossR:-1.6889 dlossQ:0.0299
Episode:8719 meanR:-0.1300 rate:-0.1538 gloss:-43.8497 dloss:-4.8594 dlossR:-4.9075 dlossQ:0.0481
Episode:8720 meanR:-0.1300 rate:0.0000 gloss:-47.4660 dloss:0.0609 dlossR:0.0000 dlossQ:0.0609
Episode:8721 meanR:-0.1400 rate:-0.0769 gloss:-52.8253 dloss:-2.8910 dlossR:-2.9624 dlossQ:0.0714
Episode:8722 meanR:-0.1700 rate:-0.2308 gloss:-44.6518 dloss:-7.4374 dlossR:-7.4787 dlossQ:0.0414
Episode:8723 meanR:-0.1600 rate:0.0000 gloss:-44.6982 dloss:0.0459 dlossR:0.0000 dlossQ:0.0459
Episode:8724 meanR:-0.1700 rate:0.0000 gloss:-46.9107 dloss:0.0688 dlossR:0.0000 dlossQ:0.0688
Episode:8725 meanR:-0.1800 rate:-0.0769 gloss:-49.2251 dloss:-2.6997 dlossR:-2.7609 dlossQ:0.0611
Episode:8726 meanR:-0.1800 rate:0.0000 gloss:-57.5962 dloss:0.0579 dlossR:0.0000 dlossQ:0.0579
Episode:8727 meanR:-0.1600 rate:0.0769 gloss:-58.7983 dloss:3.3663 dlossR:3.3094 dlossQ:0.0569
Episode:8728 meanR:-0.1800 rate:0.0

Episode:8804 meanR:-0.2300 rate:0.0000 gloss:-127.5999 dloss:0.1615 dlossR:0.0000 dlossQ:0.1615
Episode:8805 meanR:-0.2300 rate:0.0000 gloss:-92.9250 dloss:0.1095 dlossR:0.0000 dlossQ:0.1095
Episode:8806 meanR:-0.2400 rate:0.0000 gloss:-118.7441 dloss:0.1566 dlossR:0.0000 dlossQ:0.1566
Episode:8807 meanR:-0.2300 rate:0.0000 gloss:-175.6652 dloss:0.1794 dlossR:0.0000 dlossQ:0.1794
Episode:8808 meanR:-0.2400 rate:-0.0769 gloss:-115.6276 dloss:-6.3410 dlossR:-6.4946 dlossQ:0.1536
Episode:8809 meanR:-0.2400 rate:0.0000 gloss:-98.5836 dloss:0.1358 dlossR:0.0000 dlossQ:0.1358
Episode:8810 meanR:-0.2700 rate:-0.1538 gloss:-99.4583 dloss:-11.0378 dlossR:-11.1542 dlossQ:0.1164
Episode:8811 meanR:-0.2700 rate:0.0000 gloss:-125.8384 dloss:0.1344 dlossR:0.0000 dlossQ:0.1344
Episode:8812 meanR:-0.2700 rate:-0.0769 gloss:-122.6748 dloss:-6.7655 dlossR:-6.8893 dlossQ:0.1238
Episode:8813 meanR:-0.2700 rate:-0.0769 gloss:-84.4679 dloss:-4.6355 dlossR:-4.7419 dlossQ:0.1064
Episode:8814 meanR:-0.3000 rat

Episode:8889 meanR:-0.1400 rate:0.0000 gloss:-188.7096 dloss:0.2114 dlossR:0.0000 dlossQ:0.2114
Episode:8890 meanR:-0.1400 rate:0.0000 gloss:-118.9696 dloss:0.1504 dlossR:0.0000 dlossQ:0.1504
Episode:8891 meanR:-0.1200 rate:0.1538 gloss:-164.9827 dloss:18.7347 dlossR:18.5814 dlossQ:0.1533
Episode:8892 meanR:-0.1300 rate:-0.0769 gloss:-191.6679 dloss:-10.4087 dlossR:-10.7799 dlossQ:0.3712
Episode:8893 meanR:-0.1300 rate:0.0000 gloss:-251.1096 dloss:0.2481 dlossR:0.0000 dlossQ:0.2481
Episode:8894 meanR:-0.1300 rate:0.0000 gloss:-175.5724 dloss:0.1529 dlossR:0.0000 dlossQ:0.1529
Episode:8895 meanR:-0.1300 rate:0.0000 gloss:-246.8637 dloss:0.1906 dlossR:0.0000 dlossQ:0.1906
Episode:8896 meanR:-0.1100 rate:0.0000 gloss:-160.3811 dloss:0.2262 dlossR:0.0000 dlossQ:0.2262
Episode:8897 meanR:-0.1000 rate:0.0000 gloss:-201.5091 dloss:0.2668 dlossR:0.0000 dlossQ:0.2668
Episode:8898 meanR:-0.1000 rate:0.0000 gloss:-186.6858 dloss:0.2271 dlossR:0.0000 dlossQ:0.2271
Episode:8899 meanR:-0.1000 rate:0

Episode:8975 meanR:0.3100 rate:0.0000 gloss:-37.7887 dloss:0.0538 dlossR:0.0055 dlossQ:0.0482
Episode:8976 meanR:0.3200 rate:0.0769 gloss:-70.1204 dloss:4.0728 dlossR:3.9562 dlossQ:0.1166
Episode:8977 meanR:0.3200 rate:0.0000 gloss:-92.2727 dloss:0.0738 dlossR:0.0003 dlossQ:0.0735
Episode:8978 meanR:0.3100 rate:-0.0769 gloss:-86.3041 dloss:-4.4836 dlossR:-4.8183 dlossQ:0.3347
Episode:8979 meanR:0.3000 rate:-0.0769 gloss:-20.9667 dloss:-0.3940 dlossR:-0.8600 dlossQ:0.4660
Episode:8980 meanR:0.3000 rate:-0.1538 gloss:-9.8288 dloss:0.5447 dlossR:-0.0582 dlossQ:0.6029
Episode:8981 meanR:0.2900 rate:-0.0769 gloss:-15.4679 dloss:-0.2329 dlossR:-0.5114 dlossQ:0.2786
Episode:8982 meanR:0.2500 rate:-0.2308 gloss:-23.3272 dloss:-3.8317 dlossR:-3.8838 dlossQ:0.0521
Episode:8983 meanR:0.2500 rate:-0.0769 gloss:-32.2208 dloss:-1.7799 dlossR:-1.8161 dlossQ:0.0363
Episode:8984 meanR:0.2500 rate:0.0000 gloss:-51.7121 dloss:0.0725 dlossR:0.0000 dlossQ:0.0725
Episode:8985 meanR:0.2300 rate:0.0000 gloss:

Episode:9062 meanR:0.0800 rate:-0.1538 gloss:-30.2573 dloss:-3.3435 dlossR:-3.3796 dlossQ:0.0361
Episode:9063 meanR:0.0800 rate:0.0000 gloss:-25.0935 dloss:0.0344 dlossR:0.0000 dlossQ:0.0344
Episode:9064 meanR:0.0800 rate:0.0769 gloss:-23.5157 dloss:1.3551 dlossR:1.3304 dlossQ:0.0247
Episode:9065 meanR:0.0700 rate:-0.0769 gloss:-29.5225 dloss:-1.6279 dlossR:-1.6658 dlossQ:0.0379
Episode:9066 meanR:0.0900 rate:0.0000 gloss:-24.0336 dloss:0.0265 dlossR:0.0000 dlossQ:0.0265
Episode:9067 meanR:0.1100 rate:0.1538 gloss:-26.9042 dloss:3.0980 dlossR:3.0697 dlossQ:0.0282
Episode:9068 meanR:0.1000 rate:-0.1538 gloss:-60.4880 dloss:-6.7613 dlossR:-6.8055 dlossQ:0.0441
Episode:9069 meanR:0.0800 rate:0.0000 gloss:-44.4229 dloss:0.0432 dlossR:0.0000 dlossQ:0.0432
Episode:9070 meanR:0.0900 rate:0.0000 gloss:-103.7422 dloss:0.3441 dlossR:0.0000 dlossQ:0.3441
Episode:9071 meanR:0.0700 rate:0.0000 gloss:-128.1250 dloss:0.1109 dlossR:0.0000 dlossQ:0.1109
Episode:9072 meanR:0.0500 rate:0.0000 gloss:-74.0

Episode:9149 meanR:0.3500 rate:0.4615 gloss:-11.3317 dloss:4.2407 dlossR:4.0734 dlossQ:0.1673
Episode:9150 meanR:0.3600 rate:0.0769 gloss:-11.4376 dloss:0.7224 dlossR:0.6560 dlossQ:0.0664
Episode:9151 meanR:0.3500 rate:0.0769 gloss:-13.4537 dloss:0.7990 dlossR:0.7640 dlossQ:0.0350
Episode:9152 meanR:0.3600 rate:0.0000 gloss:-6.8072 dloss:0.0645 dlossR:0.0075 dlossQ:0.0569
Episode:9153 meanR:0.3800 rate:0.0769 gloss:-18.3793 dloss:1.0614 dlossR:1.0379 dlossQ:0.0235
Episode:9154 meanR:0.3800 rate:0.0000 gloss:-17.6423 dloss:0.1347 dlossR:0.0094 dlossQ:0.1253
Episode:9155 meanR:0.3700 rate:-0.0769 gloss:-34.4079 dloss:-1.5247 dlossR:-1.9195 dlossQ:0.3948
Episode:9156 meanR:0.3700 rate:-0.0769 gloss:-19.8936 dloss:-0.9625 dlossR:-1.1083 dlossQ:0.1458
Episode:9157 meanR:0.4000 rate:0.0000 gloss:-11.9753 dloss:0.1140 dlossR:0.0141 dlossQ:0.0998
Episode:9158 meanR:0.4200 rate:0.1538 gloss:-10.9010 dloss:1.3926 dlossR:1.2747 dlossQ:0.1179
Episode:9159 meanR:0.4400 rate:-0.0769 gloss:-8.2200 dl

Episode:9237 meanR:0.7200 rate:0.0769 gloss:-4.6381 dloss:0.5281 dlossR:0.3136 dlossQ:0.2145
Episode:9238 meanR:0.7600 rate:0.3077 gloss:-5.0306 dloss:1.5119 dlossR:1.2682 dlossQ:0.2437
Episode:9239 meanR:0.7700 rate:0.0000 gloss:-4.0200 dloss:0.3051 dlossR:0.0625 dlossQ:0.2426
Episode:9240 meanR:0.7700 rate:0.0000 gloss:-7.9406 dloss:0.2381 dlossR:0.0435 dlossQ:0.1947
Episode:9241 meanR:0.7800 rate:0.0769 gloss:-5.3617 dloss:0.6125 dlossR:0.3648 dlossQ:0.2477
Episode:9242 meanR:0.7700 rate:0.0000 gloss:-4.7118 dloss:0.2972 dlossR:0.0590 dlossQ:0.2383
Episode:9243 meanR:0.7700 rate:0.0000 gloss:-5.0484 dloss:0.3158 dlossR:0.0600 dlossQ:0.2558
Episode:9244 meanR:0.7800 rate:0.0000 gloss:-3.5751 dloss:0.3288 dlossR:0.0709 dlossQ:0.2579
Episode:9245 meanR:0.7800 rate:0.0000 gloss:-4.3250 dloss:0.2265 dlossR:0.0430 dlossQ:0.1835
Episode:9246 meanR:0.8000 rate:0.1538 gloss:-4.4220 dloss:0.8079 dlossR:0.5905 dlossQ:0.2174
Episode:9247 meanR:0.8100 rate:0.0769 gloss:-4.3793 dloss:0.5346 dloss

Episode:9325 meanR:0.5900 rate:0.0000 gloss:-7.1851 dloss:0.1070 dlossR:0.0138 dlossQ:0.0931
Episode:9326 meanR:0.5700 rate:0.0000 gloss:-7.9031 dloss:0.0437 dlossR:0.0045 dlossQ:0.0392
Episode:9327 meanR:0.5600 rate:0.0000 gloss:-10.3490 dloss:0.0642 dlossR:0.0064 dlossQ:0.0578
Episode:9328 meanR:0.5500 rate:-0.0769 gloss:-9.1104 dloss:-0.4570 dlossR:-0.5027 dlossQ:0.0457
Episode:9329 meanR:0.5100 rate:-0.1538 gloss:-9.9143 dloss:-1.0418 dlossR:-1.0854 dlossQ:0.0436
Episode:9330 meanR:0.5100 rate:0.0769 gloss:-24.7471 dloss:1.4721 dlossR:1.3963 dlossQ:0.0758
Episode:9331 meanR:0.5200 rate:0.0769 gloss:-8.9401 dloss:0.5326 dlossR:0.5095 dlossQ:0.0231
Episode:9332 meanR:0.5300 rate:0.0769 gloss:-9.6830 dloss:0.5744 dlossR:0.5512 dlossQ:0.0232
Episode:9333 meanR:0.5300 rate:0.0000 gloss:-9.8519 dloss:0.0266 dlossR:0.0017 dlossQ:0.0250
Episode:9334 meanR:0.5500 rate:0.1538 gloss:-9.9054 dloss:1.1651 dlossR:1.1385 dlossQ:0.0266
Episode:9335 meanR:0.5300 rate:-0.0769 gloss:-9.4757 dloss:-0.

Episode:9413 meanR:0.6900 rate:0.3077 gloss:-5.7063 dloss:1.5084 dlossR:1.3933 dlossQ:0.1151
Episode:9414 meanR:0.7000 rate:0.0769 gloss:-5.4988 dloss:0.4375 dlossR:0.3331 dlossQ:0.1044
Episode:9415 meanR:0.7000 rate:0.0000 gloss:-6.8951 dloss:0.1649 dlossR:0.0215 dlossQ:0.1434
Episode:9416 meanR:0.7200 rate:0.0769 gloss:-5.3353 dloss:0.4820 dlossR:0.3337 dlossQ:0.1483
Episode:9417 meanR:0.7300 rate:0.0769 gloss:-5.2431 dloss:0.6070 dlossR:0.3473 dlossQ:0.2598
Episode:9418 meanR:0.7300 rate:0.0769 gloss:-4.3793 dloss:0.4875 dlossR:0.2941 dlossQ:0.1935
Episode:9419 meanR:0.7400 rate:0.0000 gloss:-3.8345 dloss:0.3873 dlossR:0.0815 dlossQ:0.3057
Episode:9420 meanR:0.7300 rate:0.0000 gloss:-3.8817 dloss:0.3135 dlossR:0.0649 dlossQ:0.2486
Episode:9421 meanR:0.7400 rate:0.1538 gloss:-4.0254 dloss:0.9290 dlossR:0.5616 dlossQ:0.3674
Episode:9422 meanR:0.7500 rate:0.0769 gloss:-3.7263 dloss:0.6271 dlossR:0.2981 dlossQ:0.3290
Episode:9423 meanR:0.7600 rate:0.0769 gloss:-3.3601 dloss:0.5772 dloss

Episode:9501 meanR:0.7300 rate:-0.1538 gloss:-6.1174 dloss:-0.5246 dlossR:-0.6408 dlossQ:0.1163
Episode:9502 meanR:0.7400 rate:0.0769 gloss:-4.7170 dloss:0.5200 dlossR:0.3154 dlossQ:0.2045
Episode:9503 meanR:0.7200 rate:-0.1538 gloss:-5.8362 dloss:-0.4531 dlossR:-0.5990 dlossQ:0.1458
Episode:9504 meanR:0.7500 rate:0.2308 gloss:-3.8502 dloss:1.0672 dlossR:0.7721 dlossQ:0.2952
Episode:9505 meanR:0.7500 rate:0.1538 gloss:-5.2868 dloss:0.8254 dlossR:0.6534 dlossQ:0.1720
Episode:9506 meanR:0.7800 rate:0.2308 gloss:-3.3362 dloss:1.0216 dlossR:0.6977 dlossQ:0.3239
Episode:9507 meanR:0.7700 rate:-0.0769 gloss:-3.4695 dloss:0.2242 dlossR:-0.0857 dlossQ:0.3098
Episode:9508 meanR:0.8000 rate:0.3077 gloss:-2.8488 dloss:1.2525 dlossR:0.8497 dlossQ:0.4028
Episode:9509 meanR:0.7900 rate:-0.0769 gloss:-3.2335 dloss:0.3038 dlossR:-0.0493 dlossQ:0.3531
Episode:9510 meanR:0.8000 rate:0.0769 gloss:-4.0432 dloss:0.5489 dlossR:0.3017 dlossQ:0.2472
Episode:9511 meanR:0.7900 rate:0.0000 gloss:-3.7073 dloss:0.

Episode:9588 meanR:0.0000 rate:0.0000 gloss:-21.2242 dloss:0.0231 dlossR:0.0000 dlossQ:0.0231
Episode:9589 meanR:-0.0200 rate:-0.0769 gloss:-22.3089 dloss:-1.2224 dlossR:-1.2477 dlossQ:0.0252
Episode:9590 meanR:-0.0200 rate:0.0000 gloss:-21.6473 dloss:0.0269 dlossR:0.0000 dlossQ:0.0269
Episode:9591 meanR:-0.0200 rate:0.0000 gloss:-21.3112 dloss:0.0257 dlossR:0.0000 dlossQ:0.0257
Episode:9592 meanR:-0.0400 rate:-0.1538 gloss:-23.0728 dloss:-2.5451 dlossR:-2.5709 dlossQ:0.0258
Episode:9593 meanR:-0.0900 rate:-0.0769 gloss:-24.5774 dloss:-1.3500 dlossR:-1.3793 dlossQ:0.0292
Episode:9594 meanR:-0.1000 rate:0.0000 gloss:-21.1900 dloss:0.0239 dlossR:0.0000 dlossQ:0.0239
Episode:9595 meanR:-0.1100 rate:0.0000 gloss:-22.3028 dloss:0.0302 dlossR:0.0000 dlossQ:0.0302
Episode:9596 meanR:-0.1700 rate:-0.1538 gloss:-23.7404 dloss:-2.6234 dlossR:-2.6473 dlossQ:0.0239
Episode:9597 meanR:-0.1600 rate:0.0769 gloss:-26.7337 dloss:1.5378 dlossR:1.5082 dlossQ:0.0295
Episode:9598 meanR:-0.1500 rate:0.0769 

Episode:9674 meanR:-0.1600 rate:-0.2308 gloss:-44.1005 dloss:-7.3332 dlossR:-7.3827 dlossQ:0.0495
Episode:9675 meanR:-0.1600 rate:0.0000 gloss:-45.3319 dloss:0.0924 dlossR:0.0000 dlossQ:0.0924
Episode:9676 meanR:-0.1700 rate:0.0000 gloss:-42.2084 dloss:0.0552 dlossR:0.0000 dlossQ:0.0552
Episode:9677 meanR:-0.2000 rate:-0.0769 gloss:-45.2193 dloss:-2.4988 dlossR:-2.5342 dlossQ:0.0354
Episode:9678 meanR:-0.1600 rate:0.3077 gloss:-25.2148 dloss:5.8068 dlossR:5.7741 dlossQ:0.0327
Episode:9679 meanR:-0.1600 rate:-0.0769 gloss:-28.1074 dloss:-1.5434 dlossR:-1.5756 dlossQ:0.0322
Episode:9680 meanR:-0.1600 rate:0.0000 gloss:-30.1821 dloss:0.0359 dlossR:0.0000 dlossQ:0.0359
Episode:9681 meanR:-0.1600 rate:-0.2308 gloss:-28.9624 dloss:-4.8362 dlossR:-4.8640 dlossQ:0.0278
Episode:9682 meanR:-0.2000 rate:-0.3077 gloss:-29.1039 dloss:-6.4284 dlossR:-6.4670 dlossQ:0.0386
Episode:9683 meanR:-0.1800 rate:0.0000 gloss:-29.5374 dloss:0.0347 dlossR:0.0000 dlossQ:0.0347
Episode:9684 meanR:-0.1900 rate:-0.

Episode:9760 meanR:-0.0700 rate:-0.0769 gloss:-33.5847 dloss:-1.8225 dlossR:-1.8773 dlossQ:0.0547
Episode:9761 meanR:-0.0300 rate:0.1538 gloss:-55.1957 dloss:6.3635 dlossR:6.2435 dlossQ:0.1199
Episode:9762 meanR:-0.0400 rate:-0.0769 gloss:-51.6048 dloss:-2.8639 dlossR:-2.8935 dlossQ:0.0296
Episode:9763 meanR:-0.0600 rate:-0.1538 gloss:-36.2039 dloss:-3.9565 dlossR:-4.0443 dlossQ:0.0878
Episode:9764 meanR:-0.0700 rate:-0.0769 gloss:-30.0396 dloss:-1.6398 dlossR:-1.6834 dlossQ:0.0435
Episode:9765 meanR:-0.0200 rate:0.2308 gloss:-15.2694 dloss:2.7276 dlossR:2.6430 dlossQ:0.0846
Episode:9766 meanR:0.0200 rate:0.0769 gloss:-20.0347 dloss:1.1550 dlossR:1.1351 dlossQ:0.0199
Episode:9767 meanR:0.0300 rate:0.1538 gloss:-23.7041 dloss:2.8349 dlossR:2.6989 dlossQ:0.1361
Episode:9768 meanR:0.0200 rate:0.0769 gloss:-31.0492 dloss:1.7885 dlossR:1.7553 dlossQ:0.0332
Episode:9769 meanR:0.0300 rate:0.0000 gloss:-24.6719 dloss:0.0387 dlossR:0.0004 dlossQ:0.0384
Episode:9770 meanR:0.0400 rate:0.3077 glos

Episode:9847 meanR:0.6300 rate:0.0000 gloss:-2.2430 dloss:1.3269 dlossR:0.4379 dlossQ:0.8891
Episode:9848 meanR:0.6700 rate:0.3077 gloss:-1.9031 dloss:1.5929 dlossR:0.8551 dlossQ:0.7377
Episode:9849 meanR:0.6400 rate:-0.0769 gloss:-3.4018 dloss:0.5510 dlossR:0.0286 dlossQ:0.5224
Episode:9850 meanR:0.6700 rate:0.1538 gloss:-3.8942 dloss:0.8893 dlossR:0.5504 dlossQ:0.3389
Episode:9851 meanR:0.7200 rate:0.5385 gloss:-5.1016 dloss:2.8925 dlossR:2.3566 dlossQ:0.5359
Episode:9852 meanR:0.7100 rate:0.0769 gloss:-6.8252 dloss:1.7835 dlossR:0.5812 dlossQ:1.2022
Episode:9853 meanR:0.7100 rate:0.0000 gloss:-8.7604 dloss:1.8651 dlossR:0.2877 dlossQ:1.5774
Episode:9854 meanR:0.7000 rate:0.0000 gloss:-5.9707 dloss:1.6105 dlossR:0.3240 dlossQ:1.2865
Episode:9855 meanR:0.7100 rate:0.0769 gloss:-2.7940 dloss:0.9468 dlossR:0.3722 dlossQ:0.5746
Episode:9856 meanR:0.6900 rate:0.0000 gloss:-4.8131 dloss:0.4609 dlossR:0.1008 dlossQ:0.3601
Episode:9857 meanR:0.7200 rate:0.0769 gloss:-3.8897 dloss:0.7440 dlos

Episode:9935 meanR:0.6400 rate:0.0000 gloss:-6.1011 dloss:0.1257 dlossR:0.0182 dlossQ:0.1075
Episode:9936 meanR:0.6500 rate:0.0000 gloss:-4.4786 dloss:0.2086 dlossR:0.0385 dlossQ:0.1701
Episode:9937 meanR:0.6300 rate:0.0769 gloss:-4.8932 dloss:0.4535 dlossR:0.3088 dlossQ:0.1447
Episode:9938 meanR:0.6100 rate:-0.0769 gloss:-5.4483 dloss:-0.1292 dlossR:-0.2706 dlossQ:0.1415
Episode:9939 meanR:0.5900 rate:0.0769 gloss:-6.0639 dloss:0.4983 dlossR:0.3746 dlossQ:0.1237
Episode:9940 meanR:0.5800 rate:-0.1538 gloss:-10.4646 dloss:-1.0424 dlossR:-1.1419 dlossQ:0.0995
Episode:9941 meanR:0.6100 rate:0.0000 gloss:-11.0599 dloss:0.2458 dlossR:0.0161 dlossQ:0.2297
Episode:9942 meanR:0.5400 rate:0.1538 gloss:-6.2311 dloss:0.8342 dlossR:0.7385 dlossQ:0.0957
Episode:9943 meanR:0.5600 rate:0.1538 gloss:-6.2926 dloss:0.8479 dlossR:0.7463 dlossQ:0.1016
Episode:9944 meanR:0.5700 rate:0.0000 gloss:-10.9441 dloss:0.0347 dlossR:0.0027 dlossQ:0.0320
Episode:9945 meanR:0.5700 rate:0.3077 gloss:-5.8424 dloss:1.5

Episode:10023 meanR:0.4900 rate:0.0000 gloss:-3.7039 dloss:0.3472 dlossR:0.0785 dlossQ:0.2687
Episode:10024 meanR:0.5000 rate:0.0769 gloss:-4.1794 dloss:0.5208 dlossR:0.2935 dlossQ:0.2273
Episode:10025 meanR:0.4900 rate:-0.0769 gloss:-4.8051 dloss:-0.0289 dlossR:-0.2194 dlossQ:0.1906
Episode:10026 meanR:0.5100 rate:0.1538 gloss:-3.5821 dloss:0.8027 dlossR:0.5027 dlossQ:0.3000
Episode:10027 meanR:0.5200 rate:0.1538 gloss:-3.8640 dloss:0.7878 dlossR:0.5209 dlossQ:0.2669
Episode:10028 meanR:0.5500 rate:0.2308 gloss:-3.5031 dloss:1.0564 dlossR:0.7221 dlossQ:0.3343
Episode:10029 meanR:0.5400 rate:-0.0769 gloss:-4.0377 dloss:0.1480 dlossR:-0.1400 dlossQ:0.2881
Episode:10030 meanR:0.5100 rate:0.0000 gloss:-3.1955 dloss:0.5112 dlossR:0.1241 dlossQ:0.3872
Episode:10031 meanR:0.4700 rate:-0.1538 gloss:-4.3639 dloss:-0.0137 dlossR:-0.3685 dlossQ:0.3549
Episode:10032 meanR:0.4700 rate:0.0000 gloss:-11.4931 dloss:0.7648 dlossR:0.0540 dlossQ:0.7108
Episode:10033 meanR:0.5100 rate:0.0000 gloss:-4.335

Episode:10110 meanR:0.2400 rate:0.0000 gloss:-12.1107 dloss:0.0194 dlossR:0.0006 dlossQ:0.0188
Episode:10111 meanR:0.2400 rate:0.0000 gloss:-13.7017 dloss:0.0383 dlossR:0.0003 dlossQ:0.0380
Episode:10112 meanR:0.2000 rate:-0.1538 gloss:-13.9550 dloss:-1.5261 dlossR:-1.5440 dlossQ:0.0179
Episode:10113 meanR:0.1900 rate:0.0000 gloss:-15.1144 dloss:0.0177 dlossR:0.0003 dlossQ:0.0175
Episode:10114 meanR:0.2100 rate:0.1538 gloss:-13.7342 dloss:1.5855 dlossR:1.5693 dlossQ:0.0162
Episode:10115 meanR:0.2100 rate:-0.0769 gloss:-14.6583 dloss:-0.8005 dlossR:-0.8182 dlossQ:0.0177
Episode:10116 meanR:0.2000 rate:-0.2308 gloss:-15.2266 dloss:-2.4912 dlossR:-2.5127 dlossQ:0.0214
Episode:10117 meanR:0.2000 rate:0.2308 gloss:-15.2987 dloss:2.6549 dlossR:2.6330 dlossQ:0.0219
Episode:10118 meanR:0.2100 rate:0.0769 gloss:-13.0324 dloss:0.7580 dlossR:0.7388 dlossQ:0.0191
Episode:10119 meanR:0.1900 rate:0.0000 gloss:-13.5438 dloss:0.0193 dlossR:0.0001 dlossQ:0.0191
Episode:10120 meanR:0.1900 rate:0.0769 gl

Episode:10196 meanR:0.0900 rate:0.3077 gloss:-8.1305 dloss:1.9748 dlossR:1.9293 dlossQ:0.0455
Episode:10197 meanR:0.1100 rate:0.1538 gloss:-8.4361 dloss:1.0181 dlossR:0.9777 dlossQ:0.0404
Episode:10198 meanR:0.1500 rate:0.1538 gloss:-8.3196 dloss:1.0180 dlossR:0.9653 dlossQ:0.0528
Episode:10199 meanR:0.1800 rate:0.1538 gloss:-10.0535 dloss:1.2097 dlossR:1.1610 dlossQ:0.0487
Episode:10200 meanR:0.2200 rate:0.1538 gloss:-9.6058 dloss:1.1868 dlossR:1.1111 dlossQ:0.0757
Episode:10201 meanR:0.2000 rate:-0.0769 gloss:-8.8905 dloss:-0.3764 dlossR:-0.4779 dlossQ:0.1015
Episode:10202 meanR:0.1900 rate:0.0000 gloss:-8.2239 dloss:0.1644 dlossR:0.0205 dlossQ:0.1439
Episode:10203 meanR:0.1800 rate:-0.0769 gloss:-10.1240 dloss:-0.4106 dlossR:-0.5445 dlossQ:0.1339
Episode:10204 meanR:0.1900 rate:0.0000 gloss:-7.2420 dloss:0.2083 dlossR:0.0287 dlossQ:0.1796
Episode:10205 meanR:0.2300 rate:0.0769 gloss:-6.6556 dloss:0.5303 dlossR:0.4023 dlossQ:0.1280
Episode:10206 meanR:0.2600 rate:0.0000 gloss:-5.2063

Episode:10283 meanR:0.8100 rate:0.0000 gloss:-5.6583 dloss:0.1475 dlossR:0.0240 dlossQ:0.1235
Episode:10284 meanR:0.7900 rate:-0.1538 gloss:-6.6916 dloss:-0.6255 dlossR:-0.7107 dlossQ:0.0852
Episode:10285 meanR:0.8000 rate:0.0769 gloss:-5.1634 dloss:0.4919 dlossR:0.3289 dlossQ:0.1630
Episode:10286 meanR:0.7900 rate:-0.0769 gloss:-5.4821 dloss:-0.0858 dlossR:-0.2690 dlossQ:0.1831
Episode:10287 meanR:0.7900 rate:0.0000 gloss:-4.9784 dloss:0.1760 dlossR:0.0306 dlossQ:0.1454
Episode:10288 meanR:0.7700 rate:0.1538 gloss:-5.1423 dloss:0.7731 dlossR:0.6282 dlossQ:0.1449
Episode:10289 meanR:0.7900 rate:0.1538 gloss:-5.3095 dloss:0.7811 dlossR:0.6454 dlossQ:0.1356
Episode:10290 meanR:0.7900 rate:0.0000 gloss:-6.4917 dloss:0.1138 dlossR:0.0162 dlossQ:0.0976
Episode:10291 meanR:0.8200 rate:0.0769 gloss:-6.7000 dloss:0.5003 dlossR:0.3986 dlossQ:0.1018
Episode:10292 meanR:0.8100 rate:0.1538 gloss:-6.0169 dloss:0.8470 dlossR:0.7203 dlossQ:0.1267
Episode:10293 meanR:0.8200 rate:0.0000 gloss:-5.6538 d

Episode:10370 meanR:0.1200 rate:0.0000 gloss:-12.2790 dloss:0.0232 dlossR:0.0004 dlossQ:0.0228
Episode:10371 meanR:0.1100 rate:-0.0769 gloss:-14.3097 dloss:-0.7754 dlossR:-0.7981 dlossQ:0.0228
Episode:10372 meanR:0.1000 rate:-0.0769 gloss:-11.3522 dloss:-0.6133 dlossR:-0.6315 dlossQ:0.0182
Episode:10373 meanR:0.1000 rate:0.0000 gloss:-10.9019 dloss:0.0194 dlossR:0.0005 dlossQ:0.0189
Episode:10374 meanR:0.1100 rate:0.0769 gloss:-10.6056 dloss:0.6205 dlossR:0.6027 dlossQ:0.0178
Episode:10375 meanR:0.1200 rate:0.0000 gloss:-10.5218 dloss:0.0178 dlossR:0.0007 dlossQ:0.0171
Episode:10376 meanR:0.0900 rate:-0.1538 gloss:-10.6855 dloss:-1.1568 dlossR:-1.1766 dlossQ:0.0197
Episode:10377 meanR:0.0900 rate:0.0000 gloss:-10.0624 dloss:0.0208 dlossR:0.0009 dlossQ:0.0199
Episode:10378 meanR:0.0800 rate:-0.0769 gloss:-12.7435 dloss:-0.6964 dlossR:-0.7117 dlossQ:0.0153
Episode:10379 meanR:0.0700 rate:0.0000 gloss:-9.6309 dloss:0.0207 dlossR:0.0010 dlossQ:0.0197
Episode:10380 meanR:0.0400 rate:0.0000 

Episode:10455 meanR:-0.1400 rate:0.1538 gloss:-16.3249 dloss:1.8951 dlossR:1.8596 dlossQ:0.0355
Episode:10456 meanR:-0.1300 rate:0.0000 gloss:-16.1188 dloss:0.0187 dlossR:0.0000 dlossQ:0.0186
Episode:10457 meanR:-0.1500 rate:-0.0769 gloss:-13.1272 dloss:-0.7135 dlossR:-0.7330 dlossQ:0.0194
Episode:10458 meanR:-0.1400 rate:0.0769 gloss:-18.3063 dloss:1.0718 dlossR:1.0359 dlossQ:0.0359
Episode:10459 meanR:-0.1300 rate:0.0000 gloss:-15.7849 dloss:0.0160 dlossR:0.0001 dlossQ:0.0159
Episode:10460 meanR:-0.1300 rate:0.0000 gloss:-19.0119 dloss:0.0293 dlossR:0.0000 dlossQ:0.0292
Episode:10461 meanR:-0.1300 rate:0.0000 gloss:-13.9105 dloss:0.0147 dlossR:0.0002 dlossQ:0.0145
Episode:10462 meanR:-0.1300 rate:0.0000 gloss:-14.1283 dloss:0.0173 dlossR:0.0000 dlossQ:0.0172
Episode:10463 meanR:-0.1300 rate:0.0000 gloss:-17.4515 dloss:0.0210 dlossR:0.0000 dlossQ:0.0210
Episode:10464 meanR:-0.1300 rate:-0.1538 gloss:-21.0524 dloss:-2.3063 dlossR:-2.3414 dlossQ:0.0351
Episode:10465 meanR:-0.1400 rate:-

Episode:10540 meanR:0.0200 rate:-0.0769 gloss:-12.9324 dloss:-0.6978 dlossR:-0.7201 dlossQ:0.0223
Episode:10541 meanR:0.0200 rate:0.0000 gloss:-12.5815 dloss:0.0197 dlossR:0.0005 dlossQ:0.0192
Episode:10542 meanR:0.0300 rate:0.0769 gloss:-16.3171 dloss:0.9716 dlossR:0.9246 dlossQ:0.0471
Episode:10543 meanR:0.0400 rate:0.0769 gloss:-14.7492 dloss:0.9170 dlossR:0.8409 dlossQ:0.0761
Episode:10544 meanR:0.0400 rate:0.0000 gloss:-13.8904 dloss:0.0567 dlossR:0.0035 dlossQ:0.0532
Episode:10545 meanR:0.0600 rate:0.0769 gloss:-7.3962 dloss:0.5242 dlossR:0.4351 dlossQ:0.0891
Episode:10546 meanR:0.0800 rate:0.1538 gloss:-6.4832 dloss:0.8825 dlossR:0.7727 dlossQ:0.1097
Episode:10547 meanR:0.0700 rate:-0.0769 gloss:-7.9556 dloss:-0.3841 dlossR:-0.4348 dlossQ:0.0507
Episode:10548 meanR:0.0600 rate:-0.0769 gloss:-10.5957 dloss:-0.5543 dlossR:-0.5871 dlossQ:0.0329
Episode:10549 meanR:0.1000 rate:0.0769 gloss:-8.0990 dloss:0.5605 dlossR:0.4747 dlossQ:0.0858
Episode:10550 meanR:0.1000 rate:0.0000 gloss:

Episode:10627 meanR:0.6000 rate:0.1538 gloss:-3.4648 dloss:0.9541 dlossR:0.5324 dlossQ:0.4217
Episode:10628 meanR:0.6000 rate:0.0000 gloss:-3.2482 dloss:0.5610 dlossR:0.1428 dlossQ:0.4182
Episode:10629 meanR:0.5800 rate:-0.1538 gloss:-4.0162 dloss:-0.0114 dlossR:-0.3270 dlossQ:0.3156
Episode:10630 meanR:0.6000 rate:0.0769 gloss:-2.9122 dloss:0.7736 dlossR:0.3270 dlossQ:0.4465
Episode:10631 meanR:0.6300 rate:0.1538 gloss:-2.2131 dloss:0.9763 dlossR:0.4634 dlossQ:0.5129
Episode:10632 meanR:0.6400 rate:0.0769 gloss:-2.3662 dloss:0.7955 dlossR:0.3161 dlossQ:0.4793
Episode:10633 meanR:0.6600 rate:0.1538 gloss:-2.5461 dloss:0.9426 dlossR:0.4736 dlossQ:0.4690
Episode:10634 meanR:0.6800 rate:0.0000 gloss:-2.6195 dloss:0.5781 dlossR:0.1541 dlossQ:0.4241
Episode:10635 meanR:0.6900 rate:0.1538 gloss:-1.7008 dloss:1.0163 dlossR:0.4497 dlossQ:0.5666
Episode:10636 meanR:0.7000 rate:0.0000 gloss:-3.2753 dloss:0.3946 dlossR:0.0901 dlossQ:0.3045
Episode:10637 meanR:0.7100 rate:0.0000 gloss:-3.7005 dlos

Episode:10714 meanR:0.5500 rate:-0.0769 gloss:-9.9124 dloss:-0.5127 dlossR:-0.5481 dlossQ:0.0355
Episode:10715 meanR:0.5500 rate:0.0000 gloss:-8.3604 dloss:0.0431 dlossR:0.0041 dlossQ:0.0390
Episode:10716 meanR:0.5300 rate:0.0000 gloss:-7.9235 dloss:0.0443 dlossR:0.0044 dlossQ:0.0400
Episode:10717 meanR:0.5300 rate:0.0000 gloss:-6.8540 dloss:0.0591 dlossR:0.0071 dlossQ:0.0520
Episode:10718 meanR:0.5100 rate:0.0000 gloss:-7.1327 dloss:0.0567 dlossR:0.0061 dlossQ:0.0506
Episode:10719 meanR:0.5200 rate:0.0769 gloss:-6.9780 dloss:0.4578 dlossR:0.4048 dlossQ:0.0530
Episode:10720 meanR:0.4800 rate:0.0000 gloss:-9.3186 dloss:0.0326 dlossR:0.0021 dlossQ:0.0305
Episode:10721 meanR:0.4500 rate:0.0000 gloss:-8.9855 dloss:0.0426 dlossR:0.0030 dlossQ:0.0396
Episode:10722 meanR:0.3900 rate:-0.0769 gloss:-8.0116 dloss:-0.3968 dlossR:-0.4394 dlossQ:0.0426
Episode:10723 meanR:0.3700 rate:-0.0769 gloss:-6.7151 dloss:-0.3019 dlossR:-0.3618 dlossQ:0.0600
Episode:10724 meanR:0.3700 rate:0.0000 gloss:-5.800

Episode:10800 meanR:-0.1100 rate:-0.2308 gloss:-10.5238 dloss:-1.7064 dlossR:-1.7252 dlossQ:0.0188
Episode:10801 meanR:-0.1300 rate:-0.0769 gloss:-12.0694 dloss:-0.6537 dlossR:-0.6720 dlossQ:0.0183
Episode:10802 meanR:-0.1400 rate:-0.0769 gloss:-12.0895 dloss:-0.6556 dlossR:-0.6729 dlossQ:0.0174
Episode:10803 meanR:-0.1200 rate:0.0769 gloss:-26.5769 dloss:1.5269 dlossR:1.4972 dlossQ:0.0297
Episode:10804 meanR:-0.1200 rate:0.0000 gloss:-14.0270 dloss:0.0204 dlossR:0.0002 dlossQ:0.0202
Episode:10805 meanR:-0.1400 rate:0.0000 gloss:-12.1653 dloss:0.0161 dlossR:0.0002 dlossQ:0.0159
Episode:10806 meanR:-0.1600 rate:-0.0769 gloss:-12.5254 dloss:-0.6797 dlossR:-0.6983 dlossQ:0.0186
Episode:10807 meanR:-0.1700 rate:0.0000 gloss:-13.4941 dloss:0.0136 dlossR:0.0001 dlossQ:0.0134
Episode:10808 meanR:-0.1600 rate:0.0000 gloss:-10.6895 dloss:0.0242 dlossR:0.0009 dlossQ:0.0233
Episode:10809 meanR:-0.1600 rate:0.0769 gloss:-14.3640 dloss:0.8329 dlossR:0.8135 dlossQ:0.0195
Episode:10810 meanR:-0.1500 

Episode:10885 meanR:-0.2200 rate:0.0769 gloss:-56.9506 dloss:3.3024 dlossR:3.2032 dlossQ:0.0993
Episode:10886 meanR:-0.2200 rate:-0.0769 gloss:-44.8951 dloss:-2.4910 dlossR:-2.5157 dlossQ:0.0248
Episode:10887 meanR:-0.2200 rate:0.0000 gloss:-44.5757 dloss:0.0784 dlossR:0.0000 dlossQ:0.0784
Episode:10888 meanR:-0.2100 rate:0.0000 gloss:-22.6674 dloss:0.0299 dlossR:0.0000 dlossQ:0.0299
Episode:10889 meanR:-0.2000 rate:0.0000 gloss:-22.1238 dloss:0.0265 dlossR:0.0000 dlossQ:0.0265
Episode:10890 meanR:-0.1800 rate:0.0769 gloss:-25.3399 dloss:1.4631 dlossR:1.4316 dlossQ:0.0314
Episode:10891 meanR:-0.1900 rate:-0.0769 gloss:-34.9683 dloss:-1.9075 dlossR:-1.9587 dlossQ:0.0512
Episode:10892 meanR:-0.1800 rate:0.0000 gloss:-25.2647 dloss:0.0304 dlossR:0.0000 dlossQ:0.0304
Episode:10893 meanR:-0.1600 rate:0.0769 gloss:-18.8397 dloss:1.0886 dlossR:1.0673 dlossQ:0.0213
Episode:10894 meanR:-0.1700 rate:-0.1538 gloss:-17.7615 dloss:-1.9563 dlossR:-1.9785 dlossQ:0.0223
Episode:10895 meanR:-0.1800 rat

Episode:10970 meanR:-0.1100 rate:-0.0769 gloss:-28.7250 dloss:-1.5782 dlossR:-1.6092 dlossQ:0.0310
Episode:10971 meanR:-0.1100 rate:0.0000 gloss:-43.2138 dloss:0.0332 dlossR:0.0000 dlossQ:0.0332
Episode:10972 meanR:-0.1300 rate:-0.1538 gloss:-33.3702 dloss:-3.6264 dlossR:-3.7303 dlossQ:0.1039
Episode:10973 meanR:-0.1300 rate:0.0000 gloss:-30.6643 dloss:0.0374 dlossR:0.0000 dlossQ:0.0374
Episode:10974 meanR:-0.1100 rate:0.0000 gloss:-46.2532 dloss:0.0338 dlossR:0.0000 dlossQ:0.0338
Episode:10975 meanR:-0.1100 rate:0.0000 gloss:-37.7566 dloss:0.0327 dlossR:0.0000 dlossQ:0.0327
Episode:10976 meanR:-0.1100 rate:0.0000 gloss:-39.3699 dloss:0.0394 dlossR:0.0000 dlossQ:0.0394
Episode:10977 meanR:-0.1200 rate:0.0000 gloss:-47.1828 dloss:0.0584 dlossR:0.0000 dlossQ:0.0584
Episode:10978 meanR:-0.1000 rate:0.1538 gloss:-34.0649 dloss:3.8973 dlossR:3.8600 dlossQ:0.0373
Episode:10979 meanR:-0.1000 rate:0.0000 gloss:-52.9448 dloss:0.0873 dlossR:0.0000 dlossQ:0.0873
Episode:10980 meanR:-0.1000 rate:0

Episode:11055 meanR:-0.1800 rate:0.0769 gloss:-224.7198 dloss:13.0390 dlossR:12.7210 dlossQ:0.3179
Episode:11056 meanR:-0.1700 rate:0.0769 gloss:-206.6104 dloss:11.9618 dlossR:11.6791 dlossQ:0.2827
Episode:11057 meanR:-0.1600 rate:0.0000 gloss:-224.1403 dloss:0.2632 dlossR:0.0000 dlossQ:0.2632
Episode:11058 meanR:-0.1700 rate:0.0000 gloss:-229.3289 dloss:0.2833 dlossR:0.0000 dlossQ:0.2833
Episode:11059 meanR:-0.1700 rate:-0.0769 gloss:-220.7737 dloss:-12.2148 dlossR:-12.4632 dlossQ:0.2485
Episode:11060 meanR:-0.1700 rate:0.0000 gloss:-241.1699 dloss:0.4929 dlossR:0.0000 dlossQ:0.4929
Episode:11061 meanR:-0.1800 rate:-0.0769 gloss:-228.0427 dloss:-12.6216 dlossR:-12.8995 dlossQ:0.2779
Episode:11062 meanR:-0.1800 rate:0.0000 gloss:-176.2261 dloss:0.2442 dlossR:0.0000 dlossQ:0.2442
Episode:11063 meanR:-0.1700 rate:0.0000 gloss:-222.2569 dloss:0.2128 dlossR:0.0000 dlossQ:0.2128
Episode:11064 meanR:-0.1500 rate:0.0000 gloss:-369.0221 dloss:0.2510 dlossR:0.0000 dlossQ:0.2510
Episode:11065 me

Episode:11140 meanR:0.2600 rate:0.0000 gloss:-4.6514 dloss:0.4286 dlossR:0.1007 dlossQ:0.3279
Episode:11141 meanR:0.2500 rate:-0.1538 gloss:-3.9331 dloss:0.0983 dlossR:-0.2689 dlossQ:0.3672
Episode:11142 meanR:0.2700 rate:0.0000 gloss:-4.6250 dloss:0.3708 dlossR:0.0807 dlossQ:0.2901
Episode:11143 meanR:0.2700 rate:0.0000 gloss:-4.9858 dloss:0.3049 dlossR:0.0713 dlossQ:0.2336
Episode:11144 meanR:0.2700 rate:0.0000 gloss:-6.4668 dloss:0.2125 dlossR:0.0338 dlossQ:0.1787
Episode:11145 meanR:0.3100 rate:0.0769 gloss:-5.7769 dloss:0.5295 dlossR:0.3704 dlossQ:0.1591
Episode:11146 meanR:0.3000 rate:0.0000 gloss:-22.8986 dloss:0.0468 dlossR:0.0029 dlossQ:0.0439
Episode:11147 meanR:0.3000 rate:0.0000 gloss:-30.1364 dloss:0.0474 dlossR:0.0008 dlossQ:0.0467
Episode:11148 meanR:0.3100 rate:0.0769 gloss:-22.5778 dloss:1.3220 dlossR:1.2833 dlossQ:0.0387
Episode:11149 meanR:0.3400 rate:0.1538 gloss:-12.4083 dloss:1.4772 dlossR:1.4330 dlossQ:0.0441
Episode:11150 meanR:0.3600 rate:0.1538 gloss:-18.1011 

Episode:11226 meanR:0.1100 rate:0.0000 gloss:-23.9781 dloss:0.0260 dlossR:0.0000 dlossQ:0.0260
Episode:11227 meanR:0.1200 rate:0.0000 gloss:-25.8218 dloss:0.0197 dlossR:0.0000 dlossQ:0.0197
Episode:11228 meanR:0.1200 rate:0.0000 gloss:-21.2126 dloss:0.0180 dlossR:0.0000 dlossQ:0.0180
Episode:11229 meanR:0.1300 rate:0.1538 gloss:-20.9474 dloss:2.4182 dlossR:2.3888 dlossQ:0.0293
Episode:11230 meanR:0.1400 rate:0.2308 gloss:-25.9916 dloss:4.4689 dlossR:4.4422 dlossQ:0.0267
Episode:11231 meanR:0.1300 rate:0.0000 gloss:-33.5917 dloss:0.0170 dlossR:0.0000 dlossQ:0.0170
Episode:11232 meanR:0.1200 rate:0.0000 gloss:-32.0886 dloss:0.0237 dlossR:0.0000 dlossQ:0.0237
Episode:11233 meanR:0.0900 rate:-0.0769 gloss:-33.5788 dloss:-1.8615 dlossR:-1.8821 dlossQ:0.0206
Episode:11234 meanR:0.0700 rate:0.0000 gloss:-45.2438 dloss:0.0218 dlossR:0.0000 dlossQ:0.0218
Episode:11235 meanR:0.1000 rate:0.1538 gloss:-34.8896 dloss:4.0542 dlossR:3.9488 dlossQ:0.1055
Episode:11236 meanR:0.0700 rate:-0.2308 gloss:-

Episode:11312 meanR:-0.0400 rate:0.0000 gloss:-51.6549 dloss:0.0182 dlossR:0.0000 dlossQ:0.0182
Episode:11313 meanR:-0.0400 rate:0.0000 gloss:-35.1262 dloss:0.0246 dlossR:0.0000 dlossQ:0.0246
Episode:11314 meanR:-0.0400 rate:0.0000 gloss:-29.7356 dloss:0.0248 dlossR:0.0000 dlossQ:0.0248
Episode:11315 meanR:-0.0500 rate:-0.0769 gloss:-44.6391 dloss:-2.4545 dlossR:-2.5023 dlossQ:0.0478
Episode:11316 meanR:-0.0400 rate:0.0769 gloss:-28.7573 dloss:1.7012 dlossR:1.6249 dlossQ:0.0763
Episode:11317 meanR:-0.0400 rate:0.0000 gloss:-24.4598 dloss:0.0305 dlossR:0.0000 dlossQ:0.0305
Episode:11318 meanR:-0.0500 rate:-0.0769 gloss:-24.0293 dloss:-1.3163 dlossR:-1.3463 dlossQ:0.0300
Episode:11319 meanR:-0.0500 rate:0.0000 gloss:-31.2955 dloss:0.0374 dlossR:0.0000 dlossQ:0.0374
Episode:11320 meanR:-0.0400 rate:0.0000 gloss:-24.9561 dloss:0.0353 dlossR:0.0000 dlossQ:0.0353
Episode:11321 meanR:-0.0500 rate:-0.0769 gloss:-24.6553 dloss:-1.3557 dlossR:-1.3806 dlossQ:0.0250
Episode:11322 meanR:-0.0800 rat

Episode:11398 meanR:0.1500 rate:0.0000 gloss:-11.3179 dloss:0.0226 dlossR:0.0009 dlossQ:0.0217
Episode:11399 meanR:0.1500 rate:0.0000 gloss:-32.0610 dloss:0.1735 dlossR:0.0023 dlossQ:0.1712
Episode:11400 meanR:0.1600 rate:0.0000 gloss:-11.1613 dloss:0.0208 dlossR:0.0007 dlossQ:0.0201
Episode:11401 meanR:0.1600 rate:0.0000 gloss:-10.3960 dloss:0.0273 dlossR:0.0014 dlossQ:0.0259
Episode:11402 meanR:0.1600 rate:0.0000 gloss:-10.2612 dloss:0.0203 dlossR:0.0009 dlossQ:0.0194
Episode:11403 meanR:0.1600 rate:0.0769 gloss:-11.5032 dloss:0.6798 dlossR:0.6539 dlossQ:0.0259
Episode:11404 meanR:0.1500 rate:-0.0769 gloss:-8.1906 dloss:-0.3993 dlossR:-0.4489 dlossQ:0.0495
Episode:11405 meanR:0.1700 rate:0.0769 gloss:-7.7648 dloss:0.5031 dlossR:0.4493 dlossQ:0.0537
Episode:11406 meanR:0.1500 rate:-0.1538 gloss:-9.7514 dloss:-1.0426 dlossR:-1.0724 dlossQ:0.0298
Episode:11407 meanR:0.2100 rate:0.4615 gloss:-8.8077 dloss:3.2474 dlossR:3.1989 dlossQ:0.0485
Episode:11408 meanR:0.1700 rate:-0.3077 gloss:-1

Episode:11484 meanR:-0.0800 rate:0.0000 gloss:-27.5375 dloss:0.1208 dlossR:0.0002 dlossQ:0.1206
Episode:11485 meanR:-0.0700 rate:0.0000 gloss:-25.7586 dloss:0.0196 dlossR:0.0003 dlossQ:0.0193
Episode:11486 meanR:-0.0900 rate:0.0000 gloss:-18.5149 dloss:0.0856 dlossR:0.0002 dlossQ:0.0854
Episode:11487 meanR:-0.1000 rate:0.0000 gloss:-13.5632 dloss:0.0168 dlossR:0.0001 dlossQ:0.0167
Episode:11488 meanR:-0.0500 rate:0.2308 gloss:-14.3089 dloss:2.4855 dlossR:2.4674 dlossQ:0.0181
Episode:11489 meanR:-0.0500 rate:-0.0769 gloss:-14.1170 dloss:-0.7703 dlossR:-0.7882 dlossQ:0.0179
Episode:11490 meanR:-0.0300 rate:0.0769 gloss:-14.0982 dloss:0.8163 dlossR:0.7986 dlossQ:0.0177
Episode:11491 meanR:-0.0500 rate:-0.0769 gloss:-16.6337 dloss:-0.9100 dlossR:-0.9302 dlossQ:0.0202
Episode:11492 meanR:-0.0400 rate:0.0000 gloss:-18.5138 dloss:0.0188 dlossR:0.0002 dlossQ:0.0186
Episode:11493 meanR:-0.0500 rate:0.0000 gloss:-15.2865 dloss:0.0172 dlossR:0.0001 dlossQ:0.0170
Episode:11494 meanR:-0.0300 rate:0

Episode:11569 meanR:-0.0600 rate:0.0000 gloss:-27.2259 dloss:0.1207 dlossR:0.0000 dlossQ:0.1206
Episode:11570 meanR:-0.0800 rate:-0.1538 gloss:-26.2683 dloss:-2.8937 dlossR:-2.9288 dlossQ:0.0350
Episode:11571 meanR:-0.1000 rate:0.0000 gloss:-30.0846 dloss:0.0207 dlossR:0.0000 dlossQ:0.0206
Episode:11572 meanR:-0.0900 rate:0.0000 gloss:-40.8247 dloss:0.0265 dlossR:0.0000 dlossQ:0.0265
Episode:11573 meanR:-0.1100 rate:0.0000 gloss:-31.2812 dloss:0.0279 dlossR:0.0000 dlossQ:0.0279
Episode:11574 meanR:-0.1200 rate:0.0000 gloss:-24.3083 dloss:0.0186 dlossR:0.0000 dlossQ:0.0186
Episode:11575 meanR:-0.1200 rate:0.0000 gloss:-15.0665 dloss:0.0182 dlossR:0.0000 dlossQ:0.0182
Episode:11576 meanR:-0.1300 rate:-0.0769 gloss:-27.7724 dloss:-1.4642 dlossR:-1.5540 dlossQ:0.0899
Episode:11577 meanR:-0.1700 rate:-0.1538 gloss:-24.7685 dloss:-2.7257 dlossR:-2.7598 dlossQ:0.0341
Episode:11578 meanR:-0.1600 rate:0.0000 gloss:-26.4114 dloss:0.0200 dlossR:0.0000 dlossQ:0.0200
Episode:11579 meanR:-0.1200 rat

Episode:11655 meanR:-0.0800 rate:0.0000 gloss:-35.6463 dloss:0.0306 dlossR:0.0000 dlossQ:0.0306
Episode:11656 meanR:-0.0700 rate:0.0769 gloss:-31.3290 dloss:1.7879 dlossR:1.7653 dlossQ:0.0225
Episode:11657 meanR:-0.0600 rate:0.0000 gloss:-30.9773 dloss:0.0409 dlossR:0.0000 dlossQ:0.0409
Episode:11658 meanR:-0.0600 rate:-0.0769 gloss:-33.2645 dloss:-1.8349 dlossR:-1.8626 dlossQ:0.0277
Episode:11659 meanR:-0.0600 rate:-0.0769 gloss:-29.5632 dloss:-1.6349 dlossR:-1.6551 dlossQ:0.0202
Episode:11660 meanR:-0.0600 rate:0.0000 gloss:-31.6605 dloss:0.0492 dlossR:0.0000 dlossQ:0.0491
Episode:11661 meanR:-0.0700 rate:0.0000 gloss:-35.5285 dloss:0.0396 dlossR:0.0000 dlossQ:0.0396
Episode:11662 meanR:-0.0800 rate:-0.0769 gloss:-38.1251 dloss:-2.0848 dlossR:-2.1366 dlossQ:0.0517
Episode:11663 meanR:-0.0900 rate:-0.0769 gloss:-30.9929 dloss:-1.7159 dlossR:-1.7362 dlossQ:0.0204
Episode:11664 meanR:-0.0900 rate:0.0000 gloss:-21.4192 dloss:0.0276 dlossR:0.0000 dlossQ:0.0276
Episode:11665 meanR:-0.0800 

Episode:11740 meanR:-0.0500 rate:0.1538 gloss:-47.7406 dloss:5.4774 dlossR:5.3860 dlossQ:0.0914
Episode:11741 meanR:-0.0500 rate:0.0000 gloss:-34.4956 dloss:0.0365 dlossR:0.0000 dlossQ:0.0365
Episode:11742 meanR:-0.0400 rate:0.0769 gloss:-27.3528 dloss:1.5799 dlossR:1.5453 dlossQ:0.0346
Episode:11743 meanR:-0.0400 rate:0.0000 gloss:-20.2841 dloss:0.0239 dlossR:0.0000 dlossQ:0.0239
Episode:11744 meanR:-0.0400 rate:0.0000 gloss:-22.8728 dloss:0.0272 dlossR:0.0000 dlossQ:0.0272
Episode:11745 meanR:-0.0500 rate:-0.0769 gloss:-51.9784 dloss:-2.8369 dlossR:-2.9155 dlossQ:0.0786
Episode:11746 meanR:-0.0600 rate:-0.0769 gloss:-23.4918 dloss:-1.2878 dlossR:-1.3156 dlossQ:0.0278
Episode:11747 meanR:-0.0700 rate:-0.0769 gloss:-49.3337 dloss:-2.7347 dlossR:-2.7649 dlossQ:0.0302
Episode:11748 meanR:-0.0700 rate:0.0000 gloss:-44.4203 dloss:0.0427 dlossR:0.0000 dlossQ:0.0427
Episode:11749 meanR:-0.0500 rate:0.0769 gloss:-33.7365 dloss:1.9368 dlossR:1.9037 dlossQ:0.0332
Episode:11750 meanR:-0.0600 rat

Episode:11826 meanR:0.0200 rate:0.0769 gloss:-22.8762 dloss:1.3221 dlossR:1.2922 dlossQ:0.0299
Episode:11827 meanR:0.0100 rate:0.0000 gloss:-17.5192 dloss:0.0209 dlossR:0.0000 dlossQ:0.0209
Episode:11828 meanR:0.0300 rate:0.0000 gloss:-23.3748 dloss:0.0254 dlossR:0.0000 dlossQ:0.0254
Episode:11829 meanR:0.0400 rate:0.0769 gloss:-16.1147 dloss:0.9345 dlossR:0.9154 dlossQ:0.0191
Episode:11830 meanR:0.0400 rate:0.0000 gloss:-35.1240 dloss:0.0398 dlossR:0.0000 dlossQ:0.0398
Episode:11831 meanR:0.0300 rate:0.0000 gloss:-19.9418 dloss:0.0233 dlossR:0.0000 dlossQ:0.0233
Episode:11832 meanR:0.0200 rate:-0.0769 gloss:-19.0689 dloss:-1.0426 dlossR:-1.0678 dlossQ:0.0252
Episode:11833 meanR:0.0300 rate:0.0000 gloss:-54.8264 dloss:0.0559 dlossR:0.0000 dlossQ:0.0559
Episode:11834 meanR:0.0400 rate:0.0769 gloss:-43.4940 dloss:2.4805 dlossR:2.4510 dlossQ:0.0295
Episode:11835 meanR:0.0400 rate:-0.0769 gloss:-20.8053 dloss:-1.1397 dlossR:-1.1638 dlossQ:0.0241
Episode:11836 meanR:0.0300 rate:-0.0769 glos

Episode:11912 meanR:-0.0500 rate:0.0000 gloss:-25.4896 dloss:0.0326 dlossR:0.0000 dlossQ:0.0326
Episode:11913 meanR:-0.0400 rate:0.0000 gloss:-48.8024 dloss:0.0309 dlossR:0.0000 dlossQ:0.0309
Episode:11914 meanR:-0.0500 rate:-0.0769 gloss:-25.0735 dloss:-1.3791 dlossR:-1.4068 dlossQ:0.0277
Episode:11915 meanR:-0.0400 rate:0.0769 gloss:-34.5888 dloss:1.9943 dlossR:1.9507 dlossQ:0.0436
Episode:11916 meanR:-0.0400 rate:0.0000 gloss:-41.7251 dloss:0.1397 dlossR:0.0000 dlossQ:0.1397
Episode:11917 meanR:-0.0400 rate:0.0000 gloss:-48.1127 dloss:0.1332 dlossR:0.0000 dlossQ:0.1332
Episode:11918 meanR:-0.0500 rate:0.0000 gloss:-26.2944 dloss:0.0379 dlossR:0.0000 dlossQ:0.0379
Episode:11919 meanR:-0.0500 rate:0.0000 gloss:-26.0040 dloss:0.0419 dlossR:0.0000 dlossQ:0.0419
Episode:11920 meanR:-0.0500 rate:0.0000 gloss:-24.3129 dloss:0.0300 dlossR:0.0000 dlossQ:0.0300
Episode:11921 meanR:-0.0900 rate:0.0000 gloss:-27.7218 dloss:0.0347 dlossR:0.0000 dlossQ:0.0347
Episode:11922 meanR:-0.0900 rate:0.00

Episode:11997 meanR:0.0100 rate:0.0000 gloss:-38.3332 dloss:0.0255 dlossR:0.0000 dlossQ:0.0255
Episode:11998 meanR:0.0100 rate:0.0000 gloss:-49.3619 dloss:0.0255 dlossR:0.0000 dlossQ:0.0255
Episode:11999 meanR:0.0100 rate:0.0000 gloss:-37.0658 dloss:0.0266 dlossR:0.0000 dlossQ:0.0266
Episode:12000 meanR:0.0100 rate:0.0000 gloss:-22.6585 dloss:0.0236 dlossR:0.0000 dlossQ:0.0236
Episode:12001 meanR:0.0100 rate:0.0000 gloss:-24.4799 dloss:0.0272 dlossR:0.0000 dlossQ:0.0272
Episode:12002 meanR:0.0000 rate:-0.0769 gloss:-21.7253 dloss:-1.1968 dlossR:-1.2174 dlossQ:0.0206
Episode:12003 meanR:-0.0100 rate:-0.0769 gloss:-16.8687 dloss:-0.9313 dlossR:-0.9516 dlossQ:0.0203
Episode:12004 meanR:-0.0300 rate:-0.1538 gloss:-23.1024 dloss:-2.5390 dlossR:-2.5751 dlossQ:0.0361
Episode:12005 meanR:-0.0300 rate:0.0000 gloss:-21.9449 dloss:0.0240 dlossR:0.0000 dlossQ:0.0240
Episode:12006 meanR:-0.0300 rate:0.0769 gloss:-24.0650 dloss:1.3887 dlossR:1.3593 dlossQ:0.0294
Episode:12007 meanR:-0.0400 rate:-0.0

Episode:12082 meanR:-0.1100 rate:0.0000 gloss:-48.1262 dloss:0.0748 dlossR:0.0000 dlossQ:0.0748
Episode:12083 meanR:-0.1000 rate:0.0769 gloss:-52.1714 dloss:3.0365 dlossR:2.9434 dlossQ:0.0931
Episode:12084 meanR:-0.1200 rate:-0.0769 gloss:-44.1735 dloss:-2.4070 dlossR:-2.4796 dlossQ:0.0726
Episode:12085 meanR:-0.1200 rate:0.0000 gloss:-50.2477 dloss:0.0440 dlossR:0.0000 dlossQ:0.0440
Episode:12086 meanR:-0.1400 rate:-0.0769 gloss:-57.6419 dloss:-3.0924 dlossR:-3.2330 dlossQ:0.1406
Episode:12087 meanR:-0.1400 rate:0.0000 gloss:-85.7241 dloss:0.0727 dlossR:0.0000 dlossQ:0.0727
Episode:12088 meanR:-0.1400 rate:0.0000 gloss:-44.6342 dloss:0.0485 dlossR:0.0000 dlossQ:0.0485
Episode:12089 meanR:-0.1500 rate:0.0000 gloss:-45.5112 dloss:0.0628 dlossR:0.0000 dlossQ:0.0628
Episode:12090 meanR:-0.1900 rate:-0.2308 gloss:-48.1623 dloss:-8.0345 dlossR:-8.0741 dlossQ:0.0395
Episode:12091 meanR:-0.1700 rate:0.1538 gloss:-56.5926 dloss:6.4562 dlossR:6.3859 dlossQ:0.0702
Episode:12092 meanR:-0.1700 rat

Episode:12168 meanR:0.0600 rate:0.0000 gloss:-36.2655 dloss:0.0322 dlossR:0.0000 dlossQ:0.0322
Episode:12169 meanR:0.0700 rate:0.0000 gloss:-45.3306 dloss:0.0532 dlossR:0.0000 dlossQ:0.0532
Episode:12170 meanR:0.0500 rate:-0.0769 gloss:-49.5721 dloss:-2.6380 dlossR:-2.7779 dlossQ:0.1399
Episode:12171 meanR:0.0400 rate:0.0000 gloss:-29.1060 dloss:0.0394 dlossR:0.0000 dlossQ:0.0394
Episode:12172 meanR:0.0400 rate:0.0000 gloss:-36.1856 dloss:0.0339 dlossR:0.0000 dlossQ:0.0339
Episode:12173 meanR:0.0400 rate:0.0000 gloss:-25.1276 dloss:0.0261 dlossR:0.0000 dlossQ:0.0261
Episode:12174 meanR:0.0400 rate:0.0000 gloss:-36.5298 dloss:0.0281 dlossR:0.0000 dlossQ:0.0281
Episode:12175 meanR:0.0400 rate:0.0000 gloss:-36.0968 dloss:0.0247 dlossR:0.0000 dlossQ:0.0247
Episode:12176 meanR:0.0500 rate:0.0000 gloss:-36.1156 dloss:0.0309 dlossR:0.0000 dlossQ:0.0309
Episode:12177 meanR:0.0400 rate:0.0000 gloss:-31.9011 dloss:0.0392 dlossR:0.0000 dlossQ:0.0392
Episode:12178 meanR:0.0500 rate:0.0000 gloss:-2

Episode:12254 meanR:-0.0500 rate:0.0769 gloss:-31.8659 dloss:1.8339 dlossR:1.7988 dlossQ:0.0352
Episode:12255 meanR:-0.0500 rate:0.0000 gloss:-30.2899 dloss:0.0337 dlossR:0.0000 dlossQ:0.0337
Episode:12256 meanR:-0.0600 rate:0.0000 gloss:-37.9673 dloss:0.0455 dlossR:0.0000 dlossQ:0.0455
Episode:12257 meanR:-0.0500 rate:0.0769 gloss:-64.1862 dloss:3.6662 dlossR:3.6109 dlossQ:0.0553
Episode:12258 meanR:-0.0500 rate:0.0000 gloss:-78.5880 dloss:0.0427 dlossR:0.0000 dlossQ:0.0427
Episode:12259 meanR:-0.0600 rate:0.0000 gloss:-41.2191 dloss:0.0338 dlossR:0.0000 dlossQ:0.0338
Episode:12260 meanR:-0.0400 rate:-0.1538 gloss:-30.4053 dloss:-3.3585 dlossR:-3.4001 dlossQ:0.0417
Episode:12261 meanR:-0.0600 rate:-0.0769 gloss:-31.8380 dloss:-1.7433 dlossR:-1.7860 dlossQ:0.0427
Episode:12262 meanR:-0.0800 rate:-0.1538 gloss:-29.3217 dloss:-3.2481 dlossR:-3.2832 dlossQ:0.0351
Episode:12263 meanR:-0.0700 rate:0.0769 gloss:-32.2048 dloss:1.8561 dlossR:1.8176 dlossQ:0.0385
Episode:12264 meanR:-0.0800 rat

Episode:12339 meanR:-0.0900 rate:0.0000 gloss:-60.5998 dloss:0.0772 dlossR:0.0000 dlossQ:0.0772
Episode:12340 meanR:-0.0800 rate:0.0000 gloss:-47.5227 dloss:0.0359 dlossR:0.0000 dlossQ:0.0359
Episode:12341 meanR:-0.0600 rate:0.1538 gloss:-54.3600 dloss:6.2064 dlossR:6.1435 dlossQ:0.0629
Episode:12342 meanR:-0.0500 rate:0.0000 gloss:-49.1730 dloss:0.0485 dlossR:0.0000 dlossQ:0.0485
Episode:12343 meanR:-0.0500 rate:0.0769 gloss:-125.8164 dloss:7.1634 dlossR:7.0708 dlossQ:0.0926
Episode:12344 meanR:-0.0500 rate:0.0000 gloss:-82.7853 dloss:0.1435 dlossR:0.0000 dlossQ:0.1435
Episode:12345 meanR:-0.0600 rate:-0.0769 gloss:-71.7068 dloss:-3.8013 dlossR:-4.0208 dlossQ:0.2195
Episode:12346 meanR:-0.0600 rate:0.0000 gloss:-36.6252 dloss:0.0433 dlossR:0.0000 dlossQ:0.0433
Episode:12347 meanR:-0.0600 rate:0.0000 gloss:-47.0534 dloss:0.0618 dlossR:0.0000 dlossQ:0.0618
Episode:12348 meanR:-0.0700 rate:-0.0769 gloss:-51.3344 dloss:-2.8148 dlossR:-2.8837 dlossQ:0.0689
Episode:12349 meanR:-0.0700 rate:

Episode:12424 meanR:-0.0300 rate:0.0000 gloss:-106.9184 dloss:0.0583 dlossR:0.0000 dlossQ:0.0583
Episode:12425 meanR:-0.0400 rate:-0.0769 gloss:-49.2920 dloss:-2.7389 dlossR:-2.7723 dlossQ:0.0334
Episode:12426 meanR:-0.0500 rate:-0.0769 gloss:-50.3159 dloss:-2.7854 dlossR:-2.8231 dlossQ:0.0377
Episode:12427 meanR:-0.0300 rate:0.1538 gloss:-52.3385 dloss:6.0125 dlossR:5.9131 dlossQ:0.0995
Episode:12428 meanR:-0.0100 rate:0.0769 gloss:-44.0366 dloss:2.5238 dlossR:2.4852 dlossQ:0.0386
Episode:12429 meanR:-0.0200 rate:0.0000 gloss:-53.2552 dloss:0.0700 dlossR:0.0000 dlossQ:0.0700
Episode:12430 meanR:0.0000 rate:0.0769 gloss:-63.5313 dloss:3.6508 dlossR:3.5817 dlossQ:0.0691
Episode:12431 meanR:-0.0100 rate:0.0000 gloss:-56.2307 dloss:0.0437 dlossR:0.0000 dlossQ:0.0437
Episode:12432 meanR:-0.0400 rate:-0.2308 gloss:-55.9561 dloss:-9.3476 dlossR:-9.4246 dlossQ:0.0771
Episode:12433 meanR:-0.0600 rate:-0.1538 gloss:-57.5541 dloss:-6.4014 dlossR:-6.4438 dlossQ:0.0424
Episode:12434 meanR:-0.0500 

Episode:12509 meanR:-0.1000 rate:0.0000 gloss:-75.6487 dloss:0.0719 dlossR:0.0000 dlossQ:0.0719
Episode:12510 meanR:-0.0900 rate:0.0000 gloss:-63.1750 dloss:0.0900 dlossR:0.0000 dlossQ:0.0900
Episode:12511 meanR:-0.0800 rate:0.0000 gloss:-80.7235 dloss:0.0807 dlossR:0.0000 dlossQ:0.0807
Episode:12512 meanR:-0.0500 rate:0.0000 gloss:-77.4846 dloss:0.0769 dlossR:0.0000 dlossQ:0.0769
Episode:12513 meanR:-0.0700 rate:-0.1538 gloss:-159.2017 dloss:-17.7196 dlossR:-17.9069 dlossQ:0.1873
Episode:12514 meanR:-0.0600 rate:0.0000 gloss:-102.4936 dloss:0.1335 dlossR:0.0000 dlossQ:0.1335
Episode:12515 meanR:-0.0500 rate:0.0000 gloss:-75.7556 dloss:0.0979 dlossR:0.0000 dlossQ:0.0979
Episode:12516 meanR:-0.0500 rate:0.0000 gloss:-75.6151 dloss:0.0928 dlossR:0.0000 dlossQ:0.0928
Episode:12517 meanR:-0.0600 rate:0.0000 gloss:-76.4523 dloss:0.0701 dlossR:0.0000 dlossQ:0.0701
Episode:12518 meanR:-0.0700 rate:-0.0769 gloss:-50.0606 dloss:-2.7625 dlossR:-2.8136 dlossQ:0.0511
Episode:12519 meanR:-0.0800 ra

Episode:12594 meanR:-0.0300 rate:-0.0769 gloss:-104.6157 dloss:-5.7874 dlossR:-5.8766 dlossQ:0.0891
Episode:12595 meanR:-0.0400 rate:-0.0769 gloss:-99.7963 dloss:-5.5215 dlossR:-5.6074 dlossQ:0.0859
Episode:12596 meanR:-0.0400 rate:0.0000 gloss:-75.3338 dloss:0.1345 dlossR:0.0000 dlossQ:0.1345
Episode:12597 meanR:-0.0400 rate:0.0000 gloss:-142.1227 dloss:0.1874 dlossR:0.0000 dlossQ:0.1874
Episode:12598 meanR:-0.0500 rate:-0.0769 gloss:-153.9365 dloss:-8.4897 dlossR:-8.6590 dlossQ:0.1693
Episode:12599 meanR:-0.0400 rate:0.0769 gloss:-56.8278 dloss:3.2639 dlossR:3.2070 dlossQ:0.0569
Episode:12600 meanR:-0.0500 rate:0.0000 gloss:-120.3113 dloss:0.1328 dlossR:0.0000 dlossQ:0.1328
Episode:12601 meanR:-0.0600 rate:-0.0769 gloss:-128.0655 dloss:-7.0757 dlossR:-7.1920 dlossQ:0.1163
Episode:12602 meanR:-0.0500 rate:0.0769 gloss:-108.7732 dloss:6.2803 dlossR:6.1347 dlossQ:0.1456
Episode:12603 meanR:-0.0800 rate:0.0000 gloss:-108.6047 dloss:0.0908 dlossR:0.0000 dlossQ:0.0908
Episode:12604 meanR:-

Episode:12680 meanR:0.3300 rate:0.0000 gloss:-45.6061 dloss:0.0627 dlossR:0.0000 dlossQ:0.0627
Episode:12681 meanR:0.3300 rate:0.0000 gloss:-52.6914 dloss:0.0225 dlossR:0.0000 dlossQ:0.0225
Episode:12682 meanR:0.3100 rate:-0.0769 gloss:-34.6478 dloss:-1.8862 dlossR:-1.9453 dlossQ:0.0591
Episode:12683 meanR:0.3100 rate:0.0000 gloss:-28.5274 dloss:0.0332 dlossR:0.0000 dlossQ:0.0332
Episode:12684 meanR:0.2900 rate:-0.1538 gloss:-35.1745 dloss:-3.8385 dlossR:-3.9283 dlossQ:0.0897
Episode:12685 meanR:0.2900 rate:0.0000 gloss:-21.1568 dloss:0.0252 dlossR:0.0000 dlossQ:0.0252
Episode:12686 meanR:0.2900 rate:0.0000 gloss:-18.4903 dloss:0.0204 dlossR:0.0000 dlossQ:0.0203
Episode:12687 meanR:0.3100 rate:0.0769 gloss:-19.0043 dloss:1.0948 dlossR:1.0757 dlossQ:0.0191
Episode:12688 meanR:0.3600 rate:0.2308 gloss:-23.1874 dloss:3.9816 dlossR:3.9639 dlossQ:0.0177
Episode:12689 meanR:0.3600 rate:0.0000 gloss:-28.4952 dloss:0.0298 dlossR:0.0000 dlossQ:0.0298
Episode:12690 meanR:0.3500 rate:0.0000 gloss

Episode:12767 meanR:0.7300 rate:0.0769 gloss:-3.5796 dloss:0.6219 dlossR:0.2978 dlossQ:0.3241
Episode:12768 meanR:0.7300 rate:0.0000 gloss:-3.8622 dloss:0.3106 dlossR:0.0639 dlossQ:0.2467
Episode:12769 meanR:0.7400 rate:0.0769 gloss:-3.7965 dloss:0.5245 dlossR:0.2798 dlossQ:0.2448
Episode:12770 meanR:0.7400 rate:0.0000 gloss:-4.9103 dloss:0.2080 dlossR:0.0370 dlossQ:0.1710
Episode:12771 meanR:0.7400 rate:0.0000 gloss:-9.0341 dloss:0.2974 dlossR:0.0297 dlossQ:0.2676
Episode:12772 meanR:0.7500 rate:0.1538 gloss:-5.1897 dloss:0.7926 dlossR:0.6453 dlossQ:0.1474
Episode:12773 meanR:0.7500 rate:0.0000 gloss:-8.5566 dloss:0.1352 dlossR:0.0157 dlossQ:0.1195
Episode:12774 meanR:0.7900 rate:0.2308 gloss:-6.0747 dloss:1.2285 dlossR:1.1018 dlossQ:0.1266
Episode:12775 meanR:0.7900 rate:0.0769 gloss:-12.1260 dloss:0.9628 dlossR:0.7119 dlossQ:0.2509
Episode:12776 meanR:0.8100 rate:0.2308 gloss:-7.0659 dloss:1.4304 dlossR:1.2643 dlossQ:0.1661
Episode:12777 meanR:0.8100 rate:0.0000 gloss:-5.0715 dloss:

Episode:12855 meanR:1.0600 rate:0.0000 gloss:-3.3692 dloss:0.4502 dlossR:0.1035 dlossQ:0.3467
Episode:12856 meanR:1.0300 rate:0.0000 gloss:-3.2101 dloss:0.4570 dlossR:0.1093 dlossQ:0.3476
Episode:12857 meanR:1.0500 rate:0.2308 gloss:-2.4371 dloss:1.0992 dlossR:0.6278 dlossQ:0.4714
Episode:12858 meanR:1.0200 rate:0.0000 gloss:-4.1101 dloss:0.3700 dlossR:0.0760 dlossQ:0.2940
Episode:12859 meanR:1.0600 rate:0.3846 gloss:-2.8763 dloss:1.5010 dlossR:1.0726 dlossQ:0.4284
Episode:12860 meanR:1.0800 rate:0.0769 gloss:-2.7750 dloss:0.6575 dlossR:0.2810 dlossQ:0.3765
Episode:12861 meanR:1.0300 rate:0.0000 gloss:-3.2566 dloss:0.5066 dlossR:0.1175 dlossQ:0.3890
Episode:12862 meanR:1.0100 rate:0.0769 gloss:-3.4285 dloss:0.7576 dlossR:0.3182 dlossQ:0.4394
Episode:12863 meanR:1.0300 rate:0.0769 gloss:-3.1233 dloss:0.6444 dlossR:0.2876 dlossQ:0.3568
Episode:12864 meanR:1.0200 rate:0.0000 gloss:-3.0903 dloss:0.4892 dlossR:0.1203 dlossQ:0.3689
Episode:12865 meanR:1.0000 rate:0.0769 gloss:-3.2470 dloss:0

Episode:12943 meanR:1.5300 rate:0.0769 gloss:-1.8145 dloss:0.9572 dlossR:0.3719 dlossQ:0.5853
Episode:12944 meanR:1.4600 rate:0.0000 gloss:-2.0924 dloss:0.7556 dlossR:0.2351 dlossQ:0.5205
Episode:12945 meanR:1.5300 rate:0.5385 gloss:-1.5148 dloss:1.6685 dlossR:1.0711 dlossQ:0.5974
Episode:12946 meanR:1.5100 rate:0.0769 gloss:-2.0219 dloss:0.8277 dlossR:0.3214 dlossQ:0.5063
Episode:12947 meanR:1.4900 rate:0.0769 gloss:-2.1317 dloss:0.7815 dlossR:0.3071 dlossQ:0.4744
Episode:12948 meanR:1.4900 rate:0.2308 gloss:-1.9702 dloss:1.1201 dlossR:0.5848 dlossQ:0.5354
Episode:12949 meanR:1.4900 rate:0.0000 gloss:-2.2070 dloss:0.6531 dlossR:0.1876 dlossQ:0.4655
Episode:12950 meanR:1.4800 rate:0.0000 gloss:-2.3244 dloss:0.6301 dlossR:0.1777 dlossQ:0.4524
Episode:12951 meanR:1.4600 rate:0.2308 gloss:-2.1856 dloss:1.0587 dlossR:0.5790 dlossQ:0.4797
Episode:12952 meanR:1.4600 rate:0.0000 gloss:-2.7121 dloss:0.5140 dlossR:0.1322 dlossQ:0.3819
Episode:12953 meanR:1.4700 rate:0.0769 gloss:-2.7667 dloss:0

Episode:13030 meanR:1.2300 rate:0.3077 gloss:-4.0451 dloss:1.3099 dlossR:1.0546 dlossQ:0.2553
Episode:13031 meanR:1.2200 rate:0.1538 gloss:-4.5789 dloss:0.7721 dlossR:0.5783 dlossQ:0.1938
Episode:13032 meanR:1.2200 rate:0.2308 gloss:-4.3178 dloss:1.0548 dlossR:0.8282 dlossQ:0.2266
Episode:13033 meanR:1.2600 rate:0.3077 gloss:-3.7254 dloss:1.2745 dlossR:0.9925 dlossQ:0.2821
Episode:13034 meanR:1.2900 rate:0.2308 gloss:-4.1043 dloss:1.0169 dlossR:0.7929 dlossQ:0.2240
Episode:13035 meanR:1.2500 rate:-0.0769 gloss:-4.4080 dloss:-0.0026 dlossR:-0.1929 dlossQ:0.1903
Episode:13036 meanR:1.2700 rate:0.2308 gloss:-3.3714 dloss:1.0631 dlossR:0.7133 dlossQ:0.3498
Episode:13037 meanR:1.2700 rate:0.0769 gloss:-2.8685 dloss:0.6935 dlossR:0.3024 dlossQ:0.3911
Episode:13038 meanR:1.2600 rate:0.0000 gloss:-2.8250 dloss:0.5382 dlossR:0.1432 dlossQ:0.3949
Episode:13039 meanR:1.2200 rate:0.0000 gloss:-3.3071 dloss:0.4241 dlossR:0.0988 dlossQ:0.3253
Episode:13040 meanR:1.2400 rate:0.1538 gloss:-2.0603 dlos

Episode:13117 meanR:0.8900 rate:0.1538 gloss:-3.0339 dloss:0.7942 dlossR:0.4580 dlossQ:0.3362
Episode:13118 meanR:0.8900 rate:0.0769 gloss:-2.9247 dloss:0.6278 dlossR:0.2773 dlossQ:0.3504
Episode:13119 meanR:0.8800 rate:0.0769 gloss:-2.8524 dloss:0.6512 dlossR:0.2821 dlossQ:0.3691
Episode:13120 meanR:0.8600 rate:0.0000 gloss:-3.0545 dloss:0.4391 dlossR:0.1051 dlossQ:0.3340
Episode:13121 meanR:0.8600 rate:0.0000 gloss:-3.0217 dloss:0.4411 dlossR:0.1064 dlossQ:0.3347
Episode:13122 meanR:0.8700 rate:0.0769 gloss:-2.8990 dloss:0.6319 dlossR:0.2772 dlossQ:0.3547
Episode:13123 meanR:0.9000 rate:0.0000 gloss:-3.1294 dloss:0.4243 dlossR:0.1005 dlossQ:0.3238
Episode:13124 meanR:0.9200 rate:0.0769 gloss:-3.2721 dloss:0.5873 dlossR:0.2766 dlossQ:0.3107
Episode:13125 meanR:0.9000 rate:-0.0769 gloss:-3.5035 dloss:0.1876 dlossR:-0.1013 dlossQ:0.2889
Episode:13126 meanR:0.9000 rate:-0.1538 gloss:-4.5301 dloss:-0.2082 dlossR:-0.4235 dlossQ:0.2154
Episode:13127 meanR:0.9100 rate:0.2308 gloss:-3.6450 dl

Episode:13204 meanR:0.3000 rate:0.0769 gloss:-8.3102 dloss:0.5044 dlossR:0.4757 dlossQ:0.0287
Episode:13205 meanR:0.2700 rate:-0.1538 gloss:-8.6202 dloss:-0.9152 dlossR:-0.9427 dlossQ:0.0275
Episode:13206 meanR:0.2500 rate:0.1538 gloss:-8.5086 dloss:1.0114 dlossR:0.9827 dlossQ:0.0287
Episode:13207 meanR:0.2400 rate:0.0000 gloss:-7.8782 dloss:0.0388 dlossR:0.0035 dlossQ:0.0353
Episode:13208 meanR:0.2400 rate:-0.0769 gloss:-8.0860 dloss:-0.4144 dlossR:-0.4453 dlossQ:0.0309
Episode:13209 meanR:0.2500 rate:0.0000 gloss:-7.6627 dloss:0.0443 dlossR:0.0047 dlossQ:0.0397
Episode:13210 meanR:0.2500 rate:0.0769 gloss:-8.4971 dloss:0.5142 dlossR:0.4857 dlossQ:0.0285
Episode:13211 meanR:0.2400 rate:-0.0769 gloss:-8.7421 dloss:-0.4557 dlossR:-0.4833 dlossQ:0.0276
Episode:13212 meanR:0.2300 rate:0.0000 gloss:-8.9076 dloss:0.0272 dlossR:0.0018 dlossQ:0.0254
Episode:13213 meanR:0.2000 rate:-0.1538 gloss:-8.3147 dloss:-0.8784 dlossR:-0.9076 dlossQ:0.0293
Episode:13214 meanR:0.1800 rate:0.0000 gloss:-8.

Episode:13290 meanR:-0.1200 rate:0.0769 gloss:-13.5502 dloss:0.7880 dlossR:0.7689 dlossQ:0.0191
Episode:13291 meanR:-0.1200 rate:0.0000 gloss:-14.7672 dloss:0.0208 dlossR:0.0001 dlossQ:0.0207
Episode:13292 meanR:-0.1200 rate:0.0000 gloss:-15.0937 dloss:0.0182 dlossR:0.0000 dlossQ:0.0182
Episode:13293 meanR:-0.1200 rate:0.0000 gloss:-15.8281 dloss:0.0228 dlossR:0.0000 dlossQ:0.0227
Episode:13294 meanR:-0.1000 rate:0.0769 gloss:-17.3126 dloss:1.0025 dlossR:0.9789 dlossQ:0.0236
Episode:13295 meanR:-0.1300 rate:-0.0769 gloss:-14.5908 dloss:-0.7942 dlossR:-0.8141 dlossQ:0.0199
Episode:13296 meanR:-0.1500 rate:-0.0769 gloss:-15.5106 dloss:-0.8509 dlossR:-0.8670 dlossQ:0.0161
Episode:13297 meanR:-0.1200 rate:0.2308 gloss:-15.1823 dloss:2.6376 dlossR:2.6142 dlossQ:0.0234
Episode:13298 meanR:-0.1100 rate:0.0000 gloss:-15.5893 dloss:0.0190 dlossR:0.0000 dlossQ:0.0190
Episode:13299 meanR:-0.1000 rate:0.0000 gloss:-18.3631 dloss:0.0200 dlossR:0.0000 dlossQ:0.0200
Episode:13300 meanR:-0.1000 rate:0

Episode:13376 meanR:0.3400 rate:0.3077 gloss:-4.8136 dloss:1.3891 dlossR:1.2089 dlossQ:0.1802
Episode:13377 meanR:0.3900 rate:0.2308 gloss:-4.4424 dloss:1.0282 dlossR:0.8393 dlossQ:0.1889
Episode:13378 meanR:0.4100 rate:0.1538 gloss:-5.3653 dloss:0.7829 dlossR:0.6503 dlossQ:0.1326
Episode:13379 meanR:0.4300 rate:0.1538 gloss:-5.4285 dloss:0.8006 dlossR:0.6594 dlossQ:0.1412
Episode:13380 meanR:0.4200 rate:-0.0769 gloss:-5.3167 dloss:-0.1292 dlossR:-0.2645 dlossQ:0.1353
Episode:13381 meanR:0.4200 rate:0.0000 gloss:-4.7581 dloss:0.1938 dlossR:0.0347 dlossQ:0.1591
Episode:13382 meanR:0.4400 rate:0.0769 gloss:-3.9427 dloss:0.5153 dlossR:0.2841 dlossQ:0.2312
Episode:13383 meanR:0.4500 rate:0.0769 gloss:-4.2880 dloss:0.5168 dlossR:0.2987 dlossQ:0.2182
Episode:13384 meanR:0.4200 rate:-0.2308 gloss:-4.1831 dloss:-0.3741 dlossR:-0.5853 dlossQ:0.2113
Episode:13385 meanR:0.4300 rate:0.0769 gloss:-4.0001 dloss:0.5033 dlossR:0.2825 dlossQ:0.2209
Episode:13386 meanR:0.4300 rate:0.0000 gloss:-3.9738 d

Episode:13462 meanR:0.0100 rate:0.0000 gloss:-10.5774 dloss:0.0194 dlossR:0.0006 dlossQ:0.0189
Episode:13463 meanR:0.0200 rate:0.0769 gloss:-12.1821 dloss:0.7072 dlossR:0.6914 dlossQ:0.0158
Episode:13464 meanR:0.0200 rate:0.0000 gloss:-12.8777 dloss:0.0236 dlossR:0.0003 dlossQ:0.0233
Episode:13465 meanR:0.0200 rate:0.0000 gloss:-11.9699 dloss:0.0145 dlossR:0.0004 dlossQ:0.0142
Episode:13466 meanR:0.0200 rate:0.0000 gloss:-11.8325 dloss:0.0193 dlossR:0.0005 dlossQ:0.0188
Episode:13467 meanR:0.0400 rate:0.0769 gloss:-9.2781 dloss:0.5511 dlossR:0.5291 dlossQ:0.0220
Episode:13468 meanR:0.0400 rate:0.0000 gloss:-9.2827 dloss:0.0284 dlossR:0.0021 dlossQ:0.0263
Episode:13469 meanR:0.0500 rate:0.0769 gloss:-9.0247 dloss:0.5413 dlossR:0.5154 dlossQ:0.0259
Episode:13470 meanR:0.0500 rate:0.0769 gloss:-9.3233 dloss:0.5592 dlossR:0.5335 dlossQ:0.0257
Episode:13471 meanR:0.0700 rate:0.0769 gloss:-9.1362 dloss:0.5491 dlossR:0.5212 dlossQ:0.0279
Episode:13472 meanR:0.0700 rate:0.0000 gloss:-9.4043 dl

Episode:13548 meanR:0.1500 rate:0.0769 gloss:-5.2218 dloss:0.4543 dlossR:0.3244 dlossQ:0.1299
Episode:13549 meanR:0.1600 rate:0.1538 gloss:-5.7794 dloss:0.8010 dlossR:0.6931 dlossQ:0.1080
Episode:13550 meanR:0.1600 rate:0.0000 gloss:-3.7461 dloss:0.3088 dlossR:0.0653 dlossQ:0.2435
Episode:13551 meanR:0.1800 rate:0.0769 gloss:-6.1333 dloss:0.4779 dlossR:0.3706 dlossQ:0.1072
Episode:13552 meanR:0.1900 rate:0.0769 gloss:-6.2204 dloss:0.4700 dlossR:0.3722 dlossQ:0.0978
Episode:13553 meanR:0.1900 rate:0.0000 gloss:-7.5627 dloss:0.0609 dlossR:0.0073 dlossQ:0.0536
Episode:13554 meanR:0.1800 rate:0.0000 gloss:-7.8696 dloss:0.0755 dlossR:0.0091 dlossQ:0.0663
Episode:13555 meanR:0.1800 rate:0.2308 gloss:-5.9811 dloss:1.2391 dlossR:1.0859 dlossQ:0.1532
Episode:13556 meanR:0.1700 rate:0.0000 gloss:-5.8736 dloss:0.1171 dlossR:0.0180 dlossQ:0.0991
Episode:13557 meanR:0.2000 rate:0.1538 gloss:-6.1170 dloss:0.8240 dlossR:0.7270 dlossQ:0.0970
Episode:13558 meanR:0.2100 rate:0.0000 gloss:-6.0556 dloss:0

Episode:13635 meanR:0.7600 rate:0.0769 gloss:-4.6332 dloss:0.4805 dlossR:0.3016 dlossQ:0.1789
Episode:13636 meanR:0.7900 rate:0.0769 gloss:-4.2091 dloss:0.4858 dlossR:0.2872 dlossQ:0.1987
Episode:13637 meanR:0.8000 rate:0.0000 gloss:-3.6900 dloss:0.3235 dlossR:0.0694 dlossQ:0.2541
Episode:13638 meanR:0.8100 rate:0.0769 gloss:-4.1275 dloss:0.4964 dlossR:0.2867 dlossQ:0.2097
Episode:13639 meanR:0.8100 rate:0.0769 gloss:-4.4931 dloss:0.4662 dlossR:0.2945 dlossQ:0.1716
Episode:13640 meanR:0.8100 rate:0.0769 gloss:-4.4497 dloss:0.4735 dlossR:0.2939 dlossQ:0.1796
Episode:13641 meanR:0.8300 rate:0.0769 gloss:-5.4918 dloss:0.4506 dlossR:0.3354 dlossQ:0.1153
Episode:13642 meanR:0.8400 rate:0.0000 gloss:-5.2254 dloss:0.1644 dlossR:0.0280 dlossQ:0.1364
Episode:13643 meanR:0.8100 rate:-0.0769 gloss:-4.8546 dloss:-0.0789 dlossR:-0.2314 dlossQ:0.1524
Episode:13644 meanR:0.8200 rate:0.0769 gloss:-4.4993 dloss:0.4712 dlossR:0.2959 dlossQ:0.1753
Episode:13645 meanR:0.8300 rate:0.0769 gloss:-4.6759 dlos

Episode:13722 meanR:0.0200 rate:0.0000 gloss:-10.2099 dloss:0.0186 dlossR:0.0007 dlossQ:0.0179
Episode:13723 meanR:0.0200 rate:0.0000 gloss:-9.0626 dloss:0.0289 dlossR:0.0018 dlossQ:0.0271
Episode:13724 meanR:0.0300 rate:0.0769 gloss:-9.6226 dloss:0.5700 dlossR:0.5481 dlossQ:0.0218
Episode:13725 meanR:0.0600 rate:0.2308 gloss:-9.5050 dloss:1.6823 dlossR:1.6571 dlossQ:0.0252
Episode:13726 meanR:0.0500 rate:-0.1538 gloss:-9.7372 dloss:-1.0477 dlossR:-1.0696 dlossQ:0.0219
Episode:13727 meanR:0.0300 rate:0.0000 gloss:-12.3961 dloss:0.0195 dlossR:0.0004 dlossQ:0.0192
Episode:13728 meanR:0.0600 rate:0.2308 gloss:-9.8117 dloss:1.7318 dlossR:1.7098 dlossQ:0.0220
Episode:13729 meanR:0.0600 rate:-0.0769 gloss:-9.2468 dloss:-0.4902 dlossR:-0.5121 dlossQ:0.0219
Episode:13730 meanR:0.0500 rate:0.0000 gloss:-9.8993 dloss:0.0219 dlossR:0.0009 dlossQ:0.0209
Episode:13731 meanR:0.0400 rate:-0.0769 gloss:-8.8685 dloss:-0.4654 dlossR:-0.4904 dlossQ:0.0250
Episode:13732 meanR:0.0400 rate:0.0769 gloss:-9.7

Episode:13809 meanR:0.2800 rate:0.0000 gloss:-4.7220 dloss:0.1870 dlossR:0.0331 dlossQ:0.1539
Episode:13810 meanR:0.2800 rate:0.0000 gloss:-4.9497 dloss:0.1786 dlossR:0.0319 dlossQ:0.1467
Episode:13811 meanR:0.3000 rate:0.1538 gloss:-4.4048 dloss:0.7521 dlossR:0.5594 dlossQ:0.1927
Episode:13812 meanR:0.3200 rate:0.0769 gloss:-5.0284 dloss:0.4481 dlossR:0.3144 dlossQ:0.1338
Episode:13813 meanR:0.3300 rate:0.0000 gloss:-5.0426 dloss:0.1583 dlossR:0.0268 dlossQ:0.1315
Episode:13814 meanR:0.3600 rate:0.2308 gloss:-4.3858 dloss:1.0211 dlossR:0.8299 dlossQ:0.1912
Episode:13815 meanR:0.3600 rate:0.0000 gloss:-6.1973 dloss:0.0939 dlossR:0.0131 dlossQ:0.0808
Episode:13816 meanR:0.3500 rate:-0.0769 gloss:-5.0487 dloss:-0.1201 dlossR:-0.2494 dlossQ:0.1293
Episode:13817 meanR:0.3500 rate:0.0000 gloss:-6.6916 dloss:0.0930 dlossR:0.0131 dlossQ:0.0798
Episode:13818 meanR:0.3500 rate:0.0000 gloss:-5.9824 dloss:0.1188 dlossR:0.0183 dlossQ:0.1005
Episode:13819 meanR:0.3500 rate:0.0769 gloss:-4.7960 dlos

Episode:13896 meanR:0.0900 rate:0.2308 gloss:-8.3889 dloss:1.4997 dlossR:1.4708 dlossQ:0.0290
Episode:13897 meanR:0.0900 rate:0.0000 gloss:-8.8187 dloss:0.0254 dlossR:0.0017 dlossQ:0.0237
Episode:13898 meanR:0.0800 rate:0.0000 gloss:-8.2266 dloss:0.0369 dlossR:0.0035 dlossQ:0.0335
Episode:13899 meanR:0.0700 rate:0.0000 gloss:-8.6604 dloss:0.0298 dlossR:0.0023 dlossQ:0.0275
Episode:13900 meanR:0.0800 rate:0.0769 gloss:-7.5103 dloss:0.4734 dlossR:0.4324 dlossQ:0.0410
Episode:13901 meanR:0.1000 rate:0.0000 gloss:-7.8925 dloss:0.0359 dlossR:0.0035 dlossQ:0.0324
Episode:13902 meanR:0.1200 rate:0.0000 gloss:-7.5514 dloss:0.0435 dlossR:0.0044 dlossQ:0.0391
Episode:13903 meanR:0.1300 rate:0.0769 gloss:-7.6979 dloss:0.4775 dlossR:0.4425 dlossQ:0.0350
Episode:13904 meanR:0.1400 rate:0.0769 gloss:-7.1889 dloss:0.4611 dlossR:0.4155 dlossQ:0.0456
Episode:13905 meanR:0.1500 rate:0.0769 gloss:-7.1620 dloss:0.4678 dlossR:0.4150 dlossQ:0.0528
Episode:13906 meanR:0.1400 rate:0.0000 gloss:-7.6968 dloss:0

Episode:13983 meanR:0.3400 rate:0.0769 gloss:-4.6314 dloss:0.6574 dlossR:0.3592 dlossQ:0.2982
Episode:13984 meanR:0.3900 rate:0.3846 gloss:-1.3478 dloss:1.4444 dlossR:0.8190 dlossQ:0.6254
Episode:13985 meanR:0.4200 rate:0.2308 gloss:-2.6647 dloss:1.1062 dlossR:0.6562 dlossQ:0.4500
Episode:13986 meanR:0.4100 rate:-0.0769 gloss:-2.7191 dloss:0.4787 dlossR:0.0287 dlossQ:0.4500
Episode:13987 meanR:0.4200 rate:0.0769 gloss:-2.6255 dloss:0.7779 dlossR:0.3199 dlossQ:0.4580
Episode:13988 meanR:0.4400 rate:0.1538 gloss:-2.8814 dloss:0.8987 dlossR:0.4755 dlossQ:0.4231
Episode:13989 meanR:0.4300 rate:-0.0769 gloss:-4.6183 dloss:-0.0639 dlossR:-0.2188 dlossQ:0.1549
Episode:13990 meanR:0.4400 rate:0.0769 gloss:-3.0607 dloss:0.6623 dlossR:0.2938 dlossQ:0.3686
Episode:13991 meanR:0.4700 rate:0.3077 gloss:-2.8262 dloss:1.2254 dlossR:0.8363 dlossQ:0.3891
Episode:13992 meanR:0.4900 rate:0.1538 gloss:-4.5401 dloss:0.8146 dlossR:0.5872 dlossQ:0.2274
Episode:13993 meanR:0.5000 rate:0.0769 gloss:-5.2972 dlo

Episode:14070 meanR:0.5600 rate:-0.0769 gloss:-5.8763 dloss:-0.2171 dlossR:-0.3072 dlossQ:0.0901
Episode:14071 meanR:0.4900 rate:-0.0769 gloss:-6.0491 dloss:-0.2408 dlossR:-0.3198 dlossQ:0.0790
Episode:14072 meanR:0.4500 rate:-0.0769 gloss:-5.8697 dloss:-0.2224 dlossR:-0.3084 dlossQ:0.0860
Episode:14073 meanR:0.4000 rate:-0.0769 gloss:-6.6762 dloss:-0.2970 dlossR:-0.3593 dlossQ:0.0623
Episode:14074 meanR:0.3900 rate:0.0000 gloss:-8.0158 dloss:0.0506 dlossR:0.0048 dlossQ:0.0458
Episode:14075 meanR:0.4000 rate:0.0769 gloss:-6.5884 dloss:0.4539 dlossR:0.3859 dlossQ:0.0680
Episode:14076 meanR:0.4000 rate:0.1538 gloss:-6.2215 dloss:0.8185 dlossR:0.7349 dlossQ:0.0837
Episode:14077 meanR:0.3900 rate:0.0000 gloss:-7.4248 dloss:0.0482 dlossR:0.0051 dlossQ:0.0430
Episode:14078 meanR:0.4000 rate:0.0000 gloss:-7.0144 dloss:0.0598 dlossR:0.0073 dlossQ:0.0525
Episode:14079 meanR:0.4200 rate:0.0769 gloss:-7.1800 dloss:0.4697 dlossR:0.4158 dlossQ:0.0539
Episode:14080 meanR:0.4200 rate:0.0000 gloss:-7.

Episode:14157 meanR:0.1600 rate:0.0000 gloss:-6.8838 dloss:0.0678 dlossR:0.0089 dlossQ:0.0589
Episode:14158 meanR:0.1500 rate:0.0000 gloss:-5.7259 dloss:0.1107 dlossR:0.0165 dlossQ:0.0942
Episode:14159 meanR:0.1500 rate:0.0000 gloss:-6.4583 dloss:0.0722 dlossR:0.0097 dlossQ:0.0625
Episode:14160 meanR:0.1100 rate:0.0000 gloss:-7.3013 dloss:0.0560 dlossR:0.0068 dlossQ:0.0492
Episode:14161 meanR:0.0900 rate:-0.1538 gloss:-8.2622 dloss:-0.8678 dlossR:-0.9013 dlossQ:0.0335
Episode:14162 meanR:0.0800 rate:-0.0769 gloss:-6.9196 dloss:-0.3149 dlossR:-0.3738 dlossQ:0.0590
Episode:14163 meanR:0.1000 rate:0.0769 gloss:-5.7648 dloss:0.4452 dlossR:0.3472 dlossQ:0.0981
Episode:14164 meanR:0.1600 rate:0.2308 gloss:-5.6566 dloss:1.1270 dlossR:1.0244 dlossQ:0.1026
Episode:14165 meanR:0.1800 rate:0.0769 gloss:-6.5121 dloss:0.4625 dlossR:0.3844 dlossQ:0.0781
Episode:14166 meanR:0.1900 rate:0.0769 gloss:-6.9790 dloss:0.4711 dlossR:0.4072 dlossQ:0.0639
Episode:14167 meanR:0.1700 rate:-0.0769 gloss:-8.0077 

Episode:14244 meanR:0.1000 rate:-0.2308 gloss:-7.4847 dloss:-1.1569 dlossR:-1.2019 dlossQ:0.0450
Episode:14245 meanR:0.0900 rate:0.0000 gloss:-9.5905 dloss:0.0278 dlossR:0.0019 dlossQ:0.0259
Episode:14246 meanR:0.0900 rate:-0.0769 gloss:-7.7868 dloss:-0.3919 dlossR:-0.4275 dlossQ:0.0357
Episode:14247 meanR:0.0900 rate:0.0000 gloss:-6.9992 dloss:0.0578 dlossR:0.0070 dlossQ:0.0508
Episode:14248 meanR:0.0800 rate:0.0000 gloss:-6.7081 dloss:0.0714 dlossR:0.0091 dlossQ:0.0623
Episode:14249 meanR:0.1100 rate:0.1538 gloss:-6.7895 dloss:0.8530 dlossR:0.7951 dlossQ:0.0578
Episode:14250 meanR:0.1100 rate:0.0000 gloss:-7.7620 dloss:0.0415 dlossR:0.0041 dlossQ:0.0374
Episode:14251 meanR:0.1300 rate:0.0000 gloss:-7.6663 dloss:0.0407 dlossR:0.0039 dlossQ:0.0368
Episode:14252 meanR:0.1500 rate:0.0769 gloss:-12.0843 dloss:0.7018 dlossR:0.6855 dlossQ:0.0163
Episode:14253 meanR:0.1300 rate:0.0000 gloss:-10.2940 dloss:0.0238 dlossR:0.0012 dlossQ:0.0226
Episode:14254 meanR:0.1100 rate:0.0769 gloss:-7.3082

Episode:14331 meanR:-0.0500 rate:0.0000 gloss:-8.7362 dloss:0.0277 dlossR:0.0019 dlossQ:0.0258
Episode:14332 meanR:-0.0500 rate:0.0000 gloss:-8.7889 dloss:0.0257 dlossR:0.0018 dlossQ:0.0239
Episode:14333 meanR:-0.0700 rate:-0.0769 gloss:-9.2447 dloss:-0.4914 dlossR:-0.5123 dlossQ:0.0209
Episode:14334 meanR:-0.0700 rate:0.0769 gloss:-9.7201 dloss:0.5753 dlossR:0.5534 dlossQ:0.0219
Episode:14335 meanR:-0.0600 rate:0.0000 gloss:-10.0663 dloss:0.0193 dlossR:0.0008 dlossQ:0.0186
Episode:14336 meanR:-0.0800 rate:-0.1538 gloss:-9.5039 dloss:-1.0231 dlossR:-1.0430 dlossQ:0.0200
Episode:14337 meanR:-0.0600 rate:0.1538 gloss:-8.7091 dloss:1.0305 dlossR:1.0055 dlossQ:0.0251
Episode:14338 meanR:-0.0500 rate:0.0000 gloss:-9.3307 dloss:0.0246 dlossR:0.0013 dlossQ:0.0233
Episode:14339 meanR:-0.0500 rate:-0.0769 gloss:-10.0510 dloss:-0.5390 dlossR:-0.5582 dlossQ:0.0192
Episode:14340 meanR:-0.0500 rate:-0.0769 gloss:-10.5867 dloss:-0.5714 dlossR:-0.5885 dlossQ:0.0170
Episode:14341 meanR:-0.0500 rate:0.

Episode:14418 meanR:-0.0400 rate:-0.0769 gloss:-11.1989 dloss:-0.6058 dlossR:-0.6226 dlossQ:0.0169
Episode:14419 meanR:-0.0500 rate:-0.1538 gloss:-10.7060 dloss:-1.1623 dlossR:-1.1797 dlossQ:0.0174
Episode:14420 meanR:-0.0600 rate:0.0000 gloss:-12.9134 dloss:0.0183 dlossR:0.0002 dlossQ:0.0182
Episode:14421 meanR:-0.0600 rate:0.0000 gloss:-10.3224 dloss:0.0243 dlossR:0.0009 dlossQ:0.0234
Episode:14422 meanR:-0.0600 rate:0.0000 gloss:-10.0870 dloss:0.0191 dlossR:0.0008 dlossQ:0.0184
Episode:14423 meanR:-0.0700 rate:0.0000 gloss:-9.7375 dloss:0.0208 dlossR:0.0010 dlossQ:0.0197
Episode:14424 meanR:-0.0700 rate:0.0000 gloss:-10.4719 dloss:0.0215 dlossR:0.0007 dlossQ:0.0208
Episode:14425 meanR:-0.0700 rate:0.0000 gloss:-10.6544 dloss:0.0182 dlossR:0.0005 dlossQ:0.0176
Episode:14426 meanR:-0.0500 rate:0.0000 gloss:-12.0979 dloss:0.0161 dlossR:0.0003 dlossQ:0.0158
Episode:14427 meanR:-0.0300 rate:0.1538 gloss:-13.5079 dloss:1.5588 dlossR:1.5435 dlossQ:0.0153
Episode:14428 meanR:-0.0400 rate:-0

Episode:14504 meanR:0.0500 rate:0.0000 gloss:-7.1535 dloss:0.0608 dlossR:0.0071 dlossQ:0.0538
Episode:14505 meanR:0.0400 rate:0.0000 gloss:-6.9663 dloss:0.0614 dlossR:0.0073 dlossQ:0.0541
Episode:14506 meanR:0.0700 rate:0.0000 gloss:-7.0027 dloss:0.0573 dlossR:0.0068 dlossQ:0.0506
Episode:14507 meanR:0.0700 rate:0.0000 gloss:-6.2549 dloss:0.0826 dlossR:0.0113 dlossQ:0.0713
Episode:14508 meanR:0.1000 rate:0.1538 gloss:-5.4150 dloss:0.7583 dlossR:0.6504 dlossQ:0.1079
Episode:14509 meanR:0.1100 rate:0.0000 gloss:-5.5269 dloss:0.1162 dlossR:0.0179 dlossQ:0.0983
Episode:14510 meanR:0.1100 rate:0.0000 gloss:-6.5529 dloss:0.0687 dlossR:0.0088 dlossQ:0.0599
Episode:14511 meanR:0.1000 rate:-0.0769 gloss:-7.0684 dloss:-0.3204 dlossR:-0.3818 dlossQ:0.0615
Episode:14512 meanR:0.1100 rate:0.0000 gloss:-6.6477 dloss:0.0827 dlossR:0.0113 dlossQ:0.0714
Episode:14513 meanR:0.1100 rate:0.0000 gloss:-5.9556 dloss:0.0910 dlossR:0.0128 dlossQ:0.0782
Episode:14514 meanR:0.1100 rate:0.0000 gloss:-5.7790 dlos

Episode:14591 meanR:-0.0200 rate:0.0000 gloss:-10.2455 dloss:0.0195 dlossR:0.0008 dlossQ:0.0186
Episode:14592 meanR:-0.0200 rate:0.0000 gloss:-9.8164 dloss:0.0242 dlossR:0.0011 dlossQ:0.0230
Episode:14593 meanR:-0.0300 rate:0.0000 gloss:-11.2092 dloss:0.0169 dlossR:0.0003 dlossQ:0.0166
Episode:14594 meanR:-0.0300 rate:0.0000 gloss:-10.4628 dloss:0.0209 dlossR:0.0009 dlossQ:0.0200
Episode:14595 meanR:-0.0300 rate:0.0000 gloss:-8.4633 dloss:0.0350 dlossR:0.0032 dlossQ:0.0318
Episode:14596 meanR:-0.0300 rate:0.0000 gloss:-8.0517 dloss:0.0365 dlossR:0.0031 dlossQ:0.0333
Episode:14597 meanR:-0.0200 rate:0.0769 gloss:-9.4402 dloss:0.5628 dlossR:0.5384 dlossQ:0.0244
Episode:14598 meanR:-0.0400 rate:-0.0769 gloss:-7.9001 dloss:-0.3964 dlossR:-0.4337 dlossQ:0.0373
Episode:14599 meanR:-0.0200 rate:0.0000 gloss:-9.2097 dloss:0.0261 dlossR:0.0015 dlossQ:0.0246
Episode:14600 meanR:-0.0300 rate:-0.0769 gloss:-9.4376 dloss:-0.4977 dlossR:-0.5223 dlossQ:0.0246
Episode:14601 meanR:-0.0400 rate:-0.0769 

Episode:14678 meanR:0.2600 rate:0.0000 gloss:-5.8827 dloss:0.1097 dlossR:0.0155 dlossQ:0.0942
Episode:14679 meanR:0.2600 rate:0.0000 gloss:-6.8051 dloss:0.0669 dlossR:0.0081 dlossQ:0.0588
Episode:14680 meanR:0.2800 rate:0.1538 gloss:-5.2116 dloss:0.7641 dlossR:0.6333 dlossQ:0.1308
Episode:14681 meanR:0.2900 rate:0.0000 gloss:-5.9406 dloss:0.1090 dlossR:0.0167 dlossQ:0.0924
Episode:14682 meanR:0.3100 rate:0.1538 gloss:-7.1869 dloss:0.8914 dlossR:0.8380 dlossQ:0.0534
Episode:14683 meanR:0.3100 rate:-0.1538 gloss:-6.7368 dloss:-0.6610 dlossR:-0.7228 dlossQ:0.0618
Episode:14684 meanR:0.2800 rate:-0.2308 gloss:-5.6546 dloss:-0.7713 dlossR:-0.8757 dlossQ:0.1044
Episode:14685 meanR:0.3700 rate:0.6154 gloss:-3.9466 dloss:2.4417 dlossR:2.1939 dlossQ:0.2478
Episode:14686 meanR:0.3800 rate:0.0769 gloss:-5.0425 dloss:0.4530 dlossR:0.3161 dlossQ:0.1369
Episode:14687 meanR:0.3600 rate:-0.0769 gloss:-5.6369 dloss:-0.1907 dlossR:-0.2911 dlossQ:0.1004
Episode:14688 meanR:0.3500 rate:0.0000 gloss:-6.138

Episode:14765 meanR:0.2100 rate:0.0769 gloss:-5.5027 dloss:0.4419 dlossR:0.3347 dlossQ:0.1072
Episode:14766 meanR:0.2000 rate:-0.0769 gloss:-5.5530 dloss:-0.1803 dlossR:-0.2852 dlossQ:0.1049
Episode:14767 meanR:0.2100 rate:0.0000 gloss:-5.8193 dloss:0.1125 dlossR:0.0166 dlossQ:0.0958
Episode:14768 meanR:0.2100 rate:0.0000 gloss:-5.7494 dloss:0.1118 dlossR:0.0169 dlossQ:0.0949
Episode:14769 meanR:0.2100 rate:0.0000 gloss:-6.2453 dloss:0.0922 dlossR:0.0130 dlossQ:0.0793
Episode:14770 meanR:0.1800 rate:-0.0769 gloss:-6.3486 dloss:-0.2652 dlossR:-0.3384 dlossQ:0.0733
Episode:14771 meanR:0.1900 rate:0.0769 gloss:-5.5264 dloss:0.4409 dlossR:0.3355 dlossQ:0.1054
Episode:14772 meanR:0.2000 rate:0.0769 gloss:-5.7363 dloss:0.4413 dlossR:0.3453 dlossQ:0.0960
Episode:14773 meanR:0.1900 rate:0.0769 gloss:-5.8838 dloss:0.4402 dlossR:0.3512 dlossQ:0.0890
Episode:14774 meanR:0.1700 rate:-0.1538 gloss:-6.2998 dloss:-0.5978 dlossR:-0.6711 dlossQ:0.0734
Episode:14775 meanR:0.1500 rate:-0.1538 gloss:-7.12

Episode:14852 meanR:0.2000 rate:-0.1538 gloss:-5.4780 dloss:-0.4617 dlossR:-0.5689 dlossQ:0.1072
Episode:14853 meanR:0.2000 rate:0.0000 gloss:-5.6399 dloss:0.1240 dlossR:0.0191 dlossQ:0.1048
Episode:14854 meanR:0.2000 rate:0.0000 gloss:-5.9857 dloss:0.0990 dlossR:0.0145 dlossQ:0.0845
Episode:14855 meanR:0.2000 rate:-0.0769 gloss:-6.7378 dloss:-0.3036 dlossR:-0.3633 dlossQ:0.0597
Episode:14856 meanR:0.2200 rate:0.1538 gloss:-4.6708 dloss:0.7333 dlossR:0.5783 dlossQ:0.1550
Episode:14857 meanR:0.2100 rate:0.0000 gloss:-5.4201 dloss:0.1383 dlossR:0.0228 dlossQ:0.1155
Episode:14858 meanR:0.2000 rate:-0.0769 gloss:-5.5172 dloss:-0.1816 dlossR:-0.2835 dlossQ:0.1020
Episode:14859 meanR:0.2000 rate:0.0000 gloss:-4.9121 dloss:0.1614 dlossR:0.0275 dlossQ:0.1339
Episode:14860 meanR:0.1900 rate:0.0000 gloss:-5.1847 dloss:0.1447 dlossR:0.0240 dlossQ:0.1207
Episode:14861 meanR:0.1700 rate:-0.0769 gloss:-5.7822 dloss:-0.2015 dlossR:-0.3005 dlossQ:0.0989
Episode:14862 meanR:0.1600 rate:0.0000 gloss:-6.

Episode:14939 meanR:0.0700 rate:-0.0769 gloss:-7.2173 dloss:-0.3371 dlossR:-0.3916 dlossQ:0.0545
Episode:14940 meanR:0.0800 rate:0.0769 gloss:-7.6779 dloss:0.5052 dlossR:0.4448 dlossQ:0.0604
Episode:14941 meanR:0.0700 rate:0.0000 gloss:-7.3384 dloss:0.0631 dlossR:0.0076 dlossQ:0.0556
Episode:14942 meanR:0.0700 rate:0.0000 gloss:-7.2973 dloss:0.0651 dlossR:0.0076 dlossQ:0.0575
Episode:14943 meanR:0.0700 rate:0.0000 gloss:-6.2138 dloss:0.1017 dlossR:0.0151 dlossQ:0.0867
Episode:14944 meanR:0.0700 rate:0.0000 gloss:-6.8508 dloss:0.0713 dlossR:0.0096 dlossQ:0.0617
Episode:14945 meanR:0.0800 rate:0.0000 gloss:-6.5904 dloss:0.0701 dlossR:0.0092 dlossQ:0.0609
Episode:14946 meanR:-0.0100 rate:-0.3077 gloss:-7.0792 dloss:-1.4270 dlossR:-1.4911 dlossQ:0.0641
Episode:14947 meanR:-0.0100 rate:0.0000 gloss:-9.2955 dloss:0.0518 dlossR:0.0036 dlossQ:0.0482
Episode:14948 meanR:-0.0200 rate:-0.0769 gloss:-10.2902 dloss:-0.5418 dlossR:-0.5703 dlossQ:0.0285
Episode:14949 meanR:-0.0100 rate:0.1538 gloss:-

Episode:15025 meanR:0.0100 rate:0.0000 gloss:-10.6677 dloss:0.0171 dlossR:0.0007 dlossQ:0.0164
Episode:15026 meanR:0.0200 rate:0.0000 gloss:-9.4307 dloss:0.0288 dlossR:0.0021 dlossQ:0.0267
Episode:15027 meanR:0.0100 rate:-0.0769 gloss:-8.6107 dloss:-0.4479 dlossR:-0.4760 dlossQ:0.0280
Episode:15028 meanR:0.0100 rate:0.1538 gloss:-8.7933 dloss:1.0484 dlossR:1.0150 dlossQ:0.0334
Episode:15029 meanR:0.0000 rate:-0.0769 gloss:-10.1835 dloss:-0.5363 dlossR:-0.5648 dlossQ:0.0285
Episode:15030 meanR:0.0100 rate:0.0000 gloss:-9.2963 dloss:0.0361 dlossR:0.0025 dlossQ:0.0336
Episode:15031 meanR:0.0100 rate:0.0000 gloss:-8.8710 dloss:0.0336 dlossR:0.0030 dlossQ:0.0306
Episode:15032 meanR:-0.0100 rate:-0.1538 gloss:-8.5714 dloss:-0.9005 dlossR:-0.9366 dlossQ:0.0361
Episode:15033 meanR:0.0000 rate:0.1538 gloss:-7.7017 dloss:0.9467 dlossR:0.8953 dlossQ:0.0514
Episode:15034 meanR:0.0300 rate:0.2308 gloss:-6.6398 dloss:1.2658 dlossR:1.1876 dlossQ:0.0782
Episode:15035 meanR:0.0200 rate:0.0000 gloss:-8.

Episode:15112 meanR:0.1900 rate:0.0000 gloss:-6.6096 dloss:0.0761 dlossR:0.0099 dlossQ:0.0661
Episode:15113 meanR:0.2100 rate:0.1538 gloss:-7.7208 dloss:0.9450 dlossR:0.8960 dlossQ:0.0490
Episode:15114 meanR:0.1900 rate:0.0000 gloss:-7.5892 dloss:0.0537 dlossR:0.0063 dlossQ:0.0474
Episode:15115 meanR:0.1900 rate:0.0000 gloss:-6.4412 dloss:0.0779 dlossR:0.0109 dlossQ:0.0670
Episode:15116 meanR:0.1900 rate:0.0000 gloss:-6.5017 dloss:0.0772 dlossR:0.0102 dlossQ:0.0669
Episode:15117 meanR:0.1800 rate:0.0000 gloss:-6.2312 dloss:0.0837 dlossR:0.0115 dlossQ:0.0722
Episode:15118 meanR:0.1900 rate:0.0769 gloss:-7.0470 dloss:0.4701 dlossR:0.4092 dlossQ:0.0609
Episode:15119 meanR:0.1900 rate:0.0000 gloss:-6.6822 dloss:0.0850 dlossR:0.0109 dlossQ:0.0741
Episode:15120 meanR:0.1900 rate:0.0000 gloss:-7.1639 dloss:0.0643 dlossR:0.0080 dlossQ:0.0562
Episode:15121 meanR:0.1400 rate:0.0000 gloss:-6.4140 dloss:0.0883 dlossR:0.0124 dlossQ:0.0759
Episode:15122 meanR:0.1300 rate:0.0000 gloss:-7.3118 dloss:0

Episode:15199 meanR:0.1000 rate:-0.0769 gloss:-8.6229 dloss:-0.4228 dlossR:-0.4728 dlossQ:0.0500
Episode:15200 meanR:0.1000 rate:0.0000 gloss:-10.7458 dloss:0.0442 dlossR:0.0034 dlossQ:0.0408
Episode:15201 meanR:0.0900 rate:-0.0769 gloss:-7.0543 dloss:-0.2893 dlossR:-0.3748 dlossQ:0.0855
Episode:15202 meanR:0.0600 rate:-0.2308 gloss:-7.6046 dloss:-1.1650 dlossR:-1.2197 dlossQ:0.0547
Episode:15203 meanR:0.0700 rate:0.1538 gloss:-8.8638 dloss:1.0791 dlossR:1.0259 dlossQ:0.0532
Episode:15204 meanR:0.0900 rate:0.0769 gloss:-7.8570 dloss:0.5040 dlossR:0.4537 dlossQ:0.0503
Episode:15205 meanR:0.0600 rate:-0.2308 gloss:-10.2747 dloss:-1.6401 dlossR:-1.6763 dlossQ:0.0362
Episode:15206 meanR:0.0600 rate:0.0000 gloss:-8.9081 dloss:0.0376 dlossR:0.0035 dlossQ:0.0341
Episode:15207 meanR:0.0500 rate:-0.0769 gloss:-11.1134 dloss:-0.5782 dlossR:-0.6155 dlossQ:0.0373
Episode:15208 meanR:0.0300 rate:0.0000 gloss:-7.2258 dloss:0.0842 dlossR:0.0092 dlossQ:0.0751
Episode:15209 meanR:0.0300 rate:0.0000 glo

Episode:15285 meanR:-0.0300 rate:-0.0769 gloss:-9.0854 dloss:-0.4610 dlossR:-0.5015 dlossQ:0.0405
Episode:15286 meanR:-0.0400 rate:0.0000 gloss:-8.7383 dloss:0.0300 dlossR:0.0020 dlossQ:0.0280
Episode:15287 meanR:-0.0400 rate:0.0000 gloss:-7.0910 dloss:0.0524 dlossR:0.0058 dlossQ:0.0466
Episode:15288 meanR:-0.0300 rate:0.0769 gloss:-7.9934 dloss:0.4913 dlossR:0.4583 dlossQ:0.0330
Episode:15289 meanR:-0.0300 rate:0.0000 gloss:-12.8010 dloss:0.0195 dlossR:0.0005 dlossQ:0.0190
Episode:15290 meanR:-0.0200 rate:0.1538 gloss:-8.9201 dloss:1.0797 dlossR:1.0310 dlossQ:0.0487
Episode:15291 meanR:-0.0400 rate:0.0000 gloss:-9.8403 dloss:0.0418 dlossR:0.0042 dlossQ:0.0376
Episode:15292 meanR:-0.0400 rate:-0.0769 gloss:-8.2995 dloss:-0.4265 dlossR:-0.4580 dlossQ:0.0315
Episode:15293 meanR:-0.0500 rate:-0.0769 gloss:-12.5241 dloss:-0.6644 dlossR:-0.6939 dlossQ:0.0295
Episode:15294 meanR:-0.0500 rate:0.0000 gloss:-13.7244 dloss:0.0162 dlossR:0.0001 dlossQ:0.0161
Episode:15295 meanR:-0.0600 rate:-0.07

Episode:15371 meanR:-0.0200 rate:0.0000 gloss:-8.9593 dloss:0.0369 dlossR:0.0035 dlossQ:0.0335
Episode:15372 meanR:-0.0300 rate:0.0000 gloss:-10.3128 dloss:0.0312 dlossR:0.0018 dlossQ:0.0294
Episode:15373 meanR:-0.0600 rate:-0.0769 gloss:-9.5023 dloss:-0.4855 dlossR:-0.5233 dlossQ:0.0378
Episode:15374 meanR:-0.0800 rate:0.0000 gloss:-10.1867 dloss:0.0357 dlossR:0.0029 dlossQ:0.0328
Episode:15375 meanR:-0.0500 rate:0.0000 gloss:-15.3077 dloss:0.0311 dlossR:0.0017 dlossQ:0.0294
Episode:15376 meanR:-0.0600 rate:0.0000 gloss:-8.1993 dloss:0.0518 dlossR:0.0056 dlossQ:0.0462
Episode:15377 meanR:-0.0600 rate:0.0000 gloss:-8.9428 dloss:0.0297 dlossR:0.0022 dlossQ:0.0275
Episode:15378 meanR:-0.0500 rate:0.0000 gloss:-8.4258 dloss:0.0330 dlossR:0.0025 dlossQ:0.0305
Episode:15379 meanR:-0.0300 rate:-0.0769 gloss:-7.4151 dloss:-0.3623 dlossR:-0.4051 dlossQ:0.0428
Episode:15380 meanR:-0.0300 rate:0.0000 gloss:-7.8304 dloss:0.0430 dlossR:0.0044 dlossQ:0.0386
Episode:15381 meanR:0.0000 rate:0.1538 gl

Episode:15458 meanR:0.3100 rate:0.0000 gloss:-5.3690 dloss:0.1645 dlossR:0.0283 dlossQ:0.1363
Episode:15459 meanR:0.3900 rate:0.5385 gloss:-4.5210 dloss:2.2843 dlossR:2.0974 dlossQ:0.1870
Episode:15460 meanR:0.4000 rate:0.0000 gloss:-4.1821 dloss:0.2395 dlossR:0.0459 dlossQ:0.1936
Episode:15461 meanR:0.4100 rate:0.0769 gloss:-4.2749 dloss:0.4741 dlossR:0.2872 dlossQ:0.1869
Episode:15462 meanR:0.3900 rate:-0.1538 gloss:-5.3052 dloss:-0.4145 dlossR:-0.5437 dlossQ:0.1292
Episode:15463 meanR:0.3700 rate:-0.1538 gloss:-6.5518 dloss:-0.5761 dlossR:-0.6890 dlossQ:0.1129
Episode:15464 meanR:0.3700 rate:0.0000 gloss:-5.9365 dloss:0.1429 dlossR:0.0230 dlossQ:0.1199
Episode:15465 meanR:0.3700 rate:0.0000 gloss:-4.7851 dloss:0.2064 dlossR:0.0367 dlossQ:0.1698
Episode:15466 meanR:0.3700 rate:0.0000 gloss:-5.2094 dloss:0.1867 dlossR:0.0336 dlossQ:0.1531
Episode:15467 meanR:0.3700 rate:-0.0769 gloss:-5.4101 dloss:-0.1327 dlossR:-0.2705 dlossQ:0.1379
Episode:15468 meanR:0.3700 rate:0.0000 gloss:-4.999

Episode:15545 meanR:0.3400 rate:0.0769 gloss:-6.1498 dloss:0.4460 dlossR:0.3640 dlossQ:0.0821
Episode:15546 meanR:0.3400 rate:0.0000 gloss:-6.2605 dloss:0.0788 dlossR:0.0105 dlossQ:0.0683
Episode:15547 meanR:0.3400 rate:0.0000 gloss:-7.1080 dloss:0.0506 dlossR:0.0056 dlossQ:0.0450
Episode:15548 meanR:0.3100 rate:-0.1538 gloss:-7.1399 dloss:-0.7242 dlossR:-0.7719 dlossQ:0.0477
Episode:15549 meanR:0.2900 rate:-0.0769 gloss:-6.9039 dloss:-0.3174 dlossR:-0.3738 dlossQ:0.0564
Episode:15550 meanR:0.2900 rate:0.0000 gloss:-9.0210 dloss:0.0477 dlossR:0.0050 dlossQ:0.0427
Episode:15551 meanR:0.3200 rate:0.1538 gloss:-8.0730 dloss:0.9894 dlossR:0.9373 dlossQ:0.0521
Episode:15552 meanR:0.3000 rate:0.0000 gloss:-7.7564 dloss:0.0460 dlossR:0.0047 dlossQ:0.0413
Episode:15553 meanR:0.3100 rate:0.0000 gloss:-6.2355 dloss:0.0861 dlossR:0.0119 dlossQ:0.0742
Episode:15554 meanR:0.3400 rate:0.2308 gloss:-6.1549 dloss:1.1812 dlossR:1.1024 dlossQ:0.0787
Episode:15555 meanR:0.3500 rate:0.0769 gloss:-6.9051 d

Episode:15632 meanR:0.0000 rate:-0.1538 gloss:-7.2011 dloss:-0.7264 dlossR:-0.7808 dlossQ:0.0544
Episode:15633 meanR:-0.0100 rate:-0.0769 gloss:-9.3081 dloss:-0.4769 dlossR:-0.5131 dlossQ:0.0362
Episode:15634 meanR:-0.0200 rate:0.0000 gloss:-7.9253 dloss:0.0409 dlossR:0.0040 dlossQ:0.0369
Episode:15635 meanR:-0.0400 rate:-0.1538 gloss:-8.4477 dloss:-0.8876 dlossR:-0.9223 dlossQ:0.0346
Episode:15636 meanR:-0.0100 rate:0.1538 gloss:-7.7688 dloss:0.9446 dlossR:0.9014 dlossQ:0.0432
Episode:15637 meanR:0.0000 rate:0.0769 gloss:-7.7065 dloss:0.4861 dlossR:0.4436 dlossQ:0.0425
Episode:15638 meanR:0.0000 rate:0.0000 gloss:-7.4108 dloss:0.0689 dlossR:0.0093 dlossQ:0.0596
Episode:15639 meanR:0.0100 rate:0.0769 gloss:-7.8191 dloss:0.4922 dlossR:0.4496 dlossQ:0.0426
Episode:15640 meanR:0.0100 rate:0.0000 gloss:-8.7250 dloss:0.0340 dlossR:0.0025 dlossQ:0.0316
Episode:15641 meanR:0.0100 rate:0.0000 gloss:-7.8680 dloss:0.0412 dlossR:0.0038 dlossQ:0.0374
Episode:15642 meanR:0.0100 rate:0.0000 gloss:-8

Episode:15718 meanR:-0.0600 rate:0.1538 gloss:-17.5299 dloss:2.0174 dlossR:1.9935 dlossQ:0.0239
Episode:15719 meanR:-0.0600 rate:0.0000 gloss:-15.1987 dloss:0.0217 dlossR:0.0000 dlossQ:0.0217
Episode:15720 meanR:-0.0500 rate:0.0000 gloss:-11.2809 dloss:0.0180 dlossR:0.0006 dlossQ:0.0175
Episode:15721 meanR:-0.0600 rate:0.0000 gloss:-10.7120 dloss:0.0208 dlossR:0.0006 dlossQ:0.0202
Episode:15722 meanR:-0.0700 rate:0.0000 gloss:-10.9284 dloss:0.0190 dlossR:0.0005 dlossQ:0.0185
Episode:15723 meanR:-0.0800 rate:0.0000 gloss:-10.7213 dloss:0.0311 dlossR:0.0014 dlossQ:0.0296
Episode:15724 meanR:-0.0700 rate:0.0769 gloss:-7.8304 dloss:0.4926 dlossR:0.4509 dlossQ:0.0417
Episode:15725 meanR:-0.0500 rate:0.1538 gloss:-7.6346 dloss:0.9351 dlossR:0.8878 dlossQ:0.0473
Episode:15726 meanR:-0.0600 rate:0.0000 gloss:-10.4483 dloss:0.0310 dlossR:0.0021 dlossQ:0.0290
Episode:15727 meanR:-0.0600 rate:0.0000 gloss:-12.1973 dloss:0.0231 dlossR:0.0009 dlossQ:0.0222
Episode:15728 meanR:-0.0500 rate:0.0769 gl

Episode:15804 meanR:0.1000 rate:0.0000 gloss:-8.0868 dloss:0.0358 dlossR:0.0033 dlossQ:0.0326
Episode:15805 meanR:0.1000 rate:-0.0769 gloss:-8.1140 dloss:-0.3937 dlossR:-0.4440 dlossQ:0.0503
Episode:15806 meanR:0.1100 rate:0.0769 gloss:-5.6742 dloss:0.4538 dlossR:0.3448 dlossQ:0.1090
Episode:15807 meanR:0.0800 rate:0.0000 gloss:-6.5214 dloss:0.0817 dlossR:0.0110 dlossQ:0.0707
Episode:15808 meanR:0.0800 rate:0.0000 gloss:-7.5133 dloss:0.0687 dlossR:0.0085 dlossQ:0.0602
Episode:15809 meanR:0.0900 rate:0.0000 gloss:-9.6423 dloss:0.0214 dlossR:0.0010 dlossQ:0.0204
Episode:15810 meanR:0.1100 rate:0.0769 gloss:-11.7829 dloss:0.6893 dlossR:0.6682 dlossQ:0.0211
Episode:15811 meanR:0.1100 rate:0.0000 gloss:-8.9126 dloss:0.0378 dlossR:0.0028 dlossQ:0.0350
Episode:15812 meanR:0.1200 rate:0.0769 gloss:-7.8030 dloss:0.4957 dlossR:0.4502 dlossQ:0.0455
Episode:15813 meanR:0.1200 rate:0.0000 gloss:-6.8607 dloss:0.0818 dlossR:0.0105 dlossQ:0.0713
Episode:15814 meanR:0.1000 rate:-0.1538 gloss:-6.4502 dl

Episode:15890 meanR:-0.1900 rate:0.0000 gloss:-20.5035 dloss:0.0223 dlossR:0.0000 dlossQ:0.0223
Episode:15891 meanR:-0.1800 rate:0.1538 gloss:-23.7314 dloss:2.7207 dlossR:2.6926 dlossQ:0.0281
Episode:15892 meanR:-0.1700 rate:0.0769 gloss:-23.7156 dloss:1.3647 dlossR:1.3396 dlossQ:0.0251
Episode:15893 meanR:-0.1700 rate:0.0000 gloss:-24.6085 dloss:0.0271 dlossR:0.0000 dlossQ:0.0271
Episode:15894 meanR:-0.1800 rate:-0.0769 gloss:-22.0409 dloss:-1.2062 dlossR:-1.2341 dlossQ:0.0279
Episode:15895 meanR:-0.1700 rate:0.0769 gloss:-21.7516 dloss:1.2565 dlossR:1.2299 dlossQ:0.0265
Episode:15896 meanR:-0.1800 rate:-0.1538 gloss:-22.2214 dloss:-2.4490 dlossR:-2.4759 dlossQ:0.0269
Episode:15897 meanR:-0.1900 rate:0.0000 gloss:-23.1985 dloss:0.0310 dlossR:0.0000 dlossQ:0.0310
Episode:15898 meanR:-0.2100 rate:-0.0769 gloss:-23.2125 dloss:-1.2676 dlossR:-1.2997 dlossQ:0.0321
Episode:15899 meanR:-0.2100 rate:0.0000 gloss:-23.2178 dloss:0.0333 dlossR:0.0000 dlossQ:0.0333
Episode:15900 meanR:-0.2100 rat

Episode:15974 meanR:-0.2900 rate:0.0000 gloss:-310.8654 dloss:0.3570 dlossR:0.0000 dlossQ:0.3570
Episode:15975 meanR:-0.2900 rate:0.0000 gloss:-260.3756 dloss:0.3799 dlossR:0.0000 dlossQ:0.3799
Episode:15976 meanR:-0.2900 rate:0.0000 gloss:-287.7749 dloss:0.3593 dlossR:0.0000 dlossQ:0.3593
Episode:15977 meanR:-0.2900 rate:0.0000 gloss:-287.4550 dloss:0.3764 dlossR:0.0000 dlossQ:0.3764
Episode:15978 meanR:-0.2800 rate:0.0000 gloss:-277.9135 dloss:0.3646 dlossR:0.0000 dlossQ:0.3646
Episode:15979 meanR:-0.3200 rate:0.0000 gloss:-285.8695 dloss:0.1892 dlossR:0.0000 dlossQ:0.1892
Episode:15980 meanR:-0.3300 rate:-0.0769 gloss:-361.2931 dloss:-19.9165 dlossR:-20.3154 dlossQ:0.3989
Episode:15981 meanR:-0.3200 rate:0.0769 gloss:-307.0757 dloss:17.6554 dlossR:17.3046 dlossQ:0.3507
Episode:15982 meanR:-0.3100 rate:0.0000 gloss:-328.3768 dloss:0.4001 dlossR:0.0000 dlossQ:0.4001
Episode:15983 meanR:-0.3000 rate:0.0769 gloss:-283.0690 dloss:16.1626 dlossR:15.9720 dlossQ:0.1906
Episode:15984 meanR:-

Episode:16059 meanR:0.1900 rate:0.0000 gloss:-59.7639 dloss:0.0744 dlossR:0.0000 dlossQ:0.0744
Episode:16060 meanR:0.1700 rate:0.0000 gloss:-54.4413 dloss:0.0673 dlossR:0.0000 dlossQ:0.0673
Episode:16061 meanR:0.1700 rate:0.0000 gloss:-63.9662 dloss:0.0968 dlossR:0.0000 dlossQ:0.0968
Episode:16062 meanR:0.1700 rate:0.0000 gloss:-67.8596 dloss:0.0736 dlossR:0.0000 dlossQ:0.0736
Episode:16063 meanR:0.1600 rate:-0.0769 gloss:-63.3570 dloss:-3.5080 dlossR:-3.5547 dlossQ:0.0467
Episode:16064 meanR:0.1400 rate:-0.1538 gloss:-65.9059 dloss:-7.3222 dlossR:-7.3797 dlossQ:0.0575
Episode:16065 meanR:0.1500 rate:0.0769 gloss:-53.1725 dloss:3.0878 dlossR:2.9975 dlossQ:0.0903
Episode:16066 meanR:0.1500 rate:0.0000 gloss:-50.3437 dloss:0.0768 dlossR:0.0000 dlossQ:0.0768
Episode:16067 meanR:0.1100 rate:-0.3077 gloss:-34.9171 dloss:-7.7435 dlossR:-7.7748 dlossQ:0.0313
Episode:16068 meanR:0.1300 rate:0.1538 gloss:-32.6543 dloss:3.7533 dlossR:3.7053 dlossQ:0.0480
Episode:16069 meanR:0.1500 rate:0.0769 gl

Episode:16145 meanR:-0.1200 rate:0.0769 gloss:-45.4496 dloss:2.5977 dlossR:2.5668 dlossQ:0.0309
Episode:16146 meanR:-0.1200 rate:0.0000 gloss:-80.4961 dloss:0.0823 dlossR:0.0000 dlossQ:0.0823
Episode:16147 meanR:-0.1200 rate:0.0000 gloss:-78.7329 dloss:0.1177 dlossR:0.0000 dlossQ:0.1177
Episode:16148 meanR:-0.1000 rate:0.0000 gloss:-73.5205 dloss:0.0943 dlossR:0.0000 dlossQ:0.0943
Episode:16149 meanR:-0.0600 rate:0.0000 gloss:-68.3152 dloss:0.0660 dlossR:0.0000 dlossQ:0.0660
Episode:16150 meanR:-0.0600 rate:0.0000 gloss:-54.0802 dloss:0.0297 dlossR:0.0000 dlossQ:0.0297
Episode:16151 meanR:-0.0600 rate:0.0769 gloss:-48.2264 dloss:2.7896 dlossR:2.7194 dlossQ:0.0702
Episode:16152 meanR:-0.0800 rate:-0.0769 gloss:-75.8153 dloss:-4.2007 dlossR:-4.2558 dlossQ:0.0551
Episode:16153 meanR:-0.0600 rate:0.0769 gloss:-81.5360 dloss:4.6442 dlossR:4.5853 dlossQ:0.0589
Episode:16154 meanR:-0.0600 rate:0.0000 gloss:-61.4553 dloss:0.0877 dlossR:0.0000 dlossQ:0.0877
Episode:16155 meanR:-0.0600 rate:0.00

Episode:16231 meanR:0.1000 rate:0.0000 gloss:-26.9012 dloss:0.0640 dlossR:0.0002 dlossQ:0.0638
Episode:16232 meanR:0.1200 rate:0.0769 gloss:-24.2723 dloss:1.4244 dlossR:1.3743 dlossQ:0.0500
Episode:16233 meanR:0.1200 rate:0.0000 gloss:-17.1360 dloss:0.0409 dlossR:0.0001 dlossQ:0.0408
Episode:16234 meanR:0.0900 rate:-0.2308 gloss:-18.0089 dloss:-2.9811 dlossR:-2.9977 dlossQ:0.0166
Episode:16235 meanR:0.1000 rate:0.1538 gloss:-19.4068 dloss:2.2271 dlossR:2.2099 dlossQ:0.0172
Episode:16236 meanR:0.0800 rate:-0.1538 gloss:-18.7474 dloss:-2.0698 dlossR:-2.0881 dlossQ:0.0183
Episode:16237 meanR:0.0900 rate:0.0769 gloss:-19.1833 dloss:1.1102 dlossR:1.0872 dlossQ:0.0230
Episode:16238 meanR:0.0800 rate:-0.0769 gloss:-24.4227 dloss:-1.3024 dlossR:-1.3675 dlossQ:0.0651
Episode:16239 meanR:0.0800 rate:0.0000 gloss:-25.1634 dloss:0.0232 dlossR:0.0000 dlossQ:0.0232
Episode:16240 meanR:0.0900 rate:0.0000 gloss:-43.7084 dloss:0.0904 dlossR:0.0000 dlossQ:0.0904
Episode:16241 meanR:0.0900 rate:0.0000 gl

Episode:16317 meanR:-0.2000 rate:0.0000 gloss:-85.4922 dloss:0.0548 dlossR:0.0000 dlossQ:0.0548
Episode:16318 meanR:-0.2100 rate:0.0000 gloss:-68.3930 dloss:0.0724 dlossR:0.0000 dlossQ:0.0724
Episode:16319 meanR:-0.2100 rate:0.0769 gloss:-64.2978 dloss:3.7229 dlossR:3.6245 dlossQ:0.0984
Episode:16320 meanR:-0.2000 rate:0.0769 gloss:-63.3886 dloss:3.6265 dlossR:3.5723 dlossQ:0.0542
Episode:16321 meanR:-0.2100 rate:0.0000 gloss:-69.0043 dloss:0.0904 dlossR:0.0000 dlossQ:0.0904
Episode:16322 meanR:-0.2200 rate:-0.0769 gloss:-60.9757 dloss:-3.3578 dlossR:-3.4252 dlossQ:0.0674
Episode:16323 meanR:-0.2300 rate:0.0000 gloss:-49.6103 dloss:0.0765 dlossR:0.0000 dlossQ:0.0765
Episode:16324 meanR:-0.2200 rate:0.0000 gloss:-54.8733 dloss:0.0687 dlossR:0.0000 dlossQ:0.0687
Episode:16325 meanR:-0.2100 rate:0.0769 gloss:-48.8504 dloss:2.8089 dlossR:2.7569 dlossQ:0.0520
Episode:16326 meanR:-0.2500 rate:-0.3077 gloss:-56.7304 dloss:-12.5814 dlossR:-12.6776 dlossQ:0.0963
Episode:16327 meanR:-0.2000 rate

Episode:16402 meanR:-0.0200 rate:0.1538 gloss:-52.2071 dloss:5.9408 dlossR:5.8947 dlossQ:0.0460
Episode:16403 meanR:-0.0700 rate:-0.3846 gloss:-45.6232 dloss:-12.6121 dlossR:-12.6750 dlossQ:0.0629
Episode:16404 meanR:-0.0600 rate:0.0000 gloss:-49.5764 dloss:0.0635 dlossR:0.0000 dlossQ:0.0635
Episode:16405 meanR:-0.0400 rate:0.0000 gloss:-49.0579 dloss:0.0589 dlossR:0.0000 dlossQ:0.0589
Episode:16406 meanR:-0.0200 rate:0.0000 gloss:-48.9095 dloss:0.0832 dlossR:0.0000 dlossQ:0.0832
Episode:16407 meanR:-0.0100 rate:0.0000 gloss:-58.0040 dloss:0.1038 dlossR:0.0000 dlossQ:0.1038
Episode:16408 meanR:0.0000 rate:0.0769 gloss:-59.9647 dloss:3.4379 dlossR:3.3771 dlossQ:0.0608
Episode:16409 meanR:0.0300 rate:0.1538 gloss:-59.5697 dloss:6.8028 dlossR:6.7265 dlossQ:0.0763
Episode:16410 meanR:0.0100 rate:0.0000 gloss:-59.2736 dloss:0.1146 dlossR:0.0000 dlossQ:0.1146
Episode:16411 meanR:-0.0100 rate:-0.0769 gloss:-69.6601 dloss:-3.7986 dlossR:-3.9110 dlossQ:0.1124
Episode:16412 meanR:-0.0100 rate:0.

Episode:16487 meanR:-0.1300 rate:-0.2308 gloss:-84.3799 dloss:-14.0209 dlossR:-14.1873 dlossQ:0.1663
Episode:16488 meanR:-0.1400 rate:-0.0769 gloss:-68.1535 dloss:-3.7571 dlossR:-3.8293 dlossQ:0.0722
Episode:16489 meanR:-0.1500 rate:0.0000 gloss:-69.1278 dloss:0.0464 dlossR:0.0000 dlossQ:0.0464
Episode:16490 meanR:-0.1400 rate:0.0000 gloss:-64.1356 dloss:0.0946 dlossR:0.0000 dlossQ:0.0946
Episode:16491 meanR:-0.1300 rate:0.0000 gloss:-91.5954 dloss:0.1180 dlossR:0.0000 dlossQ:0.1180
Episode:16492 meanR:-0.1200 rate:0.0000 gloss:-79.0559 dloss:0.1307 dlossR:0.0000 dlossQ:0.1307
Episode:16493 meanR:-0.1100 rate:0.0000 gloss:-71.5891 dloss:0.0950 dlossR:0.0000 dlossQ:0.0950
Episode:16494 meanR:-0.1100 rate:0.0000 gloss:-71.6995 dloss:0.0801 dlossR:0.0000 dlossQ:0.0801
Episode:16495 meanR:-0.1100 rate:0.0000 gloss:-86.6451 dloss:0.1050 dlossR:0.0000 dlossQ:0.1050
Episode:16496 meanR:-0.1300 rate:-0.0769 gloss:-79.7665 dloss:-4.3746 dlossR:-4.4798 dlossQ:0.1052
Episode:16497 meanR:-0.1200 r

Episode:16573 meanR:-0.0400 rate:0.0769 gloss:-85.1612 dloss:4.9039 dlossR:4.7978 dlossQ:0.1061
Episode:16574 meanR:-0.0400 rate:0.0000 gloss:-74.7087 dloss:0.1016 dlossR:0.0000 dlossQ:0.1016
Episode:16575 meanR:-0.0300 rate:0.0769 gloss:-90.4243 dloss:5.2353 dlossR:5.0888 dlossQ:0.1465
Episode:16576 meanR:-0.0200 rate:0.0769 gloss:-84.4154 dloss:4.8426 dlossR:4.7524 dlossQ:0.0902
Episode:16577 meanR:0.0000 rate:0.1538 gloss:-108.7519 dloss:12.3785 dlossR:12.2526 dlossQ:0.1258
Episode:16578 meanR:0.0100 rate:0.0000 gloss:-77.1789 dloss:0.0990 dlossR:0.0000 dlossQ:0.0990
Episode:16579 meanR:0.0000 rate:-0.0769 gloss:-99.5306 dloss:-5.4402 dlossR:-5.5946 dlossQ:0.1544
Episode:16580 meanR:0.0000 rate:0.0000 gloss:-101.7566 dloss:0.1543 dlossR:0.0000 dlossQ:0.1543
Episode:16581 meanR:0.0200 rate:0.0769 gloss:-63.4018 dloss:3.6444 dlossR:3.5718 dlossQ:0.0725
Episode:16582 meanR:0.0300 rate:-0.0769 gloss:-50.5313 dloss:-2.7801 dlossR:-2.8370 dlossQ:0.0568
Episode:16583 meanR:0.0500 rate:0.15

Episode:16659 meanR:0.1400 rate:0.0000 gloss:-40.0398 dloss:0.0481 dlossR:0.0000 dlossQ:0.0481
Episode:16660 meanR:0.1500 rate:-0.1538 gloss:-58.2147 dloss:-6.4457 dlossR:-6.5226 dlossQ:0.0768
Episode:16661 meanR:0.1600 rate:0.0769 gloss:-50.8888 dloss:2.9300 dlossR:2.8750 dlossQ:0.0551
Episode:16662 meanR:0.1600 rate:0.0000 gloss:-48.4107 dloss:0.0664 dlossR:0.0000 dlossQ:0.0664
Episode:16663 meanR:0.1800 rate:0.0000 gloss:-45.2366 dloss:0.0203 dlossR:0.0000 dlossQ:0.0203
Episode:16664 meanR:0.1800 rate:0.0000 gloss:-27.2754 dloss:0.0204 dlossR:0.0000 dlossQ:0.0204
Episode:16665 meanR:0.1700 rate:-0.0769 gloss:-21.5206 dloss:-1.1707 dlossR:-1.2051 dlossQ:0.0344
Episode:16666 meanR:0.1700 rate:-0.0769 gloss:-28.6667 dloss:-1.5449 dlossR:-1.6115 dlossQ:0.0666
Episode:16667 meanR:0.1700 rate:0.0000 gloss:-30.1852 dloss:0.0530 dlossR:0.0000 dlossQ:0.0530
Episode:16668 meanR:0.1700 rate:0.0000 gloss:-31.2584 dloss:0.0528 dlossR:0.0000 dlossQ:0.0527
Episode:16669 meanR:0.1700 rate:0.0000 gl

Episode:16745 meanR:-0.0600 rate:-0.1538 gloss:-32.8363 dloss:-3.6167 dlossR:-3.6714 dlossQ:0.0548
Episode:16746 meanR:-0.0600 rate:0.0000 gloss:-33.5041 dloss:0.0577 dlossR:0.0000 dlossQ:0.0577
Episode:16747 meanR:-0.0400 rate:0.1538 gloss:-36.4738 dloss:4.1864 dlossR:4.1262 dlossQ:0.0602
Episode:16748 meanR:-0.0300 rate:0.0000 gloss:-40.3951 dloss:0.0814 dlossR:0.0000 dlossQ:0.0814
Episode:16749 meanR:-0.0300 rate:0.0000 gloss:-29.9333 dloss:0.0316 dlossR:0.0000 dlossQ:0.0316
Episode:16750 meanR:-0.0300 rate:0.0769 gloss:-31.9651 dloss:1.8623 dlossR:1.8019 dlossQ:0.0604
Episode:16751 meanR:-0.0300 rate:0.0000 gloss:-34.8791 dloss:0.0444 dlossR:0.0000 dlossQ:0.0444
Episode:16752 meanR:-0.0300 rate:0.0000 gloss:-35.6737 dloss:0.0641 dlossR:0.0000 dlossQ:0.0641
Episode:16753 meanR:-0.0300 rate:0.0000 gloss:-50.2445 dloss:0.0348 dlossR:0.0000 dlossQ:0.0348
Episode:16754 meanR:-0.0300 rate:0.0000 gloss:-37.1330 dloss:0.0221 dlossR:0.0000 dlossQ:0.0221
Episode:16755 meanR:-0.0400 rate:0.07

Episode:16830 meanR:-0.2200 rate:0.0000 gloss:-59.6338 dloss:0.0783 dlossR:0.0000 dlossQ:0.0783
Episode:16831 meanR:-0.1800 rate:0.0769 gloss:-54.9756 dloss:3.2097 dlossR:3.0964 dlossQ:0.1133
Episode:16832 meanR:-0.1500 rate:0.1538 gloss:-38.0582 dloss:4.3729 dlossR:4.3173 dlossQ:0.0556
Episode:16833 meanR:-0.1600 rate:-0.1538 gloss:-37.4456 dloss:-4.1387 dlossR:-4.1895 dlossQ:0.0508
Episode:16834 meanR:-0.1600 rate:0.0000 gloss:-59.5226 dloss:0.0775 dlossR:0.0000 dlossQ:0.0775
Episode:16835 meanR:-0.1600 rate:-0.0769 gloss:-55.4946 dloss:-3.1072 dlossR:-3.1307 dlossQ:0.0235
Episode:16836 meanR:-0.1500 rate:-0.0769 gloss:-58.4148 dloss:-3.1690 dlossR:-3.2984 dlossQ:0.1294
Episode:16837 meanR:-0.1400 rate:0.0000 gloss:-63.2989 dloss:0.0796 dlossR:0.0000 dlossQ:0.0796
Episode:16838 meanR:-0.1000 rate:0.0769 gloss:-55.7979 dloss:3.2290 dlossR:3.1416 dlossQ:0.0874
Episode:16839 meanR:-0.1000 rate:0.0000 gloss:-54.3245 dloss:0.0786 dlossR:0.0000 dlossQ:0.0786
Episode:16840 meanR:-0.1400 rat

Episode:16915 meanR:-0.0700 rate:0.0769 gloss:-127.3683 dloss:7.3863 dlossR:7.1548 dlossQ:0.2315
Episode:16916 meanR:-0.0600 rate:0.0000 gloss:-62.4012 dloss:0.0685 dlossR:0.0000 dlossQ:0.0685
Episode:16917 meanR:-0.0600 rate:-0.0769 gloss:-54.1866 dloss:-2.9926 dlossR:-3.0425 dlossQ:0.0498
Episode:16918 meanR:-0.0600 rate:0.0000 gloss:-59.7997 dloss:0.0411 dlossR:0.0000 dlossQ:0.0411
Episode:16919 meanR:-0.0500 rate:0.0000 gloss:-58.2955 dloss:0.1041 dlossR:0.0000 dlossQ:0.1041
Episode:16920 meanR:-0.0500 rate:0.0000 gloss:-55.0491 dloss:0.1134 dlossR:0.0000 dlossQ:0.1134
Episode:16921 meanR:-0.0300 rate:0.0000 gloss:-53.5569 dloss:0.0466 dlossR:0.0000 dlossQ:0.0466
Episode:16922 meanR:0.0000 rate:0.1538 gloss:-47.5515 dloss:5.4022 dlossR:5.3739 dlossQ:0.0284
Episode:16923 meanR:0.0000 rate:0.0000 gloss:-45.4493 dloss:0.0321 dlossR:0.0000 dlossQ:0.0321
Episode:16924 meanR:-0.0100 rate:0.0000 gloss:-106.4584 dloss:0.1838 dlossR:0.0000 dlossQ:0.1838
Episode:16925 meanR:-0.0100 rate:0.00

Episode:17001 meanR:0.0200 rate:-0.0769 gloss:-50.1116 dloss:-2.7783 dlossR:-2.8223 dlossQ:0.0440
Episode:17002 meanR:0.0100 rate:0.0769 gloss:-28.3272 dloss:1.6407 dlossR:1.6015 dlossQ:0.0392
Episode:17003 meanR:0.0300 rate:0.0000 gloss:-58.1618 dloss:0.0534 dlossR:0.0000 dlossQ:0.0534
Episode:17004 meanR:0.0400 rate:0.0769 gloss:-38.9028 dloss:2.2396 dlossR:2.1989 dlossQ:0.0407
Episode:17005 meanR:0.0900 rate:0.3077 gloss:-38.7206 dloss:8.8897 dlossR:8.8084 dlossQ:0.0813
Episode:17006 meanR:0.0900 rate:0.0000 gloss:-43.4305 dloss:0.0911 dlossR:0.0000 dlossQ:0.0911
Episode:17007 meanR:0.0900 rate:0.0769 gloss:-47.3026 dloss:2.7491 dlossR:2.6746 dlossQ:0.0745
Episode:17008 meanR:0.0800 rate:0.0000 gloss:-37.5314 dloss:0.0420 dlossR:0.0000 dlossQ:0.0420
Episode:17009 meanR:0.0900 rate:0.0769 gloss:-32.6763 dloss:1.8885 dlossR:1.8442 dlossQ:0.0442
Episode:17010 meanR:0.0900 rate:0.0000 gloss:-32.9217 dloss:0.0252 dlossR:0.0000 dlossQ:0.0252
Episode:17011 meanR:0.0700 rate:-0.1538 gloss:-

Episode:17086 meanR:-0.1500 rate:0.0000 gloss:-80.8939 dloss:0.1791 dlossR:0.0000 dlossQ:0.1791
Episode:17087 meanR:-0.1500 rate:-0.0769 gloss:-65.5633 dloss:-3.5999 dlossR:-3.6771 dlossQ:0.0772
Episode:17088 meanR:-0.1400 rate:-0.1538 gloss:-66.6200 dloss:-7.3666 dlossR:-7.4680 dlossQ:0.1014
Episode:17089 meanR:-0.1500 rate:-0.0769 gloss:-85.1583 dloss:-4.7033 dlossR:-4.7993 dlossQ:0.0959
Episode:17090 meanR:-0.1600 rate:-0.0769 gloss:-74.7820 dloss:-4.1631 dlossR:-4.2024 dlossQ:0.0393
Episode:17091 meanR:-0.1500 rate:0.0000 gloss:-42.9467 dloss:0.0425 dlossR:0.0000 dlossQ:0.0425
Episode:17092 meanR:-0.1500 rate:0.0000 gloss:-66.1047 dloss:0.0881 dlossR:0.0000 dlossQ:0.0881
Episode:17093 meanR:-0.1700 rate:0.0000 gloss:-76.4853 dloss:0.0849 dlossR:0.0000 dlossQ:0.0849
Episode:17094 meanR:-0.1900 rate:-0.1538 gloss:-78.3505 dloss:-8.7348 dlossR:-8.7763 dlossQ:0.0415
Episode:17095 meanR:-0.1900 rate:0.0000 gloss:-80.7696 dloss:0.0486 dlossR:0.0000 dlossQ:0.0486
Episode:17096 meanR:-0.20

Episode:17171 meanR:-0.2900 rate:0.0000 gloss:-171.3446 dloss:0.1373 dlossR:0.0000 dlossQ:0.1373
Episode:17172 meanR:-0.3300 rate:-0.2308 gloss:-150.1889 dloss:-25.2467 dlossR:-25.3422 dlossQ:0.0954
Episode:17173 meanR:-0.3100 rate:0.0000 gloss:-162.9586 dloss:0.2143 dlossR:0.0000 dlossQ:0.2143
Episode:17174 meanR:-0.2800 rate:0.0769 gloss:-175.2652 dloss:10.0685 dlossR:9.8859 dlossQ:0.1826
Episode:17175 meanR:-0.2700 rate:0.0000 gloss:-121.3330 dloss:0.1294 dlossR:0.0000 dlossQ:0.1294
Episode:17176 meanR:-0.2700 rate:0.0000 gloss:-132.9282 dloss:0.1509 dlossR:0.0000 dlossQ:0.1509
Episode:17177 meanR:-0.3000 rate:0.0000 gloss:-145.6970 dloss:0.1955 dlossR:0.0000 dlossQ:0.1955
Episode:17178 meanR:-0.3100 rate:-0.1538 gloss:-139.9938 dloss:-15.5661 dlossR:-15.7462 dlossQ:0.1801
Episode:17179 meanR:-0.3000 rate:0.0000 gloss:-174.3366 dloss:0.2264 dlossR:0.0000 dlossQ:0.2264
Episode:17180 meanR:-0.2900 rate:0.0000 gloss:-134.6828 dloss:0.1544 dlossR:0.0000 dlossQ:0.1544
Episode:17181 meanR

Episode:17255 meanR:0.0100 rate:-0.1538 gloss:-131.0128 dloss:-14.7025 dlossR:-14.7771 dlossQ:0.0746
Episode:17256 meanR:0.0100 rate:0.0000 gloss:-101.5026 dloss:0.1466 dlossR:0.0000 dlossQ:0.1466
Episode:17257 meanR:0.0000 rate:-0.0769 gloss:-110.8887 dloss:-6.1579 dlossR:-6.2646 dlossQ:0.1067
Episode:17258 meanR:-0.0100 rate:-0.0769 gloss:-181.7953 dloss:-10.0141 dlossR:-10.2426 dlossQ:0.2285
Episode:17259 meanR:-0.0300 rate:-0.1538 gloss:-137.6801 dloss:-15.3987 dlossR:-15.4827 dlossQ:0.0841
Episode:17260 meanR:-0.0400 rate:0.0000 gloss:-107.6315 dloss:0.1132 dlossR:0.0000 dlossQ:0.1132
Episode:17261 meanR:-0.0500 rate:-0.0769 gloss:-222.1206 dloss:-12.1700 dlossR:-12.4869 dlossQ:0.3168
Episode:17262 meanR:-0.0400 rate:0.0000 gloss:-137.6064 dloss:0.1476 dlossR:0.0000 dlossQ:0.1476
Episode:17263 meanR:-0.0400 rate:0.0000 gloss:-170.6732 dloss:0.2101 dlossR:0.0000 dlossQ:0.2101
Episode:17264 meanR:-0.0400 rate:0.0000 gloss:-157.0228 dloss:0.1135 dlossR:0.0000 dlossQ:0.1135
Episode:17

Episode:17340 meanR:0.0000 rate:0.0000 gloss:-101.8031 dloss:0.0710 dlossR:0.0000 dlossQ:0.0710
Episode:17341 meanR:-0.0200 rate:-0.1538 gloss:-133.3046 dloss:-14.8095 dlossR:-14.9525 dlossQ:0.1430
Episode:17342 meanR:-0.0300 rate:-0.0769 gloss:-124.8865 dloss:-6.8948 dlossR:-7.0095 dlossQ:0.1146
Episode:17343 meanR:0.0000 rate:0.2308 gloss:-118.3898 dloss:20.2329 dlossR:20.0224 dlossQ:0.2105
Episode:17344 meanR:0.0000 rate:0.0000 gloss:-138.7069 dloss:0.2468 dlossR:0.0000 dlossQ:0.2468
Episode:17345 meanR:-0.0200 rate:-0.1538 gloss:-123.2072 dloss:-13.5959 dlossR:-13.8702 dlossQ:0.2744
Episode:17346 meanR:-0.0200 rate:0.0000 gloss:-114.8554 dloss:0.0800 dlossR:0.0000 dlossQ:0.0800
Episode:17347 meanR:-0.0200 rate:0.0000 gloss:-160.0276 dloss:0.1800 dlossR:0.0000 dlossQ:0.1800
Episode:17348 meanR:-0.0300 rate:0.0000 gloss:-104.3926 dloss:0.1284 dlossR:0.0000 dlossQ:0.1284
Episode:17349 meanR:-0.0400 rate:0.0000 gloss:-99.2995 dloss:0.0793 dlossR:0.0000 dlossQ:0.0793
Episode:17350 meanR

Episode:17425 meanR:-0.0700 rate:0.0000 gloss:-79.2155 dloss:0.0994 dlossR:0.0000 dlossQ:0.0994
Episode:17426 meanR:-0.0600 rate:0.0000 gloss:-100.8462 dloss:0.1110 dlossR:0.0000 dlossQ:0.1110
Episode:17427 meanR:-0.0600 rate:0.0000 gloss:-120.3324 dloss:0.1031 dlossR:0.0000 dlossQ:0.1031
Episode:17428 meanR:-0.0500 rate:0.0000 gloss:-86.4581 dloss:0.1029 dlossR:0.0000 dlossQ:0.1029
Episode:17429 meanR:-0.0400 rate:0.0000 gloss:-109.4176 dloss:0.2397 dlossR:0.0000 dlossQ:0.2397
Episode:17430 meanR:-0.0400 rate:0.0000 gloss:-147.3278 dloss:0.2702 dlossR:0.0000 dlossQ:0.2702
Episode:17431 meanR:-0.0600 rate:-0.1538 gloss:-127.2657 dloss:-14.1872 dlossR:-14.3093 dlossQ:0.1221
Episode:17432 meanR:-0.0500 rate:0.0769 gloss:-114.8551 dloss:6.5807 dlossR:6.4803 dlossQ:0.1004
Episode:17433 meanR:-0.0500 rate:0.0000 gloss:-139.8807 dloss:0.3093 dlossR:0.0000 dlossQ:0.3093
Episode:17434 meanR:-0.0600 rate:-0.0769 gloss:-121.9003 dloss:-6.7813 dlossR:-6.8540 dlossQ:0.0727
Episode:17435 meanR:-0.0

Episode:17509 meanR:-0.0400 rate:0.1538 gloss:-95.7601 dloss:10.9216 dlossR:10.7946 dlossQ:0.1270
Episode:17510 meanR:-0.0400 rate:0.0769 gloss:-76.1080 dloss:4.3607 dlossR:4.2809 dlossQ:0.0798
Episode:17511 meanR:-0.0300 rate:0.0769 gloss:-113.4296 dloss:6.5454 dlossR:6.3746 dlossQ:0.1707
Episode:17512 meanR:-0.0200 rate:0.0000 gloss:-113.8079 dloss:0.1522 dlossR:0.0000 dlossQ:0.1522
Episode:17513 meanR:-0.0200 rate:0.0000 gloss:-127.5207 dloss:0.2840 dlossR:0.0000 dlossQ:0.2840
Episode:17514 meanR:-0.0200 rate:0.0000 gloss:-246.3746 dloss:0.4321 dlossR:0.0000 dlossQ:0.4321
Episode:17515 meanR:-0.0300 rate:0.0000 gloss:-93.7859 dloss:0.1267 dlossR:0.0000 dlossQ:0.1267
Episode:17516 meanR:-0.0300 rate:0.0000 gloss:-119.6103 dloss:0.1957 dlossR:0.0000 dlossQ:0.1957
Episode:17517 meanR:-0.0200 rate:0.0000 gloss:-181.3807 dloss:0.2314 dlossR:0.0000 dlossQ:0.2314
Episode:17518 meanR:-0.0100 rate:0.0769 gloss:-133.0040 dloss:7.6225 dlossR:7.4757 dlossQ:0.1468
Episode:17519 meanR:-0.0200 rat

Episode:17594 meanR:0.1400 rate:0.1538 gloss:-83.1045 dloss:9.4301 dlossR:9.3773 dlossQ:0.0528
Episode:17595 meanR:0.1400 rate:0.0000 gloss:-47.8421 dloss:0.0535 dlossR:0.0000 dlossQ:0.0535
Episode:17596 meanR:0.1500 rate:0.0000 gloss:-71.1113 dloss:0.0783 dlossR:0.0000 dlossQ:0.0783
Episode:17597 meanR:0.1400 rate:0.0000 gloss:-135.8910 dloss:0.2366 dlossR:0.0000 dlossQ:0.2366
Episode:17598 meanR:0.1400 rate:0.0000 gloss:-90.3777 dloss:0.0420 dlossR:0.0000 dlossQ:0.0420
Episode:17599 meanR:0.1500 rate:0.0769 gloss:-156.3610 dloss:9.0204 dlossR:8.7953 dlossQ:0.2251
Episode:17600 meanR:0.1500 rate:-0.0769 gloss:-73.6872 dloss:-4.0817 dlossR:-4.1331 dlossQ:0.0514
Episode:17601 meanR:0.1700 rate:0.1538 gloss:-39.2525 dloss:4.4888 dlossR:4.4436 dlossQ:0.0452
Episode:17602 meanR:0.1600 rate:0.0000 gloss:-43.2921 dloss:0.0515 dlossR:0.0000 dlossQ:0.0515
Episode:17603 meanR:0.1600 rate:0.1538 gloss:-56.7940 dloss:6.4606 dlossR:6.4039 dlossQ:0.0567
Episode:17604 meanR:0.2000 rate:0.1538 gloss:

Episode:17680 meanR:0.4900 rate:0.0000 gloss:-7.1785 dloss:0.1209 dlossR:0.0151 dlossQ:0.1059
Episode:17681 meanR:0.5000 rate:0.0769 gloss:-10.5677 dloss:0.6514 dlossR:0.6030 dlossQ:0.0484
Episode:17682 meanR:0.4900 rate:0.0000 gloss:-6.2515 dloss:0.1125 dlossR:0.0147 dlossQ:0.0978
Episode:17683 meanR:0.4900 rate:-0.0769 gloss:-7.2342 dloss:-0.3022 dlossR:-0.3881 dlossQ:0.0860
Episode:17684 meanR:0.5000 rate:0.0000 gloss:-8.9857 dloss:0.0583 dlossR:0.0079 dlossQ:0.0504
Episode:17685 meanR:0.5000 rate:0.0769 gloss:-11.1106 dloss:0.6722 dlossR:0.6344 dlossQ:0.0378
Episode:17686 meanR:0.5100 rate:0.0769 gloss:-7.9118 dloss:0.5068 dlossR:0.4564 dlossQ:0.0504
Episode:17687 meanR:0.5000 rate:-0.0769 gloss:-13.4828 dloss:-0.7268 dlossR:-0.7499 dlossQ:0.0231
Episode:17688 meanR:0.5000 rate:0.0000 gloss:-13.9131 dloss:0.0299 dlossR:0.0020 dlossQ:0.0280
Episode:17689 meanR:0.4800 rate:-0.1538 gloss:-15.3502 dloss:-1.6653 dlossR:-1.6966 dlossQ:0.0313
Episode:17690 meanR:0.4800 rate:0.0000 gloss:-

Episode:17766 meanR:0.4100 rate:0.0000 gloss:-8.6767 dloss:0.2268 dlossR:0.0349 dlossQ:0.1919
Episode:17767 meanR:0.4000 rate:0.0000 gloss:-9.4317 dloss:0.0454 dlossR:0.0039 dlossQ:0.0415
Episode:17768 meanR:0.4100 rate:0.1538 gloss:-5.8748 dloss:0.8682 dlossR:0.7135 dlossQ:0.1547
Episode:17769 meanR:0.4000 rate:0.0000 gloss:-7.0121 dloss:0.1054 dlossR:0.0150 dlossQ:0.0904
Episode:17770 meanR:0.3900 rate:-0.0769 gloss:-9.7247 dloss:-0.5102 dlossR:-0.5385 dlossQ:0.0283
Episode:17771 meanR:0.3700 rate:0.0000 gloss:-7.5916 dloss:0.0544 dlossR:0.0060 dlossQ:0.0484
Episode:17772 meanR:0.3200 rate:-0.0769 gloss:-10.4747 dloss:-0.5221 dlossR:-0.5893 dlossQ:0.0672
Episode:17773 meanR:0.3100 rate:0.0000 gloss:-7.4130 dloss:0.0712 dlossR:0.0078 dlossQ:0.0635
Episode:17774 meanR:0.2900 rate:0.1538 gloss:-16.3506 dloss:2.0251 dlossR:1.8851 dlossQ:0.1400
Episode:17775 meanR:0.3100 rate:0.1538 gloss:-7.9189 dloss:1.0197 dlossR:0.9371 dlossQ:0.0826
Episode:17776 meanR:0.3100 rate:0.0000 gloss:-13.546

Episode:17853 meanR:0.4700 rate:-0.0769 gloss:-3.7370 dloss:0.4275 dlossR:-0.0303 dlossQ:0.4578
Episode:17854 meanR:0.4800 rate:0.0769 gloss:-4.2105 dloss:0.7063 dlossR:0.3507 dlossQ:0.3556
Episode:17855 meanR:0.5000 rate:0.1538 gloss:-3.3174 dloss:0.8459 dlossR:0.4949 dlossQ:0.3509
Episode:17856 meanR:0.5100 rate:0.0769 gloss:-4.3664 dloss:0.5133 dlossR:0.2993 dlossQ:0.2141
Episode:17857 meanR:0.5000 rate:-0.0769 gloss:-6.6425 dloss:-0.1044 dlossR:-0.3176 dlossQ:0.2132
Episode:17858 meanR:0.5000 rate:0.0769 gloss:-4.8870 dloss:0.5719 dlossR:0.3320 dlossQ:0.2400
Episode:17859 meanR:0.4800 rate:-0.1538 gloss:-4.4260 dloss:-0.2407 dlossR:-0.4248 dlossQ:0.1840
Episode:17860 meanR:0.4700 rate:-0.0769 gloss:-7.0530 dloss:-0.2650 dlossR:-0.3827 dlossQ:0.1177
Episode:17861 meanR:0.4600 rate:0.0000 gloss:-7.3874 dloss:0.1644 dlossR:0.0199 dlossQ:0.1445
Episode:17862 meanR:0.4800 rate:0.0769 gloss:-6.8159 dloss:0.4732 dlossR:0.3996 dlossQ:0.0735
Episode:17863 meanR:0.4900 rate:0.0000 gloss:-7.3

Episode:17940 meanR:0.4500 rate:-0.1538 gloss:-9.0537 dloss:-0.8496 dlossR:-0.9803 dlossQ:0.1306
Episode:17941 meanR:0.4500 rate:0.0000 gloss:-6.7931 dloss:0.1276 dlossR:0.0182 dlossQ:0.1094
Episode:17942 meanR:0.4400 rate:-0.0769 gloss:-11.2875 dloss:-0.3869 dlossR:-0.6027 dlossQ:0.2159
Episode:17943 meanR:0.4100 rate:-0.0769 gloss:-9.4332 dloss:-0.3276 dlossR:-0.5057 dlossQ:0.1782
Episode:17944 meanR:0.4000 rate:-0.0769 gloss:-4.5644 dloss:0.1509 dlossR:-0.1633 dlossQ:0.3143
Episode:17945 meanR:0.3500 rate:-0.1538 gloss:-9.5423 dloss:-0.6374 dlossR:-0.9715 dlossQ:0.3341
Episode:17946 meanR:0.3800 rate:0.2308 gloss:-3.7181 dloss:1.1040 dlossR:0.7653 dlossQ:0.3387
Episode:17947 meanR:0.3800 rate:0.0000 gloss:-7.5554 dloss:0.2769 dlossR:0.0464 dlossQ:0.2305
Episode:17948 meanR:0.3700 rate:0.0769 gloss:-5.8423 dloss:0.5665 dlossR:0.3769 dlossQ:0.1896
Episode:17949 meanR:0.3700 rate:0.0000 gloss:-4.7559 dloss:0.2248 dlossR:0.0445 dlossQ:0.1803
Episode:17950 meanR:0.3400 rate:0.0000 gloss:

Episode:18027 meanR:0.5700 rate:0.1538 gloss:-3.0686 dloss:0.9987 dlossR:0.5073 dlossQ:0.4914
Episode:18028 meanR:0.5600 rate:0.0000 gloss:-7.0370 dloss:0.6125 dlossR:0.1136 dlossQ:0.4989
Episode:18029 meanR:0.5600 rate:-0.0769 gloss:-4.5255 dloss:0.2976 dlossR:-0.1476 dlossQ:0.4452
Episode:18030 meanR:0.5400 rate:0.1538 gloss:-4.0479 dloss:0.8607 dlossR:0.5524 dlossQ:0.3082
Episode:18031 meanR:0.5200 rate:-0.0769 gloss:-5.4360 dloss:0.1515 dlossR:-0.2132 dlossQ:0.3647
Episode:18032 meanR:0.5100 rate:0.0000 gloss:-3.9404 dloss:0.3915 dlossR:0.0798 dlossQ:0.3117
Episode:18033 meanR:0.4900 rate:0.0000 gloss:-4.3931 dloss:0.2683 dlossR:0.0500 dlossQ:0.2182
Episode:18034 meanR:0.4900 rate:0.0000 gloss:-4.0198 dloss:0.3725 dlossR:0.0813 dlossQ:0.2912
Episode:18035 meanR:0.4900 rate:0.0000 gloss:-4.9086 dloss:0.3035 dlossR:0.0556 dlossQ:0.2480
Episode:18036 meanR:0.4900 rate:0.0000 gloss:-4.7510 dloss:0.2013 dlossR:0.0356 dlossQ:0.1658
Episode:18037 meanR:0.5000 rate:0.0769 gloss:-5.0908 dlo

Episode:18113 meanR:0.1600 rate:-0.0769 gloss:-12.1529 dloss:-0.5416 dlossR:-0.6601 dlossQ:0.1185
Episode:18114 meanR:0.1700 rate:0.0769 gloss:-8.4879 dloss:0.5722 dlossR:0.4929 dlossQ:0.0793
Episode:18115 meanR:0.1700 rate:0.0000 gloss:-8.4011 dloss:0.1606 dlossR:0.0208 dlossQ:0.1398
Episode:18116 meanR:0.1600 rate:-0.0769 gloss:-8.3628 dloss:-0.3261 dlossR:-0.4449 dlossQ:0.1187
Episode:18117 meanR:0.1800 rate:0.1538 gloss:-4.7745 dloss:1.0177 dlossR:0.6549 dlossQ:0.3628
Episode:18118 meanR:0.1900 rate:0.0769 gloss:-8.4158 dloss:0.7445 dlossR:0.5260 dlossQ:0.2185
Episode:18119 meanR:0.1900 rate:-0.0769 gloss:-11.2597 dloss:-0.4588 dlossR:-0.5982 dlossQ:0.1394
Episode:18120 meanR:0.1600 rate:-0.0769 gloss:-9.5804 dloss:-0.2943 dlossR:-0.4886 dlossQ:0.1943
Episode:18121 meanR:0.1300 rate:-0.1538 gloss:-8.4313 dloss:-0.7771 dlossR:-0.8992 dlossQ:0.1221
Episode:18122 meanR:0.1000 rate:0.0000 gloss:-12.7640 dloss:0.1054 dlossR:0.0125 dlossQ:0.0929
Episode:18123 meanR:0.1200 rate:0.3077 glo

Episode:18199 meanR:0.2000 rate:0.0769 gloss:-12.0883 dloss:0.9083 dlossR:0.7247 dlossQ:0.1836
Episode:18200 meanR:0.1900 rate:0.0000 gloss:-5.8974 dloss:0.2377 dlossR:0.0443 dlossQ:0.1935
Episode:18201 meanR:0.2000 rate:0.0000 gloss:-5.3917 dloss:0.2118 dlossR:0.0389 dlossQ:0.1729
Episode:18202 meanR:0.2000 rate:0.0000 gloss:-6.5646 dloss:0.4580 dlossR:0.1248 dlossQ:0.3333
Episode:18203 meanR:0.2000 rate:0.0769 gloss:-5.2106 dloss:0.7440 dlossR:0.4012 dlossQ:0.3428
Episode:18204 meanR:0.2000 rate:-0.0769 gloss:-5.5808 dloss:-0.0594 dlossR:-0.2595 dlossQ:0.2001
Episode:18205 meanR:0.2100 rate:-0.0769 gloss:-7.2935 dloss:-0.1544 dlossR:-0.3569 dlossQ:0.2024
Episode:18206 meanR:0.1900 rate:-0.0769 gloss:-10.0171 dloss:-0.4476 dlossR:-0.5468 dlossQ:0.0992
Episode:18207 meanR:0.1900 rate:0.0000 gloss:-24.1756 dloss:0.0612 dlossR:0.0075 dlossQ:0.0537
Episode:18208 meanR:0.1800 rate:0.0769 gloss:-14.7202 dloss:0.9680 dlossR:0.8406 dlossQ:0.1274
Episode:18209 meanR:0.1600 rate:-0.1538 gloss:-

Episode:18286 meanR:0.1900 rate:0.1538 gloss:-7.7940 dloss:1.0668 dlossR:0.9231 dlossQ:0.1438
Episode:18287 meanR:0.1800 rate:0.0000 gloss:-6.0219 dloss:0.1534 dlossR:0.0205 dlossQ:0.1329
Episode:18288 meanR:0.1600 rate:0.0000 gloss:-11.5805 dloss:0.1061 dlossR:0.0063 dlossQ:0.0998
Episode:18289 meanR:0.1600 rate:0.0000 gloss:-8.0327 dloss:0.0547 dlossR:0.0060 dlossQ:0.0487
Episode:18290 meanR:0.1400 rate:0.0000 gloss:-12.3500 dloss:0.0535 dlossR:0.0039 dlossQ:0.0496
Episode:18291 meanR:0.1300 rate:0.0000 gloss:-3.9672 dloss:0.3329 dlossR:0.0700 dlossQ:0.2629
Episode:18292 meanR:0.1200 rate:-0.0769 gloss:-15.0319 dloss:-0.6966 dlossR:-0.8208 dlossQ:0.1242
Episode:18293 meanR:0.1300 rate:0.0769 gloss:-18.6515 dloss:1.2263 dlossR:1.0769 dlossQ:0.1495
Episode:18294 meanR:0.1300 rate:0.0000 gloss:-4.9339 dloss:0.3430 dlossR:0.0616 dlossQ:0.2813
Episode:18295 meanR:0.1900 rate:0.4615 gloss:-2.2379 dloss:1.8788 dlossR:1.2082 dlossQ:0.6706
Episode:18296 meanR:0.2300 rate:0.3077 gloss:-2.0338 

Episode:18373 meanR:1.0400 rate:0.0000 gloss:-4.2323 dloss:0.3847 dlossR:0.0758 dlossQ:0.3089
Episode:18374 meanR:1.0400 rate:0.0000 gloss:-4.5288 dloss:0.3159 dlossR:0.0615 dlossQ:0.2544
Episode:18375 meanR:1.0300 rate:-0.0769 gloss:-5.0580 dloss:0.1475 dlossR:-0.2027 dlossQ:0.3501
Episode:18376 meanR:1.0300 rate:0.0000 gloss:-5.9375 dloss:0.3559 dlossR:0.0589 dlossQ:0.2970
Episode:18377 meanR:1.0300 rate:0.0000 gloss:-4.3178 dloss:0.3976 dlossR:0.0816 dlossQ:0.3160
Episode:18378 meanR:1.0400 rate:0.0000 gloss:-4.3519 dloss:0.3557 dlossR:0.0748 dlossQ:0.2809
Episode:18379 meanR:1.0400 rate:0.0000 gloss:-7.2875 dloss:0.4966 dlossR:0.0903 dlossQ:0.4063
Episode:18380 meanR:1.0500 rate:0.0769 gloss:-3.9011 dloss:0.6414 dlossR:0.3059 dlossQ:0.3355
Episode:18381 meanR:1.0600 rate:0.1538 gloss:-3.1726 dloss:0.9003 dlossR:0.4966 dlossQ:0.4036
Episode:18382 meanR:1.1000 rate:0.2308 gloss:-4.0292 dloss:1.3791 dlossR:0.8727 dlossQ:0.5063
Episode:18383 meanR:1.0800 rate:0.0769 gloss:-3.0588 dloss

Episode:18460 meanR:0.4000 rate:-0.1538 gloss:-5.0477 dloss:-0.3723 dlossR:-0.5107 dlossQ:0.1384
Episode:18461 meanR:0.4200 rate:0.1538 gloss:-4.4987 dloss:0.8399 dlossR:0.5884 dlossQ:0.2515
Episode:18462 meanR:0.4200 rate:0.0000 gloss:-4.8749 dloss:0.2670 dlossR:0.0476 dlossQ:0.2194
Episode:18463 meanR:0.4100 rate:0.0769 gloss:-4.0367 dloss:0.5025 dlossR:0.2836 dlossQ:0.2188
Episode:18464 meanR:0.4200 rate:0.0769 gloss:-4.0406 dloss:0.4982 dlossR:0.2836 dlossQ:0.2146
Episode:18465 meanR:0.4100 rate:0.0769 gloss:-3.8841 dloss:0.5263 dlossR:0.2832 dlossQ:0.2431
Episode:18466 meanR:0.4200 rate:0.0000 gloss:-4.0545 dloss:0.2784 dlossR:0.0548 dlossQ:0.2235
Episode:18467 meanR:0.4400 rate:0.0769 gloss:-3.7233 dloss:0.5285 dlossR:0.2779 dlossQ:0.2505
Episode:18468 meanR:0.4100 rate:0.0769 gloss:-4.3804 dloss:0.5597 dlossR:0.3053 dlossQ:0.2545
Episode:18469 meanR:0.3900 rate:-0.1538 gloss:-4.5445 dloss:-0.1348 dlossR:-0.4149 dlossQ:0.2801
Episode:18470 meanR:0.3500 rate:0.0769 gloss:-5.9732 d

Episode:18547 meanR:0.0700 rate:-0.2308 gloss:-10.8298 dloss:-1.7497 dlossR:-1.7702 dlossQ:0.0205
Episode:18548 meanR:0.0700 rate:-0.0769 gloss:-9.5978 dloss:-0.4983 dlossR:-0.5305 dlossQ:0.0322
Episode:18549 meanR:0.0700 rate:0.0000 gloss:-8.9417 dloss:0.0374 dlossR:0.0033 dlossQ:0.0342
Episode:18550 meanR:0.0600 rate:-0.0769 gloss:-10.7853 dloss:-0.5764 dlossR:-0.5993 dlossQ:0.0228
Episode:18551 meanR:0.0700 rate:0.0769 gloss:-10.1430 dloss:0.6090 dlossR:0.5775 dlossQ:0.0315
Episode:18552 meanR:0.0600 rate:-0.1538 gloss:-12.0263 dloss:-1.3052 dlossR:-1.3258 dlossQ:0.0206
Episode:18553 meanR:0.0700 rate:0.0769 gloss:-10.1880 dloss:0.5970 dlossR:0.5794 dlossQ:0.0176
Episode:18554 meanR:0.0500 rate:-0.0769 gloss:-12.8461 dloss:-0.6948 dlossR:-0.7153 dlossQ:0.0205
Episode:18555 meanR:0.0600 rate:0.1538 gloss:-10.0192 dloss:1.1810 dlossR:1.1521 dlossQ:0.0288
Episode:18556 meanR:0.0600 rate:0.0000 gloss:-11.0911 dloss:0.0187 dlossR:0.0006 dlossQ:0.0181
Episode:18557 meanR:0.0500 rate:0.000

Episode:18634 meanR:0.2100 rate:0.2308 gloss:-3.9050 dloss:1.0960 dlossR:0.7860 dlossQ:0.3100
Episode:18635 meanR:0.2700 rate:0.3846 gloss:-3.1320 dloss:1.5235 dlossR:1.1429 dlossQ:0.3806
Episode:18636 meanR:0.2500 rate:-0.1538 gloss:-5.4100 dloss:-0.2787 dlossR:-0.5330 dlossQ:0.2543
Episode:18637 meanR:0.2600 rate:0.0769 gloss:-3.7904 dloss:0.6337 dlossR:0.3055 dlossQ:0.3281
Episode:18638 meanR:0.2700 rate:0.1538 gloss:-3.9847 dloss:0.9107 dlossR:0.5551 dlossQ:0.3556
Episode:18639 meanR:0.2800 rate:0.0769 gloss:-3.8690 dloss:0.6867 dlossR:0.3183 dlossQ:0.3683
Episode:18640 meanR:0.3200 rate:0.0000 gloss:-3.8565 dloss:0.3885 dlossR:0.0843 dlossQ:0.3042
Episode:18641 meanR:0.3600 rate:0.3846 gloss:-2.9970 dloss:1.5667 dlossR:1.1139 dlossQ:0.4527
Episode:18642 meanR:0.3600 rate:0.0000 gloss:-3.3109 dloss:0.5251 dlossR:0.1221 dlossQ:0.4030
Episode:18643 meanR:0.4100 rate:0.2308 gloss:-2.9710 dloss:1.1365 dlossR:0.6857 dlossQ:0.4509
Episode:18644 meanR:0.4100 rate:0.0000 gloss:-3.6116 dlos

Episode:18721 meanR:0.8200 rate:0.0000 gloss:-9.7429 dloss:0.0213 dlossR:0.0013 dlossQ:0.0200
Episode:18722 meanR:0.8100 rate:-0.0769 gloss:-9.4332 dloss:-0.4998 dlossR:-0.5232 dlossQ:0.0233
Episode:18723 meanR:0.8200 rate:0.2308 gloss:-8.9032 dloss:1.5866 dlossR:1.5580 dlossQ:0.0287
Episode:18724 meanR:0.8000 rate:-0.0769 gloss:-10.2181 dloss:-0.5405 dlossR:-0.5665 dlossQ:0.0260
Episode:18725 meanR:0.7800 rate:0.0000 gloss:-12.1117 dloss:0.0235 dlossR:0.0012 dlossQ:0.0223
Episode:18726 meanR:0.7700 rate:-0.0769 gloss:-14.4217 dloss:-0.7424 dlossR:-0.8031 dlossQ:0.0607
Episode:18727 meanR:0.7700 rate:0.0000 gloss:-11.6961 dloss:0.0260 dlossR:0.0015 dlossQ:0.0246
Episode:18728 meanR:0.7500 rate:0.0000 gloss:-9.9853 dloss:0.0274 dlossR:0.0019 dlossQ:0.0255
Episode:18729 meanR:0.7300 rate:-0.0769 gloss:-10.0549 dloss:-0.5335 dlossR:-0.5579 dlossQ:0.0244
Episode:18730 meanR:0.7200 rate:-0.0769 gloss:-11.5249 dloss:-0.5833 dlossR:-0.6406 dlossQ:0.0573
Episode:18731 meanR:0.7100 rate:0.0000 

Episode:18807 meanR:0.1400 rate:0.0000 gloss:-7.0112 dloss:0.1972 dlossR:0.0310 dlossQ:0.1661
Episode:18808 meanR:0.1600 rate:0.1538 gloss:-4.0312 dloss:0.8396 dlossR:0.5459 dlossQ:0.2937
Episode:18809 meanR:0.1700 rate:0.0000 gloss:-6.1346 dloss:0.1596 dlossR:0.0251 dlossQ:0.1346
Episode:18810 meanR:0.1600 rate:0.0000 gloss:-9.0319 dloss:0.0600 dlossR:0.0054 dlossQ:0.0546
Episode:18811 meanR:0.1600 rate:0.0000 gloss:-7.7235 dloss:0.0751 dlossR:0.0089 dlossQ:0.0662
Episode:18812 meanR:0.1700 rate:0.0769 gloss:-4.0082 dloss:0.5415 dlossR:0.2923 dlossQ:0.2492
Episode:18813 meanR:0.1800 rate:0.0000 gloss:-7.0938 dloss:0.2764 dlossR:0.0552 dlossQ:0.2212
Episode:18814 meanR:0.1800 rate:0.0000 gloss:-11.0990 dloss:0.1155 dlossR:0.0146 dlossQ:0.1009
Episode:18815 meanR:0.2000 rate:0.0769 gloss:-7.3994 dloss:0.6641 dlossR:0.4661 dlossQ:0.1979
Episode:18816 meanR:0.2300 rate:0.3846 gloss:-5.7429 dloss:2.0980 dlossR:1.8223 dlossQ:0.2758
Episode:18817 meanR:0.2500 rate:0.0769 gloss:-5.1952 dloss:

Episode:18894 meanR:0.6300 rate:0.0000 gloss:-3.3588 dloss:0.4515 dlossR:0.1060 dlossQ:0.3455
Episode:18895 meanR:0.6700 rate:0.0769 gloss:-3.1215 dloss:0.5927 dlossR:0.2731 dlossQ:0.3195
Episode:18896 meanR:0.6800 rate:0.0769 gloss:-3.3176 dloss:0.5746 dlossR:0.2753 dlossQ:0.2992
Episode:18897 meanR:0.6700 rate:-0.1538 gloss:-3.9777 dloss:-0.1194 dlossR:-0.3544 dlossQ:0.2350
Episode:18898 meanR:0.6500 rate:0.0000 gloss:-3.9206 dloss:0.3038 dlossR:0.0628 dlossQ:0.2410
Episode:18899 meanR:0.6700 rate:0.5385 gloss:-3.4253 dloss:2.0302 dlossR:1.6969 dlossQ:0.3332
Episode:18900 meanR:0.6800 rate:0.0000 gloss:-4.0543 dloss:0.2933 dlossR:0.0578 dlossQ:0.2355
Episode:18901 meanR:0.6800 rate:0.0000 gloss:-4.0269 dloss:0.2752 dlossR:0.0549 dlossQ:0.2203
Episode:18902 meanR:0.7000 rate:0.1538 gloss:-4.8669 dloss:0.8512 dlossR:0.6170 dlossQ:0.2342
Episode:18903 meanR:0.7100 rate:0.0000 gloss:-4.6289 dloss:0.1885 dlossR:0.0337 dlossQ:0.1548
Episode:18904 meanR:0.7200 rate:0.0000 gloss:-10.6978 dlo

Episode:18981 meanR:0.1400 rate:-0.0769 gloss:-11.3656 dloss:-0.6030 dlossR:-0.6332 dlossQ:0.0303
Episode:18982 meanR:0.1100 rate:-0.0769 gloss:-13.2309 dloss:-0.7197 dlossR:-0.7367 dlossQ:0.0170
Episode:18983 meanR:0.0900 rate:-0.0769 gloss:-10.1892 dloss:-0.5469 dlossR:-0.5681 dlossQ:0.0211
Episode:18984 meanR:0.0600 rate:-0.0769 gloss:-8.6283 dloss:-0.4476 dlossR:-0.4767 dlossQ:0.0292
Episode:18985 meanR:0.0600 rate:0.0000 gloss:-8.8855 dloss:0.0250 dlossR:0.0017 dlossQ:0.0234
Episode:18986 meanR:0.1000 rate:0.3846 gloss:-8.4386 dloss:2.5576 dlossR:2.5235 dlossQ:0.0341
Episode:18987 meanR:0.1000 rate:0.0000 gloss:-8.7208 dloss:0.0307 dlossR:0.0024 dlossQ:0.0283
Episode:18988 meanR:0.1300 rate:0.3077 gloss:-7.9899 dloss:1.9407 dlossR:1.8957 dlossQ:0.0450
Episode:18989 meanR:0.1500 rate:0.0000 gloss:-8.2412 dloss:0.0311 dlossR:0.0025 dlossQ:0.0286
Episode:18990 meanR:0.1700 rate:0.2308 gloss:-7.7180 dloss:1.4198 dlossR:1.3612 dlossQ:0.0586
Episode:18991 meanR:0.1800 rate:0.0769 gloss:

Episode:19068 meanR:0.4000 rate:0.0769 gloss:-9.8104 dloss:0.6968 dlossR:0.5677 dlossQ:0.1291
Episode:19069 meanR:0.4000 rate:0.0000 gloss:-8.6061 dloss:0.1295 dlossR:0.0132 dlossQ:0.1163
Episode:19070 meanR:0.4100 rate:0.1538 gloss:-6.1440 dloss:0.8987 dlossR:0.7396 dlossQ:0.1591
Episode:19071 meanR:0.3800 rate:-0.2308 gloss:-7.3109 dloss:-0.9843 dlossR:-1.1489 dlossQ:0.1647
Episode:19072 meanR:0.4200 rate:0.2308 gloss:-4.7523 dloss:1.0511 dlossR:0.8861 dlossQ:0.1650
Episode:19073 meanR:0.4200 rate:0.0000 gloss:-5.7740 dloss:0.1555 dlossR:0.0248 dlossQ:0.1307
Episode:19074 meanR:0.4000 rate:-0.1538 gloss:-7.0562 dloss:-0.6516 dlossR:-0.7510 dlossQ:0.0995
Episode:19075 meanR:0.4200 rate:0.1538 gloss:-4.9574 dloss:0.8108 dlossR:0.6181 dlossQ:0.1926
Episode:19076 meanR:0.3800 rate:-0.0769 gloss:-9.8224 dloss:-0.4303 dlossR:-0.5302 dlossQ:0.1000
Episode:19077 meanR:0.3400 rate:-0.2308 gloss:-12.9210 dloss:-2.0372 dlossR:-2.1117 dlossQ:0.0745
Episode:19078 meanR:0.3200 rate:-0.0769 gloss:-

Episode:19155 meanR:0.2500 rate:0.0000 gloss:-8.1361 dloss:0.0385 dlossR:0.0037 dlossQ:0.0348
Episode:19156 meanR:0.2400 rate:0.0000 gloss:-7.7615 dloss:0.0413 dlossR:0.0043 dlossQ:0.0370
Episode:19157 meanR:0.2200 rate:0.0000 gloss:-8.4032 dloss:0.0299 dlossR:0.0024 dlossQ:0.0275
Episode:19158 meanR:0.2300 rate:0.0769 gloss:-10.0219 dloss:0.5954 dlossR:0.5703 dlossQ:0.0251
Episode:19159 meanR:0.2300 rate:0.0000 gloss:-13.3419 dloss:0.0225 dlossR:0.0012 dlossQ:0.0214
Episode:19160 meanR:0.2100 rate:0.0000 gloss:-10.1424 dloss:0.0204 dlossR:0.0008 dlossQ:0.0197
Episode:19161 meanR:0.1900 rate:-0.0769 gloss:-10.8741 dloss:-0.5724 dlossR:-0.6024 dlossQ:0.0300
Episode:19162 meanR:0.2000 rate:0.0769 gloss:-8.3737 dloss:0.5313 dlossR:0.4825 dlossQ:0.0487
Episode:19163 meanR:0.2100 rate:0.0769 gloss:-7.3569 dloss:0.4758 dlossR:0.4277 dlossQ:0.0481
Episode:19164 meanR:0.2300 rate:0.2308 gloss:-8.8862 dloss:1.5950 dlossR:1.5587 dlossQ:0.0363
Episode:19165 meanR:0.2200 rate:0.1538 gloss:-8.2448 

Episode:19242 meanR:0.1700 rate:0.0000 gloss:-5.2837 dloss:0.1945 dlossR:0.0337 dlossQ:0.1608
Episode:19243 meanR:0.1900 rate:0.0000 gloss:-4.0928 dloss:0.2861 dlossR:0.0591 dlossQ:0.2270
Episode:19244 meanR:0.1800 rate:-0.0769 gloss:-7.1371 dloss:-0.2394 dlossR:-0.3720 dlossQ:0.1327
Episode:19245 meanR:0.2000 rate:0.0000 gloss:-5.6336 dloss:0.2139 dlossR:0.0373 dlossQ:0.1766
Episode:19246 meanR:0.2100 rate:-0.0769 gloss:-8.1922 dloss:-0.3089 dlossR:-0.4346 dlossQ:0.1257
Episode:19247 meanR:0.1800 rate:0.0769 gloss:-10.9183 dloss:0.7016 dlossR:0.6267 dlossQ:0.0749
Episode:19248 meanR:0.2100 rate:0.0769 gloss:-16.5014 dloss:1.1106 dlossR:0.9538 dlossQ:0.1568
Episode:19249 meanR:0.1800 rate:-0.1538 gloss:-5.6065 dloss:-0.4743 dlossR:-0.6054 dlossQ:0.1311
Episode:19250 meanR:0.2000 rate:0.0769 gloss:-5.1557 dloss:0.4620 dlossR:0.3225 dlossQ:0.1395
Episode:19251 meanR:0.2400 rate:0.1538 gloss:-5.4027 dloss:0.7857 dlossR:0.6587 dlossQ:0.1271
Episode:19252 meanR:0.2500 rate:0.2308 gloss:-5.5

Episode:19329 meanR:0.5400 rate:0.0769 gloss:-6.5625 dloss:0.4461 dlossR:0.3834 dlossQ:0.0627
Episode:19330 meanR:0.5300 rate:-0.0769 gloss:-6.9617 dloss:-0.3232 dlossR:-0.3775 dlossQ:0.0543
Episode:19331 meanR:0.5600 rate:0.1538 gloss:-6.6032 dloss:0.8383 dlossR:0.7742 dlossQ:0.0641
Episode:19332 meanR:0.5600 rate:0.0000 gloss:-9.5693 dloss:0.0482 dlossR:0.0043 dlossQ:0.0439
Episode:19333 meanR:0.5500 rate:0.0000 gloss:-9.3294 dloss:0.0512 dlossR:0.0044 dlossQ:0.0468
Episode:19334 meanR:0.5300 rate:0.0000 gloss:-9.7003 dloss:0.0467 dlossR:0.0041 dlossQ:0.0427
Episode:19335 meanR:0.5200 rate:0.0000 gloss:-9.6622 dloss:0.0644 dlossR:0.0048 dlossQ:0.0596
Episode:19336 meanR:0.5500 rate:0.0769 gloss:-6.9659 dloss:0.4562 dlossR:0.4041 dlossQ:0.0521
Episode:19337 meanR:0.5600 rate:0.0769 gloss:-7.5497 dloss:0.4798 dlossR:0.4353 dlossQ:0.0444
Episode:19338 meanR:0.5900 rate:0.1538 gloss:-6.2294 dloss:0.8162 dlossR:0.7357 dlossQ:0.0805
Episode:19339 meanR:0.5900 rate:-0.1538 gloss:-7.6464 dlo

Episode:19415 meanR:-0.3100 rate:0.6154 gloss:-51.6669 dloss:23.6453 dlossR:23.5977 dlossQ:0.0476
Episode:19416 meanR:-0.3400 rate:0.0000 gloss:-39.7035 dloss:0.0479 dlossR:0.0000 dlossQ:0.0479
Episode:19417 meanR:-0.3400 rate:0.0000 gloss:-36.2571 dloss:0.0414 dlossR:0.0000 dlossQ:0.0414
Episode:19418 meanR:-0.3100 rate:0.0769 gloss:-29.2722 dloss:1.6998 dlossR:1.6646 dlossQ:0.0351
Episode:19419 meanR:-0.3000 rate:0.0000 gloss:-56.2653 dloss:0.0871 dlossR:0.0000 dlossQ:0.0871
Episode:19420 meanR:-0.2900 rate:0.0769 gloss:-30.8549 dloss:1.7807 dlossR:1.7473 dlossQ:0.0334
Episode:19421 meanR:-0.2900 rate:-0.0769 gloss:-37.8909 dloss:-2.0753 dlossR:-2.1278 dlossQ:0.0525
Episode:19422 meanR:-0.2900 rate:0.0000 gloss:-31.8024 dloss:0.0340 dlossR:0.0000 dlossQ:0.0340
Episode:19423 meanR:-0.3200 rate:-0.2308 gloss:-32.6268 dloss:-5.4072 dlossR:-5.4465 dlossQ:0.0394
Episode:19424 meanR:-0.3200 rate:0.0000 gloss:-33.3300 dloss:0.0324 dlossR:0.0000 dlossQ:0.0324
Episode:19425 meanR:-0.3400 rate

Episode:19500 meanR:-0.0100 rate:0.0000 gloss:-32.3521 dloss:0.0385 dlossR:0.0000 dlossQ:0.0385
Episode:19501 meanR:0.0100 rate:0.0000 gloss:-28.6140 dloss:0.0309 dlossR:0.0000 dlossQ:0.0309
Episode:19502 meanR:0.0200 rate:0.0000 gloss:-25.6377 dloss:0.0223 dlossR:0.0000 dlossQ:0.0223
Episode:19503 meanR:0.0000 rate:-0.0769 gloss:-24.5893 dloss:-1.3510 dlossR:-1.3767 dlossQ:0.0258
Episode:19504 meanR:0.0000 rate:-0.1538 gloss:-27.4174 dloss:-3.0045 dlossR:-3.0582 dlossQ:0.0537
Episode:19505 meanR:-0.0100 rate:0.0000 gloss:-27.4372 dloss:0.0799 dlossR:0.0000 dlossQ:0.0799
Episode:19506 meanR:0.0100 rate:0.0000 gloss:-38.0764 dloss:0.1172 dlossR:0.0000 dlossQ:0.1172
Episode:19507 meanR:0.0000 rate:0.0000 gloss:-30.7327 dloss:0.0444 dlossR:0.0000 dlossQ:0.0444
Episode:19508 meanR:0.0300 rate:0.0000 gloss:-33.7850 dloss:0.0782 dlossR:0.0000 dlossQ:0.0782
Episode:19509 meanR:0.0300 rate:0.0000 gloss:-59.4121 dloss:0.1115 dlossR:0.0000 dlossQ:0.1115
Episode:19510 meanR:0.0200 rate:-0.0769 gl

Episode:19586 meanR:0.2100 rate:0.0000 gloss:-15.6339 dloss:0.0425 dlossR:0.0045 dlossQ:0.0380
Episode:19587 meanR:0.2400 rate:0.2308 gloss:-13.0570 dloss:2.3633 dlossR:2.2923 dlossQ:0.0711
Episode:19588 meanR:0.2400 rate:0.1538 gloss:-14.9465 dloss:1.8286 dlossR:1.7182 dlossQ:0.1104
Episode:19589 meanR:0.2500 rate:0.2308 gloss:-9.0882 dloss:1.7114 dlossR:1.6005 dlossQ:0.1109
Episode:19590 meanR:0.2400 rate:-0.0769 gloss:-11.7791 dloss:-0.6066 dlossR:-0.6493 dlossQ:0.0428
Episode:19591 meanR:0.2200 rate:-0.0769 gloss:-13.2903 dloss:-0.6939 dlossR:-0.7354 dlossQ:0.0415
Episode:19592 meanR:0.2200 rate:0.0000 gloss:-21.0651 dloss:0.1177 dlossR:0.0056 dlossQ:0.1122
Episode:19593 meanR:0.2400 rate:0.0000 gloss:-34.4981 dloss:0.0697 dlossR:0.0018 dlossQ:0.0679
Episode:19594 meanR:0.2500 rate:0.0000 gloss:-8.0518 dloss:0.7175 dlossR:0.1211 dlossQ:0.5964
Episode:19595 meanR:0.2700 rate:0.2308 gloss:-13.3775 dloss:2.5603 dlossR:2.3491 dlossQ:0.2112
Episode:19596 meanR:0.2800 rate:0.0769 gloss:-

Episode:19672 meanR:0.3400 rate:0.0000 gloss:-6.0251 dloss:0.1836 dlossR:0.0338 dlossQ:0.1497
Episode:19673 meanR:0.3600 rate:0.0769 gloss:-4.9684 dloss:0.4794 dlossR:0.3174 dlossQ:0.1620
Episode:19674 meanR:0.3600 rate:-0.0769 gloss:-5.3690 dloss:-0.1169 dlossR:-0.2602 dlossQ:0.1433
Episode:19675 meanR:0.3600 rate:0.0000 gloss:-5.0518 dloss:0.2047 dlossR:0.0380 dlossQ:0.1668
Episode:19676 meanR:0.3900 rate:-0.0769 gloss:-6.2042 dloss:-0.1373 dlossR:-0.3048 dlossQ:0.1675
Episode:19677 meanR:0.3700 rate:-0.0769 gloss:-9.7300 dloss:-0.3230 dlossR:-0.5050 dlossQ:0.1819
Episode:19678 meanR:0.3500 rate:-0.0769 gloss:-14.8001 dloss:-0.5411 dlossR:-0.8025 dlossQ:0.2614
Episode:19679 meanR:0.3600 rate:0.0000 gloss:-7.2020 dloss:0.2233 dlossR:0.0355 dlossQ:0.1878
Episode:19680 meanR:0.3500 rate:-0.0769 gloss:-6.8417 dloss:-0.2020 dlossR:-0.3507 dlossQ:0.1487
Episode:19681 meanR:0.3400 rate:-0.0769 gloss:-7.9675 dloss:-0.2423 dlossR:-0.4099 dlossQ:0.1676
Episode:19682 meanR:0.3700 rate:0.2308 gl

Episode:19759 meanR:0.5000 rate:-0.0769 gloss:-5.0678 dloss:-0.0946 dlossR:-0.2452 dlossQ:0.1506
Episode:19760 meanR:0.4800 rate:-0.0769 gloss:-8.0553 dloss:-0.2714 dlossR:-0.4165 dlossQ:0.1451
Episode:19761 meanR:0.4200 rate:-0.1538 gloss:-9.5678 dloss:-0.8643 dlossR:-1.0208 dlossQ:0.1565
Episode:19762 meanR:0.4000 rate:-0.0769 gloss:-6.5080 dloss:-0.2609 dlossR:-0.3457 dlossQ:0.0848
Episode:19763 meanR:0.3600 rate:-0.0769 gloss:-17.3291 dloss:-0.7906 dlossR:-0.9535 dlossQ:0.1629
Episode:19764 meanR:0.3600 rate:0.0000 gloss:-8.2081 dloss:0.1785 dlossR:0.0157 dlossQ:0.1629
Episode:19765 meanR:0.3600 rate:0.0769 gloss:-5.4657 dloss:0.4677 dlossR:0.3362 dlossQ:0.1315
Episode:19766 meanR:0.3200 rate:-0.2308 gloss:-10.0255 dloss:-1.5482 dlossR:-1.6249 dlossQ:0.0767
Episode:19767 meanR:0.3300 rate:0.0000 gloss:-13.7417 dloss:0.0914 dlossR:0.0110 dlossQ:0.0804
Episode:19768 meanR:0.3100 rate:-0.2308 gloss:-10.3247 dloss:-1.6271 dlossR:-1.6820 dlossQ:0.0550
Episode:19769 meanR:0.2900 rate:-0.

In [None]:
%matplotlib inline
import matplotlib.pyplot as plt

def running_mean(x, N):
    cumsum = np.cumsum(np.insert(x, 0, 0)) 
    return (cumsum[N:] - cumsum[:-N]) / N 

In [None]:
eps, arr = np.array(rewards_list).T
smoothed_arr = running_mean(arr, 10)
plt.plot(eps[-len(smoothed_arr):], smoothed_arr)
plt.plot(eps, arr, color='grey', alpha=0.3)
plt.xlabel('Episode')
plt.ylabel('Total rewards')

In [None]:
eps, arr = np.array(loss_list).T
smoothed_arr = running_mean(arr, 10)
plt.plot(eps[-len(smoothed_arr):], smoothed_arr)
plt.plot(eps, arr, color='grey', alpha=0.3)
plt.xlabel('Episode')
plt.ylabel('Average losses')

In [33]:
# # import gym
# # # env = gym.make('CartPole-v0')
# # env = gym.make('CartPole-v1')
# # # env = gym.make('Acrobot-v1')
# # # env = gym.make('MountainCar-v0')
# # # env = gym.make('Pendulum-v0')
# # # env = gym.make('Blackjack-v0')
# # # env = gym.make('FrozenLake-v0')
# # # env = gym.make('AirRaid-ram-v0')
# # # env = gym.make('AirRaid-v0')
# # # env = gym.make('BipedalWalker-v2')
# # # env = gym.make('Copy-v0')
# # # env = gym.make('CarRacing-v0')
# # # env = gym.make('Ant-v2') #mujoco
# # # env = gym.make('FetchPickAndPlace-v1') # mujoco required!

# with tf.Session() as sess:
#     #sess.run(tf.global_variables_initializer())
#     saver.restore(sess, 'checkpoints/model-nav.ckpt')    
#     #saver.restore(sess, tf.train.latest_checkpoint('checkpoints'))
    
#     # Episodes/epochs
#     for _ in range(1):
#         state = env.reset()
#         total_reward = 0

#         # Steps/batches
#         #for _ in range(111111111111111111):
#         while True:
#             env.render()
#             action_logits = sess.run(model.actions_logits, feed_dict={model.states: np.reshape(state, [1, -1])})
#             action = np.argmax(action_logits)
#             state, reward, done, _ = env.step(action)
#             total_reward += reward
#             if done:
#                 break
                
#         # Closing the env
#         print('total_reward: {:.2f}'.format(total_reward))
#         env.close()