# [DEV][PPO] Crawler

---

In this notebook, you will learn how to use the Unity ML-Agents environment for the second project of the [Deep Reinforcement Learning Nanodegree](https://www.udacity.com/course/deep-reinforcement-learning-nanodegree--nd893) program.

### 1. Start the Environment

We begin by importing the necessary packages.  If the code cell below returns an error, please revisit the project instructions to double-check that you have installed [Unity ML-Agents](https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Installation.md) and [NumPy](http://www.numpy.org/).

In [8]:
from unityagents import UnityEnvironment
import torch
import numpy as np

# widget bar to display progress
#!pip install progressbar
import progressbar as pb

Next, we will start the environment!  **_Before running the code cell below_**, change the `file_name` parameter to match the location of the Unity environment that you downloaded.

- **Mac**: `"path/to/Reacher.app"`
- **Windows** (x86): `"path/to/Reacher_Windows_x86/Reacher.exe"`
- **Windows** (x86_64): `"path/to/Reacher_Windows_x86_64/Reacher.exe"`
- **Linux** (x86): `"path/to/Reacher_Linux/Reacher.x86"`
- **Linux** (x86_64): `"path/to/Reacher_Linux/Reacher.x86_64"`
- **Linux** (x86, headless): `"path/to/Reacher_Linux_NoVis/Reacher.x86"`
- **Linux** (x86_64, headless): `"path/to/Reacher_Linux_NoVis/Reacher.x86_64"`

For instance, if you are using a Mac, then you downloaded `Reacher.app`.  If this file is in the same folder as the notebook, then the line below should appear as follows:
```
env = UnityEnvironment(file_name="Reacher.app")
```

In [9]:
env = UnityEnvironment(file_name='Crawler.app', worker_id=101,  no_graphics=True)
#env = UnityEnvironment(file_name='Reacher.app')

INFO:unityagents:
'Academy' started successfully!
Unity Academy name: Academy
        Number of Brains: 1
        Number of External Brains : 1
        Lesson number : 0
        Reset Parameters :
		
Unity brain name: CrawlerBrain
        Number of Visual Observations (per agent): 0
        Vector Observation space type: continuous
        Vector Observation space size (per agent): 129
        Number of stacked Vector Observation: 1
        Vector Action space type: continuous
        Vector Action space size (per agent): 20
        Vector Action descriptions: , , , , , , , , , , , , , , , , , , , 


Environments contain **_brains_** which are responsible for deciding the actions of their associated agents. Here we check for the first brain available, and set it as the default brain we will be controlling from Python.

### 2. Examine the State and Action Spaces

In this environment, a double-jointed arm can move to target locations. A reward of `+0.1` is provided for each step that the agent's hand is in the goal location. Thus, the goal of your agent is to maintain its position at the target location for as many time steps as possible.

The observation space consists of `33` variables corresponding to position, rotation, velocity, and angular velocities of the arm.  Each action is a vector with four numbers, corresponding to torque applicable to two joints.  Every entry in the action vector must be a number between `-1` and `1`.

Run the code cell below to print some information about the environment.

In [10]:
# get the default brain
brain_name = env.brain_names[0]
brain = env.brains[brain_name]

# reset the environment
env_info = env.reset(train_mode=True)[brain_name]

# name of brain
print('Name of brain:', brain_name)

# number of agents
num_agents = len(env_info.agents)
print('Number of agents:', num_agents)

# size of each action
action_size = brain.vector_action_space_size
print('Size of each action:', action_size)

# examine the state space 
states = env_info.vector_observations
state_size = states.shape[1]
print('There are {} agents. Each observes a state with length: {}'.format(states.shape[0], state_size))
print('The state for the first agent looks like:', states[0])

Name of brain: CrawlerBrain
Number of agents: 12
Size of each action: 20
There are 12 agents. Each observes a state with length: 129
The state for the first agent looks like: [ 0.00000000e+00  0.00000000e+00  0.00000000e+00  2.25000000e+00
  1.00000000e+00  0.00000000e+00  1.78813934e-07  0.00000000e+00
  1.00000000e+00  0.00000000e+00  0.00000000e+00  0.00000000e+00
  0.00000000e+00  0.00000000e+00  0.00000000e+00  0.00000000e+00
  0.00000000e+00  0.00000000e+00  0.00000000e+00  0.00000000e+00
  0.00000000e+00  0.00000000e+00  0.00000000e+00  0.00000000e+00
  6.06093168e-01 -1.42857209e-01 -6.06078804e-01  0.00000000e+00
  0.00000000e+00  0.00000000e+00  0.00000000e+00  0.00000000e+00
  0.00000000e+00  0.00000000e+00  0.00000000e+00  0.00000000e+00
  0.00000000e+00  0.00000000e+00  1.33339906e+00 -1.42857209e-01
 -1.33341408e+00  0.00000000e+00  0.00000000e+00  0.00000000e+00
  0.00000000e+00  0.00000000e+00  0.00000000e+00  0.00000000e+00
  0.00000000e+00  0.00000000e+00  0.00000000e

### 3. Take Random Actions in the Environment

In the next code cell, you will learn how to use the Python API to control the agent and receive feedback from the environment.

Once this cell is executed, you will watch the agent's performance, if it selects an action at random with each time step.  A window should pop up that allows you to observe the agent, as it moves through the environment.  

Of course, as part of the project, you'll have to change the code so that the agent is able to use its experience to gradually choose better actions when interacting with the environment!

In [11]:
env_info = env.reset(train_mode=False)[brain_name]     # reset the environment    
states = env_info.vector_observations                  # get the current state (for each agent)
scores = np.zeros(num_agents)                          # initialize the score (for each agent)
while True:
    actions = np.random.randn(num_agents, action_size) # select an action (for each agent)
    actions = np.clip(actions, -1, 1)                  # all actions between -1 and 1
    env_info = env.step(actions)[brain_name]           # send all actions to tne environment
    next_states = env_info.vector_observations         # get next state (for each agent)
    rewards = env_info.rewards                         # get reward (for each agent)
    dones = env_info.local_done                        # see if episode finished
    scores += env_info.rewards                         # update the score (for each agent)
    states = next_states                               # roll over states to next time step
    if np.any(dones):                                  # exit loop if episode finished
        break
print('Total score (averaged over agents) this episode: {}'.format(np.mean(scores)))
#env.close()

Total score (averaged over agents) this episode: 0.49039648139538866


### 4. It's Your Turn!

Now it's your turn to train your own agent to solve the environment!  When training the environment, set `train_mode=True`, so that the line for resetting the environment looks like the following:
```python
env_info = env.reset(train_mode=True)[brain_name]
```

In [12]:
def saveTrainedModel(agent, path):
    state_dicts = {'model': agent.model_local.state_dict()}
    torch.save(state_dicts, path)
    
def loadTrainedModel(agent, path):
    state_dicts = torch.load(path,map_location={'cuda:0': 'cpu'})

    agent.model_local.load_state_dict(state_dicts['model'])
    
    return agent

In [13]:
from PPO_agent import PPO_Agent
#import PPO_util 

model_dir = 'saved_models/'
model_name = 'unity_continuous_' + str(brain_name) + '_' + str(num_agents) + '_agents.pt'

agent = PPO_Agent(env, state_size, action_size, num_agents=num_agents, seed=1234)
#agent = loadTrainedModel(agent, model_dir+model_name)

current device:  cpu


In [None]:
episode_max = 50000 # training loop max iterations
episode_reward = 0.0
mean_rewards = []
e = 0

widget = ['training loop: ', pb.Percentage(), ' ', 
          pb.Bar(), ' ', pb.ETA() ]

#widget = ['Episode: ', pb.Counter(),'/',str(episode_max),'  ',  
#          'eps reward: ', str(np.mean(episode_reward)) ,'  ',
#          'Avg score (100e): ', str(mean_rewards[-100:]) ,'  ',
#          'actor gain: ', str(np.mean(agent.actor_gain)) ,'  ',
#          'critic loss: ', str(np.mean(agent.critic_loss)) ,'  ',
#           pb.ETA(), ' ', pb.Bar(marker=pb.RotatingMarker()), '  ' ]

timer = pb.ProgressBar(widgets=widget, maxval=episode_max).start()


while e < episode_max:

    # collect trajectories
    agent.step()
    episode_reward = agent.episodic_rewards
    
    # display some progress every 20 iterations
    if agent.is_training:

        # get the average reward of the parallel environments
        mean_rewards.append(np.mean(episode_reward))        
        
        if (e+1)%1==0 :
            print("Episode: {}   score: {:.2f}   Avg score (100e): {:.2f}   "
                  "actor gain: {:.2f}   critic loss: {:.2f}   steps: {}".format(e+1, np.mean(episode_reward),
                                                                         np.mean(mean_rewards[-100:]),
                                                                         np.mean(agent.actor_gain), 
                                                                         np.mean(agent.critic_loss),
                                                                         agent.t_step))
            
        if np.mean(mean_rewards[-100:]) >= 100:
            print("Average score over all agents across 100th episodes > 100. Problem Solved!")
            break
                
        timer.update(e)
        
        e += 1
    else:
        print('\rFetching experiences... {} '.format(len(agent.memory.memory)), end="")
        
    #update progress widget bar
    #timer.update(e+1)
    
timer.finish()

training loop:   0% |                                          | ETA:  --:--:--


Prefetch completed. Training starts! 
Number of Agents:  12
Device:  cpu


training loop:   0% |                                          | ETA:  --:--:--

Episode: 1   score: 2.09   Avg score (100e): 2.09   actor gain: -0.49   critic loss: 1.08   steps: 1


training loop:   0% |                                 | ETA:  45 days, 21:05:49

Episode: 2   score: 2.02   Avg score (100e): 2.06   actor gain: -0.49   critic loss: 0.95   steps: 2


training loop:   0% |                                  | ETA:  35 days, 7:48:22

Episode: 3   score: 2.12   Avg score (100e): 2.08   actor gain: -0.50   critic loss: 0.86   steps: 3


training loop:   0% |                                 | ETA:  32 days, 14:35:58

Episode: 4   score: 1.97   Avg score (100e): 2.05   actor gain: -0.49   critic loss: 0.81   steps: 4


training loop:   0% |                                 | ETA:  31 days, 15:25:59

Episode: 5   score: 2.03   Avg score (100e): 2.05   actor gain: -0.49   critic loss: 0.76   steps: 5


training loop:   0% |                                 | ETA:  30 days, 16:06:07

Episode: 6   score: 2.04   Avg score (100e): 2.05   actor gain: -0.49   critic loss: 0.73   steps: 6


training loop:   0% |                                  | ETA:  30 days, 2:13:04

Episode: 7   score: 2.07   Avg score (100e): 2.05   actor gain: -0.49   critic loss: 0.70   steps: 7


training loop:   0% |                                 | ETA:  29 days, 20:09:20

Episode: 8   score: 2.08   Avg score (100e): 2.05   actor gain: -0.49   critic loss: 0.68   steps: 8


training loop:   0% |                                 | ETA:  29 days, 21:22:56

Episode: 9   score: 2.08   Avg score (100e): 2.06   actor gain: -0.49   critic loss: 0.66   steps: 9


training loop:   0% |                                  | ETA:  30 days, 4:45:02

Episode: 10   score: 2.11   Avg score (100e): 2.06   actor gain: -0.49   critic loss: 0.65   steps: 10


training loop:   0% |                                 | ETA:  30 days, 15:22:55

Episode: 11   score: 2.26   Avg score (100e): 2.08   actor gain: -0.50   critic loss: 0.63   steps: 11


training loop:   0% |                                 | ETA:  30 days, 23:07:04

Episode: 12   score: 2.34   Avg score (100e): 2.10   actor gain: -0.50   critic loss: 0.62   steps: 12


training loop:   0% |                                  | ETA:  31 days, 5:49:21

Episode: 13   score: 2.34   Avg score (100e): 2.12   actor gain: -0.49   critic loss: 0.61   steps: 13


training loop:   0% |                                 | ETA:  31 days, 16:57:24

Episode: 14   score: 2.35   Avg score (100e): 2.14   actor gain: -0.49   critic loss: 0.60   steps: 14


training loop:   0% |                                 | ETA:  31 days, 18:16:17

Episode: 15   score: 2.34   Avg score (100e): 2.15   actor gain: -0.49   critic loss: 0.59   steps: 15


training loop:   0% |                                 | ETA:  31 days, 20:11:45

Episode: 16   score: 2.36   Avg score (100e): 2.16   actor gain: -0.49   critic loss: 0.58   steps: 16


training loop:   0% |                                  | ETA:  32 days, 0:45:33

Episode: 17   score: 2.38   Avg score (100e): 2.18   actor gain: -0.49   critic loss: 0.57   steps: 17


training loop:   0% |                                  | ETA:  32 days, 5:58:44

Episode: 18   score: 2.44   Avg score (100e): 2.19   actor gain: -0.49   critic loss: 0.57   steps: 18


training loop:   0% |                                 | ETA:  32 days, 13:20:49

Episode: 19   score: 2.50   Avg score (100e): 2.21   actor gain: -0.49   critic loss: 0.56   steps: 19


training loop:   0% |                                 | ETA:  32 days, 18:04:20

Episode: 20   score: 2.54   Avg score (100e): 2.22   actor gain: -0.49   critic loss: 0.56   steps: 20


training loop:   0% |                                 | ETA:  32 days, 23:28:19

Episode: 21   score: 2.62   Avg score (100e): 2.24   actor gain: -0.49   critic loss: 0.55   steps: 21


training loop:   0% |                                 | ETA:  32 days, 22:19:40

Episode: 22   score: 2.63   Avg score (100e): 2.26   actor gain: -0.49   critic loss: 0.55   steps: 22


training loop:   0% |                                 | ETA:  32 days, 22:59:11

Episode: 23   score: 2.62   Avg score (100e): 2.28   actor gain: -0.49   critic loss: 0.54   steps: 23


training loop:   0% |                                 | ETA:  32 days, 21:41:23

Episode: 24   score: 2.65   Avg score (100e): 2.29   actor gain: -0.49   critic loss: 0.54   steps: 24


training loop:   0% |                                 | ETA:  32 days, 21:32:49

Episode: 25   score: 2.67   Avg score (100e): 2.31   actor gain: -0.48   critic loss: 0.54   steps: 25


training loop:   0% |                                 | ETA:  32 days, 22:08:03

Episode: 26   score: 2.71   Avg score (100e): 2.32   actor gain: -0.48   critic loss: 0.51   steps: 26


training loop:   0% |                                  | ETA:  33 days, 2:21:06

Episode: 27   score: 2.80   Avg score (100e): 2.34   actor gain: -0.48   critic loss: 0.50   steps: 27


training loop:   0% |                                  | ETA:  33 days, 3:53:47

Episode: 28   score: 2.84   Avg score (100e): 2.36   actor gain: -0.48   critic loss: 0.49   steps: 28


training loop:   0% |                                  | ETA:  33 days, 6:59:36

Episode: 29   score: 2.88   Avg score (100e): 2.38   actor gain: -0.48   critic loss: 0.48   steps: 29


training loop:   0% |                                 | ETA:  33 days, 10:03:50

Episode: 30   score: 2.91   Avg score (100e): 2.39   actor gain: -0.48   critic loss: 0.47   steps: 30


training loop:   0% |                                 | ETA:  33 days, 11:58:29

Episode: 31   score: 2.93   Avg score (100e): 2.41   actor gain: -0.48   critic loss: 0.47   steps: 31


training loop:   0% |                                 | ETA:  33 days, 15:55:03

Episode: 32   score: 2.96   Avg score (100e): 2.43   actor gain: -0.48   critic loss: 0.47   steps: 32


training loop:   0% |                                 | ETA:  33 days, 21:51:40

Episode: 33   score: 3.00   Avg score (100e): 2.45   actor gain: -0.48   critic loss: 0.47   steps: 33


training loop:   0% |                                  | ETA:  34 days, 0:40:58

Episode: 34   score: 3.02   Avg score (100e): 2.46   actor gain: -0.48   critic loss: 0.46   steps: 34


training loop:   0% |                                 | ETA:  33 days, 23:44:10

Episode: 35   score: 3.01   Avg score (100e): 2.48   actor gain: -0.48   critic loss: 0.46   steps: 35


training loop:   0% |                                 | ETA:  33 days, 23:21:43

Episode: 36   score: 3.05   Avg score (100e): 2.49   actor gain: -0.47   critic loss: 0.46   steps: 36


training loop:   0% |                                 | ETA:  33 days, 23:31:51

Episode: 37   score: 3.09   Avg score (100e): 2.51   actor gain: -0.47   critic loss: 0.46   steps: 37


training loop:   0% |                                  | ETA:  34 days, 1:40:00

Episode: 38   score: 3.14   Avg score (100e): 2.53   actor gain: -0.47   critic loss: 0.46   steps: 38


training loop:   0% |                                  | ETA:  34 days, 1:06:21

Episode: 39   score: 3.19   Avg score (100e): 2.54   actor gain: -0.47   critic loss: 0.46   steps: 39


training loop:   0% |                                  | ETA:  34 days, 1:05:23

Episode: 40   score: 3.25   Avg score (100e): 2.56   actor gain: -0.47   critic loss: 0.46   steps: 40


training loop:   0% |                                  | ETA:  34 days, 0:51:03

Episode: 41   score: 3.29   Avg score (100e): 2.58   actor gain: -0.47   critic loss: 0.46   steps: 41


training loop:   0% |                                  | ETA:  34 days, 5:19:38

Episode: 42   score: 3.32   Avg score (100e): 2.60   actor gain: -2.32   critic loss: 0.46   steps: 42


training loop:   0% |                                  | ETA:  34 days, 4:19:24

Episode: 43   score: 3.39   Avg score (100e): 2.61   actor gain: -2.32   critic loss: 0.46   steps: 43


training loop:   0% |                                  | ETA:  34 days, 4:13:47

Episode: 44   score: 3.42   Avg score (100e): 2.63   actor gain: -2.32   critic loss: 0.45   steps: 44


training loop:   0% |                                  | ETA:  34 days, 3:06:03

Episode: 45   score: 3.42   Avg score (100e): 2.65   actor gain: -2.32   critic loss: 0.46   steps: 45


training loop:   0% |                                  | ETA:  34 days, 3:03:48

Episode: 46   score: 3.49   Avg score (100e): 2.67   actor gain: -2.32   critic loss: 0.46   steps: 46


training loop:   0% |                                  | ETA:  34 days, 2:18:54

Episode: 47   score: 3.50   Avg score (100e): 2.69   actor gain: -2.32   critic loss: 0.46   steps: 47


training loop:   0% |                                  | ETA:  34 days, 2:28:04

Episode: 48   score: 3.57   Avg score (100e): 2.70   actor gain: -2.32   critic loss: 0.46   steps: 48


training loop:   0% |                                  | ETA:  34 days, 2:27:22

Episode: 49   score: 3.62   Avg score (100e): 2.72   actor gain: -2.32   critic loss: 0.46   steps: 49


training loop:   0% |                                  | ETA:  34 days, 3:29:37

Episode: 50   score: 3.64   Avg score (100e): 2.74   actor gain: -2.32   critic loss: 0.46   steps: 50


training loop:   0% |                                  | ETA:  34 days, 4:44:36

Episode: 51   score: 3.69   Avg score (100e): 2.76   actor gain: -2.32   critic loss: 0.46   steps: 51


training loop:   0% |                                  | ETA:  34 days, 4:31:44

Episode: 52   score: 3.70   Avg score (100e): 2.78   actor gain: -2.32   critic loss: 0.46   steps: 52


training loop:   0% |                                  | ETA:  34 days, 4:17:24

Episode: 53   score: 3.73   Avg score (100e): 2.80   actor gain: -2.32   critic loss: 0.45   steps: 53


training loop:   0% |                                  | ETA:  34 days, 4:03:14

Episode: 54   score: 3.77   Avg score (100e): 2.81   actor gain: -2.32   critic loss: 0.45   steps: 54


training loop:   0% |                                  | ETA:  34 days, 3:20:56

Episode: 55   score: 3.80   Avg score (100e): 2.83   actor gain: -2.32   critic loss: 0.45   steps: 55


training loop:   0% |                                  | ETA:  34 days, 3:14:09

Episode: 56   score: 3.86   Avg score (100e): 2.85   actor gain: -2.32   critic loss: 0.45   steps: 56


training loop:   0% |                                  | ETA:  34 days, 3:22:02

Episode: 57   score: 3.89   Avg score (100e): 2.87   actor gain: -2.33   critic loss: 0.45   steps: 57


training loop:   0% |                                  | ETA:  34 days, 3:17:12

Episode: 58   score: 3.92   Avg score (100e): 2.89   actor gain: -2.32   critic loss: 0.45   steps: 58


training loop:   0% |                                  | ETA:  34 days, 3:33:39

Episode: 59   score: 3.97   Avg score (100e): 2.90   actor gain: -2.33   critic loss: 0.45   steps: 59


training loop:   0% |                                  | ETA:  34 days, 0:30:36

Episode: 60   score: 3.98   Avg score (100e): 2.92   actor gain: -2.32   critic loss: 0.45   steps: 60


training loop:   0% |                                 | ETA:  33 days, 21:49:21

Episode: 61   score: 4.00   Avg score (100e): 2.94   actor gain: -2.32   critic loss: 0.45   steps: 61


training loop:   0% |                                 | ETA:  33 days, 17:41:31

Episode: 62   score: 4.03   Avg score (100e): 2.96   actor gain: -2.32   critic loss: 0.45   steps: 62


training loop:   0% |                                 | ETA:  33 days, 14:27:35

Episode: 63   score: 4.09   Avg score (100e): 2.98   actor gain: -2.32   critic loss: 0.45   steps: 63


training loop:   0% |                                 | ETA:  33 days, 12:59:18

Episode: 64   score: 4.12   Avg score (100e): 2.99   actor gain: -2.32   critic loss: 0.44   steps: 64


training loop:   0% |                                 | ETA:  33 days, 13:05:17

Episode: 65   score: 4.16   Avg score (100e): 3.01   actor gain: -2.33   critic loss: 0.44   steps: 65


training loop:   0% |                                 | ETA:  33 days, 11:41:07

Episode: 66   score: 4.19   Avg score (100e): 3.03   actor gain: -2.33   critic loss: 0.44   steps: 66


training loop:   0% |                                  | ETA:  33 days, 9:42:23

Episode: 67   score: 4.22   Avg score (100e): 3.05   actor gain: -0.48   critic loss: 0.44   steps: 67


training loop:   0% |                                  | ETA:  33 days, 7:57:03

Episode: 68   score: 4.21   Avg score (100e): 3.06   actor gain: -0.48   critic loss: 0.44   steps: 68


training loop:   0% |                                  | ETA:  33 days, 4:40:38

Episode: 69   score: 4.24   Avg score (100e): 3.08   actor gain: -0.48   critic loss: 0.44   steps: 69


training loop:   0% |                                  | ETA:  33 days, 2:16:52

Episode: 70   score: 4.29   Avg score (100e): 3.10   actor gain: -0.48   critic loss: 0.44   steps: 70


training loop:   0% |                                  | ETA:  33 days, 0:26:22

Episode: 71   score: 4.31   Avg score (100e): 3.12   actor gain: -0.48   critic loss: 0.44   steps: 71


training loop:   0% |                                 | ETA:  32 days, 22:36:42

Episode: 72   score: 4.34   Avg score (100e): 3.13   actor gain: -0.48   critic loss: 0.44   steps: 72


training loop:   0% |                                 | ETA:  32 days, 19:58:39

Episode: 73   score: 4.35   Avg score (100e): 3.15   actor gain: -0.48   critic loss: 0.44   steps: 73


training loop:   0% |                                 | ETA:  32 days, 19:54:32

Episode: 74   score: 4.39   Avg score (100e): 3.17   actor gain: -0.48   critic loss: 0.43   steps: 74


training loop:   0% |                                 | ETA:  32 days, 17:23:42

Episode: 75   score: 4.42   Avg score (100e): 3.18   actor gain: -0.49   critic loss: 0.43   steps: 75


training loop:   0% |                                 | ETA:  32 days, 15:46:35

Episode: 76   score: 4.45   Avg score (100e): 3.20   actor gain: -0.49   critic loss: 0.43   steps: 76


training loop:   0% |                                 | ETA:  32 days, 14:15:45

Episode: 77   score: 4.46   Avg score (100e): 3.22   actor gain: -0.49   critic loss: 0.43   steps: 77


training loop:   0% |                                 | ETA:  32 days, 11:56:20

Episode: 78   score: 4.47   Avg score (100e): 3.23   actor gain: -0.49   critic loss: 0.43   steps: 78


training loop:   0% |                                  | ETA:  32 days, 9:04:08

Episode: 79   score: 4.49   Avg score (100e): 3.25   actor gain: -0.49   critic loss: 0.43   steps: 79


training loop:   0% |                                  | ETA:  32 days, 6:37:53

Episode: 80   score: 4.51   Avg score (100e): 3.26   actor gain: -0.49   critic loss: 0.43   steps: 80


training loop:   0% |                                  | ETA:  32 days, 3:59:57

Episode: 81   score: 4.54   Avg score (100e): 3.28   actor gain: -0.49   critic loss: 0.43   steps: 81


training loop:   0% |                                  | ETA:  32 days, 1:06:20

Episode: 82   score: 4.56   Avg score (100e): 3.30   actor gain: -0.49   critic loss: 0.43   steps: 82


training loop:   0% |                                 | ETA:  31 days, 23:13:45

Episode: 83   score: 4.59   Avg score (100e): 3.31   actor gain: -0.49   critic loss: 0.43   steps: 83


training loop:   0% |                                 | ETA:  31 days, 20:47:31

Episode: 84   score: 4.60   Avg score (100e): 3.33   actor gain: -0.49   critic loss: 0.43   steps: 84


training loop:   0% |                                 | ETA:  31 days, 19:53:15

Episode: 85   score: 4.63   Avg score (100e): 3.34   actor gain: -0.49   critic loss: 0.43   steps: 85


training loop:   0% |                                 | ETA:  31 days, 19:18:40

Episode: 86   score: 4.65   Avg score (100e): 3.36   actor gain: -0.49   critic loss: 0.43   steps: 86


training loop:   0% |                                 | ETA:  31 days, 16:37:32

Episode: 87   score: 4.65   Avg score (100e): 3.37   actor gain: -0.49   critic loss: 0.43   steps: 87


training loop:   0% |                                 | ETA:  31 days, 13:40:10

Episode: 88   score: 4.66   Avg score (100e): 3.39   actor gain: -0.49   critic loss: 0.43   steps: 88


training loop:   0% |                                 | ETA:  31 days, 11:40:56

Episode: 89   score: 4.69   Avg score (100e): 3.40   actor gain: -0.49   critic loss: 0.43   steps: 89


training loop:   0% |                                 | ETA:  31 days, 11:01:01

Episode: 90   score: 4.71   Avg score (100e): 3.42   actor gain: -0.49   critic loss: 0.43   steps: 90


training loop:   0% |                                  | ETA:  31 days, 9:43:21

Episode: 91   score: 4.73   Avg score (100e): 3.43   actor gain: -0.49   critic loss: 0.43   steps: 91


training loop:   0% |                                  | ETA:  31 days, 6:48:56

Episode: 92   score: 4.72   Avg score (100e): 3.44   actor gain: -0.49   critic loss: 0.42   steps: 92


training loop:   0% |                                  | ETA:  31 days, 4:18:49

Episode: 93   score: 4.75   Avg score (100e): 3.46   actor gain: -0.49   critic loss: 0.42   steps: 93


training loop:   0% |                                  | ETA:  31 days, 1:53:24

Episode: 94   score: 4.78   Avg score (100e): 3.47   actor gain: -0.49   critic loss: 0.42   steps: 94


training loop:   0% |                                 | ETA:  30 days, 23:14:29

Episode: 95   score: 4.83   Avg score (100e): 3.49   actor gain: -0.48   critic loss: 0.42   steps: 95


training loop:   0% |                                 | ETA:  30 days, 20:55:46

Episode: 96   score: 4.86   Avg score (100e): 3.50   actor gain: -0.48   critic loss: 0.42   steps: 96


training loop:   0% |                                 | ETA:  30 days, 18:52:05

Episode: 97   score: 4.89   Avg score (100e): 3.52   actor gain: -0.48   critic loss: 0.42   steps: 97


training loop:   0% |                                 | ETA:  30 days, 16:39:30

Episode: 98   score: 4.90   Avg score (100e): 3.53   actor gain: -0.48   critic loss: 0.42   steps: 98


training loop:   0% |                                 | ETA:  30 days, 14:39:20

Episode: 99   score: 4.93   Avg score (100e): 3.54   actor gain: -0.48   critic loss: 0.42   steps: 99


training loop:   0% |                                 | ETA:  30 days, 12:17:34

Episode: 100   score: 4.95   Avg score (100e): 3.56   actor gain: -0.48   critic loss: 0.42   steps: 100


training loop:   0% |                                 | ETA:  30 days, 11:56:50

Episode: 101   score: 4.96   Avg score (100e): 3.59   actor gain: -0.48   critic loss: 0.42   steps: 101


training loop:   0% |                                  | ETA:  30 days, 9:58:52

Episode: 102   score: 4.98   Avg score (100e): 3.62   actor gain: -0.48   critic loss: 0.42   steps: 102


training loop:   0% |                                  | ETA:  30 days, 7:38:46

Episode: 103   score: 4.98   Avg score (100e): 3.64   actor gain: -0.48   critic loss: 0.42   steps: 103


training loop:   0% |                                  | ETA:  30 days, 5:47:42

Episode: 104   score: 5.00   Avg score (100e): 3.67   actor gain: -0.49   critic loss: 0.42   steps: 104


training loop:   0% |                                  | ETA:  30 days, 4:08:18

Episode: 105   score: 5.04   Avg score (100e): 3.70   actor gain: -0.49   critic loss: 0.42   steps: 105


training loop:   0% |                                  | ETA:  30 days, 2:33:12

Episode: 106   score: 5.04   Avg score (100e): 3.73   actor gain: -0.49   critic loss: 0.42   steps: 106


training loop:   0% |                                  | ETA:  30 days, 2:36:20

Episode: 107   score: 5.06   Avg score (100e): 3.76   actor gain: -0.49   critic loss: 0.42   steps: 107


training loop:   0% |                                  | ETA:  30 days, 1:22:15

Episode: 108   score: 5.11   Avg score (100e): 3.79   actor gain: -0.49   critic loss: 0.42   steps: 108


training loop:   0% |                                 | ETA:  29 days, 23:42:10

Episode: 109   score: 5.13   Avg score (100e): 3.83   actor gain: -0.49   critic loss: 0.42   steps: 109


training loop:   0% |                                 | ETA:  29 days, 21:47:52

Episode: 110   score: 5.15   Avg score (100e): 3.86   actor gain: -0.49   critic loss: 0.42   steps: 110


training loop:   0% |                                 | ETA:  29 days, 20:06:43

Episode: 111   score: 5.18   Avg score (100e): 3.88   actor gain: -0.49   critic loss: 0.42   steps: 111


training loop:   0% |                                 | ETA:  29 days, 18:54:19

Episode: 112   score: 5.20   Avg score (100e): 3.91   actor gain: -0.49   critic loss: 0.42   steps: 112


training loop:   0% |                                 | ETA:  29 days, 17:19:20

Episode: 113   score: 5.22   Avg score (100e): 3.94   actor gain: -0.49   critic loss: 0.42   steps: 113


training loop:   0% |                                 | ETA:  29 days, 15:37:37

Episode: 114   score: 5.24   Avg score (100e): 3.97   actor gain: -0.49   critic loss: 0.42   steps: 114


training loop:   0% |                                 | ETA:  29 days, 13:43:01

Episode: 115   score: 5.23   Avg score (100e): 4.00   actor gain: -0.49   critic loss: 0.42   steps: 115


training loop:   0% |                                 | ETA:  29 days, 11:40:43

Episode: 116   score: 5.25   Avg score (100e): 4.03   actor gain: -0.49   critic loss: 0.42   steps: 116


training loop:   0% |                                  | ETA:  29 days, 9:58:15

Episode: 117   score: 5.26   Avg score (100e): 4.06   actor gain: -0.49   critic loss: 0.42   steps: 117


training loop:   0% |                                  | ETA:  29 days, 9:40:25

Episode: 118   score: 5.29   Avg score (100e): 4.09   actor gain: -0.49   critic loss: 0.42   steps: 118


training loop:   0% |                                 | ETA:  29 days, 10:06:03

Episode: 119   score: 5.31   Avg score (100e): 4.11   actor gain: -0.50   critic loss: 0.42   steps: 119


training loop:   0% |                                  | ETA:  29 days, 9:07:14

Episode: 120   score: 5.33   Avg score (100e): 4.14   actor gain: -0.50   critic loss: 0.42   steps: 120


training loop:   0% |                                  | ETA:  29 days, 8:40:45

Episode: 121   score: 5.34   Avg score (100e): 4.17   actor gain: -0.50   critic loss: 0.42   steps: 121


training loop:   0% |                                  | ETA:  29 days, 9:02:35

Episode: 122   score: 5.36   Avg score (100e): 4.20   actor gain: -0.50   critic loss: 0.42   steps: 122


training loop:   0% |                                  | ETA:  29 days, 7:31:58

Episode: 123   score: 5.36   Avg score (100e): 4.22   actor gain: -0.50   critic loss: 0.42   steps: 123


training loop:   0% |                                  | ETA:  29 days, 6:38:25

Episode: 124   score: 5.36   Avg score (100e): 4.25   actor gain: -0.50   critic loss: 0.42   steps: 124


training loop:   0% |                                  | ETA:  29 days, 5:39:25

Episode: 125   score: 5.38   Avg score (100e): 4.28   actor gain: -0.50   critic loss: 0.42   steps: 125


training loop:   0% |                                  | ETA:  29 days, 4:51:10

Episode: 126   score: 5.41   Avg score (100e): 4.31   actor gain: -0.50   critic loss: 0.42   steps: 126


training loop:   0% |                                  | ETA:  29 days, 4:06:03

Episode: 127   score: 5.44   Avg score (100e): 4.33   actor gain: -0.50   critic loss: 0.42   steps: 127


training loop:   0% |                                  | ETA:  29 days, 4:21:57

Episode: 128   score: 5.45   Avg score (100e): 4.36   actor gain: -0.50   critic loss: 0.42   steps: 128


training loop:   0% |                                  | ETA:  29 days, 3:33:16

Episode: 129   score: 5.50   Avg score (100e): 4.38   actor gain: -0.48   critic loss: 0.42   steps: 129


training loop:   0% |                                  | ETA:  29 days, 2:44:24

Episode: 130   score: 5.51   Avg score (100e): 4.41   actor gain: -0.48   critic loss: 0.42   steps: 130


training loop:   0% |                                  | ETA:  29 days, 1:58:03

Episode: 131   score: 5.50   Avg score (100e): 4.44   actor gain: -0.48   critic loss: 0.42   steps: 131


training loop:   0% |                                  | ETA:  29 days, 0:59:35

Episode: 132   score: 5.53   Avg score (100e): 4.46   actor gain: -0.48   critic loss: 0.42   steps: 132


training loop:   0% |                                  | ETA:  29 days, 0:01:37

Episode: 133   score: 5.57   Avg score (100e): 4.49   actor gain: -0.48   critic loss: 0.42   steps: 133


training loop:   0% |                                 | ETA:  28 days, 23:50:47

Episode: 134   score: 5.59   Avg score (100e): 4.51   actor gain: -0.48   critic loss: 0.42   steps: 134


training loop:   0% |                                  | ETA:  29 days, 0:30:30

Episode: 135   score: 5.62   Avg score (100e): 4.54   actor gain: -0.48   critic loss: 0.42   steps: 135


training loop:   0% |                                  | ETA:  29 days, 0:32:42

Episode: 136   score: 5.63   Avg score (100e): 4.56   actor gain: -0.48   critic loss: 0.42   steps: 136


training loop:   0% |                                 | ETA:  28 days, 23:56:04

Episode: 137   score: 5.69   Avg score (100e): 4.59   actor gain: -0.48   critic loss: 0.42   steps: 137


training loop:   0% |                                 | ETA:  28 days, 23:35:29

Episode: 138   score: 5.72   Avg score (100e): 4.62   actor gain: -0.48   critic loss: 0.42   steps: 138


training loop:   0% |                                  | ETA:  30 days, 1:33:51

Episode: 139   score: 5.75   Avg score (100e): 4.64   actor gain: -0.48   critic loss: 0.42   steps: 139


training loop:   0% |                                  | ETA:  30 days, 1:06:52

Episode: 140   score: 5.77   Avg score (100e): 4.67   actor gain: -0.48   critic loss: 0.42   steps: 140


training loop:   0% |                                  | ETA:  30 days, 0:24:25

Episode: 141   score: 5.81   Avg score (100e): 4.69   actor gain: -0.48   critic loss: 0.42   steps: 141


training loop:   0% |                                  | ETA:  30 days, 1:36:00

Episode: 142   score: 5.83   Avg score (100e): 4.72   actor gain: -0.48   critic loss: 0.42   steps: 142


training loop:   0% |                                  | ETA:  30 days, 4:01:14

Episode: 143   score: 5.83   Avg score (100e): 4.74   actor gain: -0.48   critic loss: 0.42   steps: 143


training loop:   0% |                                  | ETA:  30 days, 6:29:59

Episode: 144   score: 5.83   Avg score (100e): 4.77   actor gain: -0.47   critic loss: 0.42   steps: 144


training loop:   0% |                                  | ETA:  30 days, 7:28:50

Episode: 145   score: 5.83   Avg score (100e): 4.79   actor gain: -0.47   critic loss: 0.42   steps: 145


training loop:   0% |                                  | ETA:  30 days, 8:17:39

Episode: 146   score: 5.82   Avg score (100e): 4.81   actor gain: -0.47   critic loss: 0.42   steps: 146


training loop:   0% |                                  | ETA:  30 days, 9:11:20

Episode: 147   score: 5.84   Avg score (100e): 4.84   actor gain: -0.47   critic loss: 0.42   steps: 147


training loop:   0% |                                  | ETA:  30 days, 9:48:06

Episode: 148   score: 5.83   Avg score (100e): 4.86   actor gain: -0.47   critic loss: 0.42   steps: 148


training loop:   0% |                                 | ETA:  30 days, 11:04:57

Episode: 149   score: 5.84   Avg score (100e): 4.88   actor gain: -0.47   critic loss: 0.42   steps: 149


training loop:   0% |                                 | ETA:  30 days, 11:47:50

Episode: 150   score: 5.85   Avg score (100e): 4.90   actor gain: -0.47   critic loss: 0.42   steps: 150


training loop:   0% |                                 | ETA:  30 days, 12:25:54

Episode: 151   score: 5.88   Avg score (100e): 4.93   actor gain: -0.47   critic loss: 0.42   steps: 151


training loop:   0% |                                 | ETA:  30 days, 13:01:01

Episode: 152   score: 5.92   Avg score (100e): 4.95   actor gain: -0.47   critic loss: 0.42   steps: 152


training loop:   0% |                                 | ETA:  30 days, 13:38:10

Episode: 153   score: 5.95   Avg score (100e): 4.97   actor gain: -0.47   critic loss: 0.42   steps: 153


training loop:   0% |                                 | ETA:  30 days, 14:41:15

Episode: 154   score: 5.98   Avg score (100e): 4.99   actor gain: -0.47   critic loss: 0.42   steps: 154


training loop:   0% |                                 | ETA:  30 days, 16:24:53

Episode: 155   score: 6.00   Avg score (100e): 5.01   actor gain: -0.47   critic loss: 0.42   steps: 155


training loop:   0% |                                 | ETA:  30 days, 17:39:42

Episode: 156   score: 6.00   Avg score (100e): 5.04   actor gain: -0.47   critic loss: 0.42   steps: 156


training loop:   0% |                                 | ETA:  30 days, 18:24:23

Episode: 157   score: 6.03   Avg score (100e): 5.06   actor gain: -0.47   critic loss: 0.42   steps: 157


training loop:   0% |                                 | ETA:  33 days, 15:32:53

Episode: 158   score: 6.05   Avg score (100e): 5.08   actor gain: -0.47   critic loss: 0.42   steps: 158


training loop:   0% |                                 | ETA:  33 days, 17:07:06

Episode: 159   score: 6.10   Avg score (100e): 5.10   actor gain: -0.47   critic loss: 0.42   steps: 159


training loop:   0% |                                 | ETA:  33 days, 17:55:12

Episode: 160   score: 6.10   Avg score (100e): 5.12   actor gain: -0.47   critic loss: 0.42   steps: 160


training loop:   0% |                                 | ETA:  33 days, 18:35:46

Episode: 161   score: 6.12   Avg score (100e): 5.14   actor gain: -0.47   critic loss: 0.42   steps: 161


training loop:   0% |                                 | ETA:  33 days, 19:05:07

Episode: 162   score: 6.14   Avg score (100e): 5.16   actor gain: -0.47   critic loss: 0.42   steps: 162


training loop:   0% |                                 | ETA:  33 days, 18:01:57

Episode: 163   score: 6.18   Avg score (100e): 5.18   actor gain: -0.47   critic loss: 0.42   steps: 163


training loop:   0% |                                 | ETA:  33 days, 17:58:17

Episode: 164   score: 6.20   Avg score (100e): 5.21   actor gain: -0.47   critic loss: 0.42   steps: 164


training loop:   0% |                                 | ETA:  33 days, 17:45:36

Episode: 165   score: 6.20   Avg score (100e): 5.23   actor gain: -0.47   critic loss: 0.42   steps: 165


training loop:   0% |                                 | ETA:  33 days, 17:37:51

Episode: 166   score: 6.22   Avg score (100e): 5.25   actor gain: -0.47   critic loss: 0.42   steps: 166


training loop:   0% |                                 | ETA:  33 days, 17:13:20

Episode: 167   score: 6.22   Avg score (100e): 5.27   actor gain: -0.47   critic loss: 0.42   steps: 167


training loop:   0% |                                 | ETA:  33 days, 16:49:49

Episode: 168   score: 6.23   Avg score (100e): 5.29   actor gain: -0.47   critic loss: 0.42   steps: 168


training loop:   0% |                                 | ETA:  33 days, 16:20:58

Episode: 169   score: 6.24   Avg score (100e): 5.31   actor gain: -0.47   critic loss: 0.42   steps: 169


training loop:   0% |                                 | ETA:  33 days, 16:52:57

Episode: 170   score: 6.25   Avg score (100e): 5.33   actor gain: -0.47   critic loss: 0.42   steps: 170


training loop:   0% |                                 | ETA:  33 days, 17:55:11

Episode: 171   score: 6.25   Avg score (100e): 5.35   actor gain: -0.47   critic loss: 0.42   steps: 171


training loop:   0% |                                 | ETA:  33 days, 18:09:10

Episode: 172   score: 6.26   Avg score (100e): 5.36   actor gain: -0.47   critic loss: 0.42   steps: 172


training loop:   0% |                                 | ETA:  33 days, 18:31:43

Episode: 173   score: 6.28   Avg score (100e): 5.38   actor gain: -0.47   critic loss: 0.42   steps: 173


training loop:   0% |                                 | ETA:  33 days, 18:33:37

Episode: 174   score: 6.26   Avg score (100e): 5.40   actor gain: -0.47   critic loss: 0.42   steps: 174


training loop:   0% |                                 | ETA:  33 days, 18:33:14

Episode: 175   score: 6.25   Avg score (100e): 5.42   actor gain: -0.47   critic loss: 0.42   steps: 175


training loop:   0% |                                 | ETA:  33 days, 18:55:52

Episode: 176   score: 6.25   Avg score (100e): 5.44   actor gain: -0.47   critic loss: 0.42   steps: 176


training loop:   0% |                                 | ETA:  33 days, 19:05:03

Episode: 177   score: 6.27   Avg score (100e): 5.46   actor gain: -0.46   critic loss: 0.42   steps: 177


training loop:   0% |                                 | ETA:  33 days, 19:24:38

Episode: 178   score: 6.26   Avg score (100e): 5.47   actor gain: -0.46   critic loss: 0.42   steps: 178


training loop:   0% |                                 | ETA:  33 days, 19:21:03

Episode: 179   score: 6.28   Avg score (100e): 5.49   actor gain: -0.46   critic loss: 0.42   steps: 179


training loop:   0% |                                 | ETA:  33 days, 19:19:38

Episode: 180   score: 6.28   Avg score (100e): 5.51   actor gain: -0.46   critic loss: 0.42   steps: 180


training loop:   0% |                                 | ETA:  33 days, 19:26:36

Episode: 181   score: 6.30   Avg score (100e): 5.53   actor gain: -0.46   critic loss: 0.42   steps: 181


training loop:   0% |                                 | ETA:  38 days, 22:53:07

Episode: 182   score: 6.29   Avg score (100e): 5.54   actor gain: -0.47   critic loss: 0.42   steps: 182


training loop:   0% |                                 | ETA:  38 days, 22:31:47

Episode: 183   score: 6.31   Avg score (100e): 5.56   actor gain: -0.47   critic loss: 0.42   steps: 183


training loop:   0% |                                 | ETA:  38 days, 22:20:29

Episode: 184   score: 6.33   Avg score (100e): 5.58   actor gain: -0.47   critic loss: 0.42   steps: 184


training loop:   0% |                                 | ETA:  38 days, 22:01:33

Episode: 185   score: 6.34   Avg score (100e): 5.60   actor gain: -0.47   critic loss: 0.42   steps: 185


training loop:   0% |                                 | ETA:  38 days, 21:33:33

Episode: 186   score: 6.37   Avg score (100e): 5.61   actor gain: -0.46   critic loss: 0.42   steps: 186


training loop:   0% |                                 | ETA:  38 days, 20:54:28

Episode: 187   score: 6.38   Avg score (100e): 5.63   actor gain: -0.46   critic loss: 0.42   steps: 187


training loop:   0% |                                 | ETA:  38 days, 20:20:27

Episode: 188   score: 6.39   Avg score (100e): 5.65   actor gain: -0.46   critic loss: 0.42   steps: 188


training loop:   0% |                                 | ETA:  38 days, 20:02:34

Episode: 189   score: 6.43   Avg score (100e): 5.67   actor gain: -0.46   critic loss: 0.42   steps: 189


training loop:   0% |                                 | ETA:  38 days, 19:30:17

Episode: 190   score: 6.46   Avg score (100e): 5.68   actor gain: -0.46   critic loss: 0.42   steps: 190


training loop:   0% |                                 | ETA:  38 days, 18:47:38

Episode: 191   score: 6.46   Avg score (100e): 5.70   actor gain: -0.46   critic loss: 0.42   steps: 191


training loop:   0% |                                 | ETA:  38 days, 18:11:07

Episode: 192   score: 6.47   Avg score (100e): 5.72   actor gain: -0.46   critic loss: 0.42   steps: 192


training loop:   0% |                                 | ETA:  38 days, 17:28:47

Episode: 193   score: 6.49   Avg score (100e): 5.74   actor gain: -0.46   critic loss: 0.42   steps: 193


training loop:   0% |                                 | ETA:  38 days, 17:07:51

Episode: 194   score: 6.50   Avg score (100e): 5.75   actor gain: -0.46   critic loss: 0.42   steps: 194


training loop:   0% |                                 | ETA:  38 days, 16:32:02

Episode: 195   score: 6.53   Avg score (100e): 5.77   actor gain: -0.47   critic loss: 0.42   steps: 195


training loop:   0% |                                 | ETA:  38 days, 16:09:04

Episode: 196   score: 6.55   Avg score (100e): 5.79   actor gain: -0.47   critic loss: 0.42   steps: 196


training loop:   0% |                                 | ETA:  38 days, 15:30:23

Episode: 197   score: 6.56   Avg score (100e): 5.80   actor gain: -0.49   critic loss: 0.42   steps: 197


training loop:   0% |                                 | ETA:  38 days, 14:51:37

Episode: 198   score: 6.55   Avg score (100e): 5.82   actor gain: -0.49   critic loss: 0.42   steps: 198


training loop:   0% |                                 | ETA:  38 days, 14:22:32

Episode: 199   score: 6.59   Avg score (100e): 5.84   actor gain: -0.49   critic loss: 0.42   steps: 199


training loop:   0% |                                 | ETA:  38 days, 13:44:59

Episode: 200   score: 6.63   Avg score (100e): 5.85   actor gain: -0.49   critic loss: 0.42   steps: 200


training loop:   0% |                                 | ETA:  38 days, 13:17:06

Episode: 201   score: 6.65   Avg score (100e): 5.87   actor gain: -0.49   critic loss: 0.42   steps: 201


training loop:   0% |                                 | ETA:  38 days, 12:51:40

Episode: 202   score: 6.68   Avg score (100e): 5.89   actor gain: -0.49   critic loss: 0.42   steps: 202


training loop:   0% |                                 | ETA:  38 days, 12:15:04

Episode: 203   score: 6.72   Avg score (100e): 5.90   actor gain: -0.49   critic loss: 0.42   steps: 203


training loop:   0% |                                 | ETA:  38 days, 12:23:15

Episode: 204   score: 6.75   Avg score (100e): 5.92   actor gain: -0.49   critic loss: 0.42   steps: 204


training loop:   0% |                                 | ETA:  38 days, 12:06:59

Episode: 205   score: 6.79   Avg score (100e): 5.94   actor gain: -0.49   critic loss: 0.42   steps: 205


training loop:   0% |                                 | ETA:  38 days, 11:55:21

Episode: 206   score: 6.82   Avg score (100e): 5.96   actor gain: -0.49   critic loss: 0.42   steps: 206


training loop:   0% |                                 | ETA:  38 days, 11:26:00

Episode: 207   score: 6.86   Avg score (100e): 5.98   actor gain: -0.49   critic loss: 0.42   steps: 207


training loop:   0% |                                 | ETA:  38 days, 11:36:12

Episode: 208   score: 6.88   Avg score (100e): 5.99   actor gain: -0.49   critic loss: 0.42   steps: 208


training loop:   0% |                                 | ETA:  38 days, 11:17:27

Episode: 209   score: 6.92   Avg score (100e): 6.01   actor gain: -0.49   critic loss: 0.42   steps: 209


training loop:   0% |                                 | ETA:  38 days, 10:43:04

Episode: 210   score: 6.96   Avg score (100e): 6.03   actor gain: -0.49   critic loss: 0.42   steps: 210


training loop:   0% |                                 | ETA:  38 days, 10:13:11

Episode: 211   score: 6.97   Avg score (100e): 6.05   actor gain: -0.49   critic loss: 0.42   steps: 211


training loop:   0% |                                  | ETA:  38 days, 9:45:55

Episode: 212   score: 6.99   Avg score (100e): 6.07   actor gain: -0.49   critic loss: 0.42   steps: 212


training loop:   0% |                                  | ETA:  38 days, 9:21:52

Episode: 213   score: 6.98   Avg score (100e): 6.08   actor gain: -0.49   critic loss: 0.42   steps: 213


training loop:   0% |                                  | ETA:  38 days, 9:05:00

Episode: 214   score: 7.01   Avg score (100e): 6.10   actor gain: -0.49   critic loss: 0.42   steps: 214


training loop:   0% |                                  | ETA:  38 days, 8:29:00

Episode: 215   score: 7.04   Avg score (100e): 6.12   actor gain: -0.49   critic loss: 0.42   steps: 215


training loop:   0% |                                  | ETA:  38 days, 7:56:41

Episode: 216   score: 7.07   Avg score (100e): 6.14   actor gain: -0.49   critic loss: 0.42   steps: 216


training loop:   0% |                                  | ETA:  38 days, 7:29:06

Episode: 217   score: 7.10   Avg score (100e): 6.16   actor gain: -0.49   critic loss: 0.42   steps: 217


training loop:   0% |                                  | ETA:  38 days, 7:03:20

Episode: 218   score: 7.15   Avg score (100e): 6.17   actor gain: -0.49   critic loss: 0.42   steps: 218


training loop:   0% |                                  | ETA:  38 days, 6:30:33

Episode: 219   score: 7.18   Avg score (100e): 6.19   actor gain: -0.49   critic loss: 0.42   steps: 219


training loop:   0% |                                  | ETA:  38 days, 6:00:27

Episode: 220   score: 7.21   Avg score (100e): 6.21   actor gain: -0.49   critic loss: 0.42   steps: 220


training loop:   0% |                                  | ETA:  38 days, 5:33:06

Episode: 221   score: 7.24   Avg score (100e): 6.23   actor gain: -0.49   critic loss: 0.42   steps: 221


training loop:   0% |                                  | ETA:  38 days, 4:57:56

Episode: 222   score: 7.28   Avg score (100e): 6.25   actor gain: -0.46   critic loss: 0.42   steps: 222


training loop:   0% |                                  | ETA:  38 days, 4:58:40

Episode: 223   score: 7.27   Avg score (100e): 6.27   actor gain: -0.46   critic loss: 0.42   steps: 223


training loop:   0% |                                  | ETA:  38 days, 4:42:58

Episode: 224   score: 7.30   Avg score (100e): 6.29   actor gain: -0.46   critic loss: 0.42   steps: 224


training loop:   0% |                                  | ETA:  38 days, 4:34:46

Episode: 225   score: 7.36   Avg score (100e): 6.31   actor gain: -0.47   critic loss: 0.42   steps: 225


training loop:   0% |                                  | ETA:  38 days, 3:59:10

Episode: 226   score: 7.38   Avg score (100e): 6.33   actor gain: -0.47   critic loss: 0.42   steps: 226


training loop:   0% |                                  | ETA:  38 days, 3:04:47

Episode: 227   score: 7.41   Avg score (100e): 6.35   actor gain: -0.47   critic loss: 0.42   steps: 227


training loop:   0% |                                  | ETA:  38 days, 3:05:11

Episode: 228   score: 7.42   Avg score (100e): 6.37   actor gain: -0.47   critic loss: 0.42   steps: 228


training loop:   0% |                                  | ETA:  38 days, 2:56:20

Episode: 229   score: 7.46   Avg score (100e): 6.39   actor gain: -0.47   critic loss: 0.42   steps: 229


training loop:   0% |                                  | ETA:  38 days, 2:36:57

Episode: 230   score: 7.46   Avg score (100e): 6.41   actor gain: -0.47   critic loss: 0.42   steps: 230


training loop:   0% |                                  | ETA:  38 days, 2:21:01

Episode: 231   score: 7.47   Avg score (100e): 6.43   actor gain: -0.47   critic loss: 0.42   steps: 231


training loop:   0% |                                  | ETA:  38 days, 2:40:36

Episode: 232   score: 7.49   Avg score (100e): 6.45   actor gain: -0.47   critic loss: 0.42   steps: 232


training loop:   0% |                                  | ETA:  38 days, 2:53:42

Episode: 233   score: 7.53   Avg score (100e): 6.46   actor gain: -0.47   critic loss: 0.42   steps: 233


training loop:   0% |                                  | ETA:  38 days, 2:57:51

Episode: 234   score: 7.58   Avg score (100e): 6.48   actor gain: -0.47   critic loss: 0.42   steps: 234


training loop:   0% |                                  | ETA:  38 days, 2:33:17

Episode: 235   score: 7.60   Avg score (100e): 6.50   actor gain: -0.47   critic loss: 0.42   steps: 235


training loop:   0% |                                  | ETA:  38 days, 3:54:30

Episode: 236   score: 7.63   Avg score (100e): 6.52   actor gain: -0.47   critic loss: 0.42   steps: 236


training loop:   0% |                                  | ETA:  38 days, 4:25:36

Episode: 237   score: 7.67   Avg score (100e): 6.54   actor gain: -0.47   critic loss: 0.42   steps: 237


training loop:   0% |                                  | ETA:  38 days, 5:17:31

Episode: 238   score: 7.69   Avg score (100e): 6.56   actor gain: -0.47   critic loss: 0.42   steps: 238


training loop:   0% |                                  | ETA:  38 days, 5:08:02

Episode: 239   score: 7.72   Avg score (100e): 6.58   actor gain: -0.47   critic loss: 0.42   steps: 239


training loop:   0% |                                  | ETA:  38 days, 4:45:56

Episode: 240   score: 7.77   Avg score (100e): 6.60   actor gain: -0.47   critic loss: 0.42   steps: 240


training loop:   0% |                                  | ETA:  38 days, 4:42:07

Episode: 241   score: 7.79   Avg score (100e): 6.62   actor gain: -0.47   critic loss: 0.42   steps: 241


training loop:   0% |                                  | ETA:  38 days, 4:18:13

Episode: 242   score: 7.80   Avg score (100e): 6.64   actor gain: -0.47   critic loss: 0.42   steps: 242


training loop:   0% |                                  | ETA:  38 days, 3:46:52

Episode: 243   score: 7.83   Avg score (100e): 6.66   actor gain: -0.47   critic loss: 0.42   steps: 243


training loop:   0% |                                  | ETA:  38 days, 3:21:33

Episode: 244   score: 7.84   Avg score (100e): 6.68   actor gain: -0.47   critic loss: 0.42   steps: 244


training loop:   0% |                                  | ETA:  38 days, 3:04:13

Episode: 245   score: 7.88   Avg score (100e): 6.70   actor gain: -0.47   critic loss: 0.42   steps: 245


training loop:   0% |                                  | ETA:  38 days, 2:38:28

Episode: 246   score: 7.91   Avg score (100e): 6.72   actor gain: -0.47   critic loss: 0.42   steps: 246


training loop:   0% |                                  | ETA:  38 days, 2:15:09

Episode: 247   score: 7.92   Avg score (100e): 6.75   actor gain: -0.47   critic loss: 0.42   steps: 247


training loop:   0% |                                  | ETA:  38 days, 1:46:47

Episode: 248   score: 7.96   Avg score (100e): 6.77   actor gain: -0.47   critic loss: 0.42   steps: 248


training loop:   0% |                                  | ETA:  38 days, 1:16:06

Episode: 249   score: 7.98   Avg score (100e): 6.79   actor gain: -0.46   critic loss: 0.42   steps: 249


training loop:   0% |                                  | ETA:  38 days, 0:46:15

Episode: 250   score: 7.99   Avg score (100e): 6.81   actor gain: -0.47   critic loss: 0.41   steps: 250


training loop:   0% |                                  | ETA:  38 days, 0:14:02

Episode: 251   score: 8.01   Avg score (100e): 6.83   actor gain: -0.47   critic loss: 0.41   steps: 251


training loop:   0% |                                 | ETA:  37 days, 23:50:33

Episode: 252   score: 8.03   Avg score (100e): 6.85   actor gain: -0.47   critic loss: 0.41   steps: 252


training loop:   0% |                                 | ETA:  37 days, 23:34:47

Episode: 253   score: 8.06   Avg score (100e): 6.87   actor gain: -0.47   critic loss: 0.41   steps: 253


training loop:   0% |                                 | ETA:  37 days, 23:08:16

Episode: 254   score: 8.09   Avg score (100e): 6.89   actor gain: -0.46   critic loss: 0.41   steps: 254


training loop:   0% |                                 | ETA:  37 days, 22:48:52

Episode: 255   score: 8.12   Avg score (100e): 6.92   actor gain: -0.46   critic loss: 0.41   steps: 255


training loop:   0% |                                 | ETA:  37 days, 22:24:38

Episode: 256   score: 8.14   Avg score (100e): 6.94   actor gain: -0.46   critic loss: 0.41   steps: 256


training loop:   0% |                                 | ETA:  37 days, 22:02:03

Episode: 257   score: 8.16   Avg score (100e): 6.96   actor gain: -0.46   critic loss: 0.42   steps: 257


training loop:   0% |                                 | ETA:  37 days, 21:35:46

Episode: 258   score: 8.15   Avg score (100e): 6.98   actor gain: -0.46   critic loss: 0.42   steps: 258


training loop:   0% |                                 | ETA:  37 days, 21:14:01

Episode: 259   score: 8.18   Avg score (100e): 7.00   actor gain: -0.54   critic loss: 0.42   steps: 259


training loop:   0% |                                 | ETA:  37 days, 20:51:51

Episode: 260   score: 8.18   Avg score (100e): 7.02   actor gain: -0.54   critic loss: 0.42   steps: 260


training loop:   0% |                                 | ETA:  37 days, 20:33:56

Episode: 261   score: 8.21   Avg score (100e): 7.04   actor gain: -0.54   critic loss: 0.42   steps: 261


training loop:   0% |                                 | ETA:  37 days, 20:42:35

Episode: 262   score: 8.25   Avg score (100e): 7.06   actor gain: -0.54   critic loss: 0.42   steps: 262


training loop:   0% |                                 | ETA:  37 days, 20:27:19

Episode: 263   score: 8.28   Avg score (100e): 7.08   actor gain: -0.54   critic loss: 0.42   steps: 263


training loop:   0% |                                 | ETA:  37 days, 20:36:05

Episode: 264   score: 8.31   Avg score (100e): 7.10   actor gain: -0.54   critic loss: 0.42   steps: 264


training loop:   0% |                                 | ETA:  37 days, 20:22:03

Episode: 265   score: 8.33   Avg score (100e): 7.13   actor gain: -0.54   critic loss: 0.42   steps: 265


training loop:   0% |                                 | ETA:  37 days, 20:05:34

Episode: 266   score: 8.36   Avg score (100e): 7.15   actor gain: -0.54   critic loss: 0.42   steps: 266


training loop:   0% |                                 | ETA:  37 days, 19:43:55

Episode: 267   score: 8.38   Avg score (100e): 7.17   actor gain: -0.54   critic loss: 0.42   steps: 267


training loop:   0% |                                 | ETA:  37 days, 19:56:51

Episode: 268   score: 8.39   Avg score (100e): 7.19   actor gain: -0.54   critic loss: 0.42   steps: 268


training loop:   0% |                                 | ETA:  37 days, 19:51:56

Episode: 269   score: 8.42   Avg score (100e): 7.21   actor gain: -0.54   critic loss: 0.42   steps: 269


training loop:   0% |                                 | ETA:  37 days, 19:36:37

Episode: 270   score: 8.43   Avg score (100e): 7.23   actor gain: -0.54   critic loss: 0.42   steps: 270


training loop:   0% |                                 | ETA:  37 days, 19:24:58

Episode: 271   score: 8.46   Avg score (100e): 7.26   actor gain: -0.54   critic loss: 0.42   steps: 271


training loop:   0% |                                 | ETA:  37 days, 19:09:41

Episode: 272   score: 8.48   Avg score (100e): 7.28   actor gain: -0.54   critic loss: 0.42   steps: 272


training loop:   0% |                                 | ETA:  37 days, 18:51:15

Episode: 273   score: 8.51   Avg score (100e): 7.30   actor gain: -0.54   critic loss: 0.42   steps: 273


training loop:   0% |                                 | ETA:  37 days, 18:31:02

Episode: 274   score: 8.53   Avg score (100e): 7.32   actor gain: -0.54   critic loss: 0.42   steps: 274


training loop:   0% |                                 | ETA:  37 days, 18:14:43

Episode: 275   score: 8.55   Avg score (100e): 7.35   actor gain: -0.54   critic loss: 0.42   steps: 275


training loop:   0% |                                 | ETA:  37 days, 17:54:23

Episode: 276   score: 8.58   Avg score (100e): 7.37   actor gain: -0.55   critic loss: 0.42   steps: 276


training loop:   0% |                                 | ETA:  37 days, 17:29:14

Episode: 277   score: 8.62   Avg score (100e): 7.39   actor gain: -0.55   critic loss: 0.42   steps: 277


training loop:   0% |                                 | ETA:  37 days, 17:10:17

Episode: 278   score: 8.64   Avg score (100e): 7.42   actor gain: -0.55   critic loss: 0.42   steps: 278


training loop:   0% |                                 | ETA:  37 days, 16:49:00

Episode: 279   score: 8.66   Avg score (100e): 7.44   actor gain: -0.55   critic loss: 0.42   steps: 279


training loop:   0% |                                 | ETA:  37 days, 16:26:35

Episode: 280   score: 8.68   Avg score (100e): 7.46   actor gain: -0.55   critic loss: 0.42   steps: 280


training loop:   0% |                                 | ETA:  37 days, 16:05:31

Episode: 281   score: 8.69   Avg score (100e): 7.49   actor gain: -0.55   critic loss: 0.42   steps: 281


training loop:   0% |                                 | ETA:  37 days, 15:40:21

Episode: 282   score: 8.72   Avg score (100e): 7.51   actor gain: -0.55   critic loss: 0.42   steps: 282


training loop:   0% |                                 | ETA:  37 days, 15:21:03

Episode: 283   score: 8.74   Avg score (100e): 7.54   actor gain: -0.55   critic loss: 0.42   steps: 283


training loop:   0% |                                 | ETA:  37 days, 14:58:37

Episode: 284   score: 8.75   Avg score (100e): 7.56   actor gain: -0.46   critic loss: 0.42   steps: 284


training loop:   0% |                                 | ETA:  37 days, 14:41:55

Episode: 285   score: 8.77   Avg score (100e): 7.59   actor gain: -0.46   critic loss: 0.42   steps: 285


training loop:   0% |                                 | ETA:  37 days, 14:37:37

Episode: 286   score: 8.79   Avg score (100e): 7.61   actor gain: -0.47   critic loss: 0.42   steps: 286


training loop:   0% |                                 | ETA:  37 days, 14:31:10

Episode: 287   score: 8.81   Avg score (100e): 7.63   actor gain: -0.47   critic loss: 0.42   steps: 287


training loop:   0% |                                 | ETA:  37 days, 14:15:21

Episode: 288   score: 8.82   Avg score (100e): 7.66   actor gain: -0.46   critic loss: 0.42   steps: 288


training loop:   0% |                                 | ETA:  37 days, 13:57:33

Episode: 289   score: 8.87   Avg score (100e): 7.68   actor gain: -0.46   critic loss: 0.42   steps: 289


training loop:   0% |                                 | ETA:  37 days, 13:37:15

Episode: 290   score: 8.89   Avg score (100e): 7.71   actor gain: -0.47   critic loss: 0.42   steps: 290


training loop:   0% |                                 | ETA:  37 days, 13:17:57

Episode: 291   score: 8.92   Avg score (100e): 7.73   actor gain: -0.47   critic loss: 0.42   steps: 291


training loop:   0% |                                 | ETA:  37 days, 13:06:53

Episode: 292   score: 8.94   Avg score (100e): 7.76   actor gain: -0.46   critic loss: 0.41   steps: 292


training loop:   0% |                                 | ETA:  37 days, 12:50:47

Episode: 293   score: 8.97   Avg score (100e): 7.78   actor gain: -0.46   critic loss: 0.41   steps: 293


training loop:   0% |                                 | ETA:  37 days, 12:34:42

Episode: 294   score: 9.00   Avg score (100e): 7.81   actor gain: -0.48   critic loss: 0.41   steps: 294


training loop:   0% |                                 | ETA:  37 days, 12:18:44

Episode: 295   score: 9.01   Avg score (100e): 7.83   actor gain: -0.48   critic loss: 0.41   steps: 295


training loop:   0% |                                 | ETA:  37 days, 12:01:13

Episode: 296   score: 9.04   Avg score (100e): 7.86   actor gain: -0.48   critic loss: 0.41   steps: 296


training loop:   0% |                                 | ETA:  37 days, 11:42:57

Episode: 297   score: 9.05   Avg score (100e): 7.88   actor gain: -0.48   critic loss: 0.41   steps: 297


training loop:   0% |                                 | ETA:  37 days, 11:19:02

Episode: 298   score: 9.07   Avg score (100e): 7.91   actor gain: -0.48   critic loss: 0.41   steps: 298


training loop:   0% |                                 | ETA:  37 days, 11:01:48

Episode: 299   score: 9.10   Avg score (100e): 7.93   actor gain: -0.48   critic loss: 0.41   steps: 299


training loop:   0% |                                 | ETA:  37 days, 10:37:46

Episode: 300   score: 9.12   Avg score (100e): 7.96   actor gain: -0.48   critic loss: 0.41   steps: 300


training loop:   0% |                                 | ETA:  37 days, 10:46:28

Episode: 301   score: 9.13   Avg score (100e): 7.98   actor gain: -0.46   critic loss: 0.41   steps: 301


training loop:   0% |                                 | ETA:  37 days, 10:38:12

Episode: 302   score: 9.15   Avg score (100e): 8.01   actor gain: -0.46   critic loss: 0.41   steps: 302


training loop:   0% |                                 | ETA:  37 days, 10:29:21

Episode: 303   score: 9.15   Avg score (100e): 8.03   actor gain: -0.46   critic loss: 0.41   steps: 303


training loop:   0% |                                 | ETA:  37 days, 10:14:06

Episode: 304   score: 9.16   Avg score (100e): 8.05   actor gain: -0.46   critic loss: 0.41   steps: 304


training loop:   0% |                                  | ETA:  37 days, 9:56:54

Episode: 305   score: 9.18   Avg score (100e): 8.08   actor gain: -0.46   critic loss: 0.40   steps: 305


training loop:   0% |                                  | ETA:  37 days, 9:47:40

Episode: 306   score: 9.22   Avg score (100e): 8.10   actor gain: -0.46   critic loss: 0.40   steps: 306


training loop:   0% |                                  | ETA:  37 days, 9:28:33

Episode: 307   score: 9.22   Avg score (100e): 8.13   actor gain: -0.47   critic loss: 0.40   steps: 307


training loop:   0% |                                  | ETA:  37 days, 9:09:34

Episode: 308   score: 9.26   Avg score (100e): 8.15   actor gain: -0.47   critic loss: 0.40   steps: 308


training loop:   0% |                                  | ETA:  37 days, 8:55:53

Episode: 309   score: 9.26   Avg score (100e): 8.17   actor gain: -0.47   critic loss: 0.40   steps: 309


training loop:   0% |                                  | ETA:  37 days, 8:36:06

Episode: 310   score: 9.28   Avg score (100e): 8.20   actor gain: -0.47   critic loss: 0.40   steps: 310


training loop:   0% |                                  | ETA:  37 days, 8:20:30

Episode: 311   score: 9.32   Avg score (100e): 8.22   actor gain: -0.47   critic loss: 0.40   steps: 311


training loop:   0% |                                  | ETA:  37 days, 8:00:01

Episode: 312   score: 9.34   Avg score (100e): 8.24   actor gain: -0.47   critic loss: 0.40   steps: 312


training loop:   0% |                                  | ETA:  37 days, 7:42:00

Episode: 313   score: 9.38   Avg score (100e): 8.27   actor gain: -0.47   critic loss: 0.40   steps: 313


training loop:   0% |                                  | ETA:  37 days, 7:22:33

Episode: 314   score: 9.41   Avg score (100e): 8.29   actor gain: -0.47   critic loss: 0.40   steps: 314


training loop:   0% |                                  | ETA:  37 days, 7:11:21

Episode: 315   score: 9.44   Avg score (100e): 8.32   actor gain: -0.47   critic loss: 0.40   steps: 315


training loop:   0% |                                  | ETA:  37 days, 7:17:21

Episode: 316   score: 9.45   Avg score (100e): 8.34   actor gain: -0.47   critic loss: 0.40   steps: 316


training loop:   0% |                                  | ETA:  37 days, 7:10:27

Episode: 317   score: 9.45   Avg score (100e): 8.36   actor gain: -0.47   critic loss: 0.40   steps: 317


training loop:   0% |                                  | ETA:  37 days, 6:59:17

Episode: 318   score: 9.48   Avg score (100e): 8.39   actor gain: -0.47   critic loss: 0.40   steps: 318


training loop:   0% |                                  | ETA:  37 days, 7:18:26

Episode: 319   score: 9.48   Avg score (100e): 8.41   actor gain: -0.45   critic loss: 0.40   steps: 319


training loop:   0% |                                  | ETA:  37 days, 7:31:02

Episode: 320   score: 9.51   Avg score (100e): 8.43   actor gain: -0.45   critic loss: 0.40   steps: 320


training loop:   0% |                                  | ETA:  37 days, 7:51:13

Episode: 321   score: 9.53   Avg score (100e): 8.45   actor gain: -0.45   critic loss: 0.40   steps: 321


training loop:   0% |                                  | ETA:  37 days, 7:53:42

Episode: 322   score: 9.55   Avg score (100e): 8.48   actor gain: -0.45   critic loss: 0.40   steps: 322


training loop:   0% |                                  | ETA:  37 days, 7:52:37

Episode: 323   score: 9.56   Avg score (100e): 8.50   actor gain: -0.45   critic loss: 0.40   steps: 323


training loop:   0% |                                  | ETA:  37 days, 8:26:38

Episode: 324   score: 9.59   Avg score (100e): 8.52   actor gain: -0.45   critic loss: 0.40   steps: 324


training loop:   0% |                                  | ETA:  37 days, 8:27:00

Episode: 325   score: 9.61   Avg score (100e): 8.55   actor gain: -0.45   critic loss: 0.40   steps: 325


training loop:   0% |                                  | ETA:  37 days, 8:35:29

Episode: 326   score: 9.64   Avg score (100e): 8.57   actor gain: -0.45   critic loss: 0.40   steps: 326


training loop:   0% |                                  | ETA:  37 days, 8:29:11

Episode: 327   score: 9.64   Avg score (100e): 8.59   actor gain: -0.45   critic loss: 0.40   steps: 327


training loop:   0% |                                  | ETA:  37 days, 8:13:00

Episode: 328   score: 9.66   Avg score (100e): 8.61   actor gain: -0.45   critic loss: 0.40   steps: 328


training loop:   0% |                                  | ETA:  37 days, 8:00:20

Episode: 329   score: 9.70   Avg score (100e): 8.64   actor gain: -0.45   critic loss: 0.40   steps: 329


training loop:   0% |                                  | ETA:  37 days, 7:45:43

Episode: 330   score: 9.70   Avg score (100e): 8.66   actor gain: -0.45   critic loss: 0.40   steps: 330


training loop:   0% |                                  | ETA:  37 days, 7:25:26

Episode: 331   score: 9.72   Avg score (100e): 8.68   actor gain: -0.45   critic loss: 0.40   steps: 331


training loop:   0% |                                  | ETA:  37 days, 7:22:35

Episode: 332   score: 9.72   Avg score (100e): 8.70   actor gain: -0.45   critic loss: 0.41   steps: 332


training loop:   0% |                                  | ETA:  37 days, 7:40:22

Episode: 333   score: 9.75   Avg score (100e): 8.72   actor gain: -0.45   critic loss: 0.41   steps: 333


training loop:   0% |                                  | ETA:  37 days, 7:41:53

Episode: 334   score: 9.77   Avg score (100e): 8.75   actor gain: -0.45   critic loss: 0.41   steps: 334


training loop:   0% |                                  | ETA:  37 days, 7:46:12

Episode: 335   score: 9.80   Avg score (100e): 8.77   actor gain: -0.45   critic loss: 0.41   steps: 335


training loop:   0% |                                  | ETA:  37 days, 7:44:21

Episode: 336   score: 9.82   Avg score (100e): 8.79   actor gain: -0.45   critic loss: 0.41   steps: 336


training loop:   0% |                                  | ETA:  37 days, 7:36:58

Episode: 337   score: 9.86   Avg score (100e): 8.81   actor gain: -0.45   critic loss: 0.41   steps: 337


training loop:   0% |                                  | ETA:  37 days, 7:38:24

Episode: 338   score: 9.87   Avg score (100e): 8.83   actor gain: -0.44   critic loss: 0.41   steps: 338


training loop:   0% |                                  | ETA:  37 days, 7:32:24

Episode: 339   score: 9.89   Avg score (100e): 8.86   actor gain: -0.44   critic loss: 0.41   steps: 339


training loop:   0% |                                  | ETA:  37 days, 7:42:30

Episode: 340   score: 9.92   Avg score (100e): 8.88   actor gain: -0.44   critic loss: 0.41   steps: 340


training loop:   0% |                                  | ETA:  37 days, 7:40:51

Episode: 341   score: 9.95   Avg score (100e): 8.90   actor gain: -0.45   critic loss: 0.41   steps: 341


training loop:   0% |                                  | ETA:  37 days, 7:45:34

Episode: 342   score: 9.98   Avg score (100e): 8.92   actor gain: -0.45   critic loss: 0.41   steps: 342


training loop:   0% |                                  | ETA:  37 days, 7:33:25

Episode: 343   score: 10.00   Avg score (100e): 8.94   actor gain: -0.45   critic loss: 0.41   steps: 343


training loop:   0% |                                  | ETA:  37 days, 7:24:04

Episode: 344   score: 10.03   Avg score (100e): 8.96   actor gain: -0.45   critic loss: 0.41   steps: 344


training loop:   0% |                                  | ETA:  37 days, 7:07:47

Episode: 345   score: 10.04   Avg score (100e): 8.99   actor gain: -0.45   critic loss: 0.41   steps: 345


training loop:   0% |                                  | ETA:  37 days, 6:55:19

Episode: 346   score: 10.06   Avg score (100e): 9.01   actor gain: -0.45   critic loss: 0.41   steps: 346


training loop:   0% |                                  | ETA:  37 days, 6:36:10

Episode: 347   score: 10.06   Avg score (100e): 9.03   actor gain: -0.45   critic loss: 0.41   steps: 347


training loop:   0% |                                  | ETA:  37 days, 6:39:13

Episode: 348   score: 10.07   Avg score (100e): 9.05   actor gain: -0.45   critic loss: 0.41   steps: 348


training loop:   0% |                                  | ETA:  37 days, 6:45:13

Episode: 349   score: 10.09   Avg score (100e): 9.07   actor gain: -0.45   critic loss: 0.41   steps: 349


training loop:   0% |                                  | ETA:  37 days, 6:55:42

Episode: 350   score: 10.11   Avg score (100e): 9.09   actor gain: -0.45   critic loss: 0.40   steps: 350


training loop:   0% |                                  | ETA:  37 days, 6:46:55

Episode: 351   score: 10.13   Avg score (100e): 9.11   actor gain: -0.45   critic loss: 0.40   steps: 351


training loop:   0% |                                  | ETA:  37 days, 6:33:24

Episode: 352   score: 10.14   Avg score (100e): 9.13   actor gain: -0.45   critic loss: 0.40   steps: 352


training loop:   0% |                                  | ETA:  37 days, 6:22:34

Episode: 353   score: 10.16   Avg score (100e): 9.16   actor gain: -0.45   critic loss: 0.40   steps: 353


training loop:   0% |                                  | ETA:  37 days, 6:06:23

Episode: 354   score: 10.20   Avg score (100e): 9.18   actor gain: -0.45   critic loss: 0.40   steps: 354


training loop:   0% |                                  | ETA:  37 days, 5:48:27

Episode: 355   score: 10.22   Avg score (100e): 9.20   actor gain: -0.45   critic loss: 0.40   steps: 355


training loop:   0% |                                  | ETA:  37 days, 5:42:07

Episode: 356   score: 10.26   Avg score (100e): 9.22   actor gain: -0.45   critic loss: 0.40   steps: 356


training loop:   0% |                                  | ETA:  37 days, 5:51:20

Episode: 357   score: 10.27   Avg score (100e): 9.24   actor gain: -0.45   critic loss: 0.40   steps: 357


training loop:   0% |                                  | ETA:  37 days, 5:49:30

Episode: 358   score: 10.29   Avg score (100e): 9.26   actor gain: -0.45   critic loss: 0.40   steps: 358


training loop:   0% |                                  | ETA:  37 days, 5:41:10

Episode: 359   score: 10.32   Avg score (100e): 9.28   actor gain: -0.45   critic loss: 0.41   steps: 359


training loop:   0% |                                  | ETA:  37 days, 5:40:04

Episode: 360   score: 10.32   Avg score (100e): 9.30   actor gain: -0.45   critic loss: 0.41   steps: 360


training loop:   0% |                                  | ETA:  37 days, 5:24:26

Episode: 361   score: 10.34   Avg score (100e): 9.33   actor gain: -0.46   critic loss: 0.41   steps: 361


training loop:   0% |                                  | ETA:  37 days, 5:14:55

Episode: 362   score: 10.37   Avg score (100e): 9.35   actor gain: -0.46   critic loss: 0.41   steps: 362


training loop:   0% |                                  | ETA:  37 days, 5:10:28

Episode: 363   score: 10.39   Avg score (100e): 9.37   actor gain: -0.46   critic loss: 0.41   steps: 363


training loop:   0% |                                  | ETA:  37 days, 5:26:40

Episode: 364   score: 10.42   Avg score (100e): 9.39   actor gain: -0.46   critic loss: 0.41   steps: 364


training loop:   0% |                                  | ETA:  37 days, 5:47:39

Episode: 365   score: 10.43   Avg score (100e): 9.41   actor gain: -0.47   critic loss: 0.41   steps: 365


training loop:   0% |                                  | ETA:  37 days, 5:38:18

Episode: 366   score: 10.46   Avg score (100e): 9.43   actor gain: -0.46   critic loss: 0.41   steps: 366


training loop:   0% |                                  | ETA:  37 days, 5:29:22

Episode: 367   score: 10.49   Avg score (100e): 9.45   actor gain: -0.46   critic loss: 0.41   steps: 367


training loop:   0% |                                  | ETA:  37 days, 5:17:37

Episode: 368   score: 10.51   Avg score (100e): 9.47   actor gain: -0.47   critic loss: 0.41   steps: 368


training loop:   0% |                                  | ETA:  37 days, 4:56:58

Episode: 369   score: 10.51   Avg score (100e): 9.49   actor gain: -0.47   critic loss: 0.41   steps: 369


training loop:   0% |                                  | ETA:  37 days, 4:59:48

Episode: 370   score: 10.52   Avg score (100e): 9.51   actor gain: -0.47   critic loss: 0.41   steps: 370


training loop:   0% |                                  | ETA:  37 days, 4:55:41

Episode: 371   score: 10.52   Avg score (100e): 9.54   actor gain: -0.47   critic loss: 0.41   steps: 371


training loop:   0% |                                  | ETA:  37 days, 4:50:55

Episode: 372   score: 10.53   Avg score (100e): 9.56   actor gain: -0.47   critic loss: 0.41   steps: 372


training loop:   0% |                                  | ETA:  37 days, 4:46:26

Episode: 373   score: 10.54   Avg score (100e): 9.58   actor gain: -0.47   critic loss: 0.41   steps: 373


training loop:   0% |                                  | ETA:  37 days, 4:29:10

Episode: 374   score: 10.54   Avg score (100e): 9.60   actor gain: -0.47   critic loss: 0.41   steps: 374


training loop:   0% |                                  | ETA:  37 days, 4:17:20

Episode: 375   score: 10.55   Avg score (100e): 9.62   actor gain: -0.47   critic loss: 0.41   steps: 375


training loop:   0% |                                  | ETA:  37 days, 4:04:29

Episode: 376   score: 10.57   Avg score (100e): 9.64   actor gain: -0.47   critic loss: 0.41   steps: 376


training loop:   0% |                                  | ETA:  37 days, 3:52:15

Episode: 377   score: 10.57   Avg score (100e): 9.66   actor gain: -0.47   critic loss: 0.41   steps: 377


training loop:   0% |                                  | ETA:  37 days, 3:47:14

Episode: 378   score: 10.59   Avg score (100e): 9.68   actor gain: -0.47   critic loss: 0.41   steps: 378


training loop:   0% |                                  | ETA:  37 days, 3:42:49

Episode: 379   score: 10.58   Avg score (100e): 9.69   actor gain: -0.47   critic loss: 0.41   steps: 379


training loop:   0% |                                  | ETA:  37 days, 3:28:57

Episode: 380   score: 10.59   Avg score (100e): 9.71   actor gain: -0.47   critic loss: 0.41   steps: 380


training loop:   0% |                                  | ETA:  37 days, 3:17:13

Episode: 381   score: 10.59   Avg score (100e): 9.73   actor gain: -0.47   critic loss: 0.40   steps: 381


training loop:   0% |                                  | ETA:  37 days, 3:06:38

Episode: 382   score: 10.61   Avg score (100e): 9.75   actor gain: -0.48   critic loss: 0.40   steps: 382


training loop:   0% |                                  | ETA:  37 days, 2:50:55

Episode: 383   score: 10.59   Avg score (100e): 9.77   actor gain: -0.48   critic loss: 0.40   steps: 383


training loop:   0% |                                  | ETA:  37 days, 2:40:26

Episode: 384   score: 10.59   Avg score (100e): 9.79   actor gain: -0.48   critic loss: 0.40   steps: 384


training loop:   0% |                                  | ETA:  37 days, 2:31:02

Episode: 385   score: 10.61   Avg score (100e): 9.81   actor gain: -0.48   critic loss: 0.40   steps: 385


training loop:   0% |                                  | ETA:  37 days, 2:19:25

Episode: 386   score: 10.62   Avg score (100e): 9.83   actor gain: -0.47   critic loss: 0.40   steps: 386


training loop:   0% |                                  | ETA:  37 days, 2:06:41

Episode: 387   score: 10.62   Avg score (100e): 9.84   actor gain: -0.46   critic loss: 0.40   steps: 387


training loop:   0% |                                  | ETA:  37 days, 1:52:41

Episode: 388   score: 10.64   Avg score (100e): 9.86   actor gain: -0.46   critic loss: 0.40   steps: 388


training loop:   0% |                                  | ETA:  37 days, 1:56:11

Episode: 389   score: 10.67   Avg score (100e): 9.88   actor gain: -0.46   critic loss: 0.40   steps: 389


training loop:   0% |                                  | ETA:  37 days, 1:44:23

Episode: 390   score: 10.69   Avg score (100e): 9.90   actor gain: -0.45   critic loss: 0.40   steps: 390


training loop:   0% |                                  | ETA:  37 days, 1:28:17

Episode: 391   score: 10.70   Avg score (100e): 9.92   actor gain: -0.45   critic loss: 0.40   steps: 391


training loop:   0% |                                  | ETA:  37 days, 1:18:25

Episode: 392   score: 10.71   Avg score (100e): 9.93   actor gain: -0.45   critic loss: 0.40   steps: 392


training loop:   0% |                                  | ETA:  37 days, 1:08:42

Episode: 393   score: 10.73   Avg score (100e): 9.95   actor gain: -0.44   critic loss: 0.39   steps: 393


training loop:   0% |                                  | ETA:  37 days, 0:52:20

Episode: 394   score: 10.74   Avg score (100e): 9.97   actor gain: -0.45   critic loss: 0.39   steps: 394


training loop:   0% |                                  | ETA:  37 days, 0:41:47

Episode: 395   score: 10.75   Avg score (100e): 9.99   actor gain: -0.44   critic loss: 0.39   steps: 395


training loop:   0% |                                  | ETA:  37 days, 0:25:59

Episode: 396   score: 10.75   Avg score (100e): 10.00   actor gain: -0.44   critic loss: 0.39   steps: 396


training loop:   0% |                                  | ETA:  37 days, 0:14:42

Episode: 397   score: 10.75   Avg score (100e): 10.02   actor gain: -0.44   critic loss: 0.39   steps: 397


training loop:   0% |                                  | ETA:  37 days, 0:26:31

Episode: 398   score: 10.77   Avg score (100e): 10.04   actor gain: -0.44   critic loss: 0.39   steps: 398


training loop:   0% |                                  | ETA:  37 days, 0:16:06

Episode: 399   score: 10.77   Avg score (100e): 10.05   actor gain: -0.44   critic loss: 0.39   steps: 399


training loop:   0% |                                  | ETA:  37 days, 0:13:21

Episode: 400   score: 10.80   Avg score (100e): 10.07   actor gain: -0.45   critic loss: 0.39   steps: 400


training loop:   0% |                                  | ETA:  37 days, 0:08:40

Episode: 401   score: 10.80   Avg score (100e): 10.09   actor gain: -0.45   critic loss: 0.39   steps: 401


training loop:   0% |                                 | ETA:  36 days, 23:56:55

Episode: 402   score: 10.81   Avg score (100e): 10.10   actor gain: -0.45   critic loss: 0.39   steps: 402


training loop:   0% |                                 | ETA:  36 days, 23:42:34

Episode: 403   score: 10.82   Avg score (100e): 10.12   actor gain: -0.44   critic loss: 0.39   steps: 403


training loop:   0% |                                 | ETA:  36 days, 23:29:54

Episode: 404   score: 10.83   Avg score (100e): 10.14   actor gain: -0.44   critic loss: 0.39   steps: 404


training loop:   0% |                                 | ETA:  36 days, 23:18:51

Episode: 405   score: 10.83   Avg score (100e): 10.15   actor gain: -0.44   critic loss: 0.39   steps: 405


training loop:   0% |                                 | ETA:  36 days, 23:04:19

Episode: 406   score: 10.84   Avg score (100e): 10.17   actor gain: -0.44   critic loss: 0.39   steps: 406


training loop:   0% |                                 | ETA:  36 days, 22:52:22

Episode: 407   score: 10.84   Avg score (100e): 10.19   actor gain: -0.44   critic loss: 0.39   steps: 407


training loop:   0% |                                 | ETA:  36 days, 22:38:46

Episode: 408   score: 10.84   Avg score (100e): 10.20   actor gain: -0.44   critic loss: 0.39   steps: 408


training loop:   0% |                                 | ETA:  36 days, 22:25:15

Episode: 409   score: 10.87   Avg score (100e): 10.22   actor gain: -0.44   critic loss: 0.39   steps: 409


training loop:   0% |                                 | ETA:  36 days, 22:08:30

Episode: 410   score: 10.87   Avg score (100e): 10.23   actor gain: -0.44   critic loss: 0.39   steps: 410


training loop:   0% |                                 | ETA:  36 days, 21:54:03

Episode: 411   score: 10.89   Avg score (100e): 10.25   actor gain: -0.44   critic loss: 0.39   steps: 411


training loop:   0% |                                 | ETA:  36 days, 21:42:46

Episode: 412   score: 10.89   Avg score (100e): 10.26   actor gain: -0.44   critic loss: 0.39   steps: 412


training loop:   0% |                                 | ETA:  36 days, 21:29:39

Episode: 413   score: 10.90   Avg score (100e): 10.28   actor gain: -0.44   critic loss: 0.39   steps: 413


training loop:   0% |                                 | ETA:  36 days, 21:15:16

Episode: 414   score: 10.89   Avg score (100e): 10.29   actor gain: -0.44   critic loss: 0.40   steps: 414


training loop:   0% |                                 | ETA:  36 days, 21:00:34

Episode: 415   score: 10.90   Avg score (100e): 10.31   actor gain: -0.44   critic loss: 0.40   steps: 415


training loop:   0% |                                 | ETA:  36 days, 20:51:29

Episode: 416   score: 10.91   Avg score (100e): 10.32   actor gain: -0.44   critic loss: 0.40   steps: 416


training loop:   0% |                                 | ETA:  36 days, 20:44:22

Episode: 417   score: 10.90   Avg score (100e): 10.34   actor gain: -0.44   critic loss: 0.40   steps: 417


training loop:   0% |                                 | ETA:  36 days, 20:26:07

Episode: 418   score: 10.92   Avg score (100e): 10.35   actor gain: -0.44   critic loss: 0.40   steps: 418


training loop:   0% |                                 | ETA:  36 days, 20:11:38

Episode: 419   score: 10.90   Avg score (100e): 10.37   actor gain: -0.44   critic loss: 0.39   steps: 419


training loop:   0% |                                 | ETA:  36 days, 19:58:47

Episode: 420   score: 10.91   Avg score (100e): 10.38   actor gain: -0.44   critic loss: 0.39   steps: 420


training loop:   0% |                                 | ETA:  36 days, 19:47:28

Episode: 421   score: 10.92   Avg score (100e): 10.39   actor gain: -0.44   critic loss: 0.40   steps: 421


training loop:   0% |                                 | ETA:  36 days, 19:30:54

Episode: 422   score: 10.92   Avg score (100e): 10.41   actor gain: -0.44   critic loss: 0.39   steps: 422


training loop:   0% |                                 | ETA:  36 days, 19:14:21

Episode: 423   score: 10.92   Avg score (100e): 10.42   actor gain: -0.44   critic loss: 0.39   steps: 423


training loop:   0% |                                 | ETA:  36 days, 19:00:43

Episode: 424   score: 10.94   Avg score (100e): 10.44   actor gain: -0.44   critic loss: 0.39   steps: 424


training loop:   0% |                                 | ETA:  36 days, 18:49:12

Episode: 425   score: 10.95   Avg score (100e): 10.45   actor gain: -0.43   critic loss: 0.39   steps: 425


training loop:   0% |                                 | ETA:  36 days, 18:35:44

Episode: 426   score: 10.95   Avg score (100e): 10.46   actor gain: -0.43   critic loss: 0.39   steps: 426


training loop:   0% |                                 | ETA:  36 days, 18:24:32

Episode: 427   score: 10.95   Avg score (100e): 10.47   actor gain: -0.43   critic loss: 0.39   steps: 427


training loop:   0% |                                 | ETA:  36 days, 18:11:03

Episode: 428   score: 10.96   Avg score (100e): 10.49   actor gain: -0.46   critic loss: 0.39   steps: 428


training loop:   0% |                                 | ETA:  36 days, 18:05:50

Episode: 429   score: 10.97   Avg score (100e): 10.50   actor gain: -0.47   critic loss: 0.39   steps: 429


training loop:   0% |                                 | ETA:  36 days, 18:05:18

Episode: 430   score: 10.96   Avg score (100e): 10.51   actor gain: -0.46   critic loss: 0.39   steps: 430


training loop:   0% |                                 | ETA:  36 days, 17:55:33

Episode: 431   score: 10.96   Avg score (100e): 10.53   actor gain: -0.46   critic loss: 0.39   steps: 431


training loop:   0% |                                 | ETA:  36 days, 17:47:48

Episode: 432   score: 10.99   Avg score (100e): 10.54   actor gain: -0.46   critic loss: 0.39   steps: 432


training loop:   0% |                                 | ETA:  36 days, 17:37:56

Episode: 433   score: 10.98   Avg score (100e): 10.55   actor gain: -0.46   critic loss: 0.39   steps: 433


training loop:   0% |                                 | ETA:  36 days, 17:30:37

Episode: 434   score: 10.98   Avg score (100e): 10.56   actor gain: -0.46   critic loss: 0.39   steps: 434


training loop:   0% |                                 | ETA:  36 days, 17:19:06

Episode: 435   score: 10.98   Avg score (100e): 10.57   actor gain: -0.51   critic loss: 0.39   steps: 435


training loop:   0% |                                 | ETA:  36 days, 17:08:34

Episode: 436   score: 10.99   Avg score (100e): 10.59   actor gain: -0.51   critic loss: 0.39   steps: 436


training loop:   0% |                                 | ETA:  36 days, 16:56:04

Episode: 437   score: 10.98   Avg score (100e): 10.60   actor gain: -0.51   critic loss: 0.39   steps: 437


training loop:   0% |                                 | ETA:  36 days, 16:59:09

Episode: 438   score: 10.99   Avg score (100e): 10.61   actor gain: -0.51   critic loss: 0.39   steps: 438


training loop:   0% |                                 | ETA:  36 days, 16:47:40

Episode: 439   score: 10.98   Avg score (100e): 10.62   actor gain: -0.51   critic loss: 0.39   steps: 439


training loop:   0% |                                 | ETA:  36 days, 16:34:37

Episode: 440   score: 10.98   Avg score (100e): 10.63   actor gain: -0.51   critic loss: 0.39   steps: 440


training loop:   0% |                                 | ETA:  36 days, 16:25:23

Episode: 441   score: 10.98   Avg score (100e): 10.64   actor gain: -0.51   critic loss: 0.39   steps: 441


training loop:   0% |                                 | ETA:  36 days, 16:11:33

Episode: 442   score: 10.98   Avg score (100e): 10.65   actor gain: -0.51   critic loss: 0.39   steps: 442


training loop:   0% |                                 | ETA:  36 days, 15:57:41

Episode: 443   score: 10.98   Avg score (100e): 10.66   actor gain: -0.51   critic loss: 0.39   steps: 443


training loop:   0% |                                 | ETA:  36 days, 15:42:54

Episode: 444   score: 10.98   Avg score (100e): 10.67   actor gain: -0.51   critic loss: 0.39   steps: 444


training loop:   0% |                                 | ETA:  36 days, 15:27:52

Episode: 445   score: 10.98   Avg score (100e): 10.68   actor gain: -0.51   critic loss: 0.39   steps: 445


training loop:   0% |                                 | ETA:  36 days, 15:14:34

Episode: 446   score: 10.99   Avg score (100e): 10.69   actor gain: -0.51   critic loss: 0.39   steps: 446


training loop:   0% |                                 | ETA:  36 days, 14:59:43

Episode: 447   score: 10.99   Avg score (100e): 10.70   actor gain: -0.51   critic loss: 0.39   steps: 447


training loop:   0% |                                 | ETA:  36 days, 14:45:46

Episode: 448   score: 11.00   Avg score (100e): 10.71   actor gain: -0.51   critic loss: 0.39   steps: 448


training loop:   0% |                                 | ETA:  36 days, 14:36:03

Episode: 449   score: 11.00   Avg score (100e): 10.72   actor gain: -0.51   critic loss: 0.39   steps: 449


training loop:   0% |                                 | ETA:  36 days, 14:21:05

Episode: 450   score: 11.00   Avg score (100e): 10.72   actor gain: -0.51   critic loss: 0.39   steps: 450


training loop:   0% |                                 | ETA:  36 days, 14:04:36

Episode: 451   score: 11.00   Avg score (100e): 10.73   actor gain: -0.51   critic loss: 0.39   steps: 451


training loop:   0% |                                 | ETA:  36 days, 13:47:39

Episode: 452   score: 11.00   Avg score (100e): 10.74   actor gain: -0.51   critic loss: 0.39   steps: 452


training loop:   0% |                                 | ETA:  36 days, 13:33:34

Episode: 453   score: 10.99   Avg score (100e): 10.75   actor gain: -0.48   critic loss: 0.39   steps: 453


training loop:   0% |                                 | ETA:  36 days, 13:19:26

Episode: 454   score: 11.00   Avg score (100e): 10.76   actor gain: -0.48   critic loss: 0.39   steps: 454


training loop:   0% |                                 | ETA:  36 days, 13:01:26

Episode: 455   score: 11.01   Avg score (100e): 10.77   actor gain: -0.48   critic loss: 0.39   steps: 455


training loop:   0% |                                 | ETA:  36 days, 12:47:47

Episode: 456   score: 11.00   Avg score (100e): 10.77   actor gain: -0.48   critic loss: 0.39   steps: 456


training loop:   0% |                                 | ETA:  36 days, 12:34:54

Episode: 457   score: 11.00   Avg score (100e): 10.78   actor gain: -0.48   critic loss: 0.39   steps: 457


training loop:   0% |                                 | ETA:  36 days, 12:20:04

Episode: 458   score: 11.00   Avg score (100e): 10.79   actor gain: -0.48   critic loss: 0.39   steps: 458


training loop:   0% |                                 | ETA:  36 days, 12:07:28

Episode: 459   score: 11.00   Avg score (100e): 10.80   actor gain: -0.48   critic loss: 0.39   steps: 459


training loop:   0% |                                 | ETA:  36 days, 11:59:54

Episode: 460   score: 11.00   Avg score (100e): 10.80   actor gain: -0.43   critic loss: 0.39   steps: 460


training loop:   0% |                                 | ETA:  36 days, 11:44:43

Episode: 461   score: 11.00   Avg score (100e): 10.81   actor gain: -0.43   critic loss: 0.39   steps: 461


training loop:   0% |                                 | ETA:  36 days, 11:51:25

Episode: 462   score: 11.00   Avg score (100e): 10.81   actor gain: -0.43   critic loss: 0.39   steps: 462


training loop:   0% |                                 | ETA:  36 days, 11:40:15

Episode: 463   score: 11.00   Avg score (100e): 10.82   actor gain: -0.43   critic loss: 0.39   steps: 463


training loop:   0% |                                 | ETA:  36 days, 11:32:32

Episode: 464   score: 11.00   Avg score (100e): 10.83   actor gain: -0.43   critic loss: 0.39   steps: 464


training loop:   0% |                                 | ETA:  36 days, 11:25:30

Episode: 465   score: 11.01   Avg score (100e): 10.83   actor gain: -0.43   critic loss: 0.39   steps: 465


training loop:   0% |                                 | ETA:  36 days, 11:17:26

Episode: 466   score: 11.01   Avg score (100e): 10.84   actor gain: -0.43   critic loss: 0.39   steps: 466


training loop:   0% |                                 | ETA:  36 days, 11:05:46

Episode: 467   score: 11.01   Avg score (100e): 10.84   actor gain: -0.43   critic loss: 0.39   steps: 467


training loop:   0% |                                 | ETA:  36 days, 10:54:57

Episode: 468   score: 11.02   Avg score (100e): 10.85   actor gain: -0.43   critic loss: 0.39   steps: 468


training loop:   0% |                                 | ETA:  36 days, 10:46:14

Episode: 469   score: 11.02   Avg score (100e): 10.85   actor gain: -0.43   critic loss: 0.39   steps: 469


training loop:   0% |                                 | ETA:  36 days, 10:37:12

Episode: 470   score: 11.02   Avg score (100e): 10.86   actor gain: -0.43   critic loss: 0.39   steps: 470


training loop:   0% |                                 | ETA:  36 days, 10:29:21

Episode: 471   score: 11.02   Avg score (100e): 10.86   actor gain: -0.43   critic loss: 0.39   steps: 471


training loop:   0% |                                 | ETA:  36 days, 10:18:09

Episode: 472   score: 11.01   Avg score (100e): 10.87   actor gain: -0.43   critic loss: 0.39   steps: 472


training loop:   0% |                                 | ETA:  36 days, 10:06:17

Episode: 473   score: 11.02   Avg score (100e): 10.87   actor gain: -0.43   critic loss: 0.39   steps: 473


training loop:   0% |                                  | ETA:  36 days, 9:56:13

Episode: 474   score: 11.02   Avg score (100e): 10.88   actor gain: -0.43   critic loss: 0.39   steps: 474


training loop:   0% |                                  | ETA:  36 days, 9:44:47

Episode: 475   score: 11.03   Avg score (100e): 10.88   actor gain: -0.43   critic loss: 0.39   steps: 475


training loop:   0% |                                  | ETA:  36 days, 9:32:07

Episode: 476   score: 11.04   Avg score (100e): 10.89   actor gain: -0.43   critic loss: 0.39   steps: 476


training loop:   0% |                                  | ETA:  36 days, 9:24:42

Episode: 477   score: 11.04   Avg score (100e): 10.89   actor gain: -0.43   critic loss: 0.39   steps: 477


training loop:   0% |                                  | ETA:  36 days, 9:12:43

Episode: 478   score: 11.05   Avg score (100e): 10.90   actor gain: -0.43   critic loss: 0.39   steps: 478


training loop:   0% |                                  | ETA:  36 days, 9:02:47

Episode: 479   score: 11.06   Avg score (100e): 10.90   actor gain: -0.43   critic loss: 0.39   steps: 479


training loop:   0% |                                  | ETA:  36 days, 8:51:27

Episode: 480   score: 11.05   Avg score (100e): 10.91   actor gain: -0.43   critic loss: 0.39   steps: 480


training loop:   0% |                                  | ETA:  36 days, 8:41:23

Episode: 481   score: 11.04   Avg score (100e): 10.91   actor gain: -0.43   critic loss: 0.39   steps: 481


training loop:   0% |                                  | ETA:  36 days, 8:31:59

Episode: 482   score: 11.06   Avg score (100e): 10.92   actor gain: -0.43   critic loss: 0.39   steps: 482


training loop:   0% |                                  | ETA:  36 days, 8:21:28

Episode: 483   score: 11.06   Avg score (100e): 10.92   actor gain: -0.43   critic loss: 0.39   steps: 483


training loop:   0% |                                  | ETA:  36 days, 8:11:24

Episode: 484   score: 11.06   Avg score (100e): 10.92   actor gain: -0.43   critic loss: 0.39   steps: 484


training loop:   0% |                                  | ETA:  36 days, 8:03:51

Episode: 485   score: 11.06   Avg score (100e): 10.93   actor gain: -0.43   critic loss: 0.39   steps: 485


training loop:   0% |                                  | ETA:  36 days, 7:51:43

Episode: 486   score: 11.07   Avg score (100e): 10.93   actor gain: -0.43   critic loss: 0.39   steps: 486


training loop:   0% |                                  | ETA:  36 days, 7:39:08

Episode: 487   score: 11.07   Avg score (100e): 10.94   actor gain: -0.43   critic loss: 0.39   steps: 487


training loop:   0% |                                  | ETA:  36 days, 7:30:05

Episode: 488   score: 11.07   Avg score (100e): 10.94   actor gain: -0.44   critic loss: 0.39   steps: 488


training loop:   0% |                                  | ETA:  36 days, 7:22:53

Episode: 489   score: 11.07   Avg score (100e): 10.95   actor gain: -0.44   critic loss: 0.39   steps: 489


training loop:   0% |                                  | ETA:  36 days, 7:13:25

Episode: 490   score: 11.07   Avg score (100e): 10.95   actor gain: -0.44   critic loss: 0.39   steps: 490


training loop:   0% |                                  | ETA:  36 days, 7:02:01

Episode: 491   score: 11.07   Avg score (100e): 10.95   actor gain: -0.44   critic loss: 0.39   steps: 491


training loop:   0% |                                  | ETA:  36 days, 6:52:00

Episode: 492   score: 11.07   Avg score (100e): 10.96   actor gain: -0.44   critic loss: 0.39   steps: 492


training loop:   0% |                                  | ETA:  36 days, 6:40:22

Episode: 493   score: 11.07   Avg score (100e): 10.96   actor gain: -0.44   critic loss: 0.39   steps: 493


training loop:   0% |                                  | ETA:  36 days, 6:32:34

Episode: 494   score: 11.08   Avg score (100e): 10.96   actor gain: -0.44   critic loss: 0.39   steps: 494


training loop:   0% |                                  | ETA:  36 days, 6:39:50

Episode: 495   score: 11.07   Avg score (100e): 10.97   actor gain: -0.44   critic loss: 0.39   steps: 495


training loop:   0% |                                  | ETA:  36 days, 6:37:00

Episode: 496   score: 11.08   Avg score (100e): 10.97   actor gain: -0.44   critic loss: 0.39   steps: 496


training loop:   0% |                                  | ETA:  36 days, 6:33:43

Episode: 497   score: 11.08   Avg score (100e): 10.97   actor gain: -0.44   critic loss: 0.39   steps: 497


training loop:   0% |                                  | ETA:  36 days, 6:25:41

Episode: 498   score: 11.09   Avg score (100e): 10.98   actor gain: -0.43   critic loss: 0.39   steps: 498


training loop:   0% |                                  | ETA:  36 days, 6:18:30

Episode: 499   score: 11.09   Avg score (100e): 10.98   actor gain: -0.43   critic loss: 0.39   steps: 499


training loop:   0% |                                  | ETA:  36 days, 6:11:21

Episode: 500   score: 11.10   Avg score (100e): 10.98   actor gain: -0.43   critic loss: 0.39   steps: 500


training loop:   1% |                                  | ETA:  36 days, 6:02:30

Episode: 501   score: 11.10   Avg score (100e): 10.99   actor gain: -0.44   critic loss: 0.39   steps: 501


training loop:   1% |                                  | ETA:  36 days, 5:52:31

Episode: 502   score: 11.10   Avg score (100e): 10.99   actor gain: -0.44   critic loss: 0.39   steps: 502


training loop:   1% |                                  | ETA:  36 days, 5:50:51

Episode: 503   score: 11.11   Avg score (100e): 10.99   actor gain: -0.44   critic loss: 0.39   steps: 503


training loop:   1% |                                  | ETA:  36 days, 5:41:25

Episode: 504   score: 11.10   Avg score (100e): 10.99   actor gain: -0.44   critic loss: 0.39   steps: 504


training loop:   1% |                                  | ETA:  36 days, 5:33:36

Episode: 505   score: 11.10   Avg score (100e): 11.00   actor gain: -0.44   critic loss: 0.39   steps: 505


training loop:   1% |                                  | ETA:  36 days, 5:20:40

Episode: 506   score: 11.11   Avg score (100e): 11.00   actor gain: -0.43   critic loss: 0.39   steps: 506


training loop:   1% |                                  | ETA:  36 days, 5:11:15

Episode: 507   score: 11.12   Avg score (100e): 11.00   actor gain: -0.43   critic loss: 0.39   steps: 507


training loop:   1% |                                  | ETA:  36 days, 5:03:31

Episode: 508   score: 11.12   Avg score (100e): 11.01   actor gain: -0.43   critic loss: 0.39   steps: 508


training loop:   1% |                                  | ETA:  36 days, 4:52:54

Episode: 509   score: 11.12   Avg score (100e): 11.01   actor gain: -0.43   critic loss: 0.39   steps: 509


training loop:   1% |                                  | ETA:  36 days, 4:41:25

Episode: 510   score: 11.12   Avg score (100e): 11.01   actor gain: -0.43   critic loss: 0.39   steps: 510


training loop:   1% |                                  | ETA:  36 days, 4:31:03

Episode: 511   score: 11.12   Avg score (100e): 11.01   actor gain: -0.43   critic loss: 0.39   steps: 511


training loop:   1% |                                  | ETA:  36 days, 4:21:33

Episode: 512   score: 11.12   Avg score (100e): 11.02   actor gain: -0.52   critic loss: 0.39   steps: 512


training loop:   1% |                                  | ETA:  36 days, 4:09:04

Episode: 513   score: 11.12   Avg score (100e): 11.02   actor gain: -0.51   critic loss: 0.39   steps: 513


training loop:   1% |                                  | ETA:  36 days, 3:58:27

Episode: 514   score: 11.13   Avg score (100e): 11.02   actor gain: -0.51   critic loss: 0.39   steps: 514


training loop:   1% |                                  | ETA:  36 days, 3:48:57

Episode: 515   score: 11.13   Avg score (100e): 11.02   actor gain: -0.51   critic loss: 0.39   steps: 515


training loop:   1% |                                  | ETA:  36 days, 3:36:35

Episode: 516   score: 11.16   Avg score (100e): 11.02   actor gain: -0.51   critic loss: 0.39   steps: 516


training loop:   1% |                                  | ETA:  36 days, 3:27:30

Episode: 517   score: 11.18   Avg score (100e): 11.03   actor gain: -0.51   critic loss: 0.39   steps: 517


training loop:   1% |                                  | ETA:  36 days, 3:15:30

Episode: 518   score: 11.17   Avg score (100e): 11.03   actor gain: -0.51   critic loss: 0.39   steps: 518


training loop:   1% |                                  | ETA:  36 days, 3:03:16

Episode: 519   score: 11.17   Avg score (100e): 11.03   actor gain: -0.51   critic loss: 0.40   steps: 519


training loop:   1% |                                  | ETA:  36 days, 2:57:32

Episode: 520   score: 11.19   Avg score (100e): 11.04   actor gain: -0.51   critic loss: 0.40   steps: 520


training loop:   1% |                                  | ETA:  36 days, 2:57:53

Episode: 521   score: 11.21   Avg score (100e): 11.04   actor gain: -0.51   critic loss: 0.40   steps: 521


training loop:   1% |                                  | ETA:  36 days, 2:54:31

Episode: 522   score: 11.21   Avg score (100e): 11.04   actor gain: -0.51   critic loss: 0.40   steps: 522


training loop:   1% |                                  | ETA:  36 days, 2:48:55

Episode: 523   score: 11.22   Avg score (100e): 11.04   actor gain: -0.51   critic loss: 0.40   steps: 523


training loop:   1% |                                  | ETA:  36 days, 2:37:50

Episode: 524   score: 11.22   Avg score (100e): 11.05   actor gain: -0.51   critic loss: 0.40   steps: 524


training loop:   1% |                                  | ETA:  36 days, 2:31:12

Episode: 525   score: 11.23   Avg score (100e): 11.05   actor gain: -0.51   critic loss: 0.40   steps: 525


training loop:   1% |                                  | ETA:  36 days, 2:22:18

Episode: 526   score: 11.24   Avg score (100e): 11.05   actor gain: -0.51   critic loss: 0.40   steps: 526


training loop:   1% |                                  | ETA:  36 days, 2:30:32

Episode: 527   score: 11.25   Avg score (100e): 11.06   actor gain: -0.51   critic loss: 0.40   steps: 527


training loop:   1% |                                  | ETA:  36 days, 2:25:40

Episode: 528   score: 11.26   Avg score (100e): 11.06   actor gain: -0.51   critic loss: 0.40   steps: 528


training loop:   1% |                                  | ETA:  36 days, 2:19:13

Episode: 529   score: 11.28   Avg score (100e): 11.06   actor gain: -0.51   critic loss: 0.40   steps: 529


training loop:   1% |                                  | ETA:  36 days, 2:13:22

Episode: 530   score: 11.29   Avg score (100e): 11.07   actor gain: -0.51   critic loss: 0.40   steps: 530


training loop:   1% |                                  | ETA:  36 days, 2:07:02

Episode: 531   score: 11.30   Avg score (100e): 11.07   actor gain: -0.52   critic loss: 0.40   steps: 531


training loop:   1% |                                  | ETA:  36 days, 1:59:49

Episode: 532   score: 11.29   Avg score (100e): 11.07   actor gain: -0.52   critic loss: 0.40   steps: 532


training loop:   1% |                                  | ETA:  36 days, 1:51:40

Episode: 533   score: 11.30   Avg score (100e): 11.07   actor gain: -0.52   critic loss: 0.40   steps: 533


training loop:   1% |                                  | ETA:  36 days, 1:45:49

Episode: 534   score: 11.30   Avg score (100e): 11.08   actor gain: -0.52   critic loss: 0.40   steps: 534


training loop:   1% |                                  | ETA:  36 days, 1:37:41

Episode: 535   score: 11.30   Avg score (100e): 11.08   actor gain: -0.52   critic loss: 0.40   steps: 535


training loop:   1% |                                  | ETA:  36 days, 1:29:14

Episode: 536   score: 11.32   Avg score (100e): 11.08   actor gain: -0.52   critic loss: 0.40   steps: 536


training loop:   1% |                                  | ETA:  36 days, 1:27:01

Episode: 537   score: 11.31   Avg score (100e): 11.09   actor gain: -0.43   critic loss: 0.40   steps: 537


training loop:   1% |                                  | ETA:  36 days, 1:20:55

Episode: 538   score: 11.30   Avg score (100e): 11.09   actor gain: -0.43   critic loss: 0.40   steps: 538


training loop:   1% |                                  | ETA:  36 days, 1:12:38

Episode: 539   score: 11.30   Avg score (100e): 11.09   actor gain: -0.43   critic loss: 0.40   steps: 539


training loop:   1% |                                  | ETA:  36 days, 1:07:50

Episode: 540   score: 11.31   Avg score (100e): 11.10   actor gain: -0.43   critic loss: 0.40   steps: 540


training loop:   1% |                                  | ETA:  36 days, 0:59:13

Episode: 541   score: 11.32   Avg score (100e): 11.10   actor gain: -0.43   critic loss: 0.40   steps: 541


training loop:   1% |                                  | ETA:  36 days, 0:51:50

Episode: 542   score: 11.34   Avg score (100e): 11.10   actor gain: -0.43   critic loss: 0.40   steps: 542


training loop:   1% |                                  | ETA:  36 days, 0:43:49

Episode: 543   score: 11.35   Avg score (100e): 11.11   actor gain: -0.43   critic loss: 0.40   steps: 543


training loop:   1% |                                  | ETA:  36 days, 0:35:43

Episode: 544   score: 11.36   Avg score (100e): 11.11   actor gain: -0.43   critic loss: 0.40   steps: 544


training loop:   1% |                                  | ETA:  36 days, 0:31:11

Episode: 545   score: 11.36   Avg score (100e): 11.12   actor gain: -0.43   critic loss: 0.40   steps: 545


training loop:   1% |                                  | ETA:  36 days, 0:25:50

Episode: 546   score: 11.36   Avg score (100e): 11.12   actor gain: -0.43   critic loss: 0.40   steps: 546


training loop:   1% |                                  | ETA:  36 days, 0:15:38

Episode: 547   score: 11.37   Avg score (100e): 11.12   actor gain: -0.43   critic loss: 0.40   steps: 547


training loop:   1% |                                  | ETA:  36 days, 0:04:41

Episode: 548   score: 11.37   Avg score (100e): 11.13   actor gain: -0.43   critic loss: 0.40   steps: 548


training loop:   1% |                                 | ETA:  35 days, 23:55:38

Episode: 549   score: 11.38   Avg score (100e): 11.13   actor gain: -0.43   critic loss: 0.40   steps: 549


training loop:   1% |                                 | ETA:  35 days, 23:55:13

Episode: 550   score: 11.39   Avg score (100e): 11.13   actor gain: -0.43   critic loss: 0.40   steps: 550


training loop:   1% |                                 | ETA:  35 days, 23:46:49

Episode: 551   score: 11.39   Avg score (100e): 11.14   actor gain: -0.43   critic loss: 0.40   steps: 551


training loop:   1% |                                 | ETA:  35 days, 23:39:17

Episode: 552   score: 11.40   Avg score (100e): 11.14   actor gain: -0.43   critic loss: 0.40   steps: 552


training loop:   1% |                                 | ETA:  35 days, 23:33:01

Episode: 553   score: 11.40   Avg score (100e): 11.15   actor gain: -0.43   critic loss: 0.40   steps: 553


training loop:   1% |                                 | ETA:  35 days, 23:25:36

Episode: 554   score: 11.42   Avg score (100e): 11.15   actor gain: -0.43   critic loss: 0.40   steps: 554


training loop:   1% |                                 | ETA:  35 days, 23:20:56

Episode: 555   score: 11.41   Avg score (100e): 11.16   actor gain: -0.43   critic loss: 0.40   steps: 555


training loop:   1% |                                 | ETA:  35 days, 23:12:43

Episode: 556   score: 11.41   Avg score (100e): 11.16   actor gain: -0.43   critic loss: 0.40   steps: 556


training loop:   1% |                                 | ETA:  35 days, 23:09:23

Episode: 557   score: 11.41   Avg score (100e): 11.16   actor gain: -0.42   critic loss: 0.40   steps: 557


training loop:   1% |                                 | ETA:  35 days, 23:00:51

Episode: 558   score: 11.42   Avg score (100e): 11.17   actor gain: -0.42   critic loss: 0.39   steps: 558


training loop:   1% |                                 | ETA:  35 days, 23:04:00

Episode: 559   score: 11.42   Avg score (100e): 11.17   actor gain: -0.42   critic loss: 0.39   steps: 559


training loop:   1% |                                 | ETA:  35 days, 22:59:47

Episode: 560   score: 11.42   Avg score (100e): 11.18   actor gain: -0.42   critic loss: 0.39   steps: 560


training loop:   1% |                                 | ETA:  35 days, 22:53:14

Episode: 561   score: 11.42   Avg score (100e): 11.18   actor gain: -0.42   critic loss: 0.39   steps: 561


training loop:   1% |                                 | ETA:  35 days, 22:45:59

Episode: 562   score: 11.42   Avg score (100e): 11.18   actor gain: -0.43   critic loss: 0.39   steps: 562


training loop:   1% |                                 | ETA:  35 days, 22:42:44

Episode: 563   score: 11.43   Avg score (100e): 11.19   actor gain: -0.42   critic loss: 0.39   steps: 563


training loop:   1% |                                 | ETA:  35 days, 22:34:31

Episode: 564   score: 11.46   Avg score (100e): 11.19   actor gain: -0.43   critic loss: 0.39   steps: 564


training loop:   1% |                                 | ETA:  35 days, 22:28:40

Episode: 565   score: 11.46   Avg score (100e): 11.20   actor gain: -0.43   critic loss: 0.39   steps: 565


training loop:   1% |                                 | ETA:  35 days, 22:20:50

Episode: 566   score: 11.45   Avg score (100e): 11.20   actor gain: -0.43   critic loss: 0.39   steps: 566


training loop:   1% |                                 | ETA:  35 days, 22:11:19

Episode: 567   score: 11.46   Avg score (100e): 11.21   actor gain: -0.44   critic loss: 0.39   steps: 567


training loop:   1% |                                 | ETA:  35 days, 22:02:40

Episode: 568   score: 11.47   Avg score (100e): 11.21   actor gain: -0.43   critic loss: 0.39   steps: 568


training loop:   1% |                                 | ETA:  35 days, 21:54:48

Episode: 569   score: 11.47   Avg score (100e): 11.22   actor gain: -0.44   critic loss: 0.39   steps: 569


training loop:   1% |                                 | ETA:  35 days, 21:48:44

Episode: 570   score: 11.47   Avg score (100e): 11.22   actor gain: -0.44   critic loss: 0.40   steps: 570


training loop:   1% |                                 | ETA:  35 days, 21:46:51

Episode: 571   score: 11.48   Avg score (100e): 11.22   actor gain: -0.44   critic loss: 0.40   steps: 571


training loop:   1% |                                 | ETA:  35 days, 21:40:02

Episode: 572   score: 11.48   Avg score (100e): 11.23   actor gain: -0.44   critic loss: 0.40   steps: 572


training loop:   1% |                                 | ETA:  35 days, 21:31:22

Episode: 573   score: 11.49   Avg score (100e): 11.23   actor gain: -0.44   critic loss: 0.40   steps: 573


training loop:   1% |                                 | ETA:  35 days, 21:24:00

Episode: 574   score: 11.48   Avg score (100e): 11.24   actor gain: -0.43   critic loss: 0.40   steps: 574


training loop:   1% |                                 | ETA:  35 days, 21:15:31

Episode: 575   score: 11.50   Avg score (100e): 11.24   actor gain: -0.43   critic loss: 0.40   steps: 575


training loop:   1% |                                 | ETA:  35 days, 21:10:00

Episode: 576   score: 11.50   Avg score (100e): 11.25   actor gain: -0.43   critic loss: 0.40   steps: 576


training loop:   1% |                                 | ETA:  35 days, 21:00:43

Episode: 577   score: 11.51   Avg score (100e): 11.25   actor gain: -0.43   critic loss: 0.40   steps: 577


training loop:   1% |                                 | ETA:  35 days, 20:54:35

Episode: 578   score: 11.52   Avg score (100e): 11.26   actor gain: -0.43   critic loss: 0.40   steps: 578


training loop:   1% |                                 | ETA:  35 days, 20:47:42

Episode: 579   score: 11.52   Avg score (100e): 11.26   actor gain: -0.43   critic loss: 0.40   steps: 579


training loop:   1% |                                 | ETA:  35 days, 20:40:55

Episode: 580   score: 11.54   Avg score (100e): 11.27   actor gain: -0.44   critic loss: 0.40   steps: 580


training loop:   1% |                                 | ETA:  35 days, 20:34:26

Episode: 581   score: 11.54   Avg score (100e): 11.27   actor gain: -0.44   critic loss: 0.40   steps: 581


training loop:   1% |                                 | ETA:  35 days, 20:33:54

Episode: 582   score: 11.53   Avg score (100e): 11.28   actor gain: -0.44   critic loss: 0.40   steps: 582


training loop:   1% |                                 | ETA:  35 days, 20:27:37

Episode: 583   score: 11.54   Avg score (100e): 11.28   actor gain: -0.44   critic loss: 0.40   steps: 583


training loop:   1% |                                 | ETA:  35 days, 20:19:15

Episode: 584   score: 11.54   Avg score (100e): 11.29   actor gain: -0.44   critic loss: 0.40   steps: 584


training loop:   1% |                                 | ETA:  35 days, 20:12:06

Episode: 585   score: 11.54   Avg score (100e): 11.29   actor gain: -0.44   critic loss: 0.40   steps: 585


training loop:   1% |                                 | ETA:  35 days, 20:06:57

Episode: 586   score: 11.55   Avg score (100e): 11.30   actor gain: -0.44   critic loss: 0.40   steps: 586


training loop:   1% |                                 | ETA:  35 days, 20:00:07

Episode: 587   score: 11.55   Avg score (100e): 11.30   actor gain: -0.44   critic loss: 0.40   steps: 587


training loop:   1% |                                 | ETA:  35 days, 19:54:39

Episode: 588   score: 11.57   Avg score (100e): 11.31   actor gain: -0.45   critic loss: 0.40   steps: 588


training loop:   1% |                                 | ETA:  35 days, 19:47:51

Episode: 589   score: 11.58   Avg score (100e): 11.31   actor gain: -0.44   critic loss: 0.40   steps: 589


training loop:   1% |                                 | ETA:  35 days, 19:38:49

Episode: 590   score: 11.57   Avg score (100e): 11.32   actor gain: -0.45   critic loss: 0.40   steps: 590


training loop:   1% |                                 | ETA:  35 days, 19:33:29

Episode: 591   score: 11.59   Avg score (100e): 11.32   actor gain: -0.45   critic loss: 0.40   steps: 591


training loop:   1% |                                 | ETA:  35 days, 19:40:30

Episode: 592   score: 11.60   Avg score (100e): 11.33   actor gain: -0.44   critic loss: 0.40   steps: 592


training loop:   1% |                                 | ETA:  35 days, 19:35:33

Episode: 593   score: 11.61   Avg score (100e): 11.33   actor gain: -0.45   critic loss: 0.40   steps: 593


training loop:   1% |                                 | ETA:  35 days, 19:33:17

Episode: 594   score: 11.61   Avg score (100e): 11.34   actor gain: -0.44   critic loss: 0.40   steps: 594


training loop:   1% |                                 | ETA:  35 days, 19:30:16

Episode: 595   score: 11.63   Avg score (100e): 11.34   actor gain: -0.45   critic loss: 0.40   steps: 595


training loop:   1% |                                 | ETA:  35 days, 19:23:05

Episode: 596   score: 11.63   Avg score (100e): 11.35   actor gain: -0.45   critic loss: 0.40   steps: 596


training loop:   1% |                                 | ETA:  35 days, 19:19:10

Episode: 597   score: 11.63   Avg score (100e): 11.35   actor gain: -0.45   critic loss: 0.40   steps: 597


training loop:   1% |                                 | ETA:  35 days, 19:10:29

Episode: 598   score: 11.64   Avg score (100e): 11.36   actor gain: -0.45   critic loss: 0.40   steps: 598


training loop:   1% |                                 | ETA:  35 days, 19:06:41

Episode: 599   score: 11.65   Avg score (100e): 11.36   actor gain: -0.46   critic loss: 0.40   steps: 599


training loop:   1% |                                 | ETA:  35 days, 19:02:34

Episode: 600   score: 11.65   Avg score (100e): 11.37   actor gain: -0.46   critic loss: 0.40   steps: 600


training loop:   1% |                                 | ETA:  35 days, 18:54:35

Episode: 601   score: 11.66   Avg score (100e): 11.38   actor gain: -0.46   critic loss: 0.40   steps: 601


training loop:   1% |                                 | ETA:  35 days, 18:45:29

Episode: 602   score: 11.66   Avg score (100e): 11.38   actor gain: -0.47   critic loss: 0.40   steps: 602


training loop:   1% |                                 | ETA:  35 days, 18:42:29

Episode: 603   score: 11.67   Avg score (100e): 11.39   actor gain: -0.47   critic loss: 0.40   steps: 603


training loop:   1% |                                 | ETA:  35 days, 18:36:06

Episode: 604   score: 11.69   Avg score (100e): 11.39   actor gain: -0.47   critic loss: 0.40   steps: 604


training loop:   1% |                                 | ETA:  35 days, 18:28:34

Episode: 605   score: 11.70   Avg score (100e): 11.40   actor gain: -0.46   critic loss: 0.40   steps: 605


training loop:   1% |                                 | ETA:  35 days, 18:23:49

Episode: 606   score: 11.72   Avg score (100e): 11.41   actor gain: -0.46   critic loss: 0.40   steps: 606


training loop:   1% |                                 | ETA:  35 days, 18:17:16

Episode: 607   score: 11.72   Avg score (100e): 11.41   actor gain: -0.46   critic loss: 0.40   steps: 607


training loop:   1% |                                 | ETA:  35 days, 18:08:06

Episode: 608   score: 11.73   Avg score (100e): 11.42   actor gain: -0.46   critic loss: 0.40   steps: 608


training loop:   1% |                                 | ETA:  35 days, 17:59:55

Episode: 609   score: 11.75   Avg score (100e): 11.42   actor gain: -0.47   critic loss: 0.40   steps: 609


training loop:   1% |                                 | ETA:  35 days, 17:52:06

Episode: 610   score: 11.75   Avg score (100e): 11.43   actor gain: -0.46   critic loss: 0.40   steps: 610


training loop:   1% |                                 | ETA:  35 days, 17:45:45

Episode: 611   score: 11.77   Avg score (100e): 11.44   actor gain: -0.46   critic loss: 0.40   steps: 611


training loop:   1% |                                 | ETA:  35 days, 17:43:07

Episode: 612   score: 11.78   Avg score (100e): 11.44   actor gain: -0.46   critic loss: 0.40   steps: 612


training loop:   1% |                                 | ETA:  35 days, 17:35:35

Episode: 613   score: 11.77   Avg score (100e): 11.45   actor gain: -0.48   critic loss: 0.40   steps: 613


training loop:   1% |                                 | ETA:  35 days, 17:28:47

Episode: 614   score: 11.77   Avg score (100e): 11.46   actor gain: -0.47   critic loss: 0.40   steps: 614


training loop:   1% |                                 | ETA:  35 days, 17:24:37

Episode: 615   score: 11.76   Avg score (100e): 11.46   actor gain: -0.47   critic loss: 0.40   steps: 615


training loop:   1% |                                 | ETA:  35 days, 17:20:23

Episode: 616   score: 11.77   Avg score (100e): 11.47   actor gain: -0.47   critic loss: 0.40   steps: 616


training loop:   1% |                                 | ETA:  35 days, 17:13:15

Episode: 617   score: 11.78   Avg score (100e): 11.47   actor gain: -0.47   critic loss: 0.40   steps: 617


training loop:   1% |                                 | ETA:  35 days, 17:09:57

Episode: 618   score: 11.78   Avg score (100e): 11.48   actor gain: -0.46   critic loss: 0.40   steps: 618


training loop:   1% |                                 | ETA:  35 days, 17:05:24

Episode: 619   score: 11.78   Avg score (100e): 11.49   actor gain: -0.46   critic loss: 0.40   steps: 619


training loop:   1% |                                 | ETA:  35 days, 17:01:50

Episode: 620   score: 11.79   Avg score (100e): 11.49   actor gain: -0.46   critic loss: 0.40   steps: 620


training loop:   1% |                                 | ETA:  35 days, 16:57:05

Episode: 621   score: 11.80   Avg score (100e): 11.50   actor gain: -0.46   critic loss: 0.40   steps: 621


training loop:   1% |                                 | ETA:  35 days, 16:49:36

Episode: 622   score: 11.82   Avg score (100e): 11.50   actor gain: -0.46   critic loss: 0.40   steps: 622


training loop:   1% |                                 | ETA:  35 days, 16:44:47

Episode: 623   score: 11.82   Avg score (100e): 11.51   actor gain: -0.46   critic loss: 0.40   steps: 623


training loop:   1% |                                 | ETA:  35 days, 16:48:49

Episode: 624   score: 11.83   Avg score (100e): 11.52   actor gain: -0.45   critic loss: 0.40   steps: 624


training loop:   1% |                                 | ETA:  35 days, 16:45:55

Episode: 625   score: 11.84   Avg score (100e): 11.52   actor gain: -0.45   critic loss: 0.40   steps: 625


training loop:   1% |                                 | ETA:  35 days, 16:45:26

Episode: 626   score: 11.85   Avg score (100e): 11.53   actor gain: -0.44   critic loss: 0.40   steps: 626


training loop:   1% |                                 | ETA:  35 days, 16:38:10

Episode: 627   score: 11.86   Avg score (100e): 11.54   actor gain: -0.44   critic loss: 0.40   steps: 627


training loop:   1% |                                 | ETA:  35 days, 16:34:15

Episode: 628   score: 11.87   Avg score (100e): 11.54   actor gain: -0.44   critic loss: 0.40   steps: 628


training loop:   1% |                                 | ETA:  35 days, 16:29:37

Episode: 629   score: 11.87   Avg score (100e): 11.55   actor gain: -0.44   critic loss: 0.40   steps: 629


training loop:   1% |                                 | ETA:  35 days, 16:23:31

Episode: 630   score: 11.88   Avg score (100e): 11.55   actor gain: -0.45   critic loss: 0.40   steps: 630


training loop:   1% |                                 | ETA:  35 days, 16:20:29

Episode: 631   score: 11.90   Avg score (100e): 11.56   actor gain: -0.45   critic loss: 0.40   steps: 631


training loop:   1% |                                 | ETA:  35 days, 16:16:58

Episode: 632   score: 11.89   Avg score (100e): 11.57   actor gain: -0.44   critic loss: 0.40   steps: 632


training loop:   1% |                                 | ETA:  35 days, 16:10:56

Episode: 633   score: 11.90   Avg score (100e): 11.57   actor gain: -0.44   critic loss: 0.40   steps: 633


training loop:   1% |                                 | ETA:  35 days, 16:04:10

Episode: 634   score: 11.91   Avg score (100e): 11.58   actor gain: -0.44   critic loss: 0.40   steps: 634


training loop:   1% |                                 | ETA:  35 days, 15:57:38

Episode: 635   score: 11.92   Avg score (100e): 11.58   actor gain: -0.44   critic loss: 0.40   steps: 635


training loop:   1% |                                 | ETA:  35 days, 15:53:14

Episode: 636   score: 11.93   Avg score (100e): 11.59   actor gain: -0.44   critic loss: 0.40   steps: 636


training loop:   1% |                                 | ETA:  35 days, 15:49:00

Episode: 637   score: 11.94   Avg score (100e): 11.60   actor gain: -0.44   critic loss: 0.40   steps: 637


training loop:   1% |                                 | ETA:  35 days, 15:42:26

Episode: 638   score: 11.96   Avg score (100e): 11.60   actor gain: -0.42   critic loss: 0.40   steps: 638


training loop:   1% |                                 | ETA:  35 days, 15:35:56

Episode: 639   score: 11.97   Avg score (100e): 11.61   actor gain: -0.42   critic loss: 0.40   steps: 639


training loop:   1% |                                 | ETA:  35 days, 15:32:45

Episode: 640   score: 11.97   Avg score (100e): 11.62   actor gain: -0.42   critic loss: 0.40   steps: 640


training loop:   1% |                                 | ETA:  35 days, 15:27:59

Episode: 641   score: 11.97   Avg score (100e): 11.62   actor gain: -0.42   critic loss: 0.40   steps: 641


training loop:   1% |                                 | ETA:  35 days, 15:26:44

Episode: 642   score: 11.96   Avg score (100e): 11.63   actor gain: -0.42   critic loss: 0.40   steps: 642


training loop:   1% |                                 | ETA:  35 days, 15:27:00

Episode: 643   score: 11.98   Avg score (100e): 11.63   actor gain: -0.42   critic loss: 0.40   steps: 643


training loop:   1% |                                 | ETA:  35 days, 15:20:17

Episode: 644   score: 11.99   Avg score (100e): 11.64   actor gain: -0.42   critic loss: 0.41   steps: 644


training loop:   1% |                                 | ETA:  35 days, 15:13:06

Episode: 645   score: 12.00   Avg score (100e): 11.65   actor gain: -0.42   critic loss: 0.41   steps: 645


training loop:   1% |                                 | ETA:  35 days, 15:08:55

Episode: 646   score: 11.99   Avg score (100e): 11.65   actor gain: -0.42   critic loss: 0.41   steps: 646


training loop:   1% |                                 | ETA:  35 days, 15:00:24

Episode: 647   score: 12.00   Avg score (100e): 11.66   actor gain: -0.42   critic loss: 0.41   steps: 647


training loop:   1% |                                 | ETA:  35 days, 14:53:55

Episode: 648   score: 12.00   Avg score (100e): 11.67   actor gain: -0.42   critic loss: 0.41   steps: 648


training loop:   1% |                                 | ETA:  35 days, 14:50:42

Episode: 649   score: 12.01   Avg score (100e): 11.67   actor gain: -0.42   critic loss: 0.41   steps: 649


training loop:   1% |                                 | ETA:  35 days, 14:43:35

Episode: 650   score: 12.01   Avg score (100e): 11.68   actor gain: -0.42   critic loss: 0.41   steps: 650


training loop:   1% |                                 | ETA:  35 days, 14:39:21

Episode: 651   score: 12.02   Avg score (100e): 11.68   actor gain: -0.42   critic loss: 0.41   steps: 651


training loop:   1% |                                 | ETA:  35 days, 14:37:18

Episode: 652   score: 12.03   Avg score (100e): 11.69   actor gain: -0.42   critic loss: 0.41   steps: 652


training loop:   1% |                                 | ETA:  35 days, 14:31:49

Episode: 653   score: 12.04   Avg score (100e): 11.70   actor gain: -0.42   critic loss: 0.41   steps: 653


training loop:   1% |                                 | ETA:  35 days, 14:29:36

Episode: 654   score: 12.05   Avg score (100e): 11.70   actor gain: -0.42   critic loss: 0.41   steps: 654


training loop:   1% |                                 | ETA:  35 days, 14:25:13

Episode: 655   score: 12.05   Avg score (100e): 11.71   actor gain: -0.42   critic loss: 0.40   steps: 655


training loop:   1% |                                 | ETA:  35 days, 14:31:36

Episode: 656   score: 12.06   Avg score (100e): 11.72   actor gain: -0.42   critic loss: 0.40   steps: 656


training loop:   1% |                                 | ETA:  35 days, 14:29:34

Episode: 657   score: 12.08   Avg score (100e): 11.72   actor gain: -0.62   critic loss: 0.40   steps: 657


training loop:   1% |                                 | ETA:  35 days, 14:24:17

Episode: 658   score: 12.10   Avg score (100e): 11.73   actor gain: -0.63   critic loss: 0.40   steps: 658


training loop:   1% |                                 | ETA:  35 days, 14:24:55

Episode: 659   score: 12.11   Avg score (100e): 11.74   actor gain: -0.63   critic loss: 0.40   steps: 659


training loop:   1% |                                 | ETA:  35 days, 14:22:14

Episode: 660   score: 12.10   Avg score (100e): 11.74   actor gain: -0.62   critic loss: 0.40   steps: 660


training loop:   1% |                                 | ETA:  35 days, 14:20:03

Episode: 661   score: 12.10   Avg score (100e): 11.75   actor gain: -0.63   critic loss: 0.40   steps: 661


training loop:   1% |                                 | ETA:  35 days, 14:16:02

Episode: 662   score: 12.09   Avg score (100e): 11.76   actor gain: -0.63   critic loss: 0.40   steps: 662


training loop:   1% |                                 | ETA:  35 days, 14:13:49

Episode: 663   score: 12.10   Avg score (100e): 11.76   actor gain: -0.62   critic loss: 0.40   steps: 663


training loop:   1% |                                 | ETA:  35 days, 14:07:47

Episode: 664   score: 12.10   Avg score (100e): 11.77   actor gain: -0.62   critic loss: 0.40   steps: 664


training loop:   1% |                                 | ETA:  35 days, 14:01:15

Episode: 665   score: 12.08   Avg score (100e): 11.78   actor gain: -0.62   critic loss: 0.40   steps: 665


training loop:   1% |                                 | ETA:  35 days, 13:55:13

Episode: 666   score: 12.09   Avg score (100e): 11.78   actor gain: -0.62   critic loss: 0.40   steps: 666


training loop:   1% |                                 | ETA:  35 days, 13:49:19

Episode: 667   score: 12.08   Avg score (100e): 11.79   actor gain: -0.62   critic loss: 0.40   steps: 667


training loop:   1% |                                 | ETA:  35 days, 13:42:19

Episode: 668   score: 12.08   Avg score (100e): 11.80   actor gain: -0.62   critic loss: 0.40   steps: 668


training loop:   1% |                                 | ETA:  35 days, 13:38:27

Episode: 669   score: 12.08   Avg score (100e): 11.80   actor gain: -0.62   critic loss: 0.40   steps: 669


training loop:   1% |                                 | ETA:  35 days, 13:31:48

Episode: 670   score: 12.09   Avg score (100e): 11.81   actor gain: -0.62   critic loss: 0.40   steps: 670


training loop:   1% |                                 | ETA:  35 days, 13:28:41

Episode: 671   score: 12.06   Avg score (100e): 11.81   actor gain: -0.62   critic loss: 0.40   steps: 671


training loop:   1% |                                 | ETA:  35 days, 13:21:23

Episode: 672   score: 12.06   Avg score (100e): 11.82   actor gain: -0.62   critic loss: 0.40   steps: 672


training loop:   1% |                                 | ETA:  35 days, 13:15:28

Episode: 673   score: 12.05   Avg score (100e): 11.82   actor gain: -0.62   critic loss: 0.40   steps: 673


training loop:   1% |                                 | ETA:  35 days, 13:11:07

Episode: 674   score: 12.05   Avg score (100e): 11.83   actor gain: -0.62   critic loss: 0.39   steps: 674


training loop:   1% |                                 | ETA:  35 days, 13:07:23

Episode: 675   score: 12.05   Avg score (100e): 11.84   actor gain: -0.62   critic loss: 0.39   steps: 675


training loop:   1% |                                 | ETA:  35 days, 13:03:20

Episode: 676   score: 12.04   Avg score (100e): 11.84   actor gain: -0.63   critic loss: 0.39   steps: 676


training loop:   1% |                                 | ETA:  35 days, 13:02:55

Episode: 677   score: 12.04   Avg score (100e): 11.85   actor gain: -0.62   critic loss: 0.39   steps: 677


training loop:   1% |                                 | ETA:  35 days, 12:58:45

Episode: 678   score: 12.02   Avg score (100e): 11.85   actor gain: -0.62   critic loss: 0.39   steps: 678


training loop:   1% |                                 | ETA:  35 days, 12:53:18

Episode: 679   score: 12.02   Avg score (100e): 11.86   actor gain: -0.62   critic loss: 0.39   steps: 679


training loop:   1% |                                 | ETA:  35 days, 12:48:27

Episode: 680   score: 12.00   Avg score (100e): 11.86   actor gain: -0.62   critic loss: 0.39   steps: 680


training loop:   1% |                                 | ETA:  35 days, 12:44:02

Episode: 681   score: 12.01   Avg score (100e): 11.87   actor gain: -0.62   critic loss: 0.39   steps: 681


training loop:   1% |                                 | ETA:  35 days, 12:38:18

Episode: 682   score: 12.00   Avg score (100e): 11.87   actor gain: -0.41   critic loss: 0.39   steps: 682


training loop:   1% |                                 | ETA:  35 days, 12:34:12

Episode: 683   score: 11.99   Avg score (100e): 11.88   actor gain: -0.41   critic loss: 0.38   steps: 683


training loop:   1% |                                 | ETA:  35 days, 12:29:00

Episode: 684   score: 11.99   Avg score (100e): 11.88   actor gain: -0.41   critic loss: 0.38   steps: 684


training loop:   1% |                                 | ETA:  35 days, 12:25:03

Episode: 685   score: 11.99   Avg score (100e): 11.88   actor gain: -0.41   critic loss: 0.38   steps: 685


training loop:   1% |                                 | ETA:  35 days, 12:21:14

Episode: 686   score: 11.98   Avg score (100e): 11.89   actor gain: -0.43   critic loss: 0.38   steps: 686


training loop:   1% |                                 | ETA:  35 days, 12:14:44

Episode: 687   score: 11.98   Avg score (100e): 11.89   actor gain: -0.43   critic loss: 0.38   steps: 687


training loop:   1% |                                 | ETA:  35 days, 12:07:53

Episode: 688   score: 11.99   Avg score (100e): 11.90   actor gain: -0.43   critic loss: 0.38   steps: 688
np.all(done) is true! miracle!


training loop:   1% |                                 | ETA:  35 days, 12:12:53

Episode: 689   score: 11.98   Avg score (100e): 11.90   actor gain: -0.43   critic loss: 0.38   steps: 689


training loop:   1% |                                 | ETA:  35 days, 12:09:11

Episode: 690   score: 11.99   Avg score (100e): 11.91   actor gain: -0.43   critic loss: 0.38   steps: 690


training loop:   1% |                                 | ETA:  35 days, 12:11:36

Episode: 691   score: 11.99   Avg score (100e): 11.91   actor gain: -0.43   critic loss: 0.38   steps: 691


training loop:   1% |                                 | ETA:  35 days, 12:07:11

Episode: 692   score: 11.98   Avg score (100e): 11.91   actor gain: -0.44   critic loss: 0.38   steps: 692


training loop:   1% |                                 | ETA:  35 days, 12:06:41

Episode: 693   score: 11.97   Avg score (100e): 11.92   actor gain: -0.44   critic loss: 0.38   steps: 693


training loop:   1% |                                 | ETA:  35 days, 12:02:29

Episode: 694   score: 11.97   Avg score (100e): 11.92   actor gain: -0.44   critic loss: 0.38   steps: 694


training loop:   1% |                                 | ETA:  35 days, 11:57:39

Episode: 695   score: 11.96   Avg score (100e): 11.92   actor gain: -0.44   critic loss: 0.38   steps: 695


training loop:   1% |                                 | ETA:  35 days, 11:52:09

Episode: 696   score: 11.96   Avg score (100e): 11.93   actor gain: -0.44   critic loss: 0.38   steps: 696


training loop:   1% |                                 | ETA:  35 days, 11:46:45

Episode: 697   score: 11.97   Avg score (100e): 11.93   actor gain: -0.44   critic loss: 0.38   steps: 697


training loop:   1% |                                 | ETA:  35 days, 11:41:10

Episode: 698   score: 11.97   Avg score (100e): 11.93   actor gain: -0.44   critic loss: 0.38   steps: 698


training loop:   1% |                                 | ETA:  35 days, 11:35:26

Episode: 699   score: 11.98   Avg score (100e): 11.94   actor gain: -0.44   critic loss: 0.38   steps: 699


training loop:   1% |                                 | ETA:  35 days, 11:29:14

Episode: 700   score: 11.97   Avg score (100e): 11.94   actor gain: -0.44   critic loss: 0.38   steps: 700


training loop:   1% |                                 | ETA:  35 days, 11:25:36

Episode: 701   score: 11.97   Avg score (100e): 11.94   actor gain: -0.45   critic loss: 0.38   steps: 701


training loop:   1% |                                 | ETA:  35 days, 11:24:27

Episode: 702   score: 11.97   Avg score (100e): 11.95   actor gain: -0.45   critic loss: 0.38   steps: 702


training loop:   1% |                                 | ETA:  35 days, 11:23:37

Episode: 703   score: 11.98   Avg score (100e): 11.95   actor gain: -0.44   critic loss: 0.38   steps: 703


training loop:   1% |                                 | ETA:  35 days, 11:18:46

Episode: 704   score: 11.97   Avg score (100e): 11.95   actor gain: -0.44   critic loss: 0.38   steps: 704


training loop:   1% |                                 | ETA:  35 days, 11:16:59

Episode: 705   score: 11.97   Avg score (100e): 11.95   actor gain: -0.44   critic loss: 0.38   steps: 705


training loop:   1% |                                 | ETA:  35 days, 11:14:18

Episode: 706   score: 11.98   Avg score (100e): 11.96   actor gain: -0.44   critic loss: 0.38   steps: 706


training loop:   1% |                                 | ETA:  35 days, 11:09:12

Episode: 707   score: 11.98   Avg score (100e): 11.96   actor gain: -0.46   critic loss: 0.38   steps: 707


training loop:   1% |                                 | ETA:  35 days, 11:05:56

Episode: 708   score: 11.98   Avg score (100e): 11.96   actor gain: -0.46   critic loss: 0.38   steps: 708


training loop:   1% |                                 | ETA:  35 days, 11:01:10

Episode: 709   score: 11.98   Avg score (100e): 11.96   actor gain: -0.45   critic loss: 0.38   steps: 709


training loop:   1% |                                 | ETA:  35 days, 10:57:20

Episode: 710   score: 11.99   Avg score (100e): 11.97   actor gain: -0.47   critic loss: 0.38   steps: 710


training loop:   1% |                                 | ETA:  35 days, 10:54:49

Episode: 711   score: 11.98   Avg score (100e): 11.97   actor gain: -0.45   critic loss: 0.38   steps: 711


training loop:   1% |                                 | ETA:  35 days, 10:50:04

Episode: 712   score: 11.98   Avg score (100e): 11.97   actor gain: -0.45   critic loss: 0.38   steps: 712


training loop:   1% |                                 | ETA:  35 days, 10:44:40

Episode: 713   score: 11.99   Avg score (100e): 11.97   actor gain: -0.45   critic loss: 0.38   steps: 713


training loop:   1% |                                 | ETA:  35 days, 10:40:19

Episode: 714   score: 11.99   Avg score (100e): 11.98   actor gain: -0.45   critic loss: 0.38   steps: 714


training loop:   1% |                                 | ETA:  35 days, 10:34:41

Episode: 715   score: 12.00   Avg score (100e): 11.98   actor gain: -0.45   critic loss: 0.38   steps: 715


training loop:   1% |                                 | ETA:  35 days, 10:29:10

Episode: 716   score: 11.99   Avg score (100e): 11.98   actor gain: -0.45   critic loss: 0.38   steps: 716


training loop:   1% |                                 | ETA:  35 days, 10:27:09

Episode: 717   score: 11.99   Avg score (100e): 11.98   actor gain: -0.45   critic loss: 0.38   steps: 717


training loop:   1% |                                 | ETA:  35 days, 10:22:21

Episode: 718   score: 11.98   Avg score (100e): 11.98   actor gain: -0.45   critic loss: 0.38   steps: 718


training loop:   1% |                                 | ETA:  35 days, 10:16:12

Episode: 719   score: 11.98   Avg score (100e): 11.99   actor gain: -0.45   critic loss: 0.38   steps: 719


training loop:   1% |                                 | ETA:  35 days, 10:11:53

Episode: 720   score: 11.98   Avg score (100e): 11.99   actor gain: -0.45   critic loss: 0.38   steps: 720


training loop:   1% |                                 | ETA:  35 days, 10:17:18

Episode: 721   score: 11.99   Avg score (100e): 11.99   actor gain: -0.45   critic loss: 0.38   steps: 721


training loop:   1% |                                 | ETA:  35 days, 10:16:15

Episode: 722   score: 11.98   Avg score (100e): 11.99   actor gain: -0.46   critic loss: 0.38   steps: 722


training loop:   1% |                                 | ETA:  35 days, 10:15:03

Episode: 723   score: 11.98   Avg score (100e): 11.99   actor gain: -0.45   critic loss: 0.38   steps: 723


training loop:   1% |                                 | ETA:  35 days, 10:10:56

Episode: 724   score: 11.98   Avg score (100e): 11.99   actor gain: -0.46   critic loss: 0.38   steps: 724


training loop:   1% |                                 | ETA:  35 days, 10:08:45

Episode: 725   score: 11.98   Avg score (100e): 12.00   actor gain: -0.46   critic loss: 0.38   steps: 725


training loop:   1% |                                 | ETA:  35 days, 10:07:25

Episode: 726   score: 11.98   Avg score (100e): 12.00   actor gain: -0.51   critic loss: 0.38   steps: 726


training loop:   1% |                                 | ETA:  35 days, 10:03:01

Episode: 727   score: 11.97   Avg score (100e): 12.00   actor gain: -0.51   critic loss: 0.38   steps: 727
np.all(done) is true! miracle!


training loop:   1% |                                  | ETA:  35 days, 9:57:40

Episode: 728   score: 11.99   Avg score (100e): 12.00   actor gain: -0.51   critic loss: 0.38   steps: 728


training loop:   1% |                                  | ETA:  35 days, 9:53:08

Episode: 729   score: 11.97   Avg score (100e): 12.00   actor gain: -0.51   critic loss: 0.38   steps: 729


training loop:   1% |                                  | ETA:  35 days, 9:47:37

Episode: 730   score: 11.98   Avg score (100e): 12.00   actor gain: -0.51   critic loss: 0.38   steps: 730


training loop:   1% |                                  | ETA:  35 days, 9:46:40

Episode: 731   score: 11.99   Avg score (100e): 12.00   actor gain: -0.51   critic loss: 0.38   steps: 731


training loop:   1% |                                  | ETA:  35 days, 9:42:04

Episode: 732   score: 11.99   Avg score (100e): 12.00   actor gain: -0.52   critic loss: 0.38   steps: 732


training loop:   1% |                                  | ETA:  35 days, 9:37:54

Episode: 733   score: 11.99   Avg score (100e): 12.00   actor gain: -0.52   critic loss: 0.39   steps: 733


training loop:   1% |                                  | ETA:  35 days, 9:35:37

Episode: 734   score: 11.99   Avg score (100e): 12.01   actor gain: -0.52   critic loss: 0.39   steps: 734


training loop:   1% |                                  | ETA:  35 days, 9:32:28

Episode: 735   score: 11.98   Avg score (100e): 12.01   actor gain: -0.50   critic loss: 0.39   steps: 735


training loop:   1% |                                  | ETA:  35 days, 9:28:58

Episode: 736   score: 11.98   Avg score (100e): 12.01   actor gain: -0.52   critic loss: 0.39   steps: 736


training loop:   1% |                                  | ETA:  35 days, 9:26:20

Episode: 737   score: 11.99   Avg score (100e): 12.01   actor gain: -0.52   critic loss: 0.39   steps: 737


training loop:   1% |                                  | ETA:  35 days, 9:24:20

Episode: 738   score: 11.99   Avg score (100e): 12.01   actor gain: -0.52   critic loss: 0.39   steps: 738


training loop:   1% |                                  | ETA:  35 days, 9:21:57

Episode: 739   score: 12.00   Avg score (100e): 12.01   actor gain: -0.51   critic loss: 0.39   steps: 739


training loop:   1% |                                  | ETA:  35 days, 9:15:44

Episode: 740   score: 12.01   Avg score (100e): 12.01   actor gain: -0.52   critic loss: 0.39   steps: 740


training loop:   1% |                                  | ETA:  35 days, 9:12:22

Episode: 741   score: 12.01   Avg score (100e): 12.01   actor gain: -0.51   critic loss: 0.39   steps: 741


training loop:   1% |                                  | ETA:  35 days, 9:09:54

Episode: 742   score: 12.01   Avg score (100e): 12.01   actor gain: -0.52   critic loss: 0.39   steps: 742


training loop:   1% |                                  | ETA:  35 days, 9:09:23

Episode: 743   score: 12.00   Avg score (100e): 12.01   actor gain: -0.52   critic loss: 0.39   steps: 743


training loop:   1% |                                  | ETA:  35 days, 9:02:58

Episode: 744   score: 12.01   Avg score (100e): 12.01   actor gain: -0.51   critic loss: 0.39   steps: 744


training loop:   1% |                                  | ETA:  35 days, 9:00:40

Episode: 745   score: 12.01   Avg score (100e): 12.01   actor gain: -0.51   critic loss: 0.39   steps: 745


training loop:   1% |                                  | ETA:  35 days, 8:57:13

Episode: 746   score: 12.02   Avg score (100e): 12.01   actor gain: -0.51   critic loss: 0.39   steps: 746


training loop:   1% |                                  | ETA:  35 days, 8:53:19

Episode: 747   score: 12.03   Avg score (100e): 12.01   actor gain: -0.50   critic loss: 0.39   steps: 747


training loop:   1% |                                  | ETA:  35 days, 8:49:58

Episode: 748   score: 12.02   Avg score (100e): 12.01   actor gain: -0.50   critic loss: 0.39   steps: 748


training loop:   1% |                                  | ETA:  35 days, 8:46:42

Episode: 749   score: 12.03   Avg score (100e): 12.01   actor gain: -0.50   critic loss: 0.39   steps: 749


training loop:   1% |                                  | ETA:  35 days, 8:43:26

Episode: 750   score: 12.02   Avg score (100e): 12.01   actor gain: -0.50   critic loss: 0.39   steps: 750


training loop:   1% |                                  | ETA:  35 days, 8:42:03

Episode: 751   score: 12.02   Avg score (100e): 12.01   actor gain: -0.44   critic loss: 0.39   steps: 751


training loop:   1% |                                  | ETA:  35 days, 8:36:56

Episode: 752   score: 12.01   Avg score (100e): 12.01   actor gain: -0.44   critic loss: 0.39   steps: 752


training loop:   1% |                                  | ETA:  35 days, 8:42:28

Episode: 753   score: 12.01   Avg score (100e): 12.01   actor gain: -0.44   critic loss: 0.39   steps: 753


training loop:   1% |                                  | ETA:  35 days, 8:42:37

Episode: 754   score: 12.01   Avg score (100e): 12.01   actor gain: -0.44   critic loss: 0.39   steps: 754


training loop:   1% |                                  | ETA:  35 days, 8:39:05

Episode: 755   score: 12.01   Avg score (100e): 12.01   actor gain: -0.44   critic loss: 0.39   steps: 755


training loop:   1% |                                  | ETA:  35 days, 8:38:10

Episode: 756   score: 12.01   Avg score (100e): 12.01   actor gain: -0.44   critic loss: 0.39   steps: 756


training loop:   1% |                                  | ETA:  35 days, 8:34:40

Episode: 757   score: 12.00   Avg score (100e): 12.01   actor gain: -0.42   critic loss: 0.39   steps: 757


training loop:   1% |                                  | ETA:  35 days, 8:32:59

Episode: 758   score: 11.99   Avg score (100e): 12.01   actor gain: -0.42   critic loss: 0.39   steps: 758


training loop:   1% |                                  | ETA:  35 days, 8:30:29

Episode: 759   score: 12.00   Avg score (100e): 12.01   actor gain: -0.42   critic loss: 0.39   steps: 759


training loop:   1% |                                  | ETA:  35 days, 8:26:00

Episode: 760   score: 12.00   Avg score (100e): 12.00   actor gain: -0.42   critic loss: 0.39   steps: 760


training loop:   1% |                                  | ETA:  35 days, 8:22:32

Episode: 761   score: 11.99   Avg score (100e): 12.00   actor gain: -0.43   critic loss: 0.39   steps: 761


training loop:   1% |                                  | ETA:  35 days, 8:23:10

Episode: 762   score: 12.00   Avg score (100e): 12.00   actor gain: -0.43   critic loss: 0.38   steps: 762


training loop:   1% |                                  | ETA:  35 days, 8:21:20

Episode: 763   score: 12.01   Avg score (100e): 12.00   actor gain: -0.43   critic loss: 0.38   steps: 763


training loop:   1% |                                  | ETA:  35 days, 8:16:05

Episode: 764   score: 12.02   Avg score (100e): 12.00   actor gain: -0.43   critic loss: 0.38   steps: 764


training loop:   1% |                                  | ETA:  35 days, 8:13:44

Episode: 765   score: 12.02   Avg score (100e): 12.00   actor gain: -0.42   critic loss: 0.38   steps: 765


training loop:   1% |                                  | ETA:  35 days, 8:11:00

Episode: 766   score: 12.02   Avg score (100e): 12.00   actor gain: -0.43   critic loss: 0.38   steps: 766


training loop:   1% |                                  | ETA:  35 days, 8:06:10

Episode: 767   score: 12.02   Avg score (100e): 12.00   actor gain: -0.43   critic loss: 0.38   steps: 767


training loop:   1% |                                  | ETA:  35 days, 8:04:10

Episode: 768   score: 12.02   Avg score (100e): 12.00   actor gain: -0.46   critic loss: 0.38   steps: 768


training loop:   1% |                                  | ETA:  35 days, 8:00:37

Episode: 769   score: 12.02   Avg score (100e): 12.00   actor gain: -0.46   critic loss: 0.38   steps: 769


training loop:   1% |                                  | ETA:  35 days, 7:56:58

Episode: 770   score: 12.01   Avg score (100e): 12.00   actor gain: -0.47   critic loss: 0.38   steps: 770


training loop:   1% |                                  | ETA:  35 days, 7:51:56

Episode: 771   score: 12.02   Avg score (100e): 12.00   actor gain: -0.47   critic loss: 0.38   steps: 771


training loop:   1% |                                  | ETA:  35 days, 7:47:36

Episode: 772   score: 12.02   Avg score (100e): 12.00   actor gain: -0.47   critic loss: 0.38   steps: 772


training loop:   1% |                                  | ETA:  35 days, 7:43:11

Episode: 773   score: 12.02   Avg score (100e): 12.00   actor gain: -0.48   critic loss: 0.38   steps: 773


training loop:   1% |                                  | ETA:  35 days, 7:36:37

Episode: 774   score: 12.01   Avg score (100e): 12.00   actor gain: -0.48   critic loss: 0.38   steps: 774


training loop:   1% |                                  | ETA:  35 days, 7:31:20

Episode: 775   score: 12.02   Avg score (100e): 12.00   actor gain: -0.48   critic loss: 0.38   steps: 775


training loop:   1% |                                  | ETA:  35 days, 7:29:42

Episode: 776   score: 12.02   Avg score (100e): 12.00   actor gain: -0.48   critic loss: 0.38   steps: 776


training loop:   1% |                                  | ETA:  35 days, 7:25:20

Episode: 777   score: 12.04   Avg score (100e): 12.00   actor gain: -0.48   critic loss: 0.38   steps: 777


training loop:   1% |                                  | ETA:  35 days, 7:20:40

Episode: 778   score: 12.05   Avg score (100e): 12.00   actor gain: -0.48   critic loss: 0.38   steps: 778


training loop:   1% |                                  | ETA:  35 days, 7:17:16

Episode: 779   score: 12.05   Avg score (100e): 12.00   actor gain: -0.48   critic loss: 0.38   steps: 779


training loop:   1% |                                  | ETA:  35 days, 7:13:25

Episode: 780   score: 12.06   Avg score (100e): 12.00   actor gain: -0.48   critic loss: 0.38   steps: 780


training loop:   1% |                                  | ETA:  35 days, 7:09:48

Episode: 781   score: 12.06   Avg score (100e): 12.00   actor gain: -0.48   critic loss: 0.38   steps: 781


training loop:   1% |                                  | ETA:  35 days, 7:08:35

Episode: 782   score: 12.07   Avg score (100e): 12.00   actor gain: -0.48   critic loss: 0.38   steps: 782


training loop:   1% |                                  | ETA:  35 days, 7:05:14

Episode: 783   score: 12.07   Avg score (100e): 12.00   actor gain: -0.48   critic loss: 0.38   steps: 783


training loop:   1% |                                  | ETA:  35 days, 6:59:13

Episode: 784   score: 12.07   Avg score (100e): 12.00   actor gain: -0.48   critic loss: 0.38   steps: 784


training loop:   1% |                                  | ETA:  35 days, 6:54:52

Episode: 785   score: 12.07   Avg score (100e): 12.00   actor gain: -0.49   critic loss: 0.38   steps: 785


training loop:   1% |                                  | ETA:  35 days, 6:58:53

Episode: 786   score: 12.09   Avg score (100e): 12.00   actor gain: -0.47   critic loss: 0.38   steps: 786


training loop:   1% |                                  | ETA:  35 days, 6:59:56

Episode: 787   score: 12.09   Avg score (100e): 12.00   actor gain: -0.47   critic loss: 0.38   steps: 787


training loop:   1% |                                  | ETA:  35 days, 6:59:02

Episode: 788   score: 12.09   Avg score (100e): 12.00   actor gain: -0.47   critic loss: 0.39   steps: 788


training loop:   1% |                                  | ETA:  35 days, 6:57:23

Episode: 789   score: 12.09   Avg score (100e): 12.00   actor gain: -0.47   critic loss: 0.39   steps: 789


training loop:   1% |                                  | ETA:  35 days, 6:53:53

Episode: 790   score: 12.09   Avg score (100e): 12.01   actor gain: -0.47   critic loss: 0.39   steps: 790


training loop:   1% |                                  | ETA:  35 days, 6:53:47

Episode: 791   score: 12.10   Avg score (100e): 12.01   actor gain: -0.47   critic loss: 0.39   steps: 791


training loop:   1% |                                  | ETA:  35 days, 6:50:52

Episode: 792   score: 12.11   Avg score (100e): 12.01   actor gain: -6.07   critic loss: 0.38   steps: 792


training loop:   1% |                                  | ETA:  35 days, 6:49:49

Episode: 793   score: 12.12   Avg score (100e): 12.01   actor gain: -6.04   critic loss: 0.38   steps: 793


training loop:   1% |                                  | ETA:  35 days, 6:45:30

Episode: 794   score: 12.12   Avg score (100e): 12.01   actor gain: -6.03   critic loss: 0.38   steps: 794


training loop:   1% |                                  | ETA:  35 days, 6:39:26

Episode: 795   score: 12.12   Avg score (100e): 12.01   actor gain: -6.02   critic loss: 0.38   steps: 795


training loop:   1% |                                  | ETA:  35 days, 6:37:46

Episode: 796   score: 12.12   Avg score (100e): 12.01   actor gain: -6.02   critic loss: 0.38   steps: 796
np.all(done) is true! miracle!


training loop:   1% |                                  | ETA:  35 days, 6:32:43

Episode: 797   score: 12.13   Avg score (100e): 12.02   actor gain: -6.02   critic loss: 0.39   steps: 797


training loop:   1% |                                  | ETA:  35 days, 6:31:39

Episode: 798   score: 12.13   Avg score (100e): 12.02   actor gain: -6.02   critic loss: 0.38   steps: 798


training loop:   1% |                                  | ETA:  35 days, 6:29:59

Episode: 799   score: 12.14   Avg score (100e): 12.02   actor gain: -6.02   critic loss: 0.39   steps: 799


training loop:   1% |                                  | ETA:  35 days, 6:25:00

Episode: 800   score: 12.15   Avg score (100e): 12.02   actor gain: -6.02   critic loss: 0.39   steps: 800


training loop:   1% |                                  | ETA:  35 days, 6:21:47

Episode: 801   score: 12.16   Avg score (100e): 12.02   actor gain: -6.02   critic loss: 0.39   steps: 801


training loop:   1% |                                  | ETA:  35 days, 6:18:19

Episode: 802   score: 12.17   Avg score (100e): 12.02   actor gain: -6.02   critic loss: 0.39   steps: 802


training loop:   1% |                                  | ETA:  35 days, 6:13:50

Episode: 803   score: 12.19   Avg score (100e): 12.03   actor gain: -6.02   critic loss: 0.39   steps: 803


training loop:   1% |                                  | ETA:  35 days, 6:09:31

Episode: 804   score: 12.20   Avg score (100e): 12.03   actor gain: -6.02   critic loss: 0.39   steps: 804


training loop:   1% |                                  | ETA:  35 days, 6:08:35

Episode: 805   score: 12.20   Avg score (100e): 12.03   actor gain: -6.02   critic loss: 0.39   steps: 805


training loop:   1% |                                  | ETA:  35 days, 6:06:28

Episode: 806   score: 12.22   Avg score (100e): 12.03   actor gain: -6.03   critic loss: 0.39   steps: 806


training loop:   1% |                                  | ETA:  35 days, 6:05:46

Episode: 807   score: 12.22   Avg score (100e): 12.04   actor gain: -6.03   critic loss: 0.39   steps: 807


training loop:   1% |                                  | ETA:  35 days, 6:01:21

Episode: 808   score: 12.23   Avg score (100e): 12.04   actor gain: -6.03   critic loss: 0.40   steps: 808


training loop:   1% |                                  | ETA:  35 days, 5:58:01

Episode: 809   score: 12.23   Avg score (100e): 12.04   actor gain: -6.03   critic loss: 0.40   steps: 809


training loop:   1% |                                  | ETA:  35 days, 5:56:14

Episode: 810   score: 12.25   Avg score (100e): 12.04   actor gain: -6.02   critic loss: 0.40   steps: 810


training loop:   1% |                                  | ETA:  35 days, 5:52:51

Episode: 811   score: 12.24   Avg score (100e): 12.05   actor gain: -6.02   critic loss: 0.40   steps: 811
np.all(done) is true! miracle!


training loop:   1% |                                  | ETA:  35 days, 5:48:17

Episode: 812   score: 12.26   Avg score (100e): 12.05   actor gain: -6.02   critic loss: 0.40   steps: 812


training loop:   1% |                                  | ETA:  35 days, 5:44:23

Episode: 813   score: 12.27   Avg score (100e): 12.05   actor gain: -6.02   critic loss: 0.40   steps: 813


training loop:   1% |                                  | ETA:  35 days, 5:40:25

Episode: 814   score: 12.28   Avg score (100e): 12.05   actor gain: -6.02   critic loss: 0.40   steps: 814


training loop:   1% |                                  | ETA:  35 days, 5:36:46

Episode: 815   score: 12.28   Avg score (100e): 12.06   actor gain: -6.02   critic loss: 0.40   steps: 815


training loop:   1% |                                  | ETA:  35 days, 5:34:20

Episode: 816   score: 12.28   Avg score (100e): 12.06   actor gain: -6.02   critic loss: 0.41   steps: 816


training loop:   1% |                                  | ETA:  35 days, 5:30:44

Episode: 817   score: 12.29   Avg score (100e): 12.06   actor gain: -0.42   critic loss: 0.41   steps: 817


training loop:   1% |                                  | ETA:  35 days, 5:37:36

Episode: 818   score: 12.31   Avg score (100e): 12.07   actor gain: -0.42   critic loss: 0.41   steps: 818


training loop:   1% |                                  | ETA:  35 days, 5:36:11

Episode: 819   score: 12.30   Avg score (100e): 12.07   actor gain: -0.42   critic loss: 0.41   steps: 819


training loop:   1% |                                  | ETA:  35 days, 5:33:39

Episode: 820   score: 12.31   Avg score (100e): 12.07   actor gain: -0.42   critic loss: 0.41   steps: 820


training loop:   1% |                                  | ETA:  35 days, 5:37:09

Episode: 821   score: 12.32   Avg score (100e): 12.08   actor gain: -0.42   critic loss: 0.41   steps: 821


training loop:   1% |                                  | ETA:  35 days, 5:37:53

Episode: 822   score: 12.33   Avg score (100e): 12.08   actor gain: -0.42   critic loss: 0.41   steps: 822


training loop:   1% |                                  | ETA:  35 days, 5:40:07

Episode: 823   score: 12.33   Avg score (100e): 12.08   actor gain: -0.41   critic loss: 0.41   steps: 823
np.all(done) is true! miracle!


training loop:   1% |                                  | ETA:  35 days, 5:40:15

Episode: 824   score: 12.34   Avg score (100e): 12.09   actor gain: -0.41   critic loss: 0.41   steps: 824


training loop:   1% |                                  | ETA:  35 days, 5:39:16

Episode: 825   score: 12.34   Avg score (100e): 12.09   actor gain: -0.41   critic loss: 0.41   steps: 825


training loop:   1% |                                  | ETA:  35 days, 5:37:03

Episode: 826   score: 12.35   Avg score (100e): 12.09   actor gain: -0.41   critic loss: 0.41   steps: 826
np.all(done) is true! miracle!


training loop:   1% |                                  | ETA:  35 days, 5:36:06

Episode: 827   score: 12.37   Avg score (100e): 12.10   actor gain: -0.41   critic loss: 0.41   steps: 827


training loop:   1% |                                  | ETA:  35 days, 5:35:49

Episode: 828   score: 12.39   Avg score (100e): 12.10   actor gain: -0.41   critic loss: 0.41   steps: 828


training loop:   1% |                                  | ETA:  35 days, 5:34:44

Episode: 829   score: 12.39   Avg score (100e): 12.11   actor gain: -0.41   critic loss: 0.41   steps: 829


training loop:   1% |                                  | ETA:  35 days, 5:32:21

Episode: 830   score: 12.40   Avg score (100e): 12.11   actor gain: -0.41   critic loss: 0.41   steps: 830


training loop:   1% |                                  | ETA:  35 days, 5:30:37

Episode: 831   score: 12.41   Avg score (100e): 12.11   actor gain: -0.41   critic loss: 0.41   steps: 831
np.all(done) is true! miracle!


training loop:   1% |                                  | ETA:  35 days, 5:28:11

Episode: 832   score: 12.40   Avg score (100e): 12.12   actor gain: -0.40   critic loss: 0.41   steps: 832
np.all(done) is true! miracle!


training loop:   1% |                                  | ETA:  35 days, 5:23:35

Episode: 833   score: 12.40   Avg score (100e): 12.12   actor gain: -0.40   critic loss: 0.41   steps: 833


training loop:   1% |                                  | ETA:  35 days, 5:17:46

Episode: 834   score: 12.41   Avg score (100e): 12.13   actor gain: -0.40   critic loss: 0.41   steps: 834


training loop:   1% |                                  | ETA:  35 days, 5:14:59

Episode: 835   score: 12.41   Avg score (100e): 12.13   actor gain: -0.40   critic loss: 0.41   steps: 835


training loop:   1% |                                  | ETA:  35 days, 5:13:14

Episode: 836   score: 12.43   Avg score (100e): 12.14   actor gain: -0.40   critic loss: 0.41   steps: 836


training loop:   1% |                                  | ETA:  35 days, 5:10:28

Episode: 837   score: 12.43   Avg score (100e): 12.14   actor gain: -0.40   critic loss: 0.41   steps: 837


training loop:   1% |                                  | ETA:  35 days, 5:08:12

Episode: 838   score: 12.44   Avg score (100e): 12.14   actor gain: -0.40   critic loss: 0.41   steps: 838


training loop:   1% |                                  | ETA:  35 days, 5:04:15

Episode: 839   score: 12.45   Avg score (100e): 12.15   actor gain: -0.62   critic loss: 0.41   steps: 839


training loop:   1% |                                  | ETA:  35 days, 5:05:14

Episode: 840   score: 12.45   Avg score (100e): 12.15   actor gain: -0.62   critic loss: 0.41   steps: 840


training loop:   1% |                                  | ETA:  35 days, 5:02:16

Episode: 841   score: 12.46   Avg score (100e): 12.16   actor gain: -0.62   critic loss: 0.41   steps: 841


training loop:   1% |                                  | ETA:  35 days, 4:59:22

Episode: 842   score: 12.46   Avg score (100e): 12.16   actor gain: -0.62   critic loss: 0.41   steps: 842


training loop:   1% |                                  | ETA:  35 days, 4:57:21

Episode: 843   score: 12.47   Avg score (100e): 12.17   actor gain: -0.62   critic loss: 0.41   steps: 843


training loop:   1% |                                  | ETA:  35 days, 4:53:59

Episode: 844   score: 12.48   Avg score (100e): 12.17   actor gain: -0.62   critic loss: 0.41   steps: 844


training loop:   1% |                                  | ETA:  35 days, 4:51:51

Episode: 845   score: 12.50   Avg score (100e): 12.18   actor gain: -0.62   critic loss: 0.41   steps: 845


training loop:   1% |                                  | ETA:  35 days, 4:49:50

Episode: 846   score: 12.50   Avg score (100e): 12.18   actor gain: -0.62   critic loss: 0.41   steps: 846


training loop:   1% |                                  | ETA:  35 days, 4:47:06

Episode: 847   score: 12.52   Avg score (100e): 12.19   actor gain: -0.62   critic loss: 0.41   steps: 847


training loop:   1% |                                  | ETA:  35 days, 4:46:05

Episode: 848   score: 12.53   Avg score (100e): 12.19   actor gain: -0.62   critic loss: 0.41   steps: 848


training loop:   1% |                                  | ETA:  35 days, 4:44:27

Episode: 849   score: 12.54   Avg score (100e): 12.20   actor gain: -0.62   critic loss: 0.41   steps: 849


training loop:   1% |                                  | ETA:  35 days, 4:50:54

Episode: 850   score: 12.55   Avg score (100e): 12.20   actor gain: -0.62   critic loss: 0.41   steps: 850


training loop:   1% |                                  | ETA:  35 days, 4:49:07

Episode: 851   score: 12.54   Avg score (100e): 12.21   actor gain: -0.62   critic loss: 0.41   steps: 851


training loop:   1% |                                  | ETA:  35 days, 4:48:42

Episode: 852   score: 12.55   Avg score (100e): 12.21   actor gain: -0.62   critic loss: 0.41   steps: 852


training loop:   1% |                                  | ETA:  35 days, 4:48:52

Episode: 853   score: 12.57   Avg score (100e): 12.22   actor gain: -0.62   critic loss: 0.41   steps: 853


training loop:   1% |                                  | ETA:  35 days, 4:49:07

Episode: 854   score: 12.58   Avg score (100e): 12.22   actor gain: -0.62   critic loss: 0.41   steps: 854


training loop:   1% |                                  | ETA:  35 days, 4:47:30

Episode: 855   score: 12.58   Avg score (100e): 12.23   actor gain: -0.62   critic loss: 0.41   steps: 855


training loop:   1% |                                  | ETA:  35 days, 4:45:56

Episode: 856   score: 12.59   Avg score (100e): 12.24   actor gain: -0.62   critic loss: 0.41   steps: 856


training loop:   1% |                                  | ETA:  35 days, 4:45:10

Episode: 857   score: 12.60   Avg score (100e): 12.24   actor gain: -0.62   critic loss: 0.41   steps: 857


training loop:   1% |                                  | ETA:  35 days, 4:45:25

Episode: 858   score: 12.60   Avg score (100e): 12.25   actor gain: -0.62   critic loss: 0.41   steps: 858


training loop:   1% |                                  | ETA:  35 days, 4:40:07

Episode: 859   score: 12.60   Avg score (100e): 12.25   actor gain: -0.65   critic loss: 0.41   steps: 859


training loop:   1% |                                  | ETA:  35 days, 4:39:16

Episode: 860   score: 12.61   Avg score (100e): 12.26   actor gain: -0.65   critic loss: 0.41   steps: 860


training loop:   1% |                                  | ETA:  35 days, 4:39:26

Episode: 861   score: 12.63   Avg score (100e): 12.27   actor gain: -0.65   critic loss: 0.41   steps: 861


training loop:   1% |                                  | ETA:  35 days, 4:39:17

Episode: 862   score: 12.63   Avg score (100e): 12.27   actor gain: -0.65   critic loss: 0.41   steps: 862


training loop:   1% |                                  | ETA:  35 days, 4:48:32

Episode: 863   score: 12.62   Avg score (100e): 12.28   actor gain: -0.65   critic loss: 0.41   steps: 863


training loop:   1% |                                  | ETA:  35 days, 4:50:26

Episode: 864   score: 12.63   Avg score (100e): 12.28   actor gain: -0.43   critic loss: 0.41   steps: 864


training loop:   1% |                                  | ETA:  35 days, 4:50:14

Episode: 865   score: 12.63   Avg score (100e): 12.29   actor gain: -0.43   critic loss: 0.41   steps: 865


training loop:   1% |                                  | ETA:  35 days, 4:46:06

Episode: 866   score: 12.64   Avg score (100e): 12.30   actor gain: -0.43   critic loss: 0.41   steps: 866


training loop:   1% |                                  | ETA:  35 days, 4:43:43

Episode: 867   score: 12.64   Avg score (100e): 12.30   actor gain: -0.51   critic loss: 0.41   steps: 867


training loop:   1% |                                  | ETA:  35 days, 4:40:59

Episode: 868   score: 12.64   Avg score (100e): 12.31   actor gain: -0.54   critic loss: 0.41   steps: 868


training loop:   1% |                                  | ETA:  35 days, 4:38:26

Episode: 869   score: 12.66   Avg score (100e): 12.32   actor gain: -0.54   critic loss: 0.41   steps: 869
np.all(done) is true! miracle!


training loop:   1% |                                  | ETA:  35 days, 4:34:13

Episode: 870   score: 12.65   Avg score (100e): 12.32   actor gain: -0.54   critic loss: 0.41   steps: 870


training loop:   1% |                                  | ETA:  35 days, 4:31:13

Episode: 871   score: 12.66   Avg score (100e): 12.33   actor gain: -0.54   critic loss: 0.41   steps: 871


training loop:   1% |                                  | ETA:  35 days, 4:28:09

Episode: 872   score: 12.67   Avg score (100e): 12.34   actor gain: -0.55   critic loss: 0.41   steps: 872


training loop:   1% |                                  | ETA:  35 days, 4:24:23

Episode: 873   score: 12.67   Avg score (100e): 12.34   actor gain: -0.55   critic loss: 0.41   steps: 873


training loop:   1% |                                  | ETA:  35 days, 4:21:59

Episode: 874   score: 12.68   Avg score (100e): 12.35   actor gain: -0.55   critic loss: 0.41   steps: 874


training loop:   1% |                                  | ETA:  35 days, 4:17:31

Episode: 875   score: 12.68   Avg score (100e): 12.36   actor gain: -0.55   critic loss: 0.41   steps: 875


training loop:   1% |                                  | ETA:  35 days, 4:11:29

Episode: 876   score: 12.68   Avg score (100e): 12.36   actor gain: -0.54   critic loss: 0.41   steps: 876


training loop:   1% |                                  | ETA:  35 days, 4:07:49

Episode: 877   score: 12.69   Avg score (100e): 12.37   actor gain: -0.54   critic loss: 0.41   steps: 877


training loop:   1% |                                  | ETA:  35 days, 4:02:58

Episode: 878   score: 12.69   Avg score (100e): 12.37   actor gain: -0.55   critic loss: 0.41   steps: 878


training loop:   1% |                                  | ETA:  35 days, 3:59:03

Episode: 879   score: 12.70   Avg score (100e): 12.38   actor gain: -0.55   critic loss: 0.41   steps: 879


training loop:   1% |                                  | ETA:  35 days, 4:02:10

Episode: 880   score: 12.71   Avg score (100e): 12.39   actor gain: -0.55   critic loss: 0.41   steps: 880


training loop:   1% |                                  | ETA:  35 days, 4:01:38

Episode: 881   score: 12.72   Avg score (100e): 12.39   actor gain: -0.55   critic loss: 0.41   steps: 881


training loop:   1% |                                  | ETA:  35 days, 4:01:35

Episode: 882   score: 12.73   Avg score (100e): 12.40   actor gain: -0.55   critic loss: 0.41   steps: 882


training loop:   1% |                                  | ETA:  35 days, 4:07:07

Episode: 883   score: 12.73   Avg score (100e): 12.41   actor gain: -0.55   critic loss: 0.41   steps: 883


training loop:   1% |                                  | ETA:  35 days, 4:11:20

Episode: 884   score: 12.74   Avg score (100e): 12.41   actor gain: -0.52   critic loss: 0.42   steps: 884


training loop:   1% |                                  | ETA:  35 days, 4:10:52

Episode: 885   score: 12.73   Avg score (100e): 12.42   actor gain: -0.52   critic loss: 0.42   steps: 885


training loop:   1% |                                  | ETA:  35 days, 4:08:55

Episode: 886   score: 12.75   Avg score (100e): 12.43   actor gain: -0.52   critic loss: 0.42   steps: 886


training loop:   1% |                                  | ETA:  35 days, 4:10:50

Episode: 887   score: 12.76   Avg score (100e): 12.43   actor gain: -0.52   critic loss: 0.42   steps: 887


training loop:   1% |                                  | ETA:  35 days, 4:08:03

Episode: 888   score: 12.77   Avg score (100e): 12.44   actor gain: -0.52   critic loss: 0.42   steps: 888


training loop:   1% |                                  | ETA:  35 days, 4:06:58

Episode: 889   score: 12.77   Avg score (100e): 12.45   actor gain: -0.52   critic loss: 0.42   steps: 889


training loop:   1% |                                  | ETA:  35 days, 4:05:56

Episode: 890   score: 12.78   Avg score (100e): 12.45   actor gain: -0.52   critic loss: 0.42   steps: 890


training loop:   1% |                                  | ETA:  35 days, 4:07:37

Episode: 891   score: 12.80   Avg score (100e): 12.46   actor gain: -0.52   critic loss: 0.42   steps: 891


training loop:   1% |                                  | ETA:  35 days, 4:03:31

Episode: 892   score: 12.79   Avg score (100e): 12.47   actor gain: -0.44   critic loss: 0.42   steps: 892


training loop:   1% |                                  | ETA:  35 days, 4:00:32

Episode: 893   score: 12.80   Avg score (100e): 12.47   actor gain: -0.40   critic loss: 0.42   steps: 893


training loop:   1% |                                  | ETA:  35 days, 3:57:52

Episode: 894   score: 12.80   Avg score (100e): 12.48   actor gain: -0.40   critic loss: 0.42   steps: 894


training loop:   1% |                                  | ETA:  35 days, 3:53:45

Episode: 895   score: 12.81   Avg score (100e): 12.49   actor gain: -0.43   critic loss: 0.42   steps: 895


training loop:   1% |                                  | ETA:  35 days, 3:51:12

Episode: 896   score: 12.83   Avg score (100e): 12.50   actor gain: -0.43   critic loss: 0.42   steps: 896


training loop:   1% |                                  | ETA:  35 days, 3:54:59

Episode: 897   score: 12.83   Avg score (100e): 12.50   actor gain: -0.43   critic loss: 0.42   steps: 897


training loop:   1% |                                  | ETA:  35 days, 4:33:26

Episode: 898   score: 12.83   Avg score (100e): 12.51   actor gain: -0.43   critic loss: 0.42   steps: 898


training loop:   1% |                                  | ETA:  35 days, 4:39:43

Episode: 899   score: 12.83   Avg score (100e): 12.52   actor gain: -0.43   critic loss: 0.42   steps: 899


training loop:   1% |                                  | ETA:  35 days, 4:41:20

Episode: 900   score: 12.83   Avg score (100e): 12.52   actor gain: -0.43   critic loss: 0.42   steps: 900


training loop:   1% |                                  | ETA:  35 days, 4:41:27

Episode: 901   score: 12.84   Avg score (100e): 12.53   actor gain: -0.43   critic loss: 0.42   steps: 901


training loop:   1% |                                  | ETA:  35 days, 4:43:10

Episode: 902   score: 12.84   Avg score (100e): 12.54   actor gain: -0.43   critic loss: 0.42   steps: 902


training loop:   1% |                                  | ETA:  35 days, 6:43:39

Episode: 903   score: 12.84   Avg score (100e): 12.54   actor gain: -0.42   critic loss: 0.42   steps: 903


training loop:   1% |                                  | ETA:  35 days, 6:41:18

Episode: 904   score: 12.85   Avg score (100e): 12.55   actor gain: -0.42   critic loss: 0.42   steps: 904


training loop:   1% |                                  | ETA:  35 days, 6:41:06

Episode: 905   score: 12.85   Avg score (100e): 12.56   actor gain: -0.42   critic loss: 0.42   steps: 905
np.all(done) is true! miracle!


training loop:   1% |                                  | ETA:  35 days, 6:35:51

Episode: 906   score: 12.85   Avg score (100e): 12.56   actor gain: -0.42   critic loss: 0.42   steps: 906


training loop:   1% |                                  | ETA:  35 days, 6:36:49

Episode: 907   score: 12.85   Avg score (100e): 12.57   actor gain: -0.42   critic loss: 0.42   steps: 907


training loop:   1% |                                  | ETA:  36 days, 9:35:33

Episode: 908   score: 12.85   Avg score (100e): 12.58   actor gain: -0.42   critic loss: 0.42   steps: 908


training loop:   1% |                                  | ETA:  36 days, 9:31:10

Episode: 909   score: 12.86   Avg score (100e): 12.58   actor gain: -0.42   critic loss: 0.42   steps: 909


training loop:   1% |                                  | ETA:  36 days, 9:31:29

Episode: 910   score: 12.86   Avg score (100e): 12.59   actor gain: -0.42   critic loss: 0.42   steps: 910


training loop:   1% |                                  | ETA:  36 days, 9:26:03

Episode: 911   score: 12.86   Avg score (100e): 12.59   actor gain: -0.42   critic loss: 0.42   steps: 911


training loop:   1% |                                  | ETA:  36 days, 9:20:55

Episode: 912   score: 12.87   Avg score (100e): 12.60   actor gain: -0.42   critic loss: 0.42   steps: 912


training loop:   1% |                                  | ETA:  36 days, 9:17:04

Episode: 913   score: 12.87   Avg score (100e): 12.61   actor gain: -0.42   critic loss: 0.41   steps: 913


training loop:   1% |                                 | ETA:  36 days, 11:00:36

Episode: 914   score: 12.87   Avg score (100e): 12.61   actor gain: -0.43   critic loss: 0.41   steps: 914


training loop:   1% |                                 | ETA:  36 days, 11:13:41

Episode: 915   score: 12.89   Avg score (100e): 12.62   actor gain: -0.43   critic loss: 0.41   steps: 915


training loop:   1% |                                 | ETA:  36 days, 11:14:17

Episode: 916   score: 12.88   Avg score (100e): 12.62   actor gain: -0.43   critic loss: 0.41   steps: 916


training loop:   1% |                                 | ETA:  36 days, 11:12:21

Episode: 917   score: 12.88   Avg score (100e): 12.63   actor gain: -0.43   critic loss: 0.41   steps: 917


training loop:   1% |                                 | ETA:  36 days, 11:13:31

Episode: 918   score: 12.88   Avg score (100e): 12.64   actor gain: -0.44   critic loss: 0.41   steps: 918


training loop:   1% |                                 | ETA:  36 days, 11:09:04

Episode: 919   score: 12.89   Avg score (100e): 12.64   actor gain: -0.44   critic loss: 0.41   steps: 919


training loop:   1% |                                 | ETA:  36 days, 11:07:49

Episode: 920   score: 12.91   Avg score (100e): 12.65   actor gain: -0.41   critic loss: 0.41   steps: 920


training loop:   1% |                                 | ETA:  36 days, 11:12:25

Episode: 921   score: 12.90   Avg score (100e): 12.65   actor gain: -0.41   critic loss: 0.41   steps: 921


training loop:   1% |                                 | ETA:  36 days, 11:08:51

Episode: 922   score: 12.90   Avg score (100e): 12.66   actor gain: -0.41   critic loss: 0.41   steps: 922


training loop:   1% |                                 | ETA:  36 days, 11:03:36

Episode: 923   score: 12.89   Avg score (100e): 12.66   actor gain: -0.41   critic loss: 0.41   steps: 923


training loop:   1% |                                 | ETA:  36 days, 10:58:46

Episode: 924   score: 12.89   Avg score (100e): 12.67   actor gain: -0.41   critic loss: 0.41   steps: 924


training loop:   1% |                                 | ETA:  36 days, 10:52:50

Episode: 925   score: 12.91   Avg score (100e): 12.68   actor gain: -0.41   critic loss: 0.41   steps: 925


training loop:   1% |                                 | ETA:  36 days, 10:48:37

Episode: 926   score: 12.92   Avg score (100e): 12.68   actor gain: -0.41   critic loss: 0.41   steps: 926


training loop:   1% |                                 | ETA:  36 days, 16:06:58

Episode: 927   score: 12.93   Avg score (100e): 12.69   actor gain: -0.42   critic loss: 0.41   steps: 927


training loop:   1% |                                 | ETA:  36 days, 16:17:07

Episode: 928   score: 12.92   Avg score (100e): 12.69   actor gain: -0.42   critic loss: 0.41   steps: 928


training loop:   1% |                                 | ETA:  36 days, 16:29:20

Episode: 929   score: 12.93   Avg score (100e): 12.70   actor gain: -0.42   critic loss: 0.41   steps: 929


training loop:   1% |                                 | ETA:  36 days, 16:36:25

Episode: 930   score: 12.93   Avg score (100e): 12.70   actor gain: -0.41   critic loss: 0.41   steps: 930
np.all(done) is true! miracle!


training loop:   1% |                                 | ETA:  36 days, 16:31:05

Episode: 931   score: 12.93   Avg score (100e): 12.71   actor gain: -0.41   critic loss: 0.41   steps: 931


training loop:   1% |                                 | ETA:  36 days, 16:19:31

Episode: 932   score: 12.93   Avg score (100e): 12.71   actor gain: -0.41   critic loss: 0.41   steps: 932


training loop:   1% |                                 | ETA:  36 days, 16:23:04

Episode: 933   score: 12.93   Avg score (100e): 12.72   actor gain: -0.41   critic loss: 0.42   steps: 933


training loop:   1% |                                 | ETA:  36 days, 16:14:46

Episode: 934   score: 12.95   Avg score (100e): 12.72   actor gain: -0.41   critic loss: 0.42   steps: 934
np.all(done) is true! miracle!


training loop:   1% |                                 | ETA:  36 days, 16:11:22

Episode: 935   score: 12.94   Avg score (100e): 12.73   actor gain: -0.41   critic loss: 0.42   steps: 935


training loop:   1% |                                 | ETA:  36 days, 16:10:17

Episode: 936   score: 12.94   Avg score (100e): 12.74   actor gain: -0.41   critic loss: 0.42   steps: 936


training loop:   1% |                                 | ETA:  36 days, 16:03:35

Episode: 937   score: 12.95   Avg score (100e): 12.74   actor gain: -0.41   critic loss: 0.42   steps: 937


training loop:   1% |                                 | ETA:  36 days, 15:59:34

Episode: 938   score: 12.95   Avg score (100e): 12.75   actor gain: -0.41   critic loss: 0.42   steps: 938


training loop:   1% |                                 | ETA:  36 days, 16:02:45

Episode: 939   score: 12.95   Avg score (100e): 12.75   actor gain: -0.41   critic loss: 0.42   steps: 939


training loop:   1% |                                 | ETA:  36 days, 16:08:52

Episode: 940   score: 12.95   Avg score (100e): 12.76   actor gain: -0.41   critic loss: 0.42   steps: 940


training loop:   1% |                                 | ETA:  36 days, 16:44:13

Episode: 941   score: 12.95   Avg score (100e): 12.76   actor gain: -0.41   critic loss: 0.42   steps: 941


training loop:   1% |                                 | ETA:  36 days, 16:47:22

Episode: 942   score: 12.95   Avg score (100e): 12.77   actor gain: -0.41   critic loss: 0.42   steps: 942


training loop:   1% |                                 | ETA:  36 days, 16:46:43

Episode: 943   score: 12.96   Avg score (100e): 12.77   actor gain: -0.40   critic loss: 0.42   steps: 943


training loop:   1% |                                 | ETA:  36 days, 16:54:00

Episode: 944   score: 12.97   Avg score (100e): 12.77   actor gain: -0.41   critic loss: 0.42   steps: 944


training loop:   1% |                                 | ETA:  36 days, 16:56:37

Episode: 945   score: 12.98   Avg score (100e): 12.78   actor gain: -0.41   critic loss: 0.42   steps: 945


training loop:   1% |                                 | ETA:  36 days, 16:59:56

Episode: 946   score: 12.99   Avg score (100e): 12.78   actor gain: -0.41   critic loss: 0.42   steps: 946


training loop:   1% |                                 | ETA:  36 days, 17:11:59

Episode: 947   score: 13.00   Avg score (100e): 12.79   actor gain: -0.41   critic loss: 0.42   steps: 947


training loop:   1% |                                 | ETA:  36 days, 17:11:56

Episode: 948   score: 13.00   Avg score (100e): 12.79   actor gain: -0.41   critic loss: 0.42   steps: 948


training loop:   1% |                                 | ETA:  36 days, 17:14:32

Episode: 949   score: 13.00   Avg score (100e): 12.80   actor gain: -0.41   critic loss: 0.42   steps: 949


training loop:   1% |                                 | ETA:  36 days, 17:13:31

Episode: 950   score: 13.01   Avg score (100e): 12.80   actor gain: -0.41   critic loss: 0.42   steps: 950


training loop:   1% |                                 | ETA:  36 days, 17:18:23

Episode: 951   score: 13.01   Avg score (100e): 12.81   actor gain: -0.41   critic loss: 0.42   steps: 951


training loop:   1% |                                 | ETA:  36 days, 17:16:46

Episode: 952   score: 13.02   Avg score (100e): 12.81   actor gain: -0.40   critic loss: 0.42   steps: 952


training loop:   1% |                                 | ETA:  36 days, 17:17:01

Episode: 953   score: 13.02   Avg score (100e): 12.82   actor gain: -0.40   critic loss: 0.42   steps: 953


training loop:   1% |                                 | ETA:  36 days, 17:40:40

Episode: 954   score: 13.01   Avg score (100e): 12.82   actor gain: -0.40   critic loss: 0.42   steps: 954


training loop:   1% |                                 | ETA:  36 days, 17:53:19

Episode: 955   score: 13.02   Avg score (100e): 12.83   actor gain: -0.40   critic loss: 0.42   steps: 955


training loop:   1% |                                 | ETA:  36 days, 18:00:41

Episode: 956   score: 13.03   Avg score (100e): 12.83   actor gain: -0.40   critic loss: 0.42   steps: 956


training loop:   1% |                                 | ETA:  36 days, 18:06:12

Episode: 957   score: 13.03   Avg score (100e): 12.83   actor gain: -0.40   critic loss: 0.42   steps: 957


training loop:   1% |                                 | ETA:  36 days, 18:09:58

Episode: 958   score: 13.03   Avg score (100e): 12.84   actor gain: -0.40   critic loss: 0.42   steps: 958


training loop:   1% |                                 | ETA:  36 days, 18:09:19

Episode: 959   score: 13.04   Avg score (100e): 12.84   actor gain: -0.40   critic loss: 0.42   steps: 959


training loop:   1% |                                 | ETA:  36 days, 18:12:18

Episode: 960   score: 13.04   Avg score (100e): 12.85   actor gain: -0.40   critic loss: 0.42   steps: 960


training loop:   1% |                                 | ETA:  36 days, 18:21:16

Episode: 961   score: 13.05   Avg score (100e): 12.85   actor gain: -0.40   critic loss: 0.42   steps: 961


training loop:   1% |                                 | ETA:  36 days, 18:31:47

Episode: 962   score: 13.05   Avg score (100e): 12.86   actor gain: -0.40   critic loss: 0.41   steps: 962


training loop:   1% |                                 | ETA:  36 days, 18:33:38

Episode: 963   score: 13.05   Avg score (100e): 12.86   actor gain: -0.40   critic loss: 0.41   steps: 963
np.all(done) is true! miracle!


training loop:   1% |                                 | ETA:  36 days, 18:33:00

Episode: 964   score: 13.05   Avg score (100e): 12.86   actor gain: -0.40   critic loss: 0.41   steps: 964


training loop:   1% |                                 | ETA:  36 days, 18:51:39

Episode: 965   score: 13.05   Avg score (100e): 12.87   actor gain: -0.40   critic loss: 0.41   steps: 965


training loop:   1% |                                 | ETA:  36 days, 18:57:40

Episode: 966   score: 13.06   Avg score (100e): 12.87   actor gain: -0.40   critic loss: 0.41   steps: 966


training loop:   1% |                                 | ETA:  36 days, 19:04:22

Episode: 967   score: 13.06   Avg score (100e): 12.88   actor gain: -0.40   critic loss: 0.41   steps: 967
np.all(done) is true! miracle!


training loop:   1% |                                 | ETA:  36 days, 19:17:44

Episode: 968   score: 13.07   Avg score (100e): 12.88   actor gain: -0.40   critic loss: 0.41   steps: 968


training loop:   1% |                                 | ETA:  36 days, 19:51:01

Episode: 969   score: 13.07   Avg score (100e): 12.88   actor gain: -0.39   critic loss: 0.41   steps: 969


training loop:   1% |                                 | ETA:  36 days, 20:08:25

Episode: 970   score: 13.07   Avg score (100e): 12.89   actor gain: -0.39   critic loss: 0.41   steps: 970


training loop:   1% |                                 | ETA:  36 days, 20:09:45

Episode: 971   score: 13.07   Avg score (100e): 12.89   actor gain: -0.40   critic loss: 0.41   steps: 971


training loop:   1% |                                 | ETA:  36 days, 20:08:25

Episode: 972   score: 13.08   Avg score (100e): 12.90   actor gain: -0.39   critic loss: 0.41   steps: 972


training loop:   1% |                                 | ETA:  36 days, 20:10:47

Episode: 973   score: 13.07   Avg score (100e): 12.90   actor gain: -0.39   critic loss: 0.41   steps: 973


training loop:   1% |                                 | ETA:  36 days, 20:10:13

Episode: 974   score: 13.09   Avg score (100e): 12.91   actor gain: -0.39   critic loss: 0.41   steps: 974


training loop:   1% |                                 | ETA:  36 days, 20:24:13

Episode: 975   score: 13.09   Avg score (100e): 12.91   actor gain: -0.39   critic loss: 0.41   steps: 975


training loop:   1% |                                 | ETA:  36 days, 20:35:40

Episode: 976   score: 13.10   Avg score (100e): 12.91   actor gain: -0.39   critic loss: 0.41   steps: 976


training loop:   1% |                                 | ETA:  36 days, 20:34:11

Episode: 977   score: 13.10   Avg score (100e): 12.92   actor gain: -0.40   critic loss: 0.41   steps: 977


training loop:   1% |                                 | ETA:  36 days, 20:47:47

Episode: 978   score: 13.11   Avg score (100e): 12.92   actor gain: -0.39   critic loss: 0.41   steps: 978


training loop:   1% |                                 | ETA:  36 days, 20:55:24

Episode: 979   score: 13.12   Avg score (100e): 12.93   actor gain: -0.39   critic loss: 0.41   steps: 979


training loop:   1% |                                 | ETA:  36 days, 21:08:57

Episode: 980   score: 13.11   Avg score (100e): 12.93   actor gain: -0.39   critic loss: 0.41   steps: 980


training loop:   1% |                                 | ETA:  36 days, 21:13:14

Episode: 981   score: 13.12   Avg score (100e): 12.93   actor gain: -0.39   critic loss: 0.41   steps: 981


training loop:   1% |                                 | ETA:  36 days, 21:35:11

Episode: 982   score: 13.13   Avg score (100e): 12.94   actor gain: -0.39   critic loss: 0.41   steps: 982


training loop:   1% |                                 | ETA:  36 days, 21:38:09

Episode: 983   score: 13.14   Avg score (100e): 12.94   actor gain: -0.39   critic loss: 0.41   steps: 983


training loop:   1% |                                 | ETA:  36 days, 21:42:17

Episode: 984   score: 13.16   Avg score (100e): 12.95   actor gain: -0.39   critic loss: 0.41   steps: 984


training loop:   1% |                                 | ETA:  36 days, 21:38:39

Episode: 985   score: 13.16   Avg score (100e): 12.95   actor gain: -0.39   critic loss: 0.41   steps: 985


training loop:   1% |                                 | ETA:  36 days, 21:32:54

Episode: 986   score: 13.17   Avg score (100e): 12.96   actor gain: -0.39   critic loss: 0.41   steps: 986


training loop:   1% |                                 | ETA:  36 days, 21:31:16

Episode: 987   score: 13.17   Avg score (100e): 12.96   actor gain: -0.39   critic loss: 0.41   steps: 987


training loop:   1% |                                 | ETA:  36 days, 21:20:31

Episode: 988   score: 13.18   Avg score (100e): 12.96   actor gain: -0.39   critic loss: 0.42   steps: 988


training loop:   1% |                                 | ETA:  36 days, 21:09:40

Episode: 989   score: 13.19   Avg score (100e): 12.97   actor gain: -0.39   critic loss: 0.42   steps: 989


training loop:   1% |                                 | ETA:  36 days, 20:59:18

Episode: 990   score: 13.19   Avg score (100e): 12.97   actor gain: -0.53   critic loss: 0.42   steps: 990


training loop:   1% |                                 | ETA:  36 days, 21:06:19

Episode: 991   score: 13.19   Avg score (100e): 12.98   actor gain: -0.52   critic loss: 0.42   steps: 991


training loop:   1% |                                 | ETA:  36 days, 21:05:45

Episode: 992   score: 13.20   Avg score (100e): 12.98   actor gain: -0.52   critic loss: 0.42   steps: 992


training loop:   1% |                                 | ETA:  36 days, 21:06:47

Episode: 993   score: 13.21   Avg score (100e): 12.98   actor gain: -0.53   critic loss: 0.42   steps: 993


training loop:   1% |                                 | ETA:  36 days, 21:06:48

Episode: 994   score: 13.22   Avg score (100e): 12.99   actor gain: -0.53   critic loss: 0.42   steps: 994


training loop:   1% |                                 | ETA:  36 days, 21:03:59

Episode: 995   score: 13.23   Avg score (100e): 12.99   actor gain: -0.53   critic loss: 0.42   steps: 995
np.all(done) is true! miracle!


training loop:   1% |                                 | ETA:  36 days, 21:02:04

Episode: 996   score: 13.24   Avg score (100e): 13.00   actor gain: -0.52   critic loss: 0.42   steps: 996


training loop:   1% |                                 | ETA:  36 days, 20:59:29

Episode: 997   score: 13.25   Avg score (100e): 13.00   actor gain: -0.53   critic loss: 0.42   steps: 997


training loop:   1% |                                 | ETA:  36 days, 20:58:14

Episode: 998   score: 13.25   Avg score (100e): 13.00   actor gain: -0.53   critic loss: 0.42   steps: 998


training loop:   1% |                                 | ETA:  36 days, 20:57:44

Episode: 999   score: 13.24   Avg score (100e): 13.01   actor gain: -0.53   critic loss: 0.42   steps: 999


training loop:   1% |                                 | ETA:  36 days, 20:44:53

Episode: 1000   score: 13.26   Avg score (100e): 13.01   actor gain: -0.52   critic loss: 0.42   steps: 1000


training loop:   2% |                                 | ETA:  36 days, 20:27:15

Episode: 1001   score: 13.26   Avg score (100e): 13.02   actor gain: -0.52   critic loss: 0.42   steps: 1001


training loop:   2% |                                 | ETA:  36 days, 20:10:41

Episode: 1002   score: 13.26   Avg score (100e): 13.02   actor gain: -0.53   critic loss: 0.42   steps: 1002


training loop:   2% |                                 | ETA:  36 days, 19:56:48

Episode: 1003   score: 13.28   Avg score (100e): 13.03   actor gain: -0.53   critic loss: 0.42   steps: 1003


training loop:   2% |                                 | ETA:  36 days, 19:42:12

Episode: 1004   score: 13.29   Avg score (100e): 13.03   actor gain: -0.53   critic loss: 0.41   steps: 1004


training loop:   2% |                                 | ETA:  36 days, 19:35:55

Episode: 1005   score: 13.29   Avg score (100e): 13.03   actor gain: -0.53   critic loss: 0.41   steps: 1005


training loop:   2% |                                 | ETA:  36 days, 19:24:45

Episode: 1006   score: 13.30   Avg score (100e): 13.04   actor gain: -0.53   critic loss: 0.41   steps: 1006


training loop:   2% |                                 | ETA:  36 days, 19:11:04

Episode: 1007   score: 13.30   Avg score (100e): 13.04   actor gain: -0.53   critic loss: 0.41   steps: 1007
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  36 days, 18:52:16

Episode: 1008   score: 13.30   Avg score (100e): 13.05   actor gain: -0.53   critic loss: 0.41   steps: 1008


training loop:   2% |                                 | ETA:  36 days, 18:32:56

Episode: 1009   score: 13.30   Avg score (100e): 13.05   actor gain: -0.53   critic loss: 0.41   steps: 1009


training loop:   2% |                                 | ETA:  36 days, 18:16:20

Episode: 1010   score: 13.30   Avg score (100e): 13.06   actor gain: -0.53   critic loss: 0.41   steps: 1010


training loop:   2% |                                 | ETA:  36 days, 18:14:17

Episode: 1011   score: 13.32   Avg score (100e): 13.06   actor gain: -0.53   critic loss: 0.41   steps: 1011


training loop:   2% |                                 | ETA:  36 days, 18:18:06

Episode: 1012   score: 13.32   Avg score (100e): 13.07   actor gain: -0.53   critic loss: 0.41   steps: 1012


training loop:   2% |                                 | ETA:  36 days, 18:24:54

Episode: 1013   score: 13.33   Avg score (100e): 13.07   actor gain: -0.53   critic loss: 0.41   steps: 1013


training loop:   2% |                                 | ETA:  36 days, 18:45:32

Episode: 1014   score: 13.34   Avg score (100e): 13.08   actor gain: -0.53   critic loss: 0.41   steps: 1014


training loop:   2% |                                 | ETA:  36 days, 18:52:34

Episode: 1015   score: 13.34   Avg score (100e): 13.08   actor gain: -0.40   critic loss: 0.41   steps: 1015


training loop:   2% |                                 | ETA:  36 days, 19:00:39

Episode: 1016   score: 13.34   Avg score (100e): 13.08   actor gain: -0.40   critic loss: 0.41   steps: 1016


training loop:   2% |                                 | ETA:  36 days, 19:03:19

Episode: 1017   score: 13.35   Avg score (100e): 13.09   actor gain: -0.40   critic loss: 0.41   steps: 1017


training loop:   2% |                                 | ETA:  36 days, 19:04:48

Episode: 1018   score: 13.35   Avg score (100e): 13.09   actor gain: -0.40   critic loss: 0.41   steps: 1018


training loop:   2% |                                 | ETA:  36 days, 19:03:41

Episode: 1019   score: 13.36   Avg score (100e): 13.10   actor gain: -0.40   critic loss: 0.41   steps: 1019


training loop:   2% |                                 | ETA:  36 days, 18:59:28

Episode: 1020   score: 13.36   Avg score (100e): 13.10   actor gain: -0.40   critic loss: 0.41   steps: 1020


training loop:   2% |                                 | ETA:  36 days, 19:04:11

Episode: 1021   score: 13.37   Avg score (100e): 13.11   actor gain: -0.39   critic loss: 0.41   steps: 1021
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  36 days, 18:56:27

Episode: 1022   score: 13.38   Avg score (100e): 13.11   actor gain: -0.39   critic loss: 0.41   steps: 1022


training loop:   2% |                                 | ETA:  36 days, 18:48:57

Episode: 1023   score: 13.38   Avg score (100e): 13.12   actor gain: -0.40   critic loss: 0.41   steps: 1023


training loop:   2% |                                 | ETA:  36 days, 18:56:40

Episode: 1024   score: 13.39   Avg score (100e): 13.12   actor gain: -0.40   critic loss: 0.41   steps: 1024


training loop:   2% |                                 | ETA:  36 days, 19:00:12

Episode: 1025   score: 13.40   Avg score (100e): 13.13   actor gain: -0.40   critic loss: 0.41   steps: 1025


training loop:   2% |                                 | ETA:  36 days, 19:06:36

Episode: 1026   score: 13.41   Avg score (100e): 13.13   actor gain: -0.40   critic loss: 0.41   steps: 1026


training loop:   2% |                                 | ETA:  36 days, 19:07:43

Episode: 1027   score: 13.42   Avg score (100e): 13.14   actor gain: -0.40   critic loss: 0.41   steps: 1027


training loop:   2% |                                 | ETA:  36 days, 19:07:21

Episode: 1028   score: 13.43   Avg score (100e): 13.14   actor gain: -0.39   critic loss: 0.41   steps: 1028


training loop:   2% |                                 | ETA:  36 days, 19:15:52

Episode: 1029   score: 13.43   Avg score (100e): 13.15   actor gain: -0.40   critic loss: 0.41   steps: 1029


training loop:   2% |                                 | ETA:  36 days, 19:12:59

Episode: 1030   score: 13.43   Avg score (100e): 13.15   actor gain: -0.40   critic loss: 0.41   steps: 1030


training loop:   2% |                                 | ETA:  36 days, 19:06:12

Episode: 1031   score: 13.44   Avg score (100e): 13.16   actor gain: -0.41   critic loss: 0.41   steps: 1031


training loop:   2% |                                 | ETA:  36 days, 18:53:54

Episode: 1032   score: 13.45   Avg score (100e): 13.16   actor gain: -0.41   critic loss: 0.41   steps: 1032


training loop:   2% |                                 | ETA:  36 days, 18:46:43

Episode: 1033   score: 13.44   Avg score (100e): 13.17   actor gain: -0.41   critic loss: 0.41   steps: 1033


training loop:   2% |                                 | ETA:  36 days, 18:43:39

Episode: 1034   score: 13.44   Avg score (100e): 13.17   actor gain: -0.41   critic loss: 0.41   steps: 1034


training loop:   2% |                                 | ETA:  36 days, 18:46:25

Episode: 1035   score: 13.45   Avg score (100e): 13.18   actor gain: -0.41   critic loss: 0.41   steps: 1035


training loop:   2% |                                 | ETA:  36 days, 18:42:15

Episode: 1036   score: 13.46   Avg score (100e): 13.18   actor gain: -0.41   critic loss: 0.41   steps: 1036


training loop:   2% |                                 | ETA:  36 days, 18:43:49

Episode: 1037   score: 13.47   Avg score (100e): 13.19   actor gain: -0.41   critic loss: 0.41   steps: 1037


training loop:   2% |                                 | ETA:  36 days, 18:45:45

Episode: 1038   score: 13.47   Avg score (100e): 13.19   actor gain: -0.41   critic loss: 0.41   steps: 1038


training loop:   2% |                                 | ETA:  36 days, 18:47:09

Episode: 1039   score: 13.48   Avg score (100e): 13.20   actor gain: -0.41   critic loss: 0.41   steps: 1039


training loop:   2% |                                 | ETA:  36 days, 18:49:49

Episode: 1040   score: 13.48   Avg score (100e): 13.20   actor gain: -0.41   critic loss: 0.41   steps: 1040


training loop:   2% |                                 | ETA:  36 days, 18:50:52

Episode: 1041   score: 13.48   Avg score (100e): 13.21   actor gain: -0.41   critic loss: 0.41   steps: 1041


training loop:   2% |                                 | ETA:  36 days, 18:54:24

Episode: 1042   score: 13.48   Avg score (100e): 13.21   actor gain: -0.41   critic loss: 0.41   steps: 1042


training loop:   2% |                                 | ETA:  36 days, 19:21:14

Episode: 1043   score: 13.49   Avg score (100e): 13.22   actor gain: -0.41   critic loss: 0.41   steps: 1043


training loop:   2% |                                 | ETA:  36 days, 19:34:22

Episode: 1044   score: 13.50   Avg score (100e): 13.22   actor gain: -0.42   critic loss: 0.41   steps: 1044


training loop:   2% |                                 | ETA:  36 days, 19:31:48

Episode: 1045   score: 13.50   Avg score (100e): 13.23   actor gain: -0.42   critic loss: 0.41   steps: 1045


training loop:   2% |                                 | ETA:  36 days, 19:35:25

Episode: 1046   score: 13.52   Avg score (100e): 13.23   actor gain: -0.42   critic loss: 0.41   steps: 1046


training loop:   2% |                                 | ETA:  36 days, 19:44:29

Episode: 1047   score: 13.52   Avg score (100e): 13.24   actor gain: -0.42   critic loss: 0.41   steps: 1047
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  36 days, 20:03:35

Episode: 1048   score: 13.53   Avg score (100e): 13.25   actor gain: -0.42   critic loss: 0.41   steps: 1048


training loop:   2% |                                 | ETA:  36 days, 20:07:31

Episode: 1049   score: 13.53   Avg score (100e): 13.25   actor gain: -0.42   critic loss: 0.41   steps: 1049


training loop:   2% |                                 | ETA:  36 days, 20:07:24

Episode: 1050   score: 13.53   Avg score (100e): 13.26   actor gain: -0.42   critic loss: 0.41   steps: 1050


training loop:   2% |                                 | ETA:  36 days, 20:07:40

Episode: 1051   score: 13.53   Avg score (100e): 13.26   actor gain: -0.42   critic loss: 0.41   steps: 1051
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  36 days, 20:07:09

Episode: 1052   score: 13.54   Avg score (100e): 13.27   actor gain: -0.42   critic loss: 0.41   steps: 1052


training loop:   2% |                                 | ETA:  36 days, 20:05:43

Episode: 1053   score: 13.54   Avg score (100e): 13.27   actor gain: -0.41   critic loss: 0.41   steps: 1053


training loop:   2% |                                 | ETA:  36 days, 20:01:13

Episode: 1054   score: 13.55   Avg score (100e): 13.28   actor gain: -0.41   critic loss: 0.41   steps: 1054


training loop:   2% |                                 | ETA:  36 days, 19:58:50

Episode: 1055   score: 13.56   Avg score (100e): 13.28   actor gain: -0.42   critic loss: 0.41   steps: 1055


training loop:   2% |                                 | ETA:  36 days, 19:54:48

Episode: 1056   score: 13.56   Avg score (100e): 13.29   actor gain: -0.40   critic loss: 0.41   steps: 1056


training loop:   2% |                                 | ETA:  36 days, 19:53:45

Episode: 1057   score: 13.57   Avg score (100e): 13.29   actor gain: -0.40   critic loss: 0.41   steps: 1057


training loop:   2% |                                 | ETA:  36 days, 19:52:31

Episode: 1058   score: 13.57   Avg score (100e): 13.30   actor gain: -0.40   critic loss: 0.41   steps: 1058


training loop:   2% |                                 | ETA:  36 days, 20:09:14

Episode: 1059   score: 13.57   Avg score (100e): 13.30   actor gain: -0.40   critic loss: 0.41   steps: 1059


training loop:   2% |                                 | ETA:  36 days, 20:19:31

Episode: 1060   score: 13.58   Avg score (100e): 13.31   actor gain: -0.40   critic loss: 0.41   steps: 1060
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  36 days, 20:22:11

Episode: 1061   score: 13.57   Avg score (100e): 13.31   actor gain: -0.40   critic loss: 0.41   steps: 1061


training loop:   2% |                                 | ETA:  36 days, 20:37:04

Episode: 1062   score: 13.57   Avg score (100e): 13.32   actor gain: -0.40   critic loss: 0.41   steps: 1062


training loop:   2% |                                 | ETA:  36 days, 20:42:23

Episode: 1063   score: 13.58   Avg score (100e): 13.32   actor gain: -0.40   critic loss: 0.41   steps: 1063


training loop:   2% |                                 | ETA:  36 days, 20:39:48

Episode: 1064   score: 13.59   Avg score (100e): 13.33   actor gain: -0.40   critic loss: 0.41   steps: 1064


training loop:   2% |                                 | ETA:  36 days, 20:41:11

Episode: 1065   score: 13.59   Avg score (100e): 13.34   actor gain: -0.40   critic loss: 0.41   steps: 1065


training loop:   2% |                                 | ETA:  36 days, 20:43:19

Episode: 1066   score: 13.59   Avg score (100e): 13.34   actor gain: -0.40   critic loss: 0.41   steps: 1066


training loop:   2% |                                 | ETA:  36 days, 20:45:46

Episode: 1067   score: 13.59   Avg score (100e): 13.35   actor gain: -0.40   critic loss: 0.41   steps: 1067


training loop:   2% |                                 | ETA:  36 days, 20:48:49

Episode: 1068   score: 13.60   Avg score (100e): 13.35   actor gain: -0.40   critic loss: 0.41   steps: 1068


training loop:   2% |                                 | ETA:  36 days, 20:53:04

Episode: 1069   score: 13.60   Avg score (100e): 13.36   actor gain: -0.39   critic loss: 0.41   steps: 1069


training loop:   2% |                                 | ETA:  36 days, 21:06:08

Episode: 1070   score: 13.61   Avg score (100e): 13.36   actor gain: -0.39   critic loss: 0.41   steps: 1070


training loop:   2% |                                 | ETA:  36 days, 21:09:18

Episode: 1071   score: 13.61   Avg score (100e): 13.37   actor gain: -0.39   critic loss: 0.41   steps: 1071


training loop:   2% |                                 | ETA:  36 days, 21:14:10

Episode: 1072   score: 13.62   Avg score (100e): 13.37   actor gain: -0.39   critic loss: 0.41   steps: 1072


training loop:   2% |                                 | ETA:  36 days, 21:36:59

Episode: 1073   score: 13.63   Avg score (100e): 13.38   actor gain: -0.39   critic loss: 0.41   steps: 1073


training loop:   2% |                                 | ETA:  36 days, 21:44:28

Episode: 1074   score: 13.63   Avg score (100e): 13.38   actor gain: -0.39   critic loss: 0.41   steps: 1074


training loop:   2% |                                 | ETA:  36 days, 21:40:45

Episode: 1075   score: 13.64   Avg score (100e): 13.39   actor gain: -0.39   critic loss: 0.41   steps: 1075


training loop:   2% |                                 | ETA:  36 days, 21:48:54

Episode: 1076   score: 13.65   Avg score (100e): 13.40   actor gain: -0.39   critic loss: 0.41   steps: 1076


training loop:   2% |                                 | ETA:  36 days, 21:58:06

Episode: 1077   score: 13.65   Avg score (100e): 13.40   actor gain: -0.39   critic loss: 0.41   steps: 1077


training loop:   2% |                                 | ETA:  36 days, 22:01:19

Episode: 1078   score: 13.66   Avg score (100e): 13.41   actor gain: -0.39   critic loss: 0.41   steps: 1078


training loop:   2% |                                 | ETA:  36 days, 22:01:35

Episode: 1079   score: 13.67   Avg score (100e): 13.41   actor gain: -0.39   critic loss: 0.41   steps: 1079


training loop:   2% |                                 | ETA:  36 days, 21:59:50

Episode: 1080   score: 13.68   Avg score (100e): 13.42   actor gain: -0.39   critic loss: 0.41   steps: 1080


training loop:   2% |                                 | ETA:  36 days, 22:01:27

Episode: 1081   score: 13.68   Avg score (100e): 13.42   actor gain: -0.41   critic loss: 0.41   steps: 1081


training loop:   2% |                                 | ETA:  36 days, 21:55:14

Episode: 1082   score: 13.68   Avg score (100e): 13.43   actor gain: -0.42   critic loss: 0.41   steps: 1082


training loop:   2% |                                 | ETA:  36 days, 21:54:05

Episode: 1083   score: 13.68   Avg score (100e): 13.43   actor gain: -0.43   critic loss: 0.41   steps: 1083


training loop:   2% |                                 | ETA:  36 days, 22:02:12

Episode: 1084   score: 13.68   Avg score (100e): 13.44   actor gain: -0.43   critic loss: 0.40   steps: 1084


training loop:   2% |                                 | ETA:  36 days, 22:05:44

Episode: 1085   score: 13.69   Avg score (100e): 13.44   actor gain: -0.43   critic loss: 0.40   steps: 1085


training loop:   2% |                                 | ETA:  36 days, 22:12:37

Episode: 1086   score: 13.69   Avg score (100e): 13.45   actor gain: -0.42   critic loss: 0.40   steps: 1086


training loop:   2% |                                 | ETA:  36 days, 22:16:19

Episode: 1087   score: 13.71   Avg score (100e): 13.46   actor gain: -0.42   critic loss: 0.40   steps: 1087


training loop:   2% |                                 | ETA:  36 days, 22:15:41

Episode: 1088   score: 13.71   Avg score (100e): 13.46   actor gain: -0.43   critic loss: 0.40   steps: 1088


training loop:   2% |                                 | ETA:  36 days, 22:10:40

Episode: 1089   score: 13.72   Avg score (100e): 13.47   actor gain: -1.03   critic loss: 0.40   steps: 1089


training loop:   2% |                                 | ETA:  36 days, 22:05:21

Episode: 1090   score: 13.71   Avg score (100e): 13.47   actor gain: -1.03   critic loss: 0.40   steps: 1090


training loop:   2% |                                 | ETA:  36 days, 21:59:06

Episode: 1091   score: 13.72   Avg score (100e): 13.48   actor gain: -1.03   critic loss: 0.41   steps: 1091


training loop:   2% |                                 | ETA:  36 days, 21:57:54

Episode: 1092   score: 13.72   Avg score (100e): 13.48   actor gain: -1.03   critic loss: 0.41   steps: 1092


training loop:   2% |                                 | ETA:  36 days, 21:52:51

Episode: 1093   score: 13.73   Avg score (100e): 13.49   actor gain: -1.03   critic loss: 0.40   steps: 1093


training loop:   2% |                                 | ETA:  36 days, 21:45:23

Episode: 1094   score: 13.74   Avg score (100e): 13.49   actor gain: -1.32   critic loss: 0.41   steps: 1094


training loop:   2% |                                 | ETA:  36 days, 21:52:57

Episode: 1095   score: 13.76   Avg score (100e): 13.50   actor gain: -1.31   critic loss: 0.40   steps: 1095


training loop:   2% |                                 | ETA:  36 days, 21:51:23

Episode: 1096   score: 13.76   Avg score (100e): 13.50   actor gain: -1.31   critic loss: 0.40   steps: 1096


training loop:   2% |                                 | ETA:  36 days, 22:11:26

Episode: 1097   score: 13.77   Avg score (100e): 13.51   actor gain: -1.31   critic loss: 0.40   steps: 1097


training loop:   2% |                                 | ETA:  36 days, 22:09:01

Episode: 1098   score: 13.78   Avg score (100e): 13.51   actor gain: -1.31   critic loss: 0.40   steps: 1098


training loop:   2% |                                 | ETA:  36 days, 22:06:21

Episode: 1099   score: 13.80   Avg score (100e): 13.52   actor gain: -1.31   critic loss: 0.40   steps: 1099


training loop:   2% |                                 | ETA:  36 days, 22:09:40

Episode: 1100   score: 13.81   Avg score (100e): 13.52   actor gain: -1.31   critic loss: 0.40   steps: 1100


training loop:   2% |                                 | ETA:  36 days, 22:11:03

Episode: 1101   score: 13.82   Avg score (100e): 13.53   actor gain: -1.32   critic loss: 0.40   steps: 1101


training loop:   2% |                                 | ETA:  36 days, 22:09:32

Episode: 1102   score: 13.83   Avg score (100e): 13.54   actor gain: -1.32   critic loss: 0.41   steps: 1102


training loop:   2% |                                 | ETA:  36 days, 22:15:30

Episode: 1103   score: 13.84   Avg score (100e): 13.54   actor gain: -1.32   critic loss: 0.41   steps: 1103


training loop:   2% |                                 | ETA:  36 days, 22:15:56

Episode: 1104   score: 13.85   Avg score (100e): 13.55   actor gain: -1.31   critic loss: 0.41   steps: 1104


training loop:   2% |                                 | ETA:  36 days, 22:25:59

Episode: 1105   score: 13.87   Avg score (100e): 13.55   actor gain: -1.31   critic loss: 0.41   steps: 1105


training loop:   2% |                                 | ETA:  36 days, 22:35:06

Episode: 1106   score: 13.87   Avg score (100e): 13.56   actor gain: -1.29   critic loss: 0.41   steps: 1106


training loop:   2% |                                 | ETA:  36 days, 22:33:40

Episode: 1107   score: 13.88   Avg score (100e): 13.56   actor gain: -1.28   critic loss: 0.41   steps: 1107


training loop:   2% |                                 | ETA:  36 days, 22:39:53

Episode: 1108   score: 13.90   Avg score (100e): 13.57   actor gain: -1.28   critic loss: 0.41   steps: 1108


training loop:   2% |                                 | ETA:  36 days, 22:46:54

Episode: 1109   score: 13.90   Avg score (100e): 13.58   actor gain: -1.27   critic loss: 0.41   steps: 1109


training loop:   2% |                                 | ETA:  36 days, 22:51:09

Episode: 1110   score: 13.91   Avg score (100e): 13.58   actor gain: -1.27   critic loss: 0.41   steps: 1110


training loop:   2% |                                 | ETA:  36 days, 22:52:50

Episode: 1111   score: 13.93   Avg score (100e): 13.59   actor gain: -1.28   critic loss: 0.41   steps: 1111


training loop:   2% |                                 | ETA:  36 days, 22:51:16

Episode: 1112   score: 13.94   Avg score (100e): 13.59   actor gain: -1.27   critic loss: 0.41   steps: 1112


training loop:   2% |                                 | ETA:  36 days, 22:51:33

Episode: 1113   score: 13.94   Avg score (100e): 13.60   actor gain: -1.27   critic loss: 0.41   steps: 1113


training loop:   2% |                                 | ETA:  36 days, 22:48:26

Episode: 1114   score: 13.95   Avg score (100e): 13.61   actor gain: -0.67   critic loss: 0.41   steps: 1114


training loop:   2% |                                 | ETA:  36 days, 22:48:08

Episode: 1115   score: 13.96   Avg score (100e): 13.61   actor gain: -0.67   critic loss: 0.41   steps: 1115


training loop:   2% |                                 | ETA:  36 days, 23:13:51

Episode: 1116   score: 13.98   Avg score (100e): 13.62   actor gain: -0.67   critic loss: 0.41   steps: 1116


training loop:   2% |                                 | ETA:  36 days, 23:22:43

Episode: 1117   score: 13.99   Avg score (100e): 13.63   actor gain: -0.66   critic loss: 0.41   steps: 1117


training loop:   2% |                                 | ETA:  36 days, 23:22:55

Episode: 1118   score: 14.01   Avg score (100e): 13.63   actor gain: -0.66   critic loss: 0.41   steps: 1118


training loop:   2% |                                 | ETA:  36 days, 23:34:15

Episode: 1119   score: 14.01   Avg score (100e): 13.64   actor gain: -0.38   critic loss: 0.41   steps: 1119


training loop:   2% |                                 | ETA:  36 days, 23:37:49

Episode: 1120   score: 14.03   Avg score (100e): 13.65   actor gain: -0.38   critic loss: 0.41   steps: 1120


training loop:   2% |                                 | ETA:  36 days, 23:38:22

Episode: 1121   score: 14.05   Avg score (100e): 13.65   actor gain: -0.38   critic loss: 0.41   steps: 1121


training loop:   2% |                                 | ETA:  36 days, 23:37:10

Episode: 1122   score: 14.07   Avg score (100e): 13.66   actor gain: -0.38   critic loss: 0.41   steps: 1122


training loop:   2% |                                 | ETA:  36 days, 23:34:48

Episode: 1123   score: 14.09   Avg score (100e): 13.67   actor gain: -0.38   critic loss: 0.41   steps: 1123


training loop:   2% |                                 | ETA:  36 days, 23:34:41

Episode: 1124   score: 14.11   Avg score (100e): 13.67   actor gain: -0.38   critic loss: 0.41   steps: 1124


training loop:   2% |                                 | ETA:  36 days, 23:38:56

Episode: 1125   score: 14.12   Avg score (100e): 13.68   actor gain: -0.38   critic loss: 0.41   steps: 1125


training loop:   2% |                                 | ETA:  36 days, 23:46:14

Episode: 1126   score: 14.13   Avg score (100e): 13.69   actor gain: -0.38   critic loss: 0.41   steps: 1126


training loop:   2% |                                 | ETA:  36 days, 23:52:24

Episode: 1127   score: 14.14   Avg score (100e): 13.70   actor gain: -0.38   critic loss: 0.41   steps: 1127


training loop:   2% |                                 | ETA:  36 days, 23:55:22

Episode: 1128   score: 14.16   Avg score (100e): 13.70   actor gain: -0.38   critic loss: 0.41   steps: 1128


training loop:   2% |                                 | ETA:  36 days, 23:57:48

Episode: 1129   score: 14.17   Avg score (100e): 13.71   actor gain: -0.38   critic loss: 0.41   steps: 1129


training loop:   2% |                                 | ETA:  36 days, 23:58:25

Episode: 1130   score: 14.18   Avg score (100e): 13.72   actor gain: -0.38   critic loss: 0.41   steps: 1130


training loop:   2% |                                 | ETA:  36 days, 23:55:01

Episode: 1131   score: 14.19   Avg score (100e): 13.72   actor gain: -0.38   critic loss: 0.41   steps: 1131


training loop:   2% |                                 | ETA:  36 days, 23:57:36

Episode: 1132   score: 14.20   Avg score (100e): 13.73   actor gain: -0.38   critic loss: 0.41   steps: 1132


training loop:   2% |                                 | ETA:  36 days, 23:55:38

Episode: 1133   score: 14.21   Avg score (100e): 13.74   actor gain: -0.38   critic loss: 0.41   steps: 1133


training loop:   2% |                                 | ETA:  36 days, 23:56:41

Episode: 1134   score: 14.22   Avg score (100e): 13.75   actor gain: -0.38   critic loss: 0.41   steps: 1134


training loop:   2% |                                  | ETA:  37 days, 0:05:10

Episode: 1135   score: 14.25   Avg score (100e): 13.76   actor gain: -0.38   critic loss: 0.41   steps: 1135


training loop:   2% |                                  | ETA:  37 days, 0:00:26

Episode: 1136   score: 14.25   Avg score (100e): 13.76   actor gain: -0.38   critic loss: 0.41   steps: 1136


training loop:   2% |                                  | ETA:  37 days, 0:08:23

Episode: 1137   score: 14.26   Avg score (100e): 13.77   actor gain: -0.38   critic loss: 0.41   steps: 1137


training loop:   2% |                                  | ETA:  37 days, 0:16:28

Episode: 1138   score: 14.27   Avg score (100e): 13.78   actor gain: -0.38   critic loss: 0.41   steps: 1138


training loop:   2% |                                  | ETA:  37 days, 0:25:49

Episode: 1139   score: 14.28   Avg score (100e): 13.79   actor gain: -0.38   critic loss: 0.41   steps: 1139


training loop:   2% |                                  | ETA:  37 days, 0:43:59

Episode: 1140   score: 14.30   Avg score (100e): 13.80   actor gain: -0.39   critic loss: 0.41   steps: 1140


training loop:   2% |                                  | ETA:  37 days, 0:59:12

Episode: 1141   score: 14.31   Avg score (100e): 13.80   actor gain: -0.39   critic loss: 0.41   steps: 1141


training loop:   2% |                                  | ETA:  37 days, 1:07:04

Episode: 1142   score: 14.33   Avg score (100e): 13.81   actor gain: -0.39   critic loss: 0.41   steps: 1142


training loop:   2% |                                  | ETA:  37 days, 1:08:48

Episode: 1143   score: 14.34   Avg score (100e): 13.82   actor gain: -0.40   critic loss: 0.41   steps: 1143


training loop:   2% |                                  | ETA:  37 days, 1:11:37

Episode: 1144   score: 14.36   Avg score (100e): 13.83   actor gain: -0.40   critic loss: 0.41   steps: 1144


training loop:   2% |                                  | ETA:  37 days, 1:13:05

Episode: 1145   score: 14.37   Avg score (100e): 13.84   actor gain: -0.40   critic loss: 0.41   steps: 1145


training loop:   2% |                                  | ETA:  37 days, 1:14:40

Episode: 1146   score: 14.39   Avg score (100e): 13.85   actor gain: -0.40   critic loss: 0.41   steps: 1146


training loop:   2% |                                  | ETA:  37 days, 1:15:48

Episode: 1147   score: 14.39   Avg score (100e): 13.86   actor gain: -0.40   critic loss: 0.41   steps: 1147


training loop:   2% |                                  | ETA:  37 days, 1:18:11

Episode: 1148   score: 14.41   Avg score (100e): 13.86   actor gain: -0.40   critic loss: 0.41   steps: 1148


training loop:   2% |                                  | ETA:  37 days, 1:28:57

Episode: 1149   score: 14.42   Avg score (100e): 13.87   actor gain: -0.40   critic loss: 0.41   steps: 1149


training loop:   2% |                                  | ETA:  37 days, 1:36:24

Episode: 1150   score: 14.43   Avg score (100e): 13.88   actor gain: -0.41   critic loss: 0.41   steps: 1150


training loop:   2% |                                  | ETA:  37 days, 1:36:07

Episode: 1151   score: 14.45   Avg score (100e): 13.89   actor gain: -0.42   critic loss: 0.41   steps: 1151


training loop:   2% |                                  | ETA:  37 days, 1:35:38

Episode: 1152   score: 14.46   Avg score (100e): 13.90   actor gain: -0.42   critic loss: 0.41   steps: 1152


training loop:   2% |                                  | ETA:  37 days, 1:44:39

Episode: 1153   score: 14.47   Avg score (100e): 13.91   actor gain: -0.42   critic loss: 0.41   steps: 1153


training loop:   2% |                                  | ETA:  37 days, 1:53:15

Episode: 1154   score: 14.49   Avg score (100e): 13.92   actor gain: -0.42   critic loss: 0.41   steps: 1154


training loop:   2% |                                  | ETA:  37 days, 9:38:24

Episode: 1155   score: 14.49   Avg score (100e): 13.93   actor gain: -0.42   critic loss: 0.41   steps: 1155


training loop:   2% |                                  | ETA:  37 days, 9:43:21

Episode: 1156   score: 14.51   Avg score (100e): 13.94   actor gain: -0.42   critic loss: 0.41   steps: 1156


training loop:   2% |                                  | ETA:  37 days, 9:58:51

Episode: 1157   score: 14.52   Avg score (100e): 13.95   actor gain: -0.59   critic loss: 0.41   steps: 1157


training loop:   2% |                                 | ETA:  37 days, 10:08:48

Episode: 1158   score: 14.54   Avg score (100e): 13.96   actor gain: -0.59   critic loss: 0.41   steps: 1158


training loop:   2% |                                 | ETA:  37 days, 10:08:33

Episode: 1159   score: 14.55   Avg score (100e): 13.97   actor gain: -0.60   critic loss: 0.41   steps: 1159


training loop:   2% |                                 | ETA:  37 days, 10:08:45

Episode: 1160   score: 14.56   Avg score (100e): 13.98   actor gain: -0.60   critic loss: 0.41   steps: 1160


training loop:   2% |                                 | ETA:  37 days, 10:07:53

Episode: 1161   score: 14.58   Avg score (100e): 13.99   actor gain: -0.61   critic loss: 0.41   steps: 1161


training loop:   2% |                                 | ETA:  37 days, 10:11:07

Episode: 1162   score: 14.59   Avg score (100e): 14.00   actor gain: -0.61   critic loss: 0.41   steps: 1162


training loop:   2% |                                 | ETA:  37 days, 10:10:30

Episode: 1163   score: 14.61   Avg score (100e): 14.01   actor gain: -0.60   critic loss: 0.41   steps: 1163


training loop:   2% |                                 | ETA:  37 days, 10:12:33

Episode: 1164   score: 14.62   Avg score (100e): 14.02   actor gain: -0.60   critic loss: 0.41   steps: 1164


training loop:   2% |                                 | ETA:  37 days, 10:12:24

Episode: 1165   score: 14.63   Avg score (100e): 14.03   actor gain: -0.60   critic loss: 0.41   steps: 1165


training loop:   2% |                                 | ETA:  37 days, 10:11:12

Episode: 1166   score: 14.66   Avg score (100e): 14.04   actor gain: -0.60   critic loss: 0.41   steps: 1166


training loop:   2% |                                 | ETA:  37 days, 10:09:10

Episode: 1167   score: 14.67   Avg score (100e): 14.05   actor gain: -0.60   critic loss: 0.41   steps: 1167


training loop:   2% |                                 | ETA:  37 days, 10:10:12

Episode: 1168   score: 14.67   Avg score (100e): 14.06   actor gain: -0.59   critic loss: 0.41   steps: 1168


training loop:   2% |                                 | ETA:  37 days, 10:13:19

Episode: 1169   score: 14.68   Avg score (100e): 14.07   actor gain: -0.59   critic loss: 0.41   steps: 1169


training loop:   2% |                                 | ETA:  37 days, 10:17:49

Episode: 1170   score: 14.70   Avg score (100e): 14.08   actor gain: -0.59   critic loss: 0.41   steps: 1170


training loop:   2% |                                 | ETA:  37 days, 10:17:49

Episode: 1171   score: 14.71   Avg score (100e): 14.09   actor gain: -0.59   critic loss: 0.41   steps: 1171


training loop:   2% |                                 | ETA:  37 days, 10:20:34

Episode: 1172   score: 14.72   Avg score (100e): 14.10   actor gain: -0.59   critic loss: 0.41   steps: 1172


training loop:   2% |                                 | ETA:  37 days, 10:19:21

Episode: 1173   score: 14.73   Avg score (100e): 14.12   actor gain: -0.59   critic loss: 0.41   steps: 1173


training loop:   2% |                                 | ETA:  37 days, 10:38:53

Episode: 1174   score: 14.75   Avg score (100e): 14.13   actor gain: -0.60   critic loss: 0.41   steps: 1174


training loop:   2% |                                 | ETA:  37 days, 10:46:23

Episode: 1175   score: 14.76   Avg score (100e): 14.14   actor gain: -0.59   critic loss: 0.41   steps: 1175


training loop:   2% |                                 | ETA:  37 days, 10:59:32

Episode: 1176   score: 14.78   Avg score (100e): 14.15   actor gain: -0.58   critic loss: 0.41   steps: 1176


training loop:   2% |                                 | ETA:  37 days, 11:12:03

Episode: 1177   score: 14.78   Avg score (100e): 14.16   actor gain: -0.58   critic loss: 0.41   steps: 1177


training loop:   2% |                                 | ETA:  37 days, 11:30:10

Episode: 1178   score: 14.80   Avg score (100e): 14.17   actor gain: -0.58   critic loss: 0.41   steps: 1178


training loop:   2% |                                 | ETA:  37 days, 11:37:53

Episode: 1179   score: 14.81   Avg score (100e): 14.18   actor gain: -0.58   critic loss: 0.41   steps: 1179


training loop:   2% |                                 | ETA:  37 days, 11:40:57

Episode: 1180   score: 14.82   Avg score (100e): 14.19   actor gain: -0.61   critic loss: 0.41   steps: 1180


training loop:   2% |                                 | ETA:  37 days, 11:44:13

Episode: 1181   score: 14.84   Avg score (100e): 14.21   actor gain: -0.61   critic loss: 0.41   steps: 1181


training loop:   2% |                                 | ETA:  37 days, 11:46:56

Episode: 1182   score: 14.85   Avg score (100e): 14.22   actor gain: -0.44   critic loss: 0.41   steps: 1182


training loop:   2% |                                 | ETA:  37 days, 11:50:34

Episode: 1183   score: 14.86   Avg score (100e): 14.23   actor gain: -0.44   critic loss: 0.41   steps: 1183


training loop:   2% |                                 | ETA:  37 days, 11:49:19

Episode: 1184   score: 14.87   Avg score (100e): 14.24   actor gain: -0.43   critic loss: 0.41   steps: 1184


training loop:   2% |                                 | ETA:  37 days, 11:48:03

Episode: 1185   score: 14.88   Avg score (100e): 14.25   actor gain: -0.43   critic loss: 0.41   steps: 1185


training loop:   2% |                                 | ETA:  37 days, 11:46:57

Episode: 1186   score: 14.90   Avg score (100e): 14.27   actor gain: -0.42   critic loss: 0.41   steps: 1186


training loop:   2% |                                 | ETA:  37 days, 11:48:20

Episode: 1187   score: 14.91   Avg score (100e): 14.28   actor gain: -0.42   critic loss: 0.41   steps: 1187


training loop:   2% |                                 | ETA:  37 days, 11:50:38

Episode: 1188   score: 14.92   Avg score (100e): 14.29   actor gain: -0.42   critic loss: 0.41   steps: 1188


training loop:   2% |                                 | ETA:  37 days, 11:59:17

Episode: 1189   score: 14.93   Avg score (100e): 14.30   actor gain: -0.42   critic loss: 0.41   steps: 1189


training loop:   2% |                                 | ETA:  37 days, 12:05:14

Episode: 1190   score: 14.96   Avg score (100e): 14.31   actor gain: -0.41   critic loss: 0.41   steps: 1190


training loop:   2% |                                 | ETA:  37 days, 12:26:08

Episode: 1191   score: 14.97   Avg score (100e): 14.33   actor gain: -0.41   critic loss: 0.41   steps: 1191


training loop:   2% |                                 | ETA:  37 days, 12:30:20

Episode: 1192   score: 14.99   Avg score (100e): 14.34   actor gain: -0.41   critic loss: 0.41   steps: 1192


training loop:   2% |                                 | ETA:  37 days, 12:46:19

Episode: 1193   score: 15.01   Avg score (100e): 14.35   actor gain: -0.41   critic loss: 0.41   steps: 1193


training loop:   2% |                                 | ETA:  37 days, 12:59:39

Episode: 1194   score: 15.02   Avg score (100e): 14.37   actor gain: -0.41   critic loss: 0.41   steps: 1194


training loop:   2% |                                 | ETA:  37 days, 12:59:27

Episode: 1195   score: 15.03   Avg score (100e): 14.38   actor gain: -0.41   critic loss: 0.41   steps: 1195


training loop:   2% |                                 | ETA:  37 days, 13:04:21

Episode: 1196   score: 15.04   Avg score (100e): 14.39   actor gain: -0.41   critic loss: 0.41   steps: 1196


training loop:   2% |                                 | ETA:  37 days, 13:14:29

Episode: 1197   score: 15.06   Avg score (100e): 14.40   actor gain: -0.45   critic loss: 0.41   steps: 1197


training loop:   2% |                                 | ETA:  37 days, 13:29:52

Episode: 1198   score: 15.07   Avg score (100e): 14.42   actor gain: -0.45   critic loss: 0.41   steps: 1198


training loop:   2% |                                 | ETA:  37 days, 13:37:41

Episode: 1199   score: 15.08   Avg score (100e): 14.43   actor gain: -0.44   critic loss: 0.41   steps: 1199


training loop:   2% |                                 | ETA:  37 days, 13:40:00

Episode: 1200   score: 15.09   Avg score (100e): 14.44   actor gain: -0.44   critic loss: 0.41   steps: 1200


training loop:   2% |                                 | ETA:  37 days, 13:39:13

Episode: 1201   score: 15.12   Avg score (100e): 14.46   actor gain: -0.44   critic loss: 0.41   steps: 1201


training loop:   2% |                                 | ETA:  37 days, 13:38:53

Episode: 1202   score: 15.13   Avg score (100e): 14.47   actor gain: -0.46   critic loss: 0.41   steps: 1202


training loop:   2% |                                 | ETA:  37 days, 13:39:27

Episode: 1203   score: 15.15   Avg score (100e): 14.48   actor gain: -0.46   critic loss: 0.41   steps: 1203


training loop:   2% |                                 | ETA:  37 days, 13:38:35

Episode: 1204   score: 15.16   Avg score (100e): 14.49   actor gain: -0.46   critic loss: 0.41   steps: 1204


training loop:   2% |                                 | ETA:  37 days, 13:37:39

Episode: 1205   score: 15.18   Avg score (100e): 14.51   actor gain: -0.43   critic loss: 0.41   steps: 1205


training loop:   2% |                                 | ETA:  37 days, 13:44:22

Episode: 1206   score: 15.19   Avg score (100e): 14.52   actor gain: -0.43   critic loss: 0.41   steps: 1206


training loop:   2% |                                 | ETA:  37 days, 13:44:59

Episode: 1207   score: 15.20   Avg score (100e): 14.53   actor gain: -0.43   critic loss: 0.41   steps: 1207


training loop:   2% |                                 | ETA:  37 days, 13:43:09

Episode: 1208   score: 15.21   Avg score (100e): 14.55   actor gain: -0.43   critic loss: 0.41   steps: 1208


training loop:   2% |                                 | ETA:  37 days, 13:42:03

Episode: 1209   score: 15.23   Avg score (100e): 14.56   actor gain: -0.43   critic loss: 0.41   steps: 1209


training loop:   2% |                                 | ETA:  37 days, 13:41:32

Episode: 1210   score: 15.25   Avg score (100e): 14.57   actor gain: -0.59   critic loss: 0.41   steps: 1210


training loop:   2% |                                 | ETA:  37 days, 13:39:34

Episode: 1211   score: 15.26   Avg score (100e): 14.59   actor gain: -0.59   critic loss: 0.41   steps: 1211


training loop:   2% |                                 | ETA:  37 days, 13:36:26

Episode: 1212   score: 15.27   Avg score (100e): 14.60   actor gain: -0.63   critic loss: 0.41   steps: 1212


training loop:   2% |                                 | ETA:  37 days, 13:33:10

Episode: 1213   score: 15.29   Avg score (100e): 14.61   actor gain: -0.62   critic loss: 0.41   steps: 1213


training loop:   2% |                                 | ETA:  37 days, 13:31:29

Episode: 1214   score: 15.30   Avg score (100e): 14.63   actor gain: -0.62   critic loss: 0.41   steps: 1214


training loop:   2% |                                 | ETA:  37 days, 13:26:55

Episode: 1215   score: 15.32   Avg score (100e): 14.64   actor gain: -0.62   critic loss: 0.41   steps: 1215


training loop:   2% |                                 | ETA:  37 days, 13:22:51

Episode: 1216   score: 15.33   Avg score (100e): 14.65   actor gain: -0.63   critic loss: 0.42   steps: 1216


training loop:   2% |                                 | ETA:  37 days, 13:19:17

Episode: 1217   score: 15.35   Avg score (100e): 14.67   actor gain: -0.63   critic loss: 0.42   steps: 1217


training loop:   2% |                                 | ETA:  37 days, 13:16:36

Episode: 1218   score: 15.37   Avg score (100e): 14.68   actor gain: -0.63   critic loss: 0.42   steps: 1218


training loop:   2% |                                 | ETA:  37 days, 13:10:13

Episode: 1219   score: 15.38   Avg score (100e): 14.70   actor gain: -0.63   critic loss: 0.42   steps: 1219


training loop:   2% |                                 | ETA:  37 days, 13:07:32

Episode: 1220   score: 15.39   Avg score (100e): 14.71   actor gain: -0.63   critic loss: 0.42   steps: 1220


training loop:   2% |                                 | ETA:  37 days, 13:14:46

Episode: 1221   score: 15.41   Avg score (100e): 14.72   actor gain: -0.63   critic loss: 0.42   steps: 1221


training loop:   2% |                                 | ETA:  37 days, 13:11:21

Episode: 1222   score: 15.42   Avg score (100e): 14.74   actor gain: -0.59   critic loss: 0.42   steps: 1222


training loop:   2% |                                 | ETA:  37 days, 13:08:50

Episode: 1223   score: 15.43   Avg score (100e): 14.75   actor gain: -0.59   critic loss: 0.42   steps: 1223


training loop:   2% |                                 | ETA:  37 days, 13:06:54

Episode: 1224   score: 15.44   Avg score (100e): 14.76   actor gain: -0.59   critic loss: 0.42   steps: 1224


training loop:   2% |                                 | ETA:  37 days, 13:22:01

Episode: 1225   score: 15.46   Avg score (100e): 14.78   actor gain: -0.59   critic loss: 0.42   steps: 1225


training loop:   2% |                                 | ETA:  37 days, 13:19:30

Episode: 1226   score: 15.47   Avg score (100e): 14.79   actor gain: -0.59   critic loss: 0.42   steps: 1226


training loop:   2% |                                 | ETA:  37 days, 13:18:17

Episode: 1227   score: 15.49   Avg score (100e): 14.80   actor gain: -0.57   critic loss: 0.42   steps: 1227


training loop:   2% |                                 | ETA:  37 days, 13:21:24

Episode: 1228   score: 15.51   Avg score (100e): 14.82   actor gain: -0.57   critic loss: 0.42   steps: 1228


training loop:   2% |                                 | ETA:  37 days, 13:26:14

Episode: 1229   score: 15.52   Avg score (100e): 14.83   actor gain: -0.57   critic loss: 0.42   steps: 1229


training loop:   2% |                                 | ETA:  37 days, 13:26:55

Episode: 1230   score: 15.54   Avg score (100e): 14.84   actor gain: -0.58   critic loss: 0.42   steps: 1230


training loop:   2% |                                 | ETA:  37 days, 13:27:11

Episode: 1231   score: 15.55   Avg score (100e): 14.86   actor gain: -0.58   critic loss: 0.42   steps: 1231


training loop:   2% |                                 | ETA:  37 days, 13:22:33

Episode: 1232   score: 15.58   Avg score (100e): 14.87   actor gain: -0.58   critic loss: 0.42   steps: 1232


training loop:   2% |                                 | ETA:  37 days, 13:20:10

Episode: 1233   score: 15.59   Avg score (100e): 14.88   actor gain: -0.58   critic loss: 0.42   steps: 1233


training loop:   2% |                                 | ETA:  37 days, 13:15:09

Episode: 1234   score: 15.59   Avg score (100e): 14.90   actor gain: -0.58   critic loss: 0.42   steps: 1234


training loop:   2% |                                 | ETA:  37 days, 13:15:02

Episode: 1235   score: 15.61   Avg score (100e): 14.91   actor gain: -0.42   critic loss: 0.42   steps: 1235


training loop:   2% |                                 | ETA:  37 days, 13:11:44

Episode: 1236   score: 15.62   Avg score (100e): 14.93   actor gain: -0.42   critic loss: 0.42   steps: 1236


training loop:   2% |                                 | ETA:  37 days, 13:07:51

Episode: 1237   score: 15.63   Avg score (100e): 14.94   actor gain: -0.38   critic loss: 0.42   steps: 1237


training loop:   2% |                                 | ETA:  37 days, 13:17:50

Episode: 1238   score: 15.63   Avg score (100e): 14.95   actor gain: -0.38   critic loss: 0.42   steps: 1238


training loop:   2% |                                 | ETA:  37 days, 13:14:35

Episode: 1239   score: 15.64   Avg score (100e): 14.97   actor gain: -0.38   critic loss: 0.42   steps: 1239


training loop:   2% |                                 | ETA:  37 days, 13:13:39

Episode: 1240   score: 15.65   Avg score (100e): 14.98   actor gain: -0.38   critic loss: 0.42   steps: 1240
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  37 days, 13:12:59

Episode: 1241   score: 15.66   Avg score (100e): 14.99   actor gain: -0.38   critic loss: 0.42   steps: 1241


training loop:   2% |                                 | ETA:  37 days, 13:12:53

Episode: 1242   score: 15.67   Avg score (100e): 15.01   actor gain: -0.38   critic loss: 0.42   steps: 1242


training loop:   2% |                                 | ETA:  37 days, 13:13:58

Episode: 1243   score: 15.68   Avg score (100e): 15.02   actor gain: -0.38   critic loss: 0.42   steps: 1243


training loop:   2% |                                 | ETA:  37 days, 13:14:58

Episode: 1244   score: 15.69   Avg score (100e): 15.03   actor gain: -0.38   critic loss: 0.42   steps: 1244


training loop:   2% |                                 | ETA:  37 days, 13:21:20

Episode: 1245   score: 15.71   Avg score (100e): 15.05   actor gain: -0.38   critic loss: 0.42   steps: 1245


training loop:   2% |                                 | ETA:  37 days, 13:23:42

Episode: 1246   score: 15.72   Avg score (100e): 15.06   actor gain: -0.38   critic loss: 0.41   steps: 1246


training loop:   2% |                                 | ETA:  37 days, 13:26:51

Episode: 1247   score: 15.73   Avg score (100e): 15.07   actor gain: -0.38   critic loss: 0.41   steps: 1247


training loop:   2% |                                 | ETA:  37 days, 13:24:06

Episode: 1248   score: 15.76   Avg score (100e): 15.09   actor gain: -0.38   critic loss: 0.41   steps: 1248


training loop:   2% |                                 | ETA:  37 days, 13:18:34

Episode: 1249   score: 15.77   Avg score (100e): 15.10   actor gain: -0.38   critic loss: 0.41   steps: 1249


training loop:   2% |                                 | ETA:  37 days, 13:15:42

Episode: 1250   score: 15.78   Avg score (100e): 15.11   actor gain: -0.38   critic loss: 0.41   steps: 1250


training loop:   2% |                                 | ETA:  37 days, 13:14:57

Episode: 1251   score: 15.80   Avg score (100e): 15.13   actor gain: -0.38   critic loss: 0.41   steps: 1251


training loop:   2% |                                 | ETA:  37 days, 13:14:50

Episode: 1252   score: 15.82   Avg score (100e): 15.14   actor gain: -0.38   critic loss: 0.41   steps: 1252


training loop:   2% |                                 | ETA:  37 days, 13:16:55

Episode: 1253   score: 15.84   Avg score (100e): 15.15   actor gain: -0.38   critic loss: 0.41   steps: 1253


training loop:   2% |                                 | ETA:  37 days, 13:22:25

Episode: 1254   score: 15.85   Avg score (100e): 15.17   actor gain: -0.39   critic loss: 0.41   steps: 1254


training loop:   2% |                                 | ETA:  37 days, 13:23:07

Episode: 1255   score: 15.86   Avg score (100e): 15.18   actor gain: -0.38   critic loss: 0.41   steps: 1255


training loop:   2% |                                 | ETA:  37 days, 13:19:54

Episode: 1256   score: 15.87   Avg score (100e): 15.20   actor gain: -0.38   critic loss: 0.41   steps: 1256


training loop:   2% |                                 | ETA:  37 days, 13:24:48

Episode: 1257   score: 15.89   Avg score (100e): 15.21   actor gain: -0.38   critic loss: 0.41   steps: 1257
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  37 days, 13:13:47

Episode: 1258   score: 15.90   Avg score (100e): 15.22   actor gain: -0.38   critic loss: 0.41   steps: 1258


training loop:   2% |                                 | ETA:  37 days, 13:04:03

Episode: 1259   score: 15.91   Avg score (100e): 15.24   actor gain: -0.38   critic loss: 0.41   steps: 1259


training loop:   2% |                                 | ETA:  37 days, 12:53:32

Episode: 1260   score: 15.92   Avg score (100e): 15.25   actor gain: -0.38   critic loss: 0.41   steps: 1260


training loop:   2% |                                 | ETA:  37 days, 12:50:16

Episode: 1261   score: 15.92   Avg score (100e): 15.26   actor gain: -0.38   critic loss: 0.41   steps: 1261


training loop:   2% |                                 | ETA:  37 days, 12:45:11

Episode: 1262   score: 15.94   Avg score (100e): 15.28   actor gain: -0.38   critic loss: 0.41   steps: 1262


training loop:   2% |                                 | ETA:  37 days, 12:39:30

Episode: 1263   score: 15.95   Avg score (100e): 15.29   actor gain: -0.38   critic loss: 0.41   steps: 1263
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  37 days, 12:37:26

Episode: 1264   score: 15.97   Avg score (100e): 15.30   actor gain: -0.38   critic loss: 0.41   steps: 1264


training loop:   2% |                                 | ETA:  37 days, 12:43:20

Episode: 1265   score: 15.98   Avg score (100e): 15.32   actor gain: -0.38   critic loss: 0.41   steps: 1265


training loop:   2% |                                 | ETA:  37 days, 12:43:39

Episode: 1266   score: 15.99   Avg score (100e): 15.33   actor gain: -0.38   critic loss: 0.41   steps: 1266


training loop:   2% |                                 | ETA:  37 days, 12:45:59

Episode: 1267   score: 16.01   Avg score (100e): 15.34   actor gain: -0.38   critic loss: 0.41   steps: 1267


training loop:   2% |                                 | ETA:  37 days, 12:55:07

Episode: 1268   score: 16.02   Avg score (100e): 15.36   actor gain: -0.38   critic loss: 0.41   steps: 1268


training loop:   2% |                                 | ETA:  37 days, 12:52:07

Episode: 1269   score: 16.03   Avg score (100e): 15.37   actor gain: -0.38   critic loss: 0.41   steps: 1269


training loop:   2% |                                 | ETA:  37 days, 12:49:27

Episode: 1270   score: 16.04   Avg score (100e): 15.38   actor gain: -0.38   critic loss: 0.41   steps: 1270


training loop:   2% |                                 | ETA:  37 days, 12:55:32

Episode: 1271   score: 16.04   Avg score (100e): 15.40   actor gain: -0.39   critic loss: 0.41   steps: 1271


training loop:   2% |                                 | ETA:  37 days, 12:52:31

Episode: 1272   score: 16.06   Avg score (100e): 15.41   actor gain: -0.39   critic loss: 0.41   steps: 1272


training loop:   2% |                                 | ETA:  37 days, 12:50:46

Episode: 1273   score: 16.07   Avg score (100e): 15.42   actor gain: -0.39   critic loss: 0.41   steps: 1273


training loop:   2% |                                 | ETA:  37 days, 12:47:11

Episode: 1274   score: 16.09   Avg score (100e): 15.44   actor gain: -0.39   critic loss: 0.41   steps: 1274


training loop:   2% |                                 | ETA:  37 days, 12:44:04

Episode: 1275   score: 16.10   Avg score (100e): 15.45   actor gain: -0.39   critic loss: 0.41   steps: 1275


training loop:   2% |                                 | ETA:  37 days, 12:42:05

Episode: 1276   score: 16.11   Avg score (100e): 15.46   actor gain: -0.39   critic loss: 0.41   steps: 1276
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  37 days, 12:40:17

Episode: 1277   score: 16.14   Avg score (100e): 15.48   actor gain: -0.39   critic loss: 0.41   steps: 1277


training loop:   2% |                                 | ETA:  37 days, 12:37:53

Episode: 1278   score: 16.15   Avg score (100e): 15.49   actor gain: -0.39   critic loss: 0.41   steps: 1278


training loop:   2% |                                 | ETA:  37 days, 12:33:18

Episode: 1279   score: 16.17   Avg score (100e): 15.51   actor gain: -0.38   critic loss: 0.41   steps: 1279


training loop:   2% |                                 | ETA:  37 days, 12:29:33

Episode: 1280   score: 16.18   Avg score (100e): 15.52   actor gain: -0.38   critic loss: 0.41   steps: 1280


training loop:   2% |                                 | ETA:  37 days, 12:24:33

Episode: 1281   score: 16.19   Avg score (100e): 15.53   actor gain: -0.38   critic loss: 0.41   steps: 1281


training loop:   2% |                                 | ETA:  37 days, 12:22:21

Episode: 1282   score: 16.21   Avg score (100e): 15.55   actor gain: -0.39   critic loss: 0.41   steps: 1282


training loop:   2% |                                 | ETA:  37 days, 12:19:05

Episode: 1283   score: 16.22   Avg score (100e): 15.56   actor gain: -0.42   critic loss: 0.41   steps: 1283


training loop:   2% |                                 | ETA:  37 days, 12:24:39

Episode: 1284   score: 16.24   Avg score (100e): 15.57   actor gain: -0.44   critic loss: 0.41   steps: 1284
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  37 days, 12:22:42

Episode: 1285   score: 16.24   Avg score (100e): 15.59   actor gain: -7.29   critic loss: 0.41   steps: 1285


training loop:   2% |                                 | ETA:  37 days, 12:24:37

Episode: 1286   score: 16.26   Avg score (100e): 15.60   actor gain: -7.28   critic loss: 0.41   steps: 1286


training loop:   2% |                                 | ETA:  37 days, 12:22:19

Episode: 1287   score: 16.27   Avg score (100e): 15.61   actor gain: -7.28   critic loss: 0.41   steps: 1287


training loop:   2% |                                 | ETA:  37 days, 12:22:16

Episode: 1288   score: 16.27   Avg score (100e): 15.63   actor gain: -7.28   critic loss: 0.41   steps: 1288


training loop:   2% |                                 | ETA:  37 days, 12:20:38

Episode: 1289   score: 16.28   Avg score (100e): 15.64   actor gain: -7.28   critic loss: 0.41   steps: 1289


training loop:   2% |                                 | ETA:  37 days, 12:18:38

Episode: 1290   score: 16.30   Avg score (100e): 15.65   actor gain: -7.28   critic loss: 0.41   steps: 1290
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  37 days, 12:33:27

Episode: 1291   score: 16.32   Avg score (100e): 15.67   actor gain: -7.28   critic loss: 0.41   steps: 1291


training loop:   2% |                                 | ETA:  37 days, 12:38:25

Episode: 1292   score: 16.32   Avg score (100e): 15.68   actor gain: -7.28   critic loss: 0.41   steps: 1292


training loop:   2% |                                 | ETA:  37 days, 12:39:55

Episode: 1293   score: 16.33   Avg score (100e): 15.69   actor gain: -7.28   critic loss: 0.41   steps: 1293


training loop:   2% |                                 | ETA:  37 days, 12:38:50

Episode: 1294   score: 16.34   Avg score (100e): 15.71   actor gain: -7.28   critic loss: 0.41   steps: 1294


training loop:   2% |                                 | ETA:  37 days, 12:38:58

Episode: 1295   score: 16.34   Avg score (100e): 15.72   actor gain: -7.28   critic loss: 0.41   steps: 1295


training loop:   2% |                                 | ETA:  37 days, 12:40:26

Episode: 1296   score: 16.35   Avg score (100e): 15.73   actor gain: -7.28   critic loss: 0.41   steps: 1296


training loop:   2% |                                 | ETA:  37 days, 12:49:56

Episode: 1297   score: 16.35   Avg score (100e): 15.75   actor gain: -7.28   critic loss: 0.41   steps: 1297


training loop:   2% |                                 | ETA:  37 days, 12:55:59

Episode: 1298   score: 16.36   Avg score (100e): 15.76   actor gain: -7.28   critic loss: 0.41   steps: 1298


training loop:   2% |                                 | ETA:  37 days, 13:00:17

Episode: 1299   score: 16.37   Avg score (100e): 15.77   actor gain: -7.28   critic loss: 0.41   steps: 1299


training loop:   2% |                                 | ETA:  37 days, 12:58:45

Episode: 1300   score: 16.37   Avg score (100e): 15.79   actor gain: -7.28   critic loss: 0.41   steps: 1300
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  37 days, 12:55:05

Episode: 1301   score: 16.38   Avg score (100e): 15.80   actor gain: -7.28   critic loss: 0.41   steps: 1301


training loop:   2% |                                 | ETA:  37 days, 12:55:06

Episode: 1302   score: 16.39   Avg score (100e): 15.81   actor gain: -7.28   critic loss: 0.41   steps: 1302


training loop:   2% |                                 | ETA:  37 days, 13:04:56

Episode: 1303   score: 16.39   Avg score (100e): 15.82   actor gain: -7.28   critic loss: 0.41   steps: 1303


training loop:   2% |                                 | ETA:  37 days, 13:02:25

Episode: 1304   score: 16.40   Avg score (100e): 15.84   actor gain: -7.28   critic loss: 0.41   steps: 1304


training loop:   2% |                                 | ETA:  37 days, 13:02:08

Episode: 1305   score: 16.42   Avg score (100e): 15.85   actor gain: -7.28   critic loss: 0.41   steps: 1305


training loop:   2% |                                 | ETA:  37 days, 13:00:42

Episode: 1306   score: 16.42   Avg score (100e): 15.86   actor gain: -7.29   critic loss: 0.41   steps: 1306


training loop:   2% |                                 | ETA:  37 days, 12:57:41

Episode: 1307   score: 16.44   Avg score (100e): 15.87   actor gain: -7.29   critic loss: 0.41   steps: 1307


training loop:   2% |                                 | ETA:  37 days, 12:53:25

Episode: 1308   score: 16.45   Avg score (100e): 15.89   actor gain: -7.26   critic loss: 0.41   steps: 1308


training loop:   2% |                                 | ETA:  37 days, 12:50:46

Episode: 1309   score: 16.46   Avg score (100e): 15.90   actor gain: -7.24   critic loss: 0.41   steps: 1309


training loop:   2% |                                 | ETA:  37 days, 12:47:01

Episode: 1310   score: 16.46   Avg score (100e): 15.91   actor gain: -0.40   critic loss: 0.41   steps: 1310


training loop:   2% |                                 | ETA:  37 days, 12:44:31

Episode: 1311   score: 16.48   Avg score (100e): 15.92   actor gain: -0.40   critic loss: 0.41   steps: 1311


training loop:   2% |                                 | ETA:  37 days, 12:42:01

Episode: 1312   score: 16.50   Avg score (100e): 15.93   actor gain: -0.41   critic loss: 0.41   steps: 1312


training loop:   2% |                                 | ETA:  37 days, 12:49:10

Episode: 1313   score: 16.50   Avg score (100e): 15.95   actor gain: -0.41   critic loss: 0.41   steps: 1313


training loop:   2% |                                 | ETA:  37 days, 12:48:53

Episode: 1314   score: 16.51   Avg score (100e): 15.96   actor gain: -0.41   critic loss: 0.41   steps: 1314


training loop:   2% |                                 | ETA:  37 days, 12:58:27

Episode: 1315   score: 16.52   Avg score (100e): 15.97   actor gain: -0.41   critic loss: 0.41   steps: 1315


training loop:   2% |                                 | ETA:  37 days, 12:59:49

Episode: 1316   score: 16.52   Avg score (100e): 15.98   actor gain: -0.42   critic loss: 0.41   steps: 1316


training loop:   2% |                                 | ETA:  37 days, 12:58:39

Episode: 1317   score: 16.53   Avg score (100e): 15.99   actor gain: -0.42   critic loss: 0.41   steps: 1317


training loop:   2% |                                 | ETA:  37 days, 12:55:25

Episode: 1318   score: 16.55   Avg score (100e): 16.01   actor gain: -0.42   critic loss: 0.41   steps: 1318


training loop:   2% |                                 | ETA:  37 days, 12:58:12

Episode: 1319   score: 16.55   Avg score (100e): 16.02   actor gain: -0.42   critic loss: 0.41   steps: 1319


training loop:   2% |                                 | ETA:  37 days, 12:57:25

Episode: 1320   score: 16.56   Avg score (100e): 16.03   actor gain: -0.45   critic loss: 0.41   steps: 1320


training loop:   2% |                                 | ETA:  37 days, 12:55:17

Episode: 1321   score: 16.57   Avg score (100e): 16.04   actor gain: -0.46   critic loss: 0.41   steps: 1321


training loop:   2% |                                 | ETA:  37 days, 12:51:33

Episode: 1322   score: 16.59   Avg score (100e): 16.05   actor gain: -0.51   critic loss: 0.41   steps: 1322


training loop:   2% |                                 | ETA:  37 days, 12:49:34

Episode: 1323   score: 16.60   Avg score (100e): 16.06   actor gain: -0.51   critic loss: 0.41   steps: 1323


training loop:   2% |                                 | ETA:  37 days, 12:51:13

Episode: 1324   score: 16.60   Avg score (100e): 16.08   actor gain: -0.51   critic loss: 0.41   steps: 1324
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  37 days, 12:55:45

Episode: 1325   score: 16.61   Avg score (100e): 16.09   actor gain: -0.51   critic loss: 0.41   steps: 1325


training loop:   2% |                                 | ETA:  37 days, 12:57:56

Episode: 1326   score: 16.61   Avg score (100e): 16.10   actor gain: -0.51   critic loss: 0.41   steps: 1326


training loop:   2% |                                 | ETA:  37 days, 12:54:56

Episode: 1327   score: 16.62   Avg score (100e): 16.11   actor gain: -0.51   critic loss: 0.41   steps: 1327


training loop:   2% |                                 | ETA:  37 days, 13:03:42

Episode: 1328   score: 16.63   Avg score (100e): 16.12   actor gain: -0.57   critic loss: 0.41   steps: 1328


training loop:   2% |                                 | ETA:  37 days, 13:01:24

Episode: 1329   score: 16.63   Avg score (100e): 16.13   actor gain: -0.57   critic loss: 0.41   steps: 1329


training loop:   2% |                                 | ETA:  37 days, 13:18:57

Episode: 1330   score: 16.64   Avg score (100e): 16.14   actor gain: -0.57   critic loss: 0.41   steps: 1330


training loop:   2% |                                 | ETA:  37 days, 13:25:33

Episode: 1331   score: 16.65   Avg score (100e): 16.15   actor gain: -0.56   critic loss: 0.41   steps: 1331


training loop:   2% |                                 | ETA:  37 days, 13:30:30

Episode: 1332   score: 16.66   Avg score (100e): 16.16   actor gain: -0.56   critic loss: 0.41   steps: 1332
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  37 days, 13:37:13

Episode: 1333   score: 16.67   Avg score (100e): 16.18   actor gain: -0.56   critic loss: 0.41   steps: 1333


training loop:   2% |                                 | ETA:  37 days, 13:37:52

Episode: 1334   score: 16.69   Avg score (100e): 16.19   actor gain: -0.56   critic loss: 0.41   steps: 1334


training loop:   2% |                                 | ETA:  37 days, 13:57:42

Episode: 1335   score: 16.69   Avg score (100e): 16.20   actor gain: -0.56   critic loss: 0.41   steps: 1335


training loop:   2% |                                 | ETA:  37 days, 14:02:34

Episode: 1336   score: 16.68   Avg score (100e): 16.21   actor gain: -0.56   critic loss: 0.41   steps: 1336


training loop:   2% |                                 | ETA:  37 days, 14:02:55

Episode: 1337   score: 16.69   Avg score (100e): 16.22   actor gain: -0.55   critic loss: 0.41   steps: 1337


training loop:   2% |                                 | ETA:  37 days, 14:01:56

Episode: 1338   score: 16.70   Avg score (100e): 16.23   actor gain: -0.55   critic loss: 0.41   steps: 1338


training loop:   2% |                                 | ETA:  37 days, 14:01:19

Episode: 1339   score: 16.70   Avg score (100e): 16.24   actor gain: -0.56   critic loss: 0.41   steps: 1339


training loop:   2% |                                 | ETA:  37 days, 13:58:37

Episode: 1340   score: 16.71   Avg score (100e): 16.25   actor gain: -0.56   critic loss: 0.41   steps: 1340


training loop:   2% |                                 | ETA:  37 days, 13:59:16

Episode: 1341   score: 16.74   Avg score (100e): 16.26   actor gain: -0.56   critic loss: 0.41   steps: 1341


training loop:   2% |                                 | ETA:  37 days, 14:02:44

Episode: 1342   score: 16.75   Avg score (100e): 16.27   actor gain: -0.59   critic loss: 0.41   steps: 1342


training loop:   2% |                                 | ETA:  37 days, 14:02:51

Episode: 1343   score: 16.76   Avg score (100e): 16.28   actor gain: -0.59   critic loss: 0.41   steps: 1343


training loop:   2% |                                 | ETA:  37 days, 14:01:45

Episode: 1344   score: 16.77   Avg score (100e): 16.29   actor gain: -0.59   critic loss: 0.41   steps: 1344


training loop:   2% |                                 | ETA:  37 days, 14:03:57

Episode: 1345   score: 16.78   Avg score (100e): 16.30   actor gain: -0.56   critic loss: 0.41   steps: 1345


training loop:   2% |                                 | ETA:  37 days, 14:01:51

Episode: 1346   score: 16.78   Avg score (100e): 16.32   actor gain: -0.54   critic loss: 0.41   steps: 1346


training loop:   2% |                                 | ETA:  37 days, 14:01:11

Episode: 1347   score: 16.78   Avg score (100e): 16.33   actor gain: -0.50   critic loss: 0.41   steps: 1347


training loop:   2% |                                 | ETA:  37 days, 13:56:47

Episode: 1348   score: 16.79   Avg score (100e): 16.34   actor gain: -0.50   critic loss: 0.41   steps: 1348


training loop:   2% |                                 | ETA:  37 days, 13:57:57

Episode: 1349   score: 16.80   Avg score (100e): 16.35   actor gain: -0.50   critic loss: 0.41   steps: 1349


training loop:   2% |                                 | ETA:  37 days, 14:02:20

Episode: 1350   score: 16.81   Avg score (100e): 16.36   actor gain: -0.50   critic loss: 0.41   steps: 1350


training loop:   2% |                                 | ETA:  37 days, 14:04:38

Episode: 1351   score: 16.82   Avg score (100e): 16.37   actor gain: -0.50   critic loss: 0.41   steps: 1351


training loop:   2% |                                 | ETA:  37 days, 14:03:51

Episode: 1352   score: 16.83   Avg score (100e): 16.38   actor gain: -0.50   critic loss: 0.41   steps: 1352


training loop:   2% |                                 | ETA:  37 days, 14:01:25

Episode: 1353   score: 16.84   Avg score (100e): 16.39   actor gain: -0.44   critic loss: 0.41   steps: 1353


training loop:   2% |                                 | ETA:  37 days, 13:57:17

Episode: 1354   score: 16.85   Avg score (100e): 16.40   actor gain: -0.44   critic loss: 0.41   steps: 1354


training loop:   2% |                                 | ETA:  37 days, 13:55:17

Episode: 1355   score: 16.86   Avg score (100e): 16.41   actor gain: -0.44   critic loss: 0.41   steps: 1355


training loop:   2% |                                 | ETA:  37 days, 13:54:00

Episode: 1356   score: 16.87   Avg score (100e): 16.42   actor gain: -0.46   critic loss: 0.41   steps: 1356


training loop:   2% |                                 | ETA:  37 days, 13:51:43

Episode: 1357   score: 16.87   Avg score (100e): 16.43   actor gain: -0.47   critic loss: 0.41   steps: 1357


training loop:   2% |                                 | ETA:  37 days, 13:49:31

Episode: 1358   score: 16.88   Avg score (100e): 16.44   actor gain: -0.47   critic loss: 0.41   steps: 1358


training loop:   2% |                                 | ETA:  37 days, 13:47:17

Episode: 1359   score: 16.89   Avg score (100e): 16.45   actor gain: -0.47   critic loss: 0.41   steps: 1359


training loop:   2% |                                 | ETA:  37 days, 13:43:53

Episode: 1360   score: 16.90   Avg score (100e): 16.46   actor gain: -0.46   critic loss: 0.41   steps: 1360


training loop:   2% |                                 | ETA:  37 days, 13:41:05

Episode: 1361   score: 16.91   Avg score (100e): 16.47   actor gain: -0.46   critic loss: 0.41   steps: 1361


training loop:   2% |                                 | ETA:  37 days, 13:36:57

Episode: 1362   score: 16.92   Avg score (100e): 16.48   actor gain: -0.46   critic loss: 0.41   steps: 1362


training loop:   2% |                                 | ETA:  37 days, 13:33:49

Episode: 1363   score: 16.91   Avg score (100e): 16.49   actor gain: -0.46   critic loss: 0.41   steps: 1363


training loop:   2% |                                 | ETA:  37 days, 13:29:37

Episode: 1364   score: 16.92   Avg score (100e): 16.49   actor gain: -0.45   critic loss: 0.41   steps: 1364


training loop:   2% |                                 | ETA:  37 days, 13:27:10

Episode: 1365   score: 16.92   Avg score (100e): 16.50   actor gain: -0.44   critic loss: 0.41   steps: 1365


training loop:   2% |                                 | ETA:  37 days, 13:39:19

Episode: 1366   score: 16.93   Avg score (100e): 16.51   actor gain: -0.44   critic loss: 0.41   steps: 1366


training loop:   2% |                                 | ETA:  37 days, 13:47:27

Episode: 1367   score: 16.93   Avg score (100e): 16.52   actor gain: -0.41   critic loss: 0.41   steps: 1367


training loop:   2% |                                 | ETA:  37 days, 13:59:14

Episode: 1368   score: 16.95   Avg score (100e): 16.53   actor gain: -0.41   critic loss: 0.41   steps: 1368


training loop:   2% |                                 | ETA:  37 days, 14:00:36

Episode: 1369   score: 16.95   Avg score (100e): 16.54   actor gain: -0.41   critic loss: 0.41   steps: 1369


training loop:   2% |                                 | ETA:  37 days, 13:58:15

Episode: 1370   score: 16.96   Avg score (100e): 16.55   actor gain: -0.41   critic loss: 0.41   steps: 1370


training loop:   2% |                                 | ETA:  37 days, 13:58:23

Episode: 1371   score: 16.97   Avg score (100e): 16.56   actor gain: -0.41   critic loss: 0.41   steps: 1371


training loop:   2% |                                 | ETA:  37 days, 13:55:49

Episode: 1372   score: 16.99   Avg score (100e): 16.57   actor gain: -0.41   critic loss: 0.41   steps: 1372


training loop:   2% |                                 | ETA:  37 days, 13:52:46

Episode: 1373   score: 17.00   Avg score (100e): 16.58   actor gain: -0.41   critic loss: 0.41   steps: 1373


training loop:   2% |                                 | ETA:  37 days, 13:59:01

Episode: 1374   score: 17.00   Avg score (100e): 16.59   actor gain: -0.41   critic loss: 0.41   steps: 1374


training loop:   2% |                                 | ETA:  37 days, 13:58:51

Episode: 1375   score: 17.01   Avg score (100e): 16.60   actor gain: -0.41   critic loss: 0.41   steps: 1375


training loop:   2% |                                 | ETA:  37 days, 13:58:29

Episode: 1376   score: 17.02   Avg score (100e): 16.61   actor gain: -0.41   critic loss: 0.41   steps: 1376


training loop:   2% |                                 | ETA:  37 days, 13:55:10

Episode: 1377   score: 17.02   Avg score (100e): 16.61   actor gain: -0.41   critic loss: 0.41   steps: 1377


training loop:   2% |                                 | ETA:  37 days, 13:49:16

Episode: 1378   score: 17.02   Avg score (100e): 16.62   actor gain: -0.41   critic loss: 0.41   steps: 1378


training loop:   2% |                                 | ETA:  37 days, 13:45:08

Episode: 1379   score: 17.03   Avg score (100e): 16.63   actor gain: -0.41   critic loss: 0.41   steps: 1379
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  37 days, 13:39:48

Episode: 1380   score: 17.04   Avg score (100e): 16.64   actor gain: -0.41   critic loss: 0.41   steps: 1380


training loop:   2% |                                 | ETA:  37 days, 13:35:54

Episode: 1381   score: 17.05   Avg score (100e): 16.65   actor gain: -0.39   critic loss: 0.41   steps: 1381


training loop:   2% |                                 | ETA:  37 days, 13:33:18

Episode: 1382   score: 17.06   Avg score (100e): 16.66   actor gain: -0.38   critic loss: 0.41   steps: 1382


training loop:   2% |                                 | ETA:  37 days, 13:30:51

Episode: 1383   score: 17.07   Avg score (100e): 16.67   actor gain: -0.38   critic loss: 0.41   steps: 1383


training loop:   2% |                                 | ETA:  37 days, 13:26:08

Episode: 1384   score: 17.08   Avg score (100e): 16.67   actor gain: -0.38   critic loss: 0.41   steps: 1384


training loop:   2% |                                 | ETA:  37 days, 13:20:31

Episode: 1385   score: 17.09   Avg score (100e): 16.68   actor gain: -0.38   critic loss: 0.41   steps: 1385
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  37 days, 13:15:58

Episode: 1386   score: 17.10   Avg score (100e): 16.69   actor gain: -0.38   critic loss: 0.41   steps: 1386


training loop:   2% |                                 | ETA:  37 days, 13:15:41

Episode: 1387   score: 17.11   Avg score (100e): 16.70   actor gain: -0.38   critic loss: 0.41   steps: 1387


training loop:   2% |                                 | ETA:  37 days, 13:17:58

Episode: 1388   score: 17.11   Avg score (100e): 16.71   actor gain: -0.38   critic loss: 0.41   steps: 1388


training loop:   2% |                                 | ETA:  37 days, 13:22:07

Episode: 1389   score: 17.12   Avg score (100e): 16.72   actor gain: -0.38   critic loss: 0.41   steps: 1389


training loop:   2% |                                 | ETA:  37 days, 13:25:49

Episode: 1390   score: 17.12   Avg score (100e): 16.72   actor gain: -0.38   critic loss: 0.41   steps: 1390


training loop:   2% |                                 | ETA:  37 days, 13:27:20

Episode: 1391   score: 17.13   Avg score (100e): 16.73   actor gain: -0.39   critic loss: 0.41   steps: 1391


training loop:   2% |                                 | ETA:  37 days, 13:24:33

Episode: 1392   score: 17.15   Avg score (100e): 16.74   actor gain: -0.38   critic loss: 0.41   steps: 1392


training loop:   2% |                                 | ETA:  37 days, 13:22:04

Episode: 1393   score: 17.15   Avg score (100e): 16.75   actor gain: -0.38   critic loss: 0.41   steps: 1393


training loop:   2% |                                 | ETA:  37 days, 13:18:46

Episode: 1394   score: 17.16   Avg score (100e): 16.76   actor gain: -0.38   critic loss: 0.41   steps: 1394


training loop:   2% |                                 | ETA:  37 days, 14:12:48

Episode: 1395   score: 17.16   Avg score (100e): 16.77   actor gain: -0.38   critic loss: 0.41   steps: 1395


training loop:   2% |                                 | ETA:  37 days, 19:14:06

Episode: 1396   score: 17.18   Avg score (100e): 16.77   actor gain: -0.38   critic loss: 0.41   steps: 1396


training loop:   2% |                                 | ETA:  37 days, 19:16:03

Episode: 1397   score: 17.19   Avg score (100e): 16.78   actor gain: -0.38   critic loss: 0.41   steps: 1397


training loop:   2% |                                 | ETA:  37 days, 19:12:26

Episode: 1398   score: 17.19   Avg score (100e): 16.79   actor gain: -0.38   critic loss: 0.41   steps: 1398


training loop:   2% |                                 | ETA:  37 days, 19:11:07

Episode: 1399   score: 17.20   Avg score (100e): 16.80   actor gain: -0.38   critic loss: 0.41   steps: 1399


training loop:   2% |                                 | ETA:  37 days, 19:13:35

Episode: 1400   score: 17.21   Avg score (100e): 16.81   actor gain: -0.38   critic loss: 0.41   steps: 1400


training loop:   2% |                                 | ETA:  37 days, 19:10:56

Episode: 1401   score: 17.21   Avg score (100e): 16.82   actor gain: -0.38   critic loss: 0.41   steps: 1401


training loop:   2% |                                 | ETA:  37 days, 19:08:16

Episode: 1402   score: 17.23   Avg score (100e): 16.82   actor gain: -0.38   critic loss: 0.41   steps: 1402


training loop:   2% |                                 | ETA:  37 days, 19:01:54

Episode: 1403   score: 17.24   Avg score (100e): 16.83   actor gain: -0.38   critic loss: 0.41   steps: 1403


training loop:   2% |                                 | ETA:  37 days, 18:55:32

Episode: 1404   score: 17.24   Avg score (100e): 16.84   actor gain: -0.38   critic loss: 0.41   steps: 1404


training loop:   2% |                                 | ETA:  37 days, 18:51:39

Episode: 1405   score: 17.25   Avg score (100e): 16.85   actor gain: -0.38   critic loss: 0.41   steps: 1405


training loop:   2% |                                 | ETA:  37 days, 18:45:52

Episode: 1406   score: 17.25   Avg score (100e): 16.86   actor gain: -0.38   critic loss: 0.41   steps: 1406


training loop:   2% |                                 | ETA:  37 days, 18:40:46

Episode: 1407   score: 17.26   Avg score (100e): 16.87   actor gain: -0.38   critic loss: 0.41   steps: 1407


training loop:   2% |                                 | ETA:  37 days, 18:35:21

Episode: 1408   score: 17.26   Avg score (100e): 16.87   actor gain: -0.39   critic loss: 0.41   steps: 1408


training loop:   2% |                                 | ETA:  37 days, 18:30:07

Episode: 1409   score: 17.27   Avg score (100e): 16.88   actor gain: -0.39   critic loss: 0.41   steps: 1409


training loop:   2% |                                 | ETA:  37 days, 18:24:50

Episode: 1410   score: 17.28   Avg score (100e): 16.89   actor gain: -0.39   critic loss: 0.41   steps: 1410


training loop:   2% |                                 | ETA:  37 days, 18:19:40

Episode: 1411   score: 17.28   Avg score (100e): 16.90   actor gain: -0.39   critic loss: 0.41   steps: 1411


training loop:   2% |                                  | ETA:  38 days, 7:03:53

Episode: 1412   score: 17.30   Avg score (100e): 16.91   actor gain: -0.39   critic loss: 0.41   steps: 1412


training loop:   2% |                                  | ETA:  38 days, 7:00:13

Episode: 1413   score: 17.30   Avg score (100e): 16.91   actor gain: -0.40   critic loss: 0.41   steps: 1413


training loop:   2% |                                  | ETA:  38 days, 6:55:34

Episode: 1414   score: 17.30   Avg score (100e): 16.92   actor gain: -0.39   critic loss: 0.41   steps: 1414


training loop:   2% |                                  | ETA:  38 days, 6:48:50

Episode: 1415   score: 17.31   Avg score (100e): 16.93   actor gain: -0.39   critic loss: 0.41   steps: 1415


training loop:   2% |                                  | ETA:  38 days, 6:46:28

Episode: 1416   score: 17.32   Avg score (100e): 16.94   actor gain: -0.40   critic loss: 0.41   steps: 1416


training loop:   2% |                                  | ETA:  38 days, 6:39:36

Episode: 1417   score: 17.32   Avg score (100e): 16.95   actor gain: -0.41   critic loss: 0.41   steps: 1417


training loop:   2% |                                  | ETA:  38 days, 6:34:25

Episode: 1418   score: 17.35   Avg score (100e): 16.95   actor gain: -0.41   critic loss: 0.41   steps: 1418


training loop:   2% |                                  | ETA:  38 days, 6:28:05

Episode: 1419   score: 17.36   Avg score (100e): 16.96   actor gain: -0.41   critic loss: 0.41   steps: 1419


training loop:   2% |                                  | ETA:  38 days, 6:21:00

Episode: 1420   score: 17.37   Avg score (100e): 16.97   actor gain: -0.41   critic loss: 0.41   steps: 1420


training loop:   2% |                                 | ETA:  38 days, 14:03:05

Episode: 1421   score: 17.37   Avg score (100e): 16.98   actor gain: -0.41   critic loss: 0.41   steps: 1421


training loop:   2% |                                 | ETA:  38 days, 14:06:28

Episode: 1422   score: 17.37   Avg score (100e): 16.99   actor gain: -0.41   critic loss: 0.41   steps: 1422


training loop:   2% |                                 | ETA:  38 days, 14:00:33

Episode: 1423   score: 17.39   Avg score (100e): 16.99   actor gain: -0.42   critic loss: 0.41   steps: 1423


training loop:   2% |                                 | ETA:  38 days, 13:55:05

Episode: 1424   score: 17.40   Avg score (100e): 17.00   actor gain: -0.42   critic loss: 0.41   steps: 1424


training loop:   2% |                                 | ETA:  38 days, 13:47:51

Episode: 1425   score: 17.40   Avg score (100e): 17.01   actor gain: -0.42   critic loss: 0.41   steps: 1425


training loop:   2% |                                 | ETA:  38 days, 13:44:16

Episode: 1426   score: 17.42   Avg score (100e): 17.02   actor gain: -0.43   critic loss: 0.41   steps: 1426


training loop:   2% |                                 | ETA:  38 days, 13:38:02

Episode: 1427   score: 17.43   Avg score (100e): 17.03   actor gain: -0.43   critic loss: 0.41   steps: 1427


training loop:   2% |                                 | ETA:  38 days, 13:31:01

Episode: 1428   score: 17.45   Avg score (100e): 17.03   actor gain: -0.43   critic loss: 0.41   steps: 1428


training loop:   2% |                                 | ETA:  38 days, 13:27:52

Episode: 1429   score: 17.46   Avg score (100e): 17.04   actor gain: -0.43   critic loss: 0.41   steps: 1429


training loop:   2% |                                 | ETA:  38 days, 13:24:35

Episode: 1430   score: 17.46   Avg score (100e): 17.05   actor gain: -0.43   critic loss: 0.41   steps: 1430


training loop:   2% |                                 | ETA:  38 days, 13:46:48

Episode: 1431   score: 17.48   Avg score (100e): 17.06   actor gain: -0.43   critic loss: 0.41   steps: 1431


training loop:   2% |                                 | ETA:  38 days, 14:01:54

Episode: 1432   score: 17.49   Avg score (100e): 17.07   actor gain: -0.44   critic loss: 0.41   steps: 1432


training loop:   2% |                                 | ETA:  38 days, 14:10:00

Episode: 1433   score: 17.50   Avg score (100e): 17.08   actor gain: -0.42   critic loss: 0.41   steps: 1433


training loop:   2% |                                 | ETA:  38 days, 14:18:08

Episode: 1434   score: 17.51   Avg score (100e): 17.08   actor gain: -0.42   critic loss: 0.41   steps: 1434


training loop:   2% |                                 | ETA:  38 days, 14:29:45

Episode: 1435   score: 17.52   Avg score (100e): 17.09   actor gain: -0.43   critic loss: 0.41   steps: 1435


training loop:   2% |                                 | ETA:  38 days, 14:36:23

Episode: 1436   score: 17.52   Avg score (100e): 17.10   actor gain: -0.43   critic loss: 0.41   steps: 1436


training loop:   2% |                                 | ETA:  38 days, 14:46:56

Episode: 1437   score: 17.52   Avg score (100e): 17.11   actor gain: -0.43   critic loss: 0.41   steps: 1437


training loop:   2% |                                 | ETA:  38 days, 14:53:19

Episode: 1438   score: 17.53   Avg score (100e): 17.12   actor gain: -0.43   critic loss: 0.42   steps: 1438


training loop:   2% |                                 | ETA:  38 days, 14:57:26

Episode: 1439   score: 17.54   Avg score (100e): 17.13   actor gain: -0.43   critic loss: 0.42   steps: 1439


training loop:   2% |                                 | ETA:  38 days, 15:02:17

Episode: 1440   score: 17.55   Avg score (100e): 17.13   actor gain: -0.43   critic loss: 0.42   steps: 1440


training loop:   2% |                                 | ETA:  38 days, 15:06:02

Episode: 1441   score: 17.55   Avg score (100e): 17.14   actor gain: -0.42   critic loss: 0.42   steps: 1441


training loop:   2% |                                 | ETA:  38 days, 15:11:10

Episode: 1442   score: 17.56   Avg score (100e): 17.15   actor gain: -0.41   critic loss: 0.42   steps: 1442


training loop:   2% |                                 | ETA:  38 days, 15:15:06

Episode: 1443   score: 17.57   Avg score (100e): 17.16   actor gain: -0.41   critic loss: 0.42   steps: 1443


training loop:   2% |                                 | ETA:  38 days, 15:29:23

Episode: 1444   score: 17.57   Avg score (100e): 17.17   actor gain: -0.41   critic loss: 0.42   steps: 1444


training loop:   2% |                                 | ETA:  38 days, 15:40:10

Episode: 1445   score: 17.58   Avg score (100e): 17.17   actor gain: -0.41   critic loss: 0.42   steps: 1445


training loop:   2% |                                 | ETA:  38 days, 15:43:32

Episode: 1446   score: 17.59   Avg score (100e): 17.18   actor gain: -0.41   critic loss: 0.42   steps: 1446


training loop:   2% |                                 | ETA:  38 days, 15:54:23

Episode: 1447   score: 17.59   Avg score (100e): 17.19   actor gain: -0.41   critic loss: 0.41   steps: 1447


training loop:   2% |                                 | ETA:  38 days, 15:59:19

Episode: 1448   score: 17.60   Avg score (100e): 17.20   actor gain: -0.40   critic loss: 0.42   steps: 1448


training loop:   2% |                                 | ETA:  38 days, 16:04:41

Episode: 1449   score: 17.60   Avg score (100e): 17.21   actor gain: -0.40   critic loss: 0.41   steps: 1449


training loop:   2% |                                 | ETA:  38 days, 16:08:29

Episode: 1450   score: 17.61   Avg score (100e): 17.21   actor gain: -0.40   critic loss: 0.41   steps: 1450


training loop:   2% |                                 | ETA:  38 days, 16:13:50

Episode: 1451   score: 17.62   Avg score (100e): 17.22   actor gain: -0.39   critic loss: 0.41   steps: 1451


training loop:   2% |                                 | ETA:  38 days, 16:17:20

Episode: 1452   score: 17.63   Avg score (100e): 17.23   actor gain: -0.39   critic loss: 0.41   steps: 1452


training loop:   2% |                                 | ETA:  38 days, 16:49:03

Episode: 1453   score: 17.63   Avg score (100e): 17.24   actor gain: -0.39   critic loss: 0.41   steps: 1453


training loop:   2% |                                 | ETA:  38 days, 17:07:32

Episode: 1454   score: 17.64   Avg score (100e): 17.25   actor gain: -0.39   critic loss: 0.41   steps: 1454


training loop:   2% |                                 | ETA:  38 days, 17:24:17

Episode: 1455   score: 17.64   Avg score (100e): 17.25   actor gain: -0.38   critic loss: 0.41   steps: 1455


training loop:   2% |                                 | ETA:  38 days, 17:21:38

Episode: 1456   score: 17.64   Avg score (100e): 17.26   actor gain: -0.39   critic loss: 0.41   steps: 1456


training loop:   2% |                                 | ETA:  38 days, 17:17:07

Episode: 1457   score: 17.65   Avg score (100e): 17.27   actor gain: -0.38   critic loss: 0.41   steps: 1457


training loop:   2% |                                 | ETA:  38 days, 17:11:07

Episode: 1458   score: 17.66   Avg score (100e): 17.28   actor gain: -0.38   critic loss: 0.41   steps: 1458


training loop:   2% |                                 | ETA:  38 days, 17:09:49

Episode: 1459   score: 17.67   Avg score (100e): 17.29   actor gain: -0.38   critic loss: 0.41   steps: 1459


training loop:   2% |                                 | ETA:  38 days, 17:05:22

Episode: 1460   score: 17.67   Avg score (100e): 17.29   actor gain: -0.38   critic loss: 0.41   steps: 1460


training loop:   2% |                                 | ETA:  38 days, 17:00:41

Episode: 1461   score: 17.68   Avg score (100e): 17.30   actor gain: -0.38   critic loss: 0.41   steps: 1461


training loop:   2% |                                 | ETA:  38 days, 16:55:51

Episode: 1462   score: 17.69   Avg score (100e): 17.31   actor gain: -0.38   critic loss: 0.41   steps: 1462


training loop:   2% |                                 | ETA:  38 days, 16:53:08

Episode: 1463   score: 17.69   Avg score (100e): 17.32   actor gain: -0.37   critic loss: 0.41   steps: 1463
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  38 days, 17:02:04

Episode: 1464   score: 17.70   Avg score (100e): 17.32   actor gain: -0.37   critic loss: 0.41   steps: 1464


training loop:   2% |                                 | ETA:  38 days, 17:05:52

Episode: 1465   score: 17.71   Avg score (100e): 17.33   actor gain: -0.37   critic loss: 0.41   steps: 1465


training loop:   2% |                                 | ETA:  38 days, 17:01:30

Episode: 1466   score: 17.72   Avg score (100e): 17.34   actor gain: -0.37   critic loss: 0.41   steps: 1466


training loop:   2% |                                 | ETA:  38 days, 16:58:43

Episode: 1467   score: 17.74   Avg score (100e): 17.35   actor gain: -0.37   critic loss: 0.41   steps: 1467


training loop:   2% |                                 | ETA:  38 days, 16:56:45

Episode: 1468   score: 17.75   Avg score (100e): 17.36   actor gain: -0.37   critic loss: 0.41   steps: 1468


training loop:   2% |                                 | ETA:  38 days, 16:53:52

Episode: 1469   score: 17.76   Avg score (100e): 17.36   actor gain: -0.37   critic loss: 0.41   steps: 1469


training loop:   2% |                                 | ETA:  38 days, 16:50:31

Episode: 1470   score: 17.78   Avg score (100e): 17.37   actor gain: -0.37   critic loss: 0.41   steps: 1470


training loop:   2% |                                 | ETA:  38 days, 16:46:35

Episode: 1471   score: 17.79   Avg score (100e): 17.38   actor gain: -0.37   critic loss: 0.41   steps: 1471


training loop:   2% |                                 | ETA:  38 days, 16:40:38

Episode: 1472   score: 17.79   Avg score (100e): 17.39   actor gain: -0.37   critic loss: 0.41   steps: 1472
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  38 days, 16:32:33

Episode: 1473   score: 17.81   Avg score (100e): 17.40   actor gain: -0.37   critic loss: 0.41   steps: 1473


training loop:   2% |                                 | ETA:  38 days, 16:25:36

Episode: 1474   score: 17.82   Avg score (100e): 17.40   actor gain: -0.37   critic loss: 0.41   steps: 1474


training loop:   2% |                                 | ETA:  38 days, 16:20:34

Episode: 1475   score: 17.82   Avg score (100e): 17.41   actor gain: -0.36   critic loss: 0.41   steps: 1475


training loop:   2% |                                 | ETA:  38 days, 16:21:51

Episode: 1476   score: 17.83   Avg score (100e): 17.42   actor gain: -0.37   critic loss: 0.41   steps: 1476


training loop:   2% |                                 | ETA:  38 days, 16:17:18

Episode: 1477   score: 17.83   Avg score (100e): 17.43   actor gain: -0.37   critic loss: 0.41   steps: 1477


training loop:   2% |                                 | ETA:  38 days, 16:16:25

Episode: 1478   score: 17.84   Avg score (100e): 17.44   actor gain: -0.37   critic loss: 0.41   steps: 1478


training loop:   2% |                                 | ETA:  38 days, 16:11:08

Episode: 1479   score: 17.84   Avg score (100e): 17.45   actor gain: -0.38   critic loss: 0.41   steps: 1479


training loop:   2% |                                 | ETA:  38 days, 16:04:37

Episode: 1480   score: 17.85   Avg score (100e): 17.45   actor gain: -0.38   critic loss: 0.41   steps: 1480


training loop:   2% |                                 | ETA:  38 days, 15:58:41

Episode: 1481   score: 17.86   Avg score (100e): 17.46   actor gain: -0.38   critic loss: 0.41   steps: 1481


training loop:   2% |                                 | ETA:  38 days, 15:52:20

Episode: 1482   score: 17.86   Avg score (100e): 17.47   actor gain: -0.38   critic loss: 0.41   steps: 1482


training loop:   2% |                                 | ETA:  38 days, 15:47:22

Episode: 1483   score: 17.87   Avg score (100e): 17.48   actor gain: -0.38   critic loss: 0.41   steps: 1483


training loop:   2% |                                 | ETA:  38 days, 15:41:31

Episode: 1484   score: 17.87   Avg score (100e): 17.49   actor gain: -0.38   critic loss: 0.41   steps: 1484


training loop:   2% |                                 | ETA:  38 days, 15:36:01

Episode: 1485   score: 17.88   Avg score (100e): 17.49   actor gain: -0.38   critic loss: 0.41   steps: 1485


training loop:   2% |                                 | ETA:  38 days, 15:29:22

Episode: 1486   score: 17.89   Avg score (100e): 17.50   actor gain: -0.38   critic loss: 0.41   steps: 1486


training loop:   2% |                                 | ETA:  38 days, 15:23:25

Episode: 1487   score: 17.89   Avg score (100e): 17.51   actor gain: -0.38   critic loss: 0.42   steps: 1487


training loop:   2% |                                 | ETA:  38 days, 15:19:28

Episode: 1488   score: 17.90   Avg score (100e): 17.52   actor gain: -0.38   critic loss: 0.41   steps: 1488


training loop:   2% |                                 | ETA:  38 days, 15:13:34

Episode: 1489   score: 17.90   Avg score (100e): 17.52   actor gain: -0.38   critic loss: 0.41   steps: 1489
np.all(done) is true! miracle!


training loop:   2% |                                 | ETA:  38 days, 15:06:24

Episode: 1490   score: 17.90   Avg score (100e): 17.53   actor gain: -0.38   critic loss: 0.41   steps: 1490


training loop:   2% |                                 | ETA:  38 days, 15:00:13

Episode: 1491   score: 17.91   Avg score (100e): 17.54   actor gain: -0.39   critic loss: 0.41   steps: 1491


training loop:   2% |                                 | ETA:  38 days, 14:54:59

Episode: 1492   score: 17.91   Avg score (100e): 17.55   actor gain: -0.39   critic loss: 0.41   steps: 1492


training loop:   2% |                                 | ETA:  38 days, 14:50:15

Episode: 1493   score: 17.91   Avg score (100e): 17.56   actor gain: -0.39   critic loss: 0.41   steps: 1493


training loop:   2% |                                 | ETA:  38 days, 14:44:33

Episode: 1494   score: 17.92   Avg score (100e): 17.56   actor gain: -0.39   critic loss: 0.41   steps: 1494


training loop:   2% |                                 | ETA:  38 days, 14:42:22

Episode: 1495   score: 17.93   Avg score (100e): 17.57   actor gain: -0.38   critic loss: 0.41   steps: 1495


training loop:   2% |                                 | ETA:  38 days, 14:40:29

Episode: 1496   score: 17.93   Avg score (100e): 17.58   actor gain: -0.39   critic loss: 0.41   steps: 1496


training loop:   2% |                                 | ETA:  38 days, 14:40:33

Episode: 1497   score: 17.95   Avg score (100e): 17.59   actor gain: -0.39   critic loss: 0.41   steps: 1497


training loop:   2% |                                 | ETA:  38 days, 14:36:48

Episode: 1498   score: 17.96   Avg score (100e): 17.59   actor gain: -0.39   critic loss: 0.41   steps: 1498


training loop:   2% |                                 | ETA:  38 days, 14:37:51

Episode: 1499   score: 17.97   Avg score (100e): 17.60   actor gain: -0.39   critic loss: 0.41   steps: 1499


training loop:   2% |                                 | ETA:  38 days, 14:34:16

Episode: 1500   score: 17.98   Avg score (100e): 17.61   actor gain: -0.39   critic loss: 0.41   steps: 1500


training loop:   3% |                                 | ETA:  38 days, 14:29:12

Episode: 1501   score: 17.98   Avg score (100e): 17.62   actor gain: -0.39   critic loss: 0.41   steps: 1501


training loop:   3% |                                 | ETA:  38 days, 14:24:35

Episode: 1502   score: 17.99   Avg score (100e): 17.62   actor gain: -0.39   critic loss: 0.41   steps: 1502


training loop:   3% |                                 | ETA:  38 days, 14:19:02

Episode: 1503   score: 17.99   Avg score (100e): 17.63   actor gain: -0.39   critic loss: 0.41   steps: 1503


training loop:   3% |                                 | ETA:  38 days, 14:12:34

Episode: 1504   score: 18.00   Avg score (100e): 17.64   actor gain: -0.38   critic loss: 0.41   steps: 1504


training loop:   3% |                                 | ETA:  38 days, 14:07:34

Episode: 1505   score: 18.01   Avg score (100e): 17.65   actor gain: -0.38   critic loss: 0.41   steps: 1505


training loop:   3% |                                 | ETA:  38 days, 14:00:42

Episode: 1506   score: 18.01   Avg score (100e): 17.65   actor gain: -0.38   critic loss: 0.41   steps: 1506


training loop:   3% |                                 | ETA:  38 days, 13:55:14

Episode: 1507   score: 18.01   Avg score (100e): 17.66   actor gain: -0.37   critic loss: 0.41   steps: 1507


training loop:   3% |                                 | ETA:  38 days, 13:50:37

Episode: 1508   score: 18.02   Avg score (100e): 17.67   actor gain: -0.38   critic loss: 0.41   steps: 1508


training loop:   3% |                                 | ETA:  38 days, 13:45:59

Episode: 1509   score: 18.03   Avg score (100e): 17.68   actor gain: -0.39   critic loss: 0.41   steps: 1509


training loop:   3% |                                 | ETA:  38 days, 13:39:20

Episode: 1510   score: 18.04   Avg score (100e): 17.69   actor gain: -0.39   critic loss: 0.41   steps: 1510


training loop:   3% |                                 | ETA:  38 days, 13:32:36

Episode: 1511   score: 18.04   Avg score (100e): 17.69   actor gain: -0.39   critic loss: 0.41   steps: 1511


training loop:   3% |                                 | ETA:  38 days, 13:27:12

Episode: 1512   score: 18.05   Avg score (100e): 17.70   actor gain: -0.39   critic loss: 0.41   steps: 1512


training loop:   3% |                                 | ETA:  38 days, 13:20:35

Episode: 1513   score: 18.07   Avg score (100e): 17.71   actor gain: -0.39   critic loss: 0.41   steps: 1513


training loop:   3% |                                 | ETA:  38 days, 13:13:23

Episode: 1514   score: 18.07   Avg score (100e): 17.72   actor gain: -0.41   critic loss: 0.41   steps: 1514
np.all(done) is true! miracle!


training loop:   3% |                                 | ETA:  38 days, 13:08:01

Episode: 1515   score: 18.08   Avg score (100e): 17.72   actor gain: -0.41   critic loss: 0.41   steps: 1515


training loop:   3% |                                 | ETA:  38 days, 13:01:54

Episode: 1516   score: 18.09   Avg score (100e): 17.73   actor gain: -0.40   critic loss: 0.41   steps: 1516


training loop:   3% |#                                | ETA:  38 days, 12:56:23

Episode: 1517   score: 18.10   Avg score (100e): 17.74   actor gain: -0.40   critic loss: 0.41   steps: 1517


training loop:   3% |#                                | ETA:  38 days, 12:49:45

Episode: 1518   score: 18.11   Avg score (100e): 17.75   actor gain: -0.40   critic loss: 0.41   steps: 1518


training loop:   3% |#                                | ETA:  38 days, 12:43:26

Episode: 1519   score: 18.11   Avg score (100e): 17.75   actor gain: -0.40   critic loss: 0.41   steps: 1519


training loop:   3% |#                                | ETA:  38 days, 12:36:08

Episode: 1520   score: 18.12   Avg score (100e): 17.76   actor gain: -0.40   critic loss: 0.41   steps: 1520


training loop:   3% |#                                | ETA:  38 days, 12:28:02

Episode: 1521   score: 18.13   Avg score (100e): 17.77   actor gain: -0.41   critic loss: 0.41   steps: 1521


training loop:   3% |#                                | ETA:  38 days, 12:19:10

Episode: 1522   score: 18.13   Avg score (100e): 17.78   actor gain: -0.40   critic loss: 0.41   steps: 1522


training loop:   3% |#                                | ETA:  38 days, 12:13:30

Episode: 1523   score: 18.14   Avg score (100e): 17.78   actor gain: -0.41   critic loss: 0.41   steps: 1523


training loop:   3% |#                                | ETA:  38 days, 12:07:39

Episode: 1524   score: 18.15   Avg score (100e): 17.79   actor gain: -0.41   critic loss: 0.41   steps: 1524


training loop:   3% |#                                | ETA:  38 days, 12:01:17

Episode: 1525   score: 18.16   Avg score (100e): 17.80   actor gain: -0.41   critic loss: 0.41   steps: 1525


training loop:   3% |#                                | ETA:  38 days, 11:55:13

Episode: 1526   score: 18.17   Avg score (100e): 17.81   actor gain: -0.41   critic loss: 0.41   steps: 1526


training loop:   3% |#                                | ETA:  38 days, 11:48:53

Episode: 1527   score: 18.18   Avg score (100e): 17.81   actor gain: -0.41   critic loss: 0.41   steps: 1527


training loop:   3% |#                                | ETA:  38 days, 11:45:57

Episode: 1528   score: 18.18   Avg score (100e): 17.82   actor gain: -0.41   critic loss: 0.41   steps: 1528


training loop:   3% |#                                | ETA:  38 days, 11:48:44

Episode: 1529   score: 18.19   Avg score (100e): 17.83   actor gain: -0.42   critic loss: 0.41   steps: 1529


training loop:   3% |#                                | ETA:  38 days, 11:43:28

Episode: 1530   score: 18.20   Avg score (100e): 17.84   actor gain: -0.41   critic loss: 0.41   steps: 1530


training loop:   3% |#                                | ETA:  38 days, 11:37:46

Episode: 1531   score: 18.20   Avg score (100e): 17.84   actor gain: -0.41   critic loss: 0.41   steps: 1531


training loop:   3% |#                                | ETA:  38 days, 11:32:46

Episode: 1532   score: 18.21   Avg score (100e): 17.85   actor gain: -0.42   critic loss: 0.41   steps: 1532


training loop:   3% |#                                | ETA:  38 days, 11:28:37

Episode: 1533   score: 18.22   Avg score (100e): 17.86   actor gain: -0.41   critic loss: 0.41   steps: 1533


training loop:   3% |#                                | ETA:  38 days, 11:22:38

Episode: 1534   score: 18.23   Avg score (100e): 17.86   actor gain: -0.39   critic loss: 0.41   steps: 1534


training loop:   3% |#                                | ETA:  38 days, 11:16:46

Episode: 1535   score: 18.24   Avg score (100e): 17.87   actor gain: -0.39   critic loss: 0.41   steps: 1535


training loop:   3% |#                                | ETA:  38 days, 11:11:50

Episode: 1536   score: 18.25   Avg score (100e): 17.88   actor gain: -0.39   critic loss: 0.41   steps: 1536


training loop:   3% |#                                | ETA:  38 days, 11:06:21

Episode: 1537   score: 18.26   Avg score (100e): 17.89   actor gain: -0.39   critic loss: 0.41   steps: 1537


training loop:   3% |#                                | ETA:  38 days, 10:59:44

Episode: 1538   score: 18.26   Avg score (100e): 17.89   actor gain: -0.39   critic loss: 0.41   steps: 1538


training loop:   3% |#                                | ETA:  38 days, 10:52:10

Episode: 1539   score: 18.28   Avg score (100e): 17.90   actor gain: -0.38   critic loss: 0.41   steps: 1539


training loop:   3% |#                                | ETA:  38 days, 10:47:45

Episode: 1540   score: 18.28   Avg score (100e): 17.91   actor gain: -0.38   critic loss: 0.41   steps: 1540
np.all(done) is true! miracle!


training loop:   3% |#                                | ETA:  38 days, 10:39:40

Episode: 1541   score: 18.29   Avg score (100e): 17.92   actor gain: -0.39   critic loss: 0.41   steps: 1541


training loop:   3% |#                                | ETA:  38 days, 10:33:41

Episode: 1542   score: 18.30   Avg score (100e): 17.92   actor gain: -0.39   critic loss: 0.41   steps: 1542


training loop:   3% |#                                | ETA:  38 days, 10:28:46

Episode: 1543   score: 18.30   Avg score (100e): 17.93   actor gain: -0.39   critic loss: 0.41   steps: 1543


training loop:   3% |#                                | ETA:  38 days, 10:24:54

Episode: 1544   score: 18.31   Avg score (100e): 17.94   actor gain: -0.39   critic loss: 0.41   steps: 1544


training loop:   3% |#                                | ETA:  38 days, 10:18:41

Episode: 1545   score: 18.31   Avg score (100e): 17.95   actor gain: -0.39   critic loss: 0.41   steps: 1545


training loop:   3% |#                                | ETA:  38 days, 10:13:14

Episode: 1546   score: 18.32   Avg score (100e): 17.95   actor gain: -0.38   critic loss: 0.41   steps: 1546


training loop:   3% |#                                | ETA:  38 days, 10:08:29

Episode: 1547   score: 18.32   Avg score (100e): 17.96   actor gain: -0.38   critic loss: 0.41   steps: 1547


training loop:   3% |#                                | ETA:  38 days, 10:02:26

Episode: 1548   score: 18.33   Avg score (100e): 17.97   actor gain: -0.37   critic loss: 0.41   steps: 1548


training loop:   3% |#                                 | ETA:  38 days, 9:55:52

Episode: 1549   score: 18.34   Avg score (100e): 17.97   actor gain: -0.37   critic loss: 0.41   steps: 1549


training loop:   3% |#                                 | ETA:  38 days, 9:51:12

Episode: 1550   score: 18.35   Avg score (100e): 17.98   actor gain: -0.38   critic loss: 0.41   steps: 1550


training loop:   3% |#                                 | ETA:  38 days, 9:45:21

Episode: 1551   score: 18.36   Avg score (100e): 17.99   actor gain: -0.37   critic loss: 0.41   steps: 1551


training loop:   3% |#                                 | ETA:  38 days, 9:39:40

Episode: 1552   score: 18.36   Avg score (100e): 18.00   actor gain: -0.44   critic loss: 0.41   steps: 1552


training loop:   3% |#                                 | ETA:  38 days, 9:34:19

Episode: 1553   score: 18.36   Avg score (100e): 18.00   actor gain: -0.45   critic loss: 0.41   steps: 1553


training loop:   3% |#                                 | ETA:  38 days, 9:29:16

Episode: 1554   score: 18.37   Avg score (100e): 18.01   actor gain: -0.45   critic loss: 0.41   steps: 1554
np.all(done) is true! miracle!


training loop:   3% |#                                 | ETA:  38 days, 9:22:19

Episode: 1555   score: 18.38   Avg score (100e): 18.02   actor gain: -0.45   critic loss: 0.41   steps: 1555


training loop:   3% |#                                 | ETA:  38 days, 9:16:33

Episode: 1556   score: 18.38   Avg score (100e): 18.03   actor gain: -0.47   critic loss: 0.41   steps: 1556


training loop:   3% |#                                 | ETA:  38 days, 9:12:16

Episode: 1557   score: 18.39   Avg score (100e): 18.03   actor gain: -0.47   critic loss: 0.41   steps: 1557


training loop:   3% |#                                 | ETA:  38 days, 9:06:18

Episode: 1558   score: 18.40   Avg score (100e): 18.04   actor gain: -0.47   critic loss: 0.41   steps: 1558
np.all(done) is true! miracle!


training loop:   3% |#                                 | ETA:  38 days, 8:58:53

Episode: 1559   score: 18.41   Avg score (100e): 18.05   actor gain: -0.47   critic loss: 0.41   steps: 1559


training loop:   3% |#                                 | ETA:  38 days, 8:53:08

Episode: 1560   score: 18.41   Avg score (100e): 18.06   actor gain: -0.47   critic loss: 0.41   steps: 1560


training loop:   3% |#                                 | ETA:  38 days, 8:48:38

Episode: 1561   score: 18.43   Avg score (100e): 18.06   actor gain: -0.46   critic loss: 0.41   steps: 1561


training loop:   3% |#                                 | ETA:  38 days, 8:47:06

Episode: 1562   score: 18.44   Avg score (100e): 18.07   actor gain: -0.46   critic loss: 0.41   steps: 1562


training loop:   3% |#                                 | ETA:  38 days, 8:42:30

Episode: 1563   score: 18.45   Avg score (100e): 18.08   actor gain: -0.46   critic loss: 0.41   steps: 1563


training loop:   3% |#                                 | ETA:  38 days, 8:38:48

Episode: 1564   score: 18.44   Avg score (100e): 18.09   actor gain: -0.46   critic loss: 0.41   steps: 1564


training loop:   3% |#                                 | ETA:  38 days, 8:34:48

Episode: 1565   score: 18.45   Avg score (100e): 18.09   actor gain: -0.46   critic loss: 0.41   steps: 1565


training loop:   3% |#                                 | ETA:  38 days, 8:28:52

Episode: 1566   score: 18.45   Avg score (100e): 18.10   actor gain: -0.46   critic loss: 0.41   steps: 1566


training loop:   3% |#                                 | ETA:  38 days, 8:25:42

Episode: 1567   score: 18.46   Avg score (100e): 18.11   actor gain: -0.46   critic loss: 0.41   steps: 1567


training loop:   3% |#                                 | ETA:  38 days, 8:20:36

Episode: 1568   score: 18.47   Avg score (100e): 18.12   actor gain: -0.46   critic loss: 0.41   steps: 1568


training loop:   3% |#                                 | ETA:  38 days, 8:15:15

Episode: 1569   score: 18.49   Avg score (100e): 18.12   actor gain: -0.47   critic loss: 0.41   steps: 1569


training loop:   3% |#                                 | ETA:  38 days, 8:09:39

Episode: 1570   score: 18.49   Avg score (100e): 18.13   actor gain: -0.47   critic loss: 0.41   steps: 1570


training loop:   3% |#                                 | ETA:  38 days, 8:04:12

Episode: 1571   score: 18.50   Avg score (100e): 18.14   actor gain: -0.47   critic loss: 0.41   steps: 1571


training loop:   3% |#                                 | ETA:  38 days, 7:59:21

Episode: 1572   score: 18.50   Avg score (100e): 18.14   actor gain: -0.47   critic loss: 0.41   steps: 1572


training loop:   3% |#                                 | ETA:  38 days, 7:54:07

Episode: 1573   score: 18.50   Avg score (100e): 18.15   actor gain: -0.47   critic loss: 0.41   steps: 1573


training loop:   3% |#                                 | ETA:  38 days, 7:50:10

Episode: 1574   score: 18.50   Avg score (100e): 18.16   actor gain: -0.47   critic loss: 0.41   steps: 1574


training loop:   3% |#                                 | ETA:  38 days, 7:45:49

Episode: 1575   score: 18.51   Avg score (100e): 18.16   actor gain: -0.47   critic loss: 0.41   steps: 1575


training loop:   3% |#                                 | ETA:  38 days, 7:40:07

Episode: 1576   score: 18.53   Avg score (100e): 18.17   actor gain: -0.47   critic loss: 0.41   steps: 1576
np.all(done) is true! miracle!


training loop:   3% |#                                 | ETA:  38 days, 7:34:38

Episode: 1577   score: 18.53   Avg score (100e): 18.18   actor gain: -0.41   critic loss: 0.41   steps: 1577


training loop:   3% |#                                 | ETA:  38 days, 7:29:58

Episode: 1578   score: 18.53   Avg score (100e): 18.19   actor gain: -0.40   critic loss: 0.41   steps: 1578


training loop:   3% |#                                 | ETA:  38 days, 7:23:55

Episode: 1579   score: 18.54   Avg score (100e): 18.19   actor gain: -0.39   critic loss: 0.41   steps: 1579


training loop:   3% |#                                 | ETA:  38 days, 7:17:41

Episode: 1580   score: 18.55   Avg score (100e): 18.20   actor gain: -0.39   critic loss: 0.41   steps: 1580


training loop:   3% |#                                 | ETA:  38 days, 7:13:08

Episode: 1581   score: 18.56   Avg score (100e): 18.21   actor gain: -0.38   critic loss: 0.41   steps: 1581


training loop:   3% |#                                 | ETA:  38 days, 7:08:00

Episode: 1582   score: 18.57   Avg score (100e): 18.21   actor gain: -0.38   critic loss: 0.41   steps: 1582


training loop:   3% |#                                 | ETA:  38 days, 7:02:07

Episode: 1583   score: 18.57   Avg score (100e): 18.22   actor gain: -0.38   critic loss: 0.42   steps: 1583


training loop:   3% |#                                 | ETA:  38 days, 6:56:26

Episode: 1584   score: 18.57   Avg score (100e): 18.23   actor gain: -0.38   critic loss: 0.42   steps: 1584


training loop:   3% |#                                 | ETA:  38 days, 6:51:18

Episode: 1585   score: 18.57   Avg score (100e): 18.23   actor gain: -0.38   critic loss: 0.42   steps: 1585


training loop:   3% |#                                 | ETA:  38 days, 6:46:16

Episode: 1586   score: 18.58   Avg score (100e): 18.24   actor gain: -0.38   critic loss: 0.42   steps: 1586


training loop:   3% |#                                 | ETA:  38 days, 6:41:47

Episode: 1587   score: 18.60   Avg score (100e): 18.25   actor gain: -0.38   critic loss: 0.42   steps: 1587


training loop:   3% |#                                 | ETA:  38 days, 6:39:08

Episode: 1588   score: 18.59   Avg score (100e): 18.26   actor gain: -0.38   critic loss: 0.41   steps: 1588


training loop:   3% |#                                 | ETA:  38 days, 6:35:20

Episode: 1589   score: 18.60   Avg score (100e): 18.26   actor gain: -0.38   critic loss: 0.42   steps: 1589


training loop:   3% |#                                 | ETA:  38 days, 6:31:11

Episode: 1590   score: 18.61   Avg score (100e): 18.27   actor gain: -0.38   critic loss: 0.42   steps: 1590


training loop:   3% |#                                 | ETA:  38 days, 6:27:12

Episode: 1591   score: 18.60   Avg score (100e): 18.28   actor gain: -0.38   critic loss: 0.41   steps: 1591


training loop:   3% |#                                 | ETA:  38 days, 6:25:52

Episode: 1592   score: 18.61   Avg score (100e): 18.28   actor gain: -0.38   critic loss: 0.42   steps: 1592


training loop:   3% |#                                 | ETA:  38 days, 6:19:42

Episode: 1593   score: 18.63   Avg score (100e): 18.29   actor gain: -0.38   critic loss: 0.42   steps: 1593


training loop:   3% |#                                 | ETA:  38 days, 6:20:18

Episode: 1594   score: 18.63   Avg score (100e): 18.30   actor gain: -0.38   critic loss: 0.41   steps: 1594


training loop:   3% |#                                 | ETA:  38 days, 6:17:34

Episode: 1595   score: 18.63   Avg score (100e): 18.30   actor gain: -0.37   critic loss: 0.41   steps: 1595


training loop:   3% |#                                 | ETA:  38 days, 6:12:56

Episode: 1596   score: 18.64   Avg score (100e): 18.31   actor gain: -0.37   critic loss: 0.42   steps: 1596


training loop:   3% |#                                 | ETA:  38 days, 6:08:48

Episode: 1597   score: 18.65   Avg score (100e): 18.32   actor gain: -0.37   critic loss: 0.41   steps: 1597


training loop:   3% |#                                 | ETA:  38 days, 6:03:51

Episode: 1598   score: 18.66   Avg score (100e): 18.33   actor gain: -0.37   critic loss: 0.41   steps: 1598


training loop:   3% |#                                 | ETA:  38 days, 5:58:26

Episode: 1599   score: 18.66   Avg score (100e): 18.33   actor gain: -0.37   critic loss: 0.41   steps: 1599


training loop:   3% |#                                 | ETA:  38 days, 5:53:13

Episode: 1600   score: 18.66   Avg score (100e): 18.34   actor gain: -0.37   critic loss: 0.41   steps: 1600


training loop:   3% |#                                 | ETA:  38 days, 5:48:52

Episode: 1601   score: 18.67   Avg score (100e): 18.35   actor gain: -0.37   critic loss: 0.41   steps: 1601


training loop:   3% |#                                 | ETA:  38 days, 5:46:01

Episode: 1602   score: 18.68   Avg score (100e): 18.35   actor gain: -0.37   critic loss: 0.41   steps: 1602


training loop:   3% |#                                 | ETA:  38 days, 5:41:02

Episode: 1603   score: 18.69   Avg score (100e): 18.36   actor gain: -0.36   critic loss: 0.41   steps: 1603


training loop:   3% |#                                 | ETA:  38 days, 5:34:40

Episode: 1604   score: 18.69   Avg score (100e): 18.37   actor gain: -0.36   critic loss: 0.41   steps: 1604


training loop:   3% |#                                 | ETA:  38 days, 5:30:21

Episode: 1605   score: 18.70   Avg score (100e): 18.37   actor gain: -0.37   critic loss: 0.41   steps: 1605


training loop:   3% |#                                 | ETA:  38 days, 5:26:04

Episode: 1606   score: 18.71   Avg score (100e): 18.38   actor gain: -0.37   critic loss: 0.41   steps: 1606


training loop:   3% |#                                 | ETA:  38 days, 5:20:50

Episode: 1607   score: 18.71   Avg score (100e): 18.39   actor gain: -0.37   critic loss: 0.41   steps: 1607


training loop:   3% |#                                 | ETA:  38 days, 5:17:02

Episode: 1608   score: 18.72   Avg score (100e): 18.40   actor gain: -0.40   critic loss: 0.41   steps: 1608


training loop:   3% |#                                 | ETA:  38 days, 5:12:53

Episode: 1609   score: 18.73   Avg score (100e): 18.40   actor gain: -0.40   critic loss: 0.41   steps: 1609


training loop:   3% |#                                 | ETA:  38 days, 5:07:14

Episode: 1610   score: 18.73   Avg score (100e): 18.41   actor gain: -0.40   critic loss: 0.41   steps: 1610


training loop:   3% |#                                 | ETA:  38 days, 5:02:36

Episode: 1611   score: 18.74   Avg score (100e): 18.42   actor gain: -0.40   critic loss: 0.41   steps: 1611


training loop:   3% |#                                 | ETA:  38 days, 4:58:40

Episode: 1612   score: 18.76   Avg score (100e): 18.42   actor gain: -0.40   critic loss: 0.41   steps: 1612


training loop:   3% |#                                 | ETA:  38 days, 4:54:10

Episode: 1613   score: 18.76   Avg score (100e): 18.43   actor gain: -0.40   critic loss: 0.41   steps: 1613


training loop:   3% |#                                 | ETA:  38 days, 4:47:54

Episode: 1614   score: 18.76   Avg score (100e): 18.44   actor gain: -0.40   critic loss: 0.41   steps: 1614


training loop:   3% |#                                 | ETA:  38 days, 4:43:32

Episode: 1615   score: 18.77   Avg score (100e): 18.44   actor gain: -0.62   critic loss: 0.41   steps: 1615


training loop:   3% |#                                 | ETA:  38 days, 4:38:55

Episode: 1616   score: 18.79   Avg score (100e): 18.45   actor gain: -0.62   critic loss: 0.41   steps: 1616


training loop:   3% |#                                 | ETA:  38 days, 4:32:51

Episode: 1617   score: 18.79   Avg score (100e): 18.46   actor gain: -0.62   critic loss: 0.41   steps: 1617


training loop:   3% |#                                 | ETA:  38 days, 4:28:19

Episode: 1618   score: 18.80   Avg score (100e): 18.46   actor gain: -0.62   critic loss: 0.41   steps: 1618


training loop:   3% |#                                 | ETA:  38 days, 4:24:05

Episode: 1619   score: 18.81   Avg score (100e): 18.47   actor gain: -0.62   critic loss: 0.41   steps: 1619


training loop:   3% |#                                 | ETA:  38 days, 4:18:06

Episode: 1620   score: 18.81   Avg score (100e): 18.48   actor gain: -0.62   critic loss: 0.41   steps: 1620


training loop:   3% |#                                 | ETA:  38 days, 4:12:56

Episode: 1621   score: 18.82   Avg score (100e): 18.49   actor gain: -0.62   critic loss: 0.41   steps: 1621


training loop:   3% |#                                 | ETA:  38 days, 4:09:33

Episode: 1622   score: 18.82   Avg score (100e): 18.49   actor gain: -0.63   critic loss: 0.41   steps: 1622


training loop:   3% |#                                 | ETA:  38 days, 4:04:05

Episode: 1623   score: 18.82   Avg score (100e): 18.50   actor gain: -0.63   critic loss: 0.41   steps: 1623


training loop:   3% |#                                 | ETA:  38 days, 3:58:22

Episode: 1624   score: 18.83   Avg score (100e): 18.51   actor gain: -0.64   critic loss: 0.41   steps: 1624


training loop:   3% |#                                 | ETA:  38 days, 3:52:50

Episode: 1625   score: 18.83   Avg score (100e): 18.51   actor gain: -0.63   critic loss: 0.41   steps: 1625


training loop:   3% |#                                 | ETA:  38 days, 3:53:42

Episode: 1626   score: 18.85   Avg score (100e): 18.52   actor gain: -0.64   critic loss: 0.41   steps: 1626


training loop:   3% |#                                 | ETA:  38 days, 3:49:00

Episode: 1627   score: 18.85   Avg score (100e): 18.53   actor gain: -0.65   critic loss: 0.41   steps: 1627


training loop:   3% |#                                 | ETA:  38 days, 3:44:02

Episode: 1628   score: 18.86   Avg score (100e): 18.53   actor gain: -0.65   critic loss: 0.41   steps: 1628


training loop:   3% |#                                 | ETA:  38 days, 3:40:55

Episode: 1629   score: 18.87   Avg score (100e): 18.54   actor gain: -0.65   critic loss: 0.41   steps: 1629


training loop:   3% |#                                 | ETA:  38 days, 3:36:23

Episode: 1630   score: 18.88   Avg score (100e): 18.55   actor gain: -0.65   critic loss: 0.41   steps: 1630


training loop:   3% |#                                 | ETA:  38 days, 3:31:36

Episode: 1631   score: 18.88   Avg score (100e): 18.55   actor gain: -0.64   critic loss: 0.41   steps: 1631


training loop:   3% |#                                 | ETA:  38 days, 3:28:00

Episode: 1632   score: 18.89   Avg score (100e): 18.56   actor gain: -0.64   critic loss: 0.41   steps: 1632


training loop:   3% |#                                 | ETA:  38 days, 3:22:16

Episode: 1633   score: 18.89   Avg score (100e): 18.57   actor gain: -0.62   critic loss: 0.41   steps: 1633
np.all(done) is true! miracle!


training loop:   3% |#                                 | ETA:  38 days, 3:16:49

Episode: 1634   score: 18.90   Avg score (100e): 18.57   actor gain: -0.62   critic loss: 0.41   steps: 1634


training loop:   3% |#                                 | ETA:  38 days, 3:12:08

Episode: 1635   score: 18.91   Avg score (100e): 18.58   actor gain: -0.62   critic loss: 0.41   steps: 1635


training loop:   3% |#                                 | ETA:  38 days, 3:09:40

Episode: 1636   score: 18.92   Avg score (100e): 18.59   actor gain: -0.62   critic loss: 0.41   steps: 1636


training loop:   3% |#                                 | ETA:  38 days, 3:05:32

Episode: 1637   score: 18.92   Avg score (100e): 18.59   actor gain: -0.62   critic loss: 0.41   steps: 1637


training loop:   3% |#                                 | ETA:  38 days, 3:02:16

Episode: 1638   score: 18.92   Avg score (100e): 18.60   actor gain: -0.62   critic loss: 0.41   steps: 1638


training loop:   3% |#                                 | ETA:  38 days, 2:58:06

Episode: 1639   score: 18.93   Avg score (100e): 18.61   actor gain: -0.62   critic loss: 0.41   steps: 1639


training loop:   3% |#                                 | ETA:  38 days, 2:53:13

Episode: 1640   score: 18.94   Avg score (100e): 18.61   actor gain: -0.40   critic loss: 0.41   steps: 1640


training loop:   3% |#                                 | ETA:  38 days, 2:48:32

Episode: 1641   score: 18.94   Avg score (100e): 18.62   actor gain: -0.40   critic loss: 0.41   steps: 1641


training loop:   3% |#                                 | ETA:  38 days, 2:42:42

Episode: 1642   score: 18.95   Avg score (100e): 18.63   actor gain: -0.40   critic loss: 0.41   steps: 1642


training loop:   3% |#                                 | ETA:  38 days, 2:39:19

Episode: 1643   score: 18.95   Avg score (100e): 18.63   actor gain: -0.40   critic loss: 0.41   steps: 1643


training loop:   3% |#                                 | ETA:  38 days, 2:34:35

Episode: 1644   score: 18.96   Avg score (100e): 18.64   actor gain: -0.40   critic loss: 0.41   steps: 1644


training loop:   3% |#                                 | ETA:  38 days, 2:30:43

Episode: 1645   score: 18.96   Avg score (100e): 18.65   actor gain: -0.40   critic loss: 0.41   steps: 1645
np.all(done) is true! miracle!


training loop:   3% |#                                 | ETA:  38 days, 2:28:11

Episode: 1646   score: 18.96   Avg score (100e): 18.65   actor gain: -0.40   critic loss: 0.41   steps: 1646


training loop:   3% |#                                 | ETA:  38 days, 2:24:39

Episode: 1647   score: 18.97   Avg score (100e): 18.66   actor gain: -0.39   critic loss: 0.41   steps: 1647


training loop:   3% |#                                 | ETA:  38 days, 2:21:52

Episode: 1648   score: 18.97   Avg score (100e): 18.67   actor gain: -0.39   critic loss: 0.41   steps: 1648


training loop:   3% |#                                 | ETA:  38 days, 2:18:35

Episode: 1649   score: 18.98   Avg score (100e): 18.67   actor gain: -0.38   critic loss: 0.41   steps: 1649
np.all(done) is true! miracle!


training loop:   3% |#                                 | ETA:  38 days, 2:14:39

Episode: 1650   score: 18.99   Avg score (100e): 18.68   actor gain: -0.38   critic loss: 0.41   steps: 1650


training loop:   3% |#                                 | ETA:  38 days, 2:11:06

Episode: 1651   score: 18.98   Avg score (100e): 18.68   actor gain: -0.38   critic loss: 0.41   steps: 1651


training loop:   3% |#                                 | ETA:  38 days, 2:05:46

Episode: 1652   score: 18.98   Avg score (100e): 18.69   actor gain: -0.38   critic loss: 0.41   steps: 1652


training loop:   3% |#                                 | ETA:  38 days, 2:03:14

Episode: 1653   score: 18.99   Avg score (100e): 18.70   actor gain: -0.38   critic loss: 0.41   steps: 1653


training loop:   3% |#                                 | ETA:  38 days, 1:58:22

Episode: 1654   score: 19.00   Avg score (100e): 18.70   actor gain: -0.38   critic loss: 0.41   steps: 1654


training loop:   3% |#                                 | ETA:  38 days, 1:54:45

Episode: 1655   score: 19.00   Avg score (100e): 18.71   actor gain: -0.39   critic loss: 0.41   steps: 1655


training loop:   3% |#                                 | ETA:  38 days, 1:51:06

Episode: 1656   score: 19.00   Avg score (100e): 18.72   actor gain: -0.39   critic loss: 0.41   steps: 1656


training loop:   3% |#                                 | ETA:  38 days, 1:45:58

Episode: 1657   score: 19.01   Avg score (100e): 18.72   actor gain: -0.39   critic loss: 0.41   steps: 1657


training loop:   3% |#                                 | ETA:  38 days, 1:40:12

Episode: 1658   score: 19.02   Avg score (100e): 18.73   actor gain: -0.39   critic loss: 0.41   steps: 1658


training loop:   3% |#                                 | ETA:  38 days, 1:40:24

Episode: 1659   score: 19.02   Avg score (100e): 18.73   actor gain: -0.39   critic loss: 0.41   steps: 1659


training loop:   3% |#                                 | ETA:  38 days, 1:36:32

Episode: 1660   score: 19.03   Avg score (100e): 18.74   actor gain: -0.39   critic loss: 0.41   steps: 1660


training loop:   3% |#                                 | ETA:  38 days, 1:32:43

Episode: 1661   score: 19.04   Avg score (100e): 18.75   actor gain: -0.39   critic loss: 0.41   steps: 1661


training loop:   3% |#                                 | ETA:  38 days, 1:29:21

Episode: 1662   score: 19.05   Avg score (100e): 18.75   actor gain: -0.40   critic loss: 0.41   steps: 1662


training loop:   3% |#                                 | ETA:  38 days, 1:26:35

Episode: 1663   score: 19.05   Avg score (100e): 18.76   actor gain: -0.40   critic loss: 0.41   steps: 1663


training loop:   3% |#                                 | ETA:  38 days, 1:23:58

Episode: 1664   score: 19.05   Avg score (100e): 18.76   actor gain: -0.40   critic loss: 0.41   steps: 1664


training loop:   3% |#                                 | ETA:  38 days, 1:20:11

Episode: 1665   score: 19.05   Avg score (100e): 18.77   actor gain: -0.39   critic loss: 0.41   steps: 1665


training loop:   3% |#                                 | ETA:  38 days, 1:16:46

Episode: 1666   score: 19.05   Avg score (100e): 18.78   actor gain: -0.40   critic loss: 0.41   steps: 1666


training loop:   3% |#                                 | ETA:  38 days, 1:12:09

Episode: 1667   score: 19.05   Avg score (100e): 18.78   actor gain: -0.40   critic loss: 0.41   steps: 1667


training loop:   3% |#                                 | ETA:  38 days, 1:06:38

Episode: 1668   score: 19.06   Avg score (100e): 18.79   actor gain: -0.40   critic loss: 0.41   steps: 1668


training loop:   3% |#                                 | ETA:  38 days, 1:02:32

Episode: 1669   score: 19.07   Avg score (100e): 18.79   actor gain: -0.40   critic loss: 0.41   steps: 1669


training loop:   3% |#                                 | ETA:  38 days, 0:58:46

Episode: 1670   score: 19.08   Avg score (100e): 18.80   actor gain: -0.40   critic loss: 0.41   steps: 1670


training loop:   3% |#                                 | ETA:  38 days, 0:56:41

Episode: 1671   score: 19.09   Avg score (100e): 18.81   actor gain: -0.40   critic loss: 0.41   steps: 1671


training loop:   3% |#                                 | ETA:  38 days, 0:52:10

Episode: 1672   score: 19.09   Avg score (100e): 18.81   actor gain: -0.41   critic loss: 0.41   steps: 1672


training loop:   3% |#                                 | ETA:  38 days, 0:48:32

Episode: 1673   score: 19.09   Avg score (100e): 18.82   actor gain: -0.41   critic loss: 0.41   steps: 1673


training loop:   3% |#                                 | ETA:  38 days, 0:44:22

Episode: 1674   score: 19.10   Avg score (100e): 18.82   actor gain: -0.41   critic loss: 0.41   steps: 1674


training loop:   3% |#                                 | ETA:  38 days, 0:39:55

Episode: 1675   score: 19.10   Avg score (100e): 18.83   actor gain: -0.41   critic loss: 0.41   steps: 1675


training loop:   3% |#                                 | ETA:  38 days, 0:35:27

Episode: 1676   score: 19.11   Avg score (100e): 18.84   actor gain: -0.41   critic loss: 0.41   steps: 1676


training loop:   3% |#                                 | ETA:  38 days, 0:30:48

Episode: 1677   score: 19.11   Avg score (100e): 18.84   actor gain: -0.40   critic loss: 0.41   steps: 1677


training loop:   3% |#                                 | ETA:  38 days, 0:26:13

Episode: 1678   score: 19.12   Avg score (100e): 18.85   actor gain: -0.40   critic loss: 0.41   steps: 1678


training loop:   3% |#                                 | ETA:  38 days, 0:22:41

Episode: 1679   score: 19.13   Avg score (100e): 18.85   actor gain: -0.41   critic loss: 0.41   steps: 1679


training loop:   3% |#                                 | ETA:  38 days, 0:18:48

Episode: 1680   score: 19.14   Avg score (100e): 18.86   actor gain: -0.41   critic loss: 0.41   steps: 1680


training loop:   3% |#                                 | ETA:  38 days, 0:14:12

Episode: 1681   score: 19.15   Avg score (100e): 18.86   actor gain: -0.41   critic loss: 0.41   steps: 1681


training loop:   3% |#                                 | ETA:  38 days, 0:10:18

Episode: 1682   score: 19.15   Avg score (100e): 18.87   actor gain: -0.41   critic loss: 0.41   steps: 1682


training loop:   3% |#                                 | ETA:  38 days, 0:07:00

Episode: 1683   score: 19.16   Avg score (100e): 18.88   actor gain: -0.41   critic loss: 0.41   steps: 1683


training loop:   3% |#                                 | ETA:  38 days, 0:02:45

Episode: 1684   score: 19.17   Avg score (100e): 18.88   actor gain: -0.41   critic loss: 0.41   steps: 1684


training loop:   3% |#                                | ETA:  37 days, 23:58:04

Episode: 1685   score: 19.17   Avg score (100e): 18.89   actor gain: -0.41   critic loss: 0.41   steps: 1685


training loop:   3% |#                                | ETA:  37 days, 23:55:14

Episode: 1686   score: 19.17   Avg score (100e): 18.89   actor gain: -0.41   critic loss: 0.41   steps: 1686


training loop:   3% |#                                | ETA:  37 days, 23:51:16

Episode: 1687   score: 19.18   Avg score (100e): 18.90   actor gain: -0.40   critic loss: 0.41   steps: 1687


training loop:   3% |#                                | ETA:  37 days, 23:45:19

Episode: 1688   score: 19.18   Avg score (100e): 18.91   actor gain: -0.40   critic loss: 0.41   steps: 1688


training loop:   3% |#                                | ETA:  37 days, 23:40:42

Episode: 1689   score: 19.19   Avg score (100e): 18.91   actor gain: -0.40   critic loss: 0.41   steps: 1689


training loop:   3% |#                                | ETA:  37 days, 23:36:50

Episode: 1690   score: 19.20   Avg score (100e): 18.92   actor gain: -0.40   critic loss: 0.41   steps: 1690


training loop:   3% |#                                | ETA:  37 days, 23:37:14

Episode: 1691   score: 19.21   Avg score (100e): 18.92   actor gain: -0.40   critic loss: 0.41   steps: 1691


training loop:   3% |#                                | ETA:  37 days, 23:34:08

Episode: 1692   score: 19.21   Avg score (100e): 18.93   actor gain: -0.40   critic loss: 0.41   steps: 1692


training loop:   3% |#                                | ETA:  37 days, 23:32:10

Episode: 1693   score: 19.22   Avg score (100e): 18.94   actor gain: -0.41   critic loss: 0.41   steps: 1693


training loop:   3% |#                                | ETA:  37 days, 23:28:28

Episode: 1694   score: 19.22   Avg score (100e): 18.94   actor gain: -0.41   critic loss: 0.41   steps: 1694


training loop:   3% |#                                | ETA:  37 days, 23:23:52

Episode: 1695   score: 19.22   Avg score (100e): 18.95   actor gain: -0.41   critic loss: 0.41   steps: 1695
np.all(done) is true! miracle!


training loop:   3% |#                                | ETA:  37 days, 23:20:19

Episode: 1696   score: 19.22   Avg score (100e): 18.95   actor gain: -0.41   critic loss: 0.41   steps: 1696


training loop:   3% |#                                | ETA:  37 days, 23:15:24

Episode: 1697   score: 19.22   Avg score (100e): 18.96   actor gain: -0.40   critic loss: 0.41   steps: 1697


training loop:   3% |#                                | ETA:  37 days, 23:11:16

Episode: 1698   score: 19.23   Avg score (100e): 18.96   actor gain: -0.40   critic loss: 0.41   steps: 1698


training loop:   3% |#                                | ETA:  37 days, 23:06:20

Episode: 1699   score: 19.24   Avg score (100e): 18.97   actor gain: -0.40   critic loss: 0.41   steps: 1699


training loop:   3% |#                                | ETA:  37 days, 23:04:23

Episode: 1700   score: 19.24   Avg score (100e): 18.98   actor gain: -0.40   critic loss: 0.41   steps: 1700


training loop:   3% |#                                | ETA:  37 days, 22:59:20

Episode: 1701   score: 19.25   Avg score (100e): 18.98   actor gain: -0.40   critic loss: 0.41   steps: 1701


training loop:   3% |#                                | ETA:  37 days, 22:54:43

Episode: 1702   score: 19.25   Avg score (100e): 18.99   actor gain: -0.40   critic loss: 0.41   steps: 1702
np.all(done) is true! miracle!


training loop:   3% |#                                | ETA:  37 days, 22:51:23

Episode: 1703   score: 19.25   Avg score (100e): 18.99   actor gain: -0.40   critic loss: 0.41   steps: 1703


training loop:   3% |#                                | ETA:  37 days, 22:49:41

Episode: 1704   score: 19.25   Avg score (100e): 19.00   actor gain: -0.39   critic loss: 0.41   steps: 1704


training loop:   3% |#                                | ETA:  37 days, 22:48:27

Episode: 1705   score: 19.25   Avg score (100e): 19.00   actor gain: -0.39   critic loss: 0.41   steps: 1705


training loop:   3% |#                                | ETA:  37 days, 22:46:41

Episode: 1706   score: 19.25   Avg score (100e): 19.01   actor gain: -0.39   critic loss: 0.41   steps: 1706


training loop:   3% |#                                | ETA:  37 days, 22:43:36

Episode: 1707   score: 19.25   Avg score (100e): 19.02   actor gain: -0.39   critic loss: 0.41   steps: 1707


training loop:   3% |#                                | ETA:  37 days, 22:41:16

Episode: 1708   score: 19.26   Avg score (100e): 19.02   actor gain: -0.39   critic loss: 0.41   steps: 1708


training loop:   3% |#                                | ETA:  37 days, 22:38:57

Episode: 1709   score: 19.26   Avg score (100e): 19.03   actor gain: -0.38   critic loss: 0.41   steps: 1709


training loop:   3% |#                                | ETA:  37 days, 22:36:17

Episode: 1710   score: 19.27   Avg score (100e): 19.03   actor gain: -0.38   critic loss: 0.41   steps: 1710


training loop:   3% |#                                | ETA:  37 days, 22:33:23

Episode: 1711   score: 19.26   Avg score (100e): 19.04   actor gain: -0.38   critic loss: 0.41   steps: 1711


training loop:   3% |#                                | ETA:  37 days, 22:29:11

Episode: 1712   score: 19.27   Avg score (100e): 19.04   actor gain: -0.38   critic loss: 0.41   steps: 1712


training loop:   3% |#                                | ETA:  37 days, 22:25:14

Episode: 1713   score: 19.26   Avg score (100e): 19.05   actor gain: -0.38   critic loss: 0.41   steps: 1713


training loop:   3% |#                                | ETA:  37 days, 22:20:09

Episode: 1714   score: 19.27   Avg score (100e): 19.05   actor gain: -0.38   critic loss: 0.41   steps: 1714


training loop:   3% |#                                | ETA:  37 days, 22:17:20

Episode: 1715   score: 19.27   Avg score (100e): 19.06   actor gain: -0.39   critic loss: 0.41   steps: 1715


training loop:   3% |#                                | ETA:  37 days, 22:14:24

Episode: 1716   score: 19.29   Avg score (100e): 19.06   actor gain: -0.39   critic loss: 0.41   steps: 1716


training loop:   3% |#                                | ETA:  37 days, 22:10:39

Episode: 1717   score: 19.29   Avg score (100e): 19.07   actor gain: -0.39   critic loss: 0.41   steps: 1717


training loop:   3% |#                                | ETA:  37 days, 22:06:37

Episode: 1718   score: 19.29   Avg score (100e): 19.07   actor gain: -0.37   critic loss: 0.41   steps: 1718


training loop:   3% |#                                | ETA:  37 days, 22:02:50

Episode: 1719   score: 19.30   Avg score (100e): 19.08   actor gain: -0.37   critic loss: 0.41   steps: 1719


training loop:   3% |#                                | ETA:  37 days, 22:00:02

Episode: 1720   score: 19.30   Avg score (100e): 19.08   actor gain: -0.37   critic loss: 0.41   steps: 1720


training loop:   3% |#                                | ETA:  37 days, 21:55:53

Episode: 1721   score: 19.30   Avg score (100e): 19.09   actor gain: -0.38   critic loss: 0.41   steps: 1721


training loop:   3% |#                                | ETA:  37 days, 21:52:19

Episode: 1722   score: 19.31   Avg score (100e): 19.09   actor gain: -0.39   critic loss: 0.41   steps: 1722


training loop:   3% |#                                | ETA:  37 days, 21:53:50

Episode: 1723   score: 19.30   Avg score (100e): 19.10   actor gain: -0.39   critic loss: 0.41   steps: 1723


training loop:   3% |#                                | ETA:  37 days, 21:50:21

Episode: 1724   score: 19.31   Avg score (100e): 19.10   actor gain: -0.39   critic loss: 0.41   steps: 1724


training loop:   3% |#                                | ETA:  37 days, 21:47:01

Episode: 1725   score: 19.32   Avg score (100e): 19.11   actor gain: -0.39   critic loss: 0.41   steps: 1725


training loop:   3% |#                                | ETA:  37 days, 21:46:15

Episode: 1726   score: 19.32   Avg score (100e): 19.11   actor gain: -0.39   critic loss: 0.41   steps: 1726


training loop:   3% |#                                | ETA:  37 days, 21:43:37

Episode: 1727   score: 19.32   Avg score (100e): 19.11   actor gain: -0.39   critic loss: 0.41   steps: 1727


training loop:   3% |#                                | ETA:  37 days, 21:40:16

Episode: 1728   score: 19.34   Avg score (100e): 19.12   actor gain: -0.38   critic loss: 0.41   steps: 1728


training loop:   3% |#                                | ETA:  37 days, 21:38:26

Episode: 1729   score: 19.35   Avg score (100e): 19.12   actor gain: -0.38   critic loss: 0.41   steps: 1729


training loop:   3% |#                                | ETA:  37 days, 21:34:37

Episode: 1730   score: 19.35   Avg score (100e): 19.13   actor gain: -0.38   critic loss: 0.41   steps: 1730
np.all(done) is true! miracle!


training loop:   3% |#                                | ETA:  37 days, 21:31:12

Episode: 1731   score: 19.35   Avg score (100e): 19.13   actor gain: -0.39   critic loss: 0.41   steps: 1731


training loop:   3% |#                                | ETA:  37 days, 21:27:42

Episode: 1732   score: 19.36   Avg score (100e): 19.14   actor gain: -0.39   critic loss: 0.41   steps: 1732
np.all(done) is true! miracle!


training loop:   3% |#                                | ETA:  37 days, 21:24:28

Episode: 1733   score: 19.36   Avg score (100e): 19.14   actor gain: -0.39   critic loss: 0.41   steps: 1733


training loop:   3% |#                                | ETA:  37 days, 21:20:51

Episode: 1734   score: 19.37   Avg score (100e): 19.15   actor gain: -0.39   critic loss: 0.41   steps: 1734


training loop:   3% |#                                | ETA:  37 days, 21:16:00

Episode: 1735   score: 19.38   Avg score (100e): 19.15   actor gain: -0.40   critic loss: 0.41   steps: 1735


training loop:   3% |#                                | ETA:  37 days, 21:13:15

Episode: 1736   score: 19.39   Avg score (100e): 19.16   actor gain: -0.40   critic loss: 0.41   steps: 1736


training loop:   3% |#                                | ETA:  37 days, 21:09:16

Episode: 1737   score: 19.38   Avg score (100e): 19.16   actor gain: -0.40   critic loss: 0.41   steps: 1737


training loop:   3% |#                                | ETA:  37 days, 21:05:57

Episode: 1738   score: 19.39   Avg score (100e): 19.17   actor gain: -0.40   critic loss: 0.41   steps: 1738


training loop:   3% |#                                | ETA:  37 days, 21:03:13

Episode: 1739   score: 19.39   Avg score (100e): 19.17   actor gain: -0.40   critic loss: 0.41   steps: 1739


training loop:   3% |#                                | ETA:  37 days, 21:00:12

Episode: 1740   score: 19.39   Avg score (100e): 19.18   actor gain: -0.39   critic loss: 0.41   steps: 1740


training loop:   3% |#                                | ETA:  37 days, 20:56:06

Episode: 1741   score: 19.40   Avg score (100e): 19.18   actor gain: -0.39   critic loss: 0.41   steps: 1741


training loop:   3% |#                                | ETA:  37 days, 20:52:53

Episode: 1742   score: 19.40   Avg score (100e): 19.18   actor gain: -0.39   critic loss: 0.41   steps: 1742


training loop:   3% |#                                | ETA:  37 days, 20:50:54

Episode: 1743   score: 19.40   Avg score (100e): 19.19   actor gain: -0.39   critic loss: 0.41   steps: 1743


training loop:   3% |#                                | ETA:  37 days, 20:46:56

Episode: 1744   score: 19.41   Avg score (100e): 19.19   actor gain: -0.39   critic loss: 0.41   steps: 1744


training loop:   3% |#                                | ETA:  37 days, 20:43:15

Episode: 1745   score: 19.42   Avg score (100e): 19.20   actor gain: -0.39   critic loss: 0.41   steps: 1745


training loop:   3% |#                                | ETA:  37 days, 20:41:07

Episode: 1746   score: 19.42   Avg score (100e): 19.20   actor gain: -0.38   critic loss: 0.41   steps: 1746


training loop:   3% |#                                | ETA:  37 days, 20:37:36

Episode: 1747   score: 19.42   Avg score (100e): 19.21   actor gain: -0.38   critic loss: 0.41   steps: 1747


training loop:   3% |#                                | ETA:  37 days, 20:33:43

Episode: 1748   score: 19.43   Avg score (100e): 19.21   actor gain: -0.39   critic loss: 0.41   steps: 1748


training loop:   3% |#                                | ETA:  37 days, 20:30:18

Episode: 1749   score: 19.43   Avg score (100e): 19.22   actor gain: -0.40   critic loss: 0.41   steps: 1749


training loop:   3% |#                                | ETA:  37 days, 20:26:06

Episode: 1750   score: 19.44   Avg score (100e): 19.22   actor gain: -0.40   critic loss: 0.41   steps: 1750


training loop:   3% |#                                | ETA:  37 days, 20:22:35

Episode: 1751   score: 19.45   Avg score (100e): 19.23   actor gain: -0.40   critic loss: 0.41   steps: 1751


training loop:   3% |#                                | ETA:  37 days, 20:20:19

Episode: 1752   score: 19.45   Avg score (100e): 19.23   actor gain: -0.40   critic loss: 0.41   steps: 1752


training loop:   3% |#                                | ETA:  37 days, 20:18:07

Episode: 1753   score: 19.45   Avg score (100e): 19.23   actor gain: -0.40   critic loss: 0.41   steps: 1753


training loop:   3% |#                                | ETA:  37 days, 20:13:39

Episode: 1754   score: 19.46   Avg score (100e): 19.24   actor gain: -0.40   critic loss: 0.41   steps: 1754


training loop:   3% |#                                | ETA:  37 days, 20:09:46

Episode: 1755   score: 19.46   Avg score (100e): 19.24   actor gain: -0.40   critic loss: 0.41   steps: 1755


training loop:   3% |#                                | ETA:  37 days, 20:10:50

Episode: 1756   score: 19.47   Avg score (100e): 19.25   actor gain: -0.39   critic loss: 0.41   steps: 1756


training loop:   3% |#                                | ETA:  37 days, 20:08:35

Episode: 1757   score: 19.47   Avg score (100e): 19.25   actor gain: -0.39   critic loss: 0.41   steps: 1757
np.all(done) is true! miracle!


training loop:   3% |#                                | ETA:  37 days, 20:05:44

Episode: 1758   score: 19.47   Avg score (100e): 19.26   actor gain: -0.43   critic loss: 0.41   steps: 1758


training loop:   3% |#                                | ETA:  37 days, 20:04:26

Episode: 1759   score: 19.48   Avg score (100e): 19.26   actor gain: -0.43   critic loss: 0.41   steps: 1759


training loop:   3% |#                                | ETA:  37 days, 20:04:05

Episode: 1760   score: 19.48   Avg score (100e): 19.27   actor gain: -0.43   critic loss: 0.41   steps: 1760
np.all(done) is true! miracle!


training loop:   3% |#                                | ETA:  37 days, 20:00:56

Episode: 1761   score: 19.50   Avg score (100e): 19.27   actor gain: -0.43   critic loss: 0.41   steps: 1761


training loop:   3% |#                                | ETA:  37 days, 19:59:09

Episode: 1762   score: 19.50   Avg score (100e): 19.28   actor gain: -0.43   critic loss: 0.41   steps: 1762


training loop:   3% |#                                | ETA:  37 days, 19:55:02

Episode: 1763   score: 19.51   Avg score (100e): 19.28   actor gain: -0.43   critic loss: 0.41   steps: 1763


training loop:   3% |#                                | ETA:  37 days, 19:52:31

Episode: 1764   score: 19.51   Avg score (100e): 19.29   actor gain: -0.43   critic loss: 0.41   steps: 1764


training loop:   3% |#                                | ETA:  37 days, 19:48:52

Episode: 1765   score: 19.52   Avg score (100e): 19.29   actor gain: -0.42   critic loss: 0.40   steps: 1765


training loop:   3% |#                                | ETA:  37 days, 19:46:38

Episode: 1766   score: 19.52   Avg score (100e): 19.29   actor gain: -0.42   critic loss: 0.40   steps: 1766


training loop:   3% |#                                | ETA:  37 days, 19:42:34

Episode: 1767   score: 19.52   Avg score (100e): 19.30   actor gain: -0.43   critic loss: 0.40   steps: 1767


training loop:   3% |#                                | ETA:  37 days, 19:39:28

Episode: 1768   score: 19.53   Avg score (100e): 19.30   actor gain: -0.43   critic loss: 0.41   steps: 1768
np.all(done) is true! miracle!


training loop:   3% |#                                | ETA:  37 days, 19:39:10

Episode: 1769   score: 19.53   Avg score (100e): 19.31   actor gain: -0.43   critic loss: 0.41   steps: 1769


training loop:   3% |#                                | ETA:  37 days, 19:34:59

Episode: 1770   score: 19.53   Avg score (100e): 19.31   actor gain: -0.45   critic loss: 0.41   steps: 1770


training loop:   3% |#                                | ETA:  37 days, 19:31:06

Episode: 1771   score: 19.53   Avg score (100e): 19.32   actor gain: -0.46   critic loss: 0.41   steps: 1771


training loop:   3% |#                                | ETA:  37 days, 19:28:59

Episode: 1772   score: 19.55   Avg score (100e): 19.32   actor gain: -0.48   critic loss: 0.41   steps: 1772


training loop:   3% |#                                | ETA:  37 days, 19:25:21

Episode: 1773   score: 19.55   Avg score (100e): 19.33   actor gain: -0.46   critic loss: 0.41   steps: 1773


training loop:   3% |#                                | ETA:  37 days, 19:22:27

Episode: 1774   score: 19.57   Avg score (100e): 19.33   actor gain: -0.46   critic loss: 0.41   steps: 1774


training loop:   3% |#                                | ETA:  37 days, 19:18:38

Episode: 1775   score: 19.57   Avg score (100e): 19.34   actor gain: -0.46   critic loss: 0.41   steps: 1775


training loop:   3% |#                                | ETA:  37 days, 19:17:54

Episode: 1776   score: 19.58   Avg score (100e): 19.34   actor gain: -0.46   critic loss: 0.41   steps: 1776


training loop:   3% |#                                | ETA:  37 days, 19:14:32

Episode: 1777   score: 19.59   Avg score (100e): 19.35   actor gain: -0.46   critic loss: 0.41   steps: 1777


training loop:   3% |#                                | ETA:  37 days, 19:11:41

Episode: 1778   score: 19.59   Avg score (100e): 19.35   actor gain: -0.46   critic loss: 0.40   steps: 1778


training loop:   3% |#                                | ETA:  37 days, 19:09:23

Episode: 1779   score: 19.60   Avg score (100e): 19.35   actor gain: -0.46   critic loss: 0.41   steps: 1779


training loop:   3% |#                                | ETA:  37 days, 19:05:23

Episode: 1780   score: 19.60   Avg score (100e): 19.36   actor gain: -0.46   critic loss: 0.40   steps: 1780


training loop:   3% |#                                | ETA:  37 days, 19:01:18

Episode: 1781   score: 19.60   Avg score (100e): 19.36   actor gain: -0.46   critic loss: 0.40   steps: 1781


training loop:   3% |#                                | ETA:  37 days, 18:58:15

Episode: 1782   score: 19.61   Avg score (100e): 19.37   actor gain: -0.46   critic loss: 0.40   steps: 1782


training loop:   3% |#                                | ETA:  37 days, 18:55:25

Episode: 1783   score: 19.62   Avg score (100e): 19.37   actor gain: -0.42   critic loss: 0.40   steps: 1783


training loop:   3% |#                                | ETA:  37 days, 18:51:35

Episode: 1784   score: 19.62   Avg score (100e): 19.38   actor gain: -0.42   critic loss: 0.41   steps: 1784
np.all(done) is true! miracle!


training loop:   3% |#                                | ETA:  37 days, 18:49:20

Episode: 1785   score: 19.62   Avg score (100e): 19.38   actor gain: -0.42   critic loss: 0.41   steps: 1785


training loop:   3% |#                                | ETA:  37 days, 18:46:40

Episode: 1786   score: 19.62   Avg score (100e): 19.39   actor gain: -0.42   critic loss: 0.41   steps: 1786


training loop:   3% |#                                | ETA:  37 days, 18:43:32

Episode: 1787   score: 19.62   Avg score (100e): 19.39   actor gain: -0.42   critic loss: 0.41   steps: 1787


training loop:   3% |#                                | ETA:  37 days, 18:45:06

Episode: 1788   score: 19.63   Avg score (100e): 19.40   actor gain: -0.42   critic loss: 0.41   steps: 1788


training loop:   3% |#                                | ETA:  37 days, 18:42:42

Episode: 1789   score: 19.64   Avg score (100e): 19.40   actor gain: -0.42   critic loss: 0.41   steps: 1789


training loop:   3% |#                                | ETA:  37 days, 18:39:56

Episode: 1790   score: 19.65   Avg score (100e): 19.40   actor gain: -0.42   critic loss: 0.41   steps: 1790


training loop:   3% |#                                | ETA:  37 days, 18:36:11

Episode: 1791   score: 19.65   Avg score (100e): 19.41   actor gain: -0.42   critic loss: 0.41   steps: 1791


training loop:   3% |#                                | ETA:  37 days, 18:34:37

Episode: 1792   score: 19.65   Avg score (100e): 19.41   actor gain: -0.42   critic loss: 0.41   steps: 1792


training loop:   3% |#                                | ETA:  37 days, 18:32:02

Episode: 1793   score: 19.65   Avg score (100e): 19.42   actor gain: -0.43   critic loss: 0.41   steps: 1793
np.all(done) is true! miracle!


training loop:   3% |#                                | ETA:  37 days, 18:28:09

Episode: 1794   score: 19.65   Avg score (100e): 19.42   actor gain: -0.43   critic loss: 0.40   steps: 1794


training loop:   3% |#                                | ETA:  37 days, 18:26:04

Episode: 1795   score: 19.66   Avg score (100e): 19.43   actor gain: -0.41   critic loss: 0.41   steps: 1795


training loop:   3% |#                                | ETA:  37 days, 18:22:02

Episode: 1796   score: 19.66   Avg score (100e): 19.43   actor gain: -0.40   critic loss: 0.40   steps: 1796


training loop:   3% |#                                | ETA:  37 days, 18:18:53

Episode: 1797   score: 19.67   Avg score (100e): 19.44   actor gain: -0.40   critic loss: 0.40   steps: 1797


training loop:   3% |#                                | ETA:  37 days, 18:16:11

Episode: 1798   score: 19.68   Avg score (100e): 19.44   actor gain: -0.40   critic loss: 0.40   steps: 1798


training loop:   3% |#                                | ETA:  37 days, 18:13:58

Episode: 1799   score: 19.68   Avg score (100e): 19.44   actor gain: -0.40   critic loss: 0.40   steps: 1799


training loop:   3% |#                                | ETA:  37 days, 18:10:31

Episode: 1800   score: 19.68   Avg score (100e): 19.45   actor gain: -0.40   critic loss: 0.40   steps: 1800


training loop:   3% |#                                | ETA:  37 days, 18:07:20

Episode: 1801   score: 19.69   Avg score (100e): 19.45   actor gain: -0.40   critic loss: 0.40   steps: 1801


training loop:   3% |#                                | ETA:  37 days, 18:05:09

Episode: 1802   score: 19.69   Avg score (100e): 19.46   actor gain: -0.40   critic loss: 0.40   steps: 1802


training loop:   3% |#                                | ETA:  37 days, 18:01:14

Episode: 1803   score: 19.69   Avg score (100e): 19.46   actor gain: -0.40   critic loss: 0.41   steps: 1803


training loop:   3% |#                                | ETA:  37 days, 17:57:41

Episode: 1804   score: 19.69   Avg score (100e): 19.47   actor gain: -0.40   critic loss: 0.41   steps: 1804


training loop:   3% |#                                | ETA:  37 days, 17:56:00

Episode: 1805   score: 19.70   Avg score (100e): 19.47   actor gain: -0.40   critic loss: 0.41   steps: 1805
np.all(done) is true! miracle!


training loop:   3% |#                                | ETA:  37 days, 17:53:03

Episode: 1806   score: 19.71   Avg score (100e): 19.48   actor gain: -0.40   critic loss: 0.41   steps: 1806


training loop:   3% |#                                | ETA:  37 days, 17:49:03

Episode: 1807   score: 19.71   Avg score (100e): 19.48   actor gain: -0.40   critic loss: 0.41   steps: 1807


training loop:   3% |#                                | ETA:  37 days, 17:46:58

Episode: 1808   score: 19.73   Avg score (100e): 19.48   actor gain: -0.40   critic loss: 0.41   steps: 1808


training loop:   3% |#                                | ETA:  37 days, 17:43:33

Episode: 1809   score: 19.72   Avg score (100e): 19.49   actor gain: -0.40   critic loss: 0.41   steps: 1809


training loop:   3% |#                                | ETA:  37 days, 17:39:23

Episode: 1810   score: 19.73   Avg score (100e): 19.49   actor gain: -0.40   critic loss: 0.41   steps: 1810


training loop:   3% |#                                | ETA:  37 days, 17:36:17

Episode: 1811   score: 19.73   Avg score (100e): 19.50   actor gain: -0.40   critic loss: 0.41   steps: 1811


training loop:   3% |#                                | ETA:  37 days, 17:33:48

Episode: 1812   score: 19.73   Avg score (100e): 19.50   actor gain: -0.40   critic loss: 0.41   steps: 1812


training loop:   3% |#                                | ETA:  37 days, 17:29:49

Episode: 1813   score: 19.74   Avg score (100e): 19.51   actor gain: -0.41   critic loss: 0.41   steps: 1813


training loop:   3% |#                                | ETA:  37 days, 17:24:57

Episode: 1814   score: 19.75   Avg score (100e): 19.51   actor gain: -0.41   critic loss: 0.41   steps: 1814
np.all(done) is true! miracle!


training loop:   3% |#                                | ETA:  37 days, 17:21:12

Episode: 1815   score: 19.75   Avg score (100e): 19.52   actor gain: -0.41   critic loss: 0.41   steps: 1815


training loop:   3% |#                                | ETA:  37 days, 17:17:54

Episode: 1816   score: 19.76   Avg score (100e): 19.52   actor gain: -0.41   critic loss: 0.41   steps: 1816


training loop:   3% |#                                | ETA:  37 days, 17:16:38

Episode: 1817   score: 19.76   Avg score (100e): 19.53   actor gain: -0.41   critic loss: 0.41   steps: 1817


training loop:   3% |#                                | ETA:  37 days, 17:14:07

Episode: 1818   score: 19.77   Avg score (100e): 19.53   actor gain: -0.40   critic loss: 0.41   steps: 1818


training loop:   3% |#                                | ETA:  37 days, 17:10:45

Episode: 1819   score: 19.77   Avg score (100e): 19.54   actor gain: -0.40   critic loss: 0.41   steps: 1819


training loop:   3% |#                                | ETA:  37 days, 17:10:58

Episode: 1820   score: 19.78   Avg score (100e): 19.54   actor gain: -0.40   critic loss: 0.41   steps: 1820


training loop:   3% |#                                | ETA:  37 days, 17:09:31

Episode: 1821   score: 19.78   Avg score (100e): 19.55   actor gain: -0.40   critic loss: 0.41   steps: 1821


training loop:   3% |#                                | ETA:  37 days, 17:08:13

Episode: 1822   score: 19.79   Avg score (100e): 19.55   actor gain: -0.37   critic loss: 0.41   steps: 1822


training loop:   3% |#                                | ETA:  37 days, 17:05:26

Episode: 1823   score: 19.79   Avg score (100e): 19.56   actor gain: -0.37   critic loss: 0.41   steps: 1823


training loop:   3% |#                                | ETA:  37 days, 17:02:42

Episode: 1824   score: 19.79   Avg score (100e): 19.56   actor gain: -0.37   critic loss: 0.41   steps: 1824


training loop:   3% |#                                | ETA:  37 days, 17:00:39

Episode: 1825   score: 19.80   Avg score (100e): 19.57   actor gain: -0.38   critic loss: 0.41   steps: 1825


training loop:   3% |#                                | ETA:  37 days, 16:57:53

Episode: 1826   score: 19.80   Avg score (100e): 19.57   actor gain: -0.38   critic loss: 0.41   steps: 1826


training loop:   3% |#                                | ETA:  37 days, 16:54:18

Episode: 1827   score: 19.81   Avg score (100e): 19.58   actor gain: -0.41   critic loss: 0.41   steps: 1827


training loop:   3% |#                                | ETA:  37 days, 16:53:12

Episode: 1828   score: 19.83   Avg score (100e): 19.58   actor gain: -0.44   critic loss: 0.41   steps: 1828


training loop:   3% |#                                | ETA:  37 days, 16:50:29

Episode: 1829   score: 19.84   Avg score (100e): 19.58   actor gain: -0.44   critic loss: 0.41   steps: 1829


training loop:   3% |#                                | ETA:  37 days, 16:47:09

Episode: 1830   score: 19.84   Avg score (100e): 19.59   actor gain: -0.44   critic loss: 0.41   steps: 1830


training loop:   3% |#                                | ETA:  37 days, 16:45:15

Episode: 1831   score: 19.84   Avg score (100e): 19.59   actor gain: -0.44   critic loss: 0.41   steps: 1831


training loop:   3% |#                                | ETA:  37 days, 16:41:23

Episode: 1832   score: 19.85   Avg score (100e): 19.60   actor gain: -0.44   critic loss: 0.41   steps: 1832


training loop:   3% |#                                | ETA:  37 days, 16:38:18

Episode: 1833   score: 19.85   Avg score (100e): 19.60   actor gain: -0.44   critic loss: 0.41   steps: 1833


training loop:   3% |#                                | ETA:  37 days, 16:35:02

Episode: 1834   score: 19.85   Avg score (100e): 19.61   actor gain: -0.44   critic loss: 0.41   steps: 1834


training loop:   3% |#                                | ETA:  37 days, 16:33:13

Episode: 1835   score: 19.86   Avg score (100e): 19.61   actor gain: -0.47   critic loss: 0.41   steps: 1835


training loop:   3% |#                                | ETA:  37 days, 16:29:09

Episode: 1836   score: 19.87   Avg score (100e): 19.62   actor gain: -0.47   critic loss: 0.41   steps: 1836


training loop:   3% |#                                | ETA:  37 days, 16:24:49

Episode: 1837   score: 19.87   Avg score (100e): 19.62   actor gain: -0.47   critic loss: 0.41   steps: 1837


training loop:   3% |#                                | ETA:  37 days, 16:22:42

Episode: 1838   score: 19.88   Avg score (100e): 19.63   actor gain: -0.45   critic loss: 0.41   steps: 1838


training loop:   3% |#                                | ETA:  37 days, 16:19:24

Episode: 1839   score: 19.88   Avg score (100e): 19.63   actor gain: -0.45   critic loss: 0.41   steps: 1839


training loop:   3% |#                                | ETA:  37 days, 16:15:43

Episode: 1840   score: 19.89   Avg score (100e): 19.64   actor gain: -0.45   critic loss: 0.41   steps: 1840


training loop:   3% |#                                | ETA:  37 days, 16:14:46

Episode: 1841   score: 19.89   Avg score (100e): 19.64   actor gain: -0.45   critic loss: 0.41   steps: 1841


training loop:   3% |#                                | ETA:  37 days, 16:11:11

Episode: 1842   score: 19.90   Avg score (100e): 19.65   actor gain: -0.45   critic loss: 0.41   steps: 1842


training loop:   3% |#                                | ETA:  37 days, 16:05:59

Episode: 1843   score: 19.90   Avg score (100e): 19.65   actor gain: -0.45   critic loss: 0.41   steps: 1843


training loop:   3% |#                                | ETA:  37 days, 16:02:43

Episode: 1844   score: 19.90   Avg score (100e): 19.66   actor gain: -0.45   critic loss: 0.41   steps: 1844


training loop:   3% |#                                | ETA:  37 days, 16:01:00

Episode: 1845   score: 19.90   Avg score (100e): 19.66   actor gain: -0.45   critic loss: 0.41   steps: 1845


training loop:   3% |#                                | ETA:  37 days, 15:58:14

Episode: 1846   score: 19.91   Avg score (100e): 19.67   actor gain: -0.45   critic loss: 0.41   steps: 1846


training loop:   3% |#                                | ETA:  37 days, 15:55:25

Episode: 1847   score: 19.91   Avg score (100e): 19.67   actor gain: -0.45   critic loss: 0.41   steps: 1847


training loop:   3% |#                                | ETA:  37 days, 15:53:48

Episode: 1848   score: 19.92   Avg score (100e): 19.68   actor gain: -0.45   critic loss: 0.41   steps: 1848


training loop:   3% |#                                | ETA:  37 days, 15:50:38

Episode: 1849   score: 19.93   Avg score (100e): 19.68   actor gain: -0.46   critic loss: 0.41   steps: 1849


training loop:   3% |#                                | ETA:  37 days, 15:47:01

Episode: 1850   score: 19.93   Avg score (100e): 19.69   actor gain: -0.46   critic loss: 0.41   steps: 1850


training loop:   3% |#                                | ETA:  37 days, 15:45:11

Episode: 1851   score: 19.94   Avg score (100e): 19.69   actor gain: -0.46   critic loss: 0.41   steps: 1851


training loop:   3% |#                                | ETA:  37 days, 15:41:28

Episode: 1852   score: 19.95   Avg score (100e): 19.70   actor gain: -0.43   critic loss: 0.41   steps: 1852


training loop:   3% |#                                | ETA:  37 days, 15:42:37

Episode: 1853   score: 19.95   Avg score (100e): 19.70   actor gain: -0.39   critic loss: 0.41   steps: 1853


training loop:   3% |#                                | ETA:  37 days, 15:41:16

Episode: 1854   score: 19.95   Avg score (100e): 19.71   actor gain: -0.39   critic loss: 0.41   steps: 1854


training loop:   3% |#                                | ETA:  37 days, 15:38:51

Episode: 1855   score: 19.96   Avg score (100e): 19.71   actor gain: -0.39   critic loss: 0.41   steps: 1855


training loop:   3% |#                                | ETA:  37 days, 15:35:11

Episode: 1856   score: 19.96   Avg score (100e): 19.72   actor gain: -0.39   critic loss: 0.41   steps: 1856


training loop:   3% |#                                | ETA:  37 days, 15:26:52

Episode: 1857   score: 19.96   Avg score (100e): 19.72   actor gain: -0.39   critic loss: 0.41   steps: 1857


training loop:   3% |#                                | ETA:  37 days, 15:19:04

Episode: 1858   score: 19.97   Avg score (100e): 19.73   actor gain: -0.39   critic loss: 0.41   steps: 1858


training loop:   3% |#                                | ETA:  37 days, 15:09:48

Episode: 1859   score: 19.98   Avg score (100e): 19.73   actor gain: -0.39   critic loss: 0.41   steps: 1859


training loop:   3% |#                                | ETA:  37 days, 15:00:14

Episode: 1860   score: 19.98   Avg score (100e): 19.74   actor gain: -0.37   critic loss: 0.41   steps: 1860


training loop:   3% |#                                | ETA:  37 days, 14:50:51

Episode: 1861   score: 19.98   Avg score (100e): 19.74   actor gain: -0.37   critic loss: 0.41   steps: 1861


training loop:   3% |#                                | ETA:  37 days, 14:40:39

Episode: 1862   score: 19.98   Avg score (100e): 19.75   actor gain: -0.37   critic loss: 0.41   steps: 1862


training loop:   3% |#                                | ETA:  37 days, 14:30:14

Episode: 1863   score: 19.99   Avg score (100e): 19.75   actor gain: -0.37   critic loss: 0.41   steps: 1863


training loop:   3% |#                                | ETA:  37 days, 14:18:51

Episode: 1864   score: 20.00   Avg score (100e): 19.76   actor gain: -0.37   critic loss: 0.41   steps: 1864


training loop:   3% |#                                | ETA:  37 days, 14:06:50

Episode: 1865   score: 20.01   Avg score (100e): 19.76   actor gain: -0.38   critic loss: 0.41   steps: 1865


training loop:   3% |#                                | ETA:  37 days, 13:55:16

Episode: 1866   score: 20.02   Avg score (100e): 19.77   actor gain: -0.38   critic loss: 0.41   steps: 1866


training loop:   3% |#                                | ETA:  37 days, 13:43:28

Episode: 1867   score: 20.03   Avg score (100e): 19.77   actor gain: -0.38   critic loss: 0.41   steps: 1867


training loop:   3% |#                                | ETA:  37 days, 13:29:24

Episode: 1868   score: 20.02   Avg score (100e): 19.78   actor gain: -0.38   critic loss: 0.41   steps: 1868


training loop:   3% |#                                | ETA:  37 days, 13:14:58

Episode: 1869   score: 20.03   Avg score (100e): 19.78   actor gain: -0.38   critic loss: 0.41   steps: 1869


training loop:   3% |#                                | ETA:  37 days, 13:00:26

Episode: 1870   score: 20.03   Avg score (100e): 19.79   actor gain: -0.38   critic loss: 0.41   steps: 1870


training loop:   3% |#                                | ETA:  37 days, 12:46:04

Episode: 1871   score: 20.04   Avg score (100e): 19.79   actor gain: -0.38   critic loss: 0.41   steps: 1871


training loop:   3% |#                                | ETA:  37 days, 12:32:57

Episode: 1872   score: 20.04   Avg score (100e): 19.80   actor gain: -0.38   critic loss: 0.41   steps: 1872


training loop:   3% |#                                | ETA:  37 days, 12:20:47

Episode: 1873   score: 20.05   Avg score (100e): 19.80   actor gain: -0.38   critic loss: 0.41   steps: 1873


training loop:   3% |#                                | ETA:  37 days, 12:07:38

Episode: 1874   score: 20.05   Avg score (100e): 19.81   actor gain: -0.37   critic loss: 0.41   steps: 1874


training loop:   3% |#                                | ETA:  37 days, 11:52:47

Episode: 1875   score: 20.05   Avg score (100e): 19.81   actor gain: -0.40   critic loss: 0.41   steps: 1875


training loop:   3% |#                                | ETA:  37 days, 11:38:40

Episode: 1876   score: 20.05   Avg score (100e): 19.82   actor gain: -0.40   critic loss: 0.41   steps: 1876


training loop:   3% |#                                | ETA:  37 days, 11:23:21

Episode: 1877   score: 20.06   Avg score (100e): 19.82   actor gain: -0.40   critic loss: 0.41   steps: 1877


training loop:   3% |#                                | ETA:  37 days, 11:09:23

Episode: 1878   score: 20.07   Avg score (100e): 19.83   actor gain: -0.40   critic loss: 0.41   steps: 1878


training loop:   3% |#                                | ETA:  37 days, 10:54:42

Episode: 1879   score: 20.07   Avg score (100e): 19.83   actor gain: -0.40   critic loss: 0.41   steps: 1879


training loop:   3% |#                                | ETA:  37 days, 10:40:23

Episode: 1880   score: 20.07   Avg score (100e): 19.84   actor gain: -0.40   critic loss: 0.41   steps: 1880


training loop:   3% |#                                | ETA:  37 days, 10:25:56

Episode: 1881   score: 20.07   Avg score (100e): 19.84   actor gain: -0.40   critic loss: 0.41   steps: 1881


training loop:   3% |#                                | ETA:  37 days, 10:11:34

Episode: 1882   score: 20.08   Avg score (100e): 19.84   actor gain: -0.40   critic loss: 0.41   steps: 1882


training loop:   3% |#                                 | ETA:  37 days, 9:57:09

Episode: 1883   score: 20.09   Avg score (100e): 19.85   actor gain: -0.40   critic loss: 0.41   steps: 1883


training loop:   3% |#                                 | ETA:  37 days, 9:42:56

Episode: 1884   score: 20.10   Avg score (100e): 19.85   actor gain: -0.40   critic loss: 0.41   steps: 1884
np.all(done) is true! miracle!


training loop:   3% |#                                 | ETA:  37 days, 9:33:14

Episode: 1885   score: 20.10   Avg score (100e): 19.86   actor gain: -0.40   critic loss: 0.41   steps: 1885


training loop:   3% |#                                 | ETA:  37 days, 9:22:22

Episode: 1886   score: 20.10   Avg score (100e): 19.86   actor gain: -0.40   critic loss: 0.41   steps: 1886


training loop:   3% |#                                 | ETA:  37 days, 9:09:20

Episode: 1887   score: 20.10   Avg score (100e): 19.87   actor gain: -0.40   critic loss: 0.41   steps: 1887


training loop:   3% |#                                 | ETA:  37 days, 8:59:34

Episode: 1888   score: 20.10   Avg score (100e): 19.87   actor gain: -0.40   critic loss: 0.41   steps: 1888


training loop:   3% |#                                 | ETA:  37 days, 8:47:34

Episode: 1889   score: 20.10   Avg score (100e): 19.88   actor gain: -0.40   critic loss: 0.41   steps: 1889
np.all(done) is true! miracle!


training loop:   3% |#                                 | ETA:  37 days, 8:34:24

Episode: 1890   score: 20.11   Avg score (100e): 19.88   actor gain: -0.39   critic loss: 0.41   steps: 1890


training loop:   3% |#                                 | ETA:  37 days, 8:21:13

Episode: 1891   score: 20.12   Avg score (100e): 19.89   actor gain: -0.39   critic loss: 0.41   steps: 1891


training loop:   3% |#                                 | ETA:  37 days, 8:08:41

Episode: 1892   score: 20.12   Avg score (100e): 19.89   actor gain: -0.39   critic loss: 0.41   steps: 1892


training loop:   3% |#                                 | ETA:  37 days, 7:54:56

Episode: 1893   score: 20.13   Avg score (100e): 19.90   actor gain: -0.39   critic loss: 0.41   steps: 1893


training loop:   3% |#                                 | ETA:  37 days, 7:42:11

Episode: 1894   score: 20.14   Avg score (100e): 19.90   actor gain: -0.39   critic loss: 0.41   steps: 1894


training loop:   3% |#                                 | ETA:  37 days, 7:28:49

Episode: 1895   score: 20.14   Avg score (100e): 19.91   actor gain: -0.39   critic loss: 0.41   steps: 1895


training loop:   3% |#                                 | ETA:  37 days, 7:15:05

Episode: 1896   score: 20.15   Avg score (100e): 19.91   actor gain: -0.39   critic loss: 0.41   steps: 1896


training loop:   3% |#                                 | ETA:  37 days, 7:01:09

Episode: 1897   score: 20.16   Avg score (100e): 19.92   actor gain: -0.39   critic loss: 0.41   steps: 1897
np.all(done) is true! miracle!


training loop:   3% |#                                 | ETA:  37 days, 6:46:44

Episode: 1898   score: 20.16   Avg score (100e): 19.92   actor gain: -0.39   critic loss: 0.41   steps: 1898


training loop:   3% |#                                 | ETA:  37 days, 6:33:10

Episode: 1899   score: 20.16   Avg score (100e): 19.93   actor gain: -0.39   critic loss: 0.41   steps: 1899


training loop:   3% |#                                 | ETA:  37 days, 6:19:47

Episode: 1900   score: 20.17   Avg score (100e): 19.93   actor gain: -0.36   critic loss: 0.41   steps: 1900


training loop:   3% |#                                 | ETA:  37 days, 6:05:39

Episode: 1901   score: 20.17   Avg score (100e): 19.94   actor gain: -0.36   critic loss: 0.41   steps: 1901


training loop:   3% |#                                 | ETA:  37 days, 5:51:05

Episode: 1902   score: 20.17   Avg score (100e): 19.94   actor gain: -0.36   critic loss: 0.41   steps: 1902


training loop:   3% |#                                 | ETA:  37 days, 5:37:09

Episode: 1903   score: 20.18   Avg score (100e): 19.95   actor gain: -0.36   critic loss: 0.41   steps: 1903


training loop:   3% |#                                 | ETA:  37 days, 5:22:47

Episode: 1904   score: 20.19   Avg score (100e): 19.95   actor gain: -0.36   critic loss: 0.41   steps: 1904


training loop:   3% |#                                 | ETA:  37 days, 5:08:12

Episode: 1905   score: 20.20   Avg score (100e): 19.95   actor gain: -0.36   critic loss: 0.41   steps: 1905


training loop:   3% |#                                 | ETA:  37 days, 4:55:33

Episode: 1906   score: 20.20   Avg score (100e): 19.96   actor gain: -0.36   critic loss: 0.41   steps: 1906
np.all(done) is true! miracle!


training loop:   3% |#                                 | ETA:  37 days, 4:41:46

Episode: 1907   score: 20.21   Avg score (100e): 19.96   actor gain: -0.35   critic loss: 0.41   steps: 1907


training loop:   3% |#                                 | ETA:  37 days, 4:28:03

Episode: 1908   score: 20.22   Avg score (100e): 19.97   actor gain: -0.35   critic loss: 0.41   steps: 1908


training loop:   3% |#                                 | ETA:  37 days, 4:13:50

Episode: 1909   score: 20.22   Avg score (100e): 19.97   actor gain: -0.35   critic loss: 0.41   steps: 1909


training loop:   3% |#                                 | ETA:  37 days, 3:59:24

Episode: 1910   score: 20.23   Avg score (100e): 19.98   actor gain: -0.35   critic loss: 0.41   steps: 1910


training loop:   3% |#                                 | ETA:  37 days, 3:45:12

Episode: 1911   score: 20.24   Avg score (100e): 19.98   actor gain: -0.35   critic loss: 0.41   steps: 1911


training loop:   3% |#                                 | ETA:  37 days, 3:34:10

Episode: 1912   score: 20.25   Avg score (100e): 19.99   actor gain: -0.35   critic loss: 0.41   steps: 1912


training loop:   3% |#                                 | ETA:  37 days, 3:20:19

Episode: 1913   score: 20.25   Avg score (100e): 20.00   actor gain: -0.35   critic loss: 0.41   steps: 1913


training loop:   3% |#                                 | ETA:  37 days, 3:06:10

Episode: 1914   score: 20.26   Avg score (100e): 20.00   actor gain: -0.35   critic loss: 0.41   steps: 1914


training loop:   3% |#                                 | ETA:  37 days, 2:52:16

Episode: 1915   score: 20.26   Avg score (100e): 20.01   actor gain: -0.35   critic loss: 0.41   steps: 1915


training loop:   3% |#                                 | ETA:  37 days, 2:40:10

Episode: 1916   score: 20.27   Avg score (100e): 20.01   actor gain: -0.35   critic loss: 0.41   steps: 1916


training loop:   3% |#                                 | ETA:  37 days, 2:32:01

Episode: 1917   score: 20.27   Avg score (100e): 20.02   actor gain: -0.35   critic loss: 0.41   steps: 1917


training loop:   3% |#                                 | ETA:  37 days, 2:19:52

Episode: 1918   score: 20.28   Avg score (100e): 20.02   actor gain: -0.35   critic loss: 0.41   steps: 1918


training loop:   3% |#                                 | ETA:  37 days, 2:07:57

Episode: 1919   score: 20.28   Avg score (100e): 20.03   actor gain: -0.35   critic loss: 0.41   steps: 1919


training loop:   3% |#                                 | ETA:  37 days, 1:55:19

Episode: 1920   score: 20.29   Avg score (100e): 20.03   actor gain: -0.34   critic loss: 0.41   steps: 1920


training loop:   3% |#                                 | ETA:  37 days, 1:41:19

Episode: 1921   score: 20.30   Avg score (100e): 20.04   actor gain: -0.34   critic loss: 0.41   steps: 1921


training loop:   3% |#                                 | ETA:  37 days, 1:28:19

Episode: 1922   score: 20.31   Avg score (100e): 20.04   actor gain: -0.34   critic loss: 0.41   steps: 1922


training loop:   3% |#                                 | ETA:  37 days, 1:15:32

Episode: 1923   score: 20.31   Avg score (100e): 20.05   actor gain: -0.34   critic loss: 0.41   steps: 1923


training loop:   3% |#                                 | ETA:  37 days, 1:02:07

Episode: 1924   score: 20.33   Avg score (100e): 20.05   actor gain: -0.34   critic loss: 0.41   steps: 1924


training loop:   3% |#                                 | ETA:  37 days, 0:48:36

Episode: 1925   score: 20.33   Avg score (100e): 20.06   actor gain: -0.34   critic loss: 0.41   steps: 1925


training loop:   3% |#                                 | ETA:  37 days, 0:34:59

Episode: 1926   score: 20.34   Avg score (100e): 20.06   actor gain: -0.34   critic loss: 0.41   steps: 1926


training loop:   3% |#                                 | ETA:  37 days, 0:21:37

Episode: 1927   score: 20.34   Avg score (100e): 20.07   actor gain: -0.34   critic loss: 0.41   steps: 1927


training loop:   3% |#                                 | ETA:  37 days, 0:09:09

Episode: 1928   score: 20.36   Avg score (100e): 20.07   actor gain: -0.34   critic loss: 0.41   steps: 1928


training loop:   3% |#                                | ETA:  36 days, 23:55:22

Episode: 1929   score: 20.35   Avg score (100e): 20.08   actor gain: -0.34   critic loss: 0.41   steps: 1929


training loop:   3% |#                                | ETA:  36 days, 23:41:19

Episode: 1930   score: 20.36   Avg score (100e): 20.08   actor gain: -0.34   critic loss: 0.41   steps: 1930


training loop:   3% |#                                | ETA:  36 days, 23:27:34

Episode: 1931   score: 20.37   Avg score (100e): 20.09   actor gain: -0.34   critic loss: 0.41   steps: 1931


training loop:   3% |#                                | ETA:  36 days, 23:16:16

Episode: 1932   score: 20.37   Avg score (100e): 20.09   actor gain: -0.34   critic loss: 0.42   steps: 1932


training loop:   3% |#                                | ETA:  36 days, 23:03:56

Episode: 1933   score: 20.38   Avg score (100e): 20.10   actor gain: -0.34   critic loss: 0.42   steps: 1933


training loop:   3% |#                                | ETA:  36 days, 22:50:02

Episode: 1934   score: 20.39   Avg score (100e): 20.10   actor gain: -0.36   critic loss: 0.42   steps: 1934


training loop:   3% |#                                | ETA:  36 days, 22:36:36

Episode: 1935   score: 20.40   Avg score (100e): 20.11   actor gain: -0.36   critic loss: 0.42   steps: 1935


training loop:   3% |#                                | ETA:  36 days, 22:22:44

Episode: 1936   score: 20.40   Avg score (100e): 20.12   actor gain: -0.36   critic loss: 0.42   steps: 1936


training loop:   3% |#                                | ETA:  36 days, 22:09:02

Episode: 1937   score: 20.40   Avg score (100e): 20.12   actor gain: -0.36   critic loss: 0.41   steps: 1937


training loop:   3% |#                                | ETA:  36 days, 21:54:52

Episode: 1938   score: 20.40   Avg score (100e): 20.13   actor gain: -0.36   critic loss: 0.41   steps: 1938


training loop:   3% |#                                | ETA:  36 days, 21:42:03

Episode: 1939   score: 20.40   Avg score (100e): 20.13   actor gain: -0.36   critic loss: 0.41   steps: 1939


training loop:   3% |#                                | ETA:  36 days, 21:28:47

Episode: 1940   score: 20.40   Avg score (100e): 20.14   actor gain: -0.36   critic loss: 0.41   steps: 1940


training loop:   3% |#                                | ETA:  36 days, 21:15:31

Episode: 1941   score: 20.41   Avg score (100e): 20.14   actor gain: -0.36   critic loss: 0.41   steps: 1941


training loop:   3% |#                                | ETA:  36 days, 21:01:45

Episode: 1942   score: 20.42   Avg score (100e): 20.15   actor gain: -0.37   critic loss: 0.41   steps: 1942


training loop:   3% |#                                | ETA:  36 days, 20:48:25

Episode: 1943   score: 20.42   Avg score (100e): 20.15   actor gain: -0.37   critic loss: 0.41   steps: 1943


training loop:   3% |#                                | ETA:  36 days, 20:35:14

Episode: 1944   score: 20.43   Avg score (100e): 20.16   actor gain: -0.37   critic loss: 0.41   steps: 1944


training loop:   3% |#                                | ETA:  36 days, 20:22:38

Episode: 1945   score: 20.43   Avg score (100e): 20.16   actor gain: -0.37   critic loss: 0.41   steps: 1945


training loop:   3% |#                                | ETA:  36 days, 20:08:46

Episode: 1946   score: 20.43   Avg score (100e): 20.17   actor gain: -0.37   critic loss: 0.41   steps: 1946
np.all(done) is true! miracle!


training loop:   3% |#                                | ETA:  36 days, 19:54:46

Episode: 1947   score: 20.43   Avg score (100e): 20.17   actor gain: -0.37   critic loss: 0.41   steps: 1947


training loop:   3% |#                                | ETA:  36 days, 19:41:06

Episode: 1948   score: 20.44   Avg score (100e): 20.18   actor gain: -0.45   critic loss: 0.41   steps: 1948


training loop:   3% |#                                | ETA:  36 days, 19:27:38

Episode: 1949   score: 20.45   Avg score (100e): 20.18   actor gain: -0.51   critic loss: 0.41   steps: 1949


training loop:   3% |#                                | ETA:  36 days, 19:16:55

Episode: 1950   score: 20.45   Avg score (100e): 20.19   actor gain: -0.51   critic loss: 0.41   steps: 1950


training loop:   3% |#                                | ETA:  36 days, 19:05:37

Episode: 1951   score: 20.46   Avg score (100e): 20.19   actor gain: -0.51   critic loss: 0.41   steps: 1951


training loop:   3% |#                                | ETA:  36 days, 18:52:08

Episode: 1952   score: 20.46   Avg score (100e): 20.20   actor gain: -0.51   critic loss: 0.41   steps: 1952


training loop:   3% |#                                | ETA:  36 days, 18:38:46

Episode: 1953   score: 20.47   Avg score (100e): 20.20   actor gain: -0.51   critic loss: 0.41   steps: 1953


training loop:   3% |#                                | ETA:  36 days, 18:25:27

Episode: 1954   score: 20.48   Avg score (100e): 20.21   actor gain: -0.51   critic loss: 0.41   steps: 1954


training loop:   3% |#                                | ETA:  36 days, 18:12:34

Episode: 1955   score: 20.48   Avg score (100e): 20.21   actor gain: -0.51   critic loss: 0.41   steps: 1955


training loop:   3% |#                                | ETA:  36 days, 18:00:23

Episode: 1956   score: 20.49   Avg score (100e): 20.22   actor gain: -0.51   critic loss: 0.40   steps: 1956


training loop:   3% |#                                | ETA:  36 days, 17:47:06

Episode: 1957   score: 20.49   Avg score (100e): 20.22   actor gain: -0.51   critic loss: 0.40   steps: 1957


training loop:   3% |#                                | ETA:  36 days, 17:33:26

Episode: 1958   score: 20.50   Avg score (100e): 20.23   actor gain: -0.51   critic loss: 0.40   steps: 1958


training loop:   3% |#                                | ETA:  36 days, 17:19:24

Episode: 1959   score: 20.50   Avg score (100e): 20.24   actor gain: -0.49   critic loss: 0.40   steps: 1959


training loop:   3% |#                                | ETA:  36 days, 17:08:45

Episode: 1960   score: 20.51   Avg score (100e): 20.24   actor gain: -0.49   critic loss: 0.40   steps: 1960


training loop:   3% |#                                | ETA:  36 days, 16:56:23

Episode: 1961   score: 20.52   Avg score (100e): 20.25   actor gain: -0.49   critic loss: 0.40   steps: 1961


training loop:   3% |#                                | ETA:  36 days, 16:44:53

Episode: 1962   score: 20.52   Avg score (100e): 20.25   actor gain: -0.49   critic loss: 0.40   steps: 1962


training loop:   3% |#                                | ETA:  36 days, 16:32:43

Episode: 1963   score: 20.53   Avg score (100e): 20.26   actor gain: -0.49   critic loss: 0.40   steps: 1963


training loop:   3% |#                                | ETA:  36 days, 16:21:51

Episode: 1964   score: 20.54   Avg score (100e): 20.26   actor gain: -0.49   critic loss: 0.40   steps: 1964


training loop:   3% |#                                | ETA:  36 days, 16:09:09

Episode: 1965   score: 20.54   Avg score (100e): 20.27   actor gain: -0.48   critic loss: 0.40   steps: 1965


training loop:   3% |#                                | ETA:  36 days, 15:55:44

Episode: 1966   score: 20.55   Avg score (100e): 20.27   actor gain: -0.48   critic loss: 0.41   steps: 1966
np.all(done) is true! miracle!


training loop:   3% |#                                | ETA:  36 days, 15:44:00

Episode: 1967   score: 20.55   Avg score (100e): 20.28   actor gain: -0.48   critic loss: 0.41   steps: 1967


training loop:   3% |#                                | ETA:  36 days, 15:33:32

Episode: 1968   score: 20.55   Avg score (100e): 20.28   actor gain: -0.48   critic loss: 0.41   steps: 1968


training loop:   3% |#                                | ETA:  36 days, 15:21:57

Episode: 1969   score: 20.55   Avg score (100e): 20.29   actor gain: -0.48   critic loss: 0.41   steps: 1969


training loop:   3% |#                                | ETA:  36 days, 15:08:57

Episode: 1970   score: 20.55   Avg score (100e): 20.29   actor gain: -0.48   critic loss: 0.41   steps: 1970


training loop:   3% |#                                | ETA:  36 days, 14:57:06

Episode: 1971   score: 20.56   Avg score (100e): 20.30   actor gain: -0.48   critic loss: 0.41   steps: 1971


training loop:   3% |#                                | ETA:  36 days, 14:44:48

Episode: 1972   score: 20.56   Avg score (100e): 20.30   actor gain: -0.48   critic loss: 0.41   steps: 1972


training loop:   3% |#                                | ETA:  36 days, 14:33:05

Episode: 1973   score: 20.56   Avg score (100e): 20.31   actor gain: -0.40   critic loss: 0.41   steps: 1973


training loop:   3% |#                                | ETA:  36 days, 14:19:46

Episode: 1974   score: 20.57   Avg score (100e): 20.31   actor gain: -0.34   critic loss: 0.41   steps: 1974


training loop:   3% |#                                | ETA:  36 days, 14:08:33

Episode: 1975   score: 20.56   Avg score (100e): 20.32   actor gain: -0.34   critic loss: 0.41   steps: 1975


training loop:   3% |#                                | ETA:  36 days, 13:58:47

Episode: 1976   score: 20.56   Avg score (100e): 20.32   actor gain: -0.34   critic loss: 0.41   steps: 1976


training loop:   3% |#                                | ETA:  36 days, 13:47:54

Episode: 1977   score: 20.57   Avg score (100e): 20.33   actor gain: -0.34   critic loss: 0.41   steps: 1977


training loop:   3% |#                                | ETA:  36 days, 13:36:22

Episode: 1978   score: 20.58   Avg score (100e): 20.33   actor gain: -0.34   critic loss: 0.41   steps: 1978


training loop:   3% |#                                | ETA:  36 days, 13:24:39

Episode: 1979   score: 20.58   Avg score (100e): 20.34   actor gain: -0.34   critic loss: 0.41   steps: 1979


training loop:   3% |#                                | ETA:  36 days, 13:13:41

Episode: 1980   score: 20.59   Avg score (100e): 20.35   actor gain: -0.34   critic loss: 0.41   steps: 1980


training loop:   3% |#                                | ETA:  36 days, 13:02:04

Episode: 1981   score: 20.59   Avg score (100e): 20.35   actor gain: -0.34   critic loss: 0.41   steps: 1981


training loop:   3% |#                                | ETA:  36 days, 12:52:38

Episode: 1982   score: 20.58   Avg score (100e): 20.36   actor gain: -0.34   critic loss: 0.41   steps: 1982


training loop:   3% |#                                | ETA:  36 days, 12:44:12

Episode: 1983   score: 20.58   Avg score (100e): 20.36   actor gain: -0.34   critic loss: 0.40   steps: 1983


training loop:   3% |#                                | ETA:  36 days, 12:32:45

Episode: 1984   score: 20.58   Avg score (100e): 20.36   actor gain: -0.34   critic loss: 0.40   steps: 1984


training loop:   3% |#                                | ETA:  36 days, 12:21:34

Episode: 1985   score: 20.59   Avg score (100e): 20.37   actor gain: -0.34   critic loss: 0.40   steps: 1985


training loop:   3% |#                                | ETA:  36 days, 12:11:15

Episode: 1986   score: 20.60   Avg score (100e): 20.37   actor gain: -0.34   critic loss: 0.40   steps: 1986


training loop:   3% |#                                | ETA:  36 days, 12:00:26

Episode: 1987   score: 20.60   Avg score (100e): 20.38   actor gain: -0.34   critic loss: 0.40   steps: 1987
np.all(done) is true! miracle!


training loop:   3% |#                                | ETA:  36 days, 11:50:18

Episode: 1988   score: 20.61   Avg score (100e): 20.38   actor gain: -0.34   critic loss: 0.40   steps: 1988


training loop:   3% |#                                | ETA:  36 days, 11:40:33

Episode: 1989   score: 20.61   Avg score (100e): 20.39   actor gain: -0.34   critic loss: 0.40   steps: 1989


training loop:   3% |#                                | ETA:  36 days, 11:28:49

Episode: 1990   score: 20.62   Avg score (100e): 20.40   actor gain: -0.34   critic loss: 0.40   steps: 1990
np.all(done) is true! miracle!


training loop:   3% |#                                | ETA:  36 days, 11:16:05

Episode: 1991   score: 20.63   Avg score (100e): 20.40   actor gain: -0.34   critic loss: 0.40   steps: 1991


training loop:   3% |#                                | ETA:  36 days, 11:03:42

Episode: 1992   score: 20.63   Avg score (100e): 20.41   actor gain: -0.34   critic loss: 0.40   steps: 1992


training loop:   3% |#                                | ETA:  36 days, 10:52:40

Episode: 1993   score: 20.63   Avg score (100e): 20.41   actor gain: -0.34   critic loss: 0.40   steps: 1993


training loop:   3% |#                                | ETA:  36 days, 10:39:55

Episode: 1994   score: 20.63   Avg score (100e): 20.42   actor gain: -0.34   critic loss: 0.40   steps: 1994
np.all(done) is true! miracle!


training loop:   3% |#                                | ETA:  36 days, 10:26:46

Episode: 1995   score: 20.64   Avg score (100e): 20.42   actor gain: -0.34   critic loss: 0.40   steps: 1995


training loop:   3% |#                                | ETA:  36 days, 10:15:35

Episode: 1996   score: 20.65   Avg score (100e): 20.43   actor gain: -0.36   critic loss: 0.40   steps: 1996


training loop:   3% |#                                | ETA:  36 days, 10:05:54

Episode: 1997   score: 20.65   Avg score (100e): 20.43   actor gain: -0.36   critic loss: 0.40   steps: 1997


training loop:   3% |#                                 | ETA:  36 days, 9:56:01

Episode: 1998   score: 20.66   Avg score (100e): 20.44   actor gain: -0.36   critic loss: 0.40   steps: 1998


training loop:   3% |#                                 | ETA:  36 days, 9:44:11

Episode: 1999   score: 20.66   Avg score (100e): 20.44   actor gain: -0.36   critic loss: 0.40   steps: 1999


training loop:   3% |#                                 | ETA:  36 days, 9:31:36

Episode: 2000   score: 20.67   Avg score (100e): 20.45   actor gain: -0.36   critic loss: 0.40   steps: 2000


training loop:   4% |#                                 | ETA:  36 days, 9:18:56

Episode: 2001   score: 20.67   Avg score (100e): 20.45   actor gain: -0.36   critic loss: 0.40   steps: 2001


training loop:   4% |#                                 | ETA:  36 days, 9:06:37

Episode: 2002   score: 20.68   Avg score (100e): 20.46   actor gain: -0.36   critic loss: 0.40   steps: 2002


training loop:   4% |#                                 | ETA:  36 days, 8:53:58

Episode: 2003   score: 20.69   Avg score (100e): 20.46   actor gain: -0.36   critic loss: 0.40   steps: 2003


training loop:   4% |#                                 | ETA:  36 days, 8:43:51

Episode: 2004   score: 20.69   Avg score (100e): 20.47   actor gain: -0.36   critic loss: 0.40   steps: 2004


training loop:   4% |#                                 | ETA:  36 days, 8:32:47

Episode: 2005   score: 20.70   Avg score (100e): 20.47   actor gain: -0.36   critic loss: 0.40   steps: 2005


training loop:   4% |#                                 | ETA:  36 days, 8:19:59

Episode: 2006   score: 20.70   Avg score (100e): 20.48   actor gain: -0.36   critic loss: 0.40   steps: 2006


training loop:   4% |#                                 | ETA:  36 days, 8:07:15

Episode: 2007   score: 20.71   Avg score (100e): 20.48   actor gain: -0.36   critic loss: 0.41   steps: 2007


training loop:   4% |#                                 | ETA:  36 days, 7:54:14

Episode: 2008   score: 20.72   Avg score (100e): 20.49   actor gain: -0.36   critic loss: 0.41   steps: 2008


training loop:   4% |#                                 | ETA:  36 days, 7:41:47

Episode: 2009   score: 20.72   Avg score (100e): 20.49   actor gain: -0.37   critic loss: 0.41   steps: 2009


training loop:   4% |#                                 | ETA:  36 days, 7:28:57

Episode: 2010   score: 20.72   Avg score (100e): 20.50   actor gain: -0.37   critic loss: 0.41   steps: 2010


training loop:   4% |#                                 | ETA:  36 days, 7:16:08

Episode: 2011   score: 20.72   Avg score (100e): 20.50   actor gain: -0.37   critic loss: 0.41   steps: 2011


training loop:   4% |#                                 | ETA:  36 days, 7:02:50

Episode: 2012   score: 20.72   Avg score (100e): 20.50   actor gain: -0.37   critic loss: 0.41   steps: 2012


training loop:   4% |#                                 | ETA:  36 days, 6:51:18

Episode: 2013   score: 20.72   Avg score (100e): 20.51   actor gain: -0.37   critic loss: 0.41   steps: 2013
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  36 days, 6:40:14

Episode: 2014   score: 20.73   Avg score (100e): 20.51   actor gain: -0.37   critic loss: 0.41   steps: 2014


training loop:   4% |#                                 | ETA:  36 days, 6:29:25

Episode: 2015   score: 20.73   Avg score (100e): 20.52   actor gain: -0.37   critic loss: 0.41   steps: 2015


training loop:   4% |#                                 | ETA:  36 days, 6:17:10

Episode: 2016   score: 20.74   Avg score (100e): 20.52   actor gain: -0.37   critic loss: 0.41   steps: 2016


training loop:   4% |#                                 | ETA:  36 days, 6:04:32

Episode: 2017   score: 20.74   Avg score (100e): 20.53   actor gain: -0.37   critic loss: 0.41   steps: 2017
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  36 days, 5:51:08

Episode: 2018   score: 20.76   Avg score (100e): 20.53   actor gain: -0.37   critic loss: 0.41   steps: 2018


training loop:   4% |#                                 | ETA:  36 days, 5:40:28

Episode: 2019   score: 20.76   Avg score (100e): 20.54   actor gain: -0.37   critic loss: 0.41   steps: 2019


training loop:   4% |#                                 | ETA:  36 days, 5:30:54

Episode: 2020   score: 20.76   Avg score (100e): 20.54   actor gain: -0.37   critic loss: 0.41   steps: 2020


training loop:   4% |#                                 | ETA:  36 days, 5:21:22

Episode: 2021   score: 20.77   Avg score (100e): 20.55   actor gain: -0.36   critic loss: 0.41   steps: 2021


training loop:   4% |#                                 | ETA:  36 days, 5:11:41

Episode: 2022   score: 20.77   Avg score (100e): 20.55   actor gain: -0.36   critic loss: 0.41   steps: 2022


training loop:   4% |#                                 | ETA:  36 days, 5:01:12

Episode: 2023   score: 20.77   Avg score (100e): 20.56   actor gain: -0.36   critic loss: 0.41   steps: 2023
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  36 days, 4:49:57

Episode: 2024   score: 20.77   Avg score (100e): 20.56   actor gain: -0.36   critic loss: 0.41   steps: 2024


training loop:   4% |#                                 | ETA:  36 days, 4:41:31

Episode: 2025   score: 20.77   Avg score (100e): 20.57   actor gain: -0.36   critic loss: 0.41   steps: 2025


training loop:   4% |#                                 | ETA:  36 days, 4:31:36

Episode: 2026   score: 20.78   Avg score (100e): 20.57   actor gain: -0.36   critic loss: 0.41   steps: 2026


training loop:   4% |#                                 | ETA:  36 days, 4:21:27

Episode: 2027   score: 20.78   Avg score (100e): 20.57   actor gain: -0.36   critic loss: 0.41   steps: 2027


training loop:   4% |#                                 | ETA:  36 days, 4:11:30

Episode: 2028   score: 20.78   Avg score (100e): 20.58   actor gain: -0.36   critic loss: 0.41   steps: 2028


training loop:   4% |#                                 | ETA:  36 days, 4:00:41

Episode: 2029   score: 20.79   Avg score (100e): 20.58   actor gain: -0.36   critic loss: 0.41   steps: 2029


training loop:   4% |#                                 | ETA:  36 days, 3:50:19

Episode: 2030   score: 20.80   Avg score (100e): 20.59   actor gain: -0.36   critic loss: 0.41   steps: 2030
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  36 days, 3:39:13

Episode: 2031   score: 20.80   Avg score (100e): 20.59   actor gain: -0.36   critic loss: 0.41   steps: 2031


training loop:   4% |#                                 | ETA:  36 days, 3:28:35

Episode: 2032   score: 20.81   Avg score (100e): 20.60   actor gain: -0.36   critic loss: 0.41   steps: 2032
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  36 days, 3:17:46

Episode: 2033   score: 20.81   Avg score (100e): 20.60   actor gain: -0.36   critic loss: 0.41   steps: 2033


training loop:   4% |#                                 | ETA:  36 days, 3:07:02

Episode: 2034   score: 20.82   Avg score (100e): 20.60   actor gain: -0.36   critic loss: 0.41   steps: 2034
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  36 days, 2:57:48

Episode: 2035   score: 20.82   Avg score (100e): 20.61   actor gain: -0.35   critic loss: 0.41   steps: 2035


training loop:   4% |#                                 | ETA:  36 days, 2:46:58

Episode: 2036   score: 20.83   Avg score (100e): 20.61   actor gain: -0.35   critic loss: 0.41   steps: 2036


training loop:   4% |#                                 | ETA:  36 days, 2:37:14

Episode: 2037   score: 20.84   Avg score (100e): 20.62   actor gain: -0.35   critic loss: 0.41   steps: 2037


training loop:   4% |#                                 | ETA:  36 days, 2:27:59

Episode: 2038   score: 20.84   Avg score (100e): 20.62   actor gain: -0.35   critic loss: 0.41   steps: 2038


training loop:   4% |#                                 | ETA:  36 days, 2:19:16

Episode: 2039   score: 20.84   Avg score (100e): 20.63   actor gain: -0.35   critic loss: 0.41   steps: 2039


training loop:   4% |#                                 | ETA:  36 days, 2:09:37

Episode: 2040   score: 20.85   Avg score (100e): 20.63   actor gain: -0.35   critic loss: 0.41   steps: 2040


training loop:   4% |#                                 | ETA:  36 days, 1:59:32

Episode: 2041   score: 20.86   Avg score (100e): 20.64   actor gain: -0.35   critic loss: 0.41   steps: 2041


training loop:   4% |#                                 | ETA:  36 days, 1:50:26

Episode: 2042   score: 20.86   Avg score (100e): 20.64   actor gain: -0.35   critic loss: 0.41   steps: 2042


training loop:   4% |#                                 | ETA:  36 days, 1:39:55

Episode: 2043   score: 20.87   Avg score (100e): 20.64   actor gain: -0.35   critic loss: 0.41   steps: 2043


training loop:   4% |#                                 | ETA:  36 days, 1:28:44

Episode: 2044   score: 20.87   Avg score (100e): 20.65   actor gain: -0.35   critic loss: 0.41   steps: 2044


training loop:   4% |#                                 | ETA:  36 days, 1:19:45

Episode: 2045   score: 20.88   Avg score (100e): 20.65   actor gain: -0.35   critic loss: 0.41   steps: 2045


training loop:   4% |#                                 | ETA:  36 days, 1:08:53

Episode: 2046   score: 20.89   Avg score (100e): 20.66   actor gain: -0.34   critic loss: 0.41   steps: 2046


training loop:   4% |#                                 | ETA:  36 days, 1:00:13

Episode: 2047   score: 20.90   Avg score (100e): 20.66   actor gain: -0.35   critic loss: 0.41   steps: 2047


training loop:   4% |#                                 | ETA:  36 days, 0:48:33

Episode: 2048   score: 20.90   Avg score (100e): 20.67   actor gain: -0.35   critic loss: 0.41   steps: 2048


training loop:   4% |#                                 | ETA:  36 days, 0:38:07

Episode: 2049   score: 20.90   Avg score (100e): 20.67   actor gain: -0.35   critic loss: 0.41   steps: 2049
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  36 days, 0:27:57

Episode: 2050   score: 20.90   Avg score (100e): 20.68   actor gain: -0.35   critic loss: 0.41   steps: 2050


training loop:   4% |#                                 | ETA:  36 days, 0:17:17

Episode: 2051   score: 20.90   Avg score (100e): 20.68   actor gain: -0.34   critic loss: 0.41   steps: 2051


training loop:   4% |#                                 | ETA:  36 days, 0:07:26

Episode: 2052   score: 20.91   Avg score (100e): 20.68   actor gain: -0.35   critic loss: 0.41   steps: 2052


training loop:   4% |#                                | ETA:  35 days, 23:57:22

Episode: 2053   score: 20.91   Avg score (100e): 20.69   actor gain: -0.34   critic loss: 0.41   steps: 2053


training loop:   4% |#                                | ETA:  35 days, 23:48:03

Episode: 2054   score: 20.91   Avg score (100e): 20.69   actor gain: -0.34   critic loss: 0.41   steps: 2054
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  35 days, 23:37:20

Episode: 2055   score: 20.91   Avg score (100e): 20.70   actor gain: -0.34   critic loss: 0.41   steps: 2055


training loop:   4% |#                                | ETA:  35 days, 23:26:05

Episode: 2056   score: 20.92   Avg score (100e): 20.70   actor gain: -0.34   critic loss: 0.41   steps: 2056
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  35 days, 23:13:32

Episode: 2057   score: 20.93   Avg score (100e): 20.71   actor gain: -0.34   critic loss: 0.41   steps: 2057


training loop:   4% |#                                | ETA:  35 days, 23:02:37

Episode: 2058   score: 20.93   Avg score (100e): 20.71   actor gain: -0.34   critic loss: 0.41   steps: 2058
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  35 days, 22:51:20

Episode: 2059   score: 20.94   Avg score (100e): 20.72   actor gain: -0.34   critic loss: 0.41   steps: 2059
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  35 days, 22:39:09

Episode: 2060   score: 20.95   Avg score (100e): 20.72   actor gain: -0.34   critic loss: 0.41   steps: 2060


training loop:   4% |#                                | ETA:  35 days, 22:28:31

Episode: 2061   score: 20.95   Avg score (100e): 20.72   actor gain: -0.34   critic loss: 0.41   steps: 2061


training loop:   4% |#                                | ETA:  35 days, 22:17:08

Episode: 2062   score: 20.96   Avg score (100e): 20.73   actor gain: -0.34   critic loss: 0.41   steps: 2062


training loop:   4% |#                                | ETA:  35 days, 22:05:36

Episode: 2063   score: 20.95   Avg score (100e): 20.73   actor gain: -0.34   critic loss: 0.41   steps: 2063


training loop:   4% |#                                | ETA:  35 days, 21:54:12

Episode: 2064   score: 20.96   Avg score (100e): 20.74   actor gain: -0.34   critic loss: 0.41   steps: 2064
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  35 days, 21:43:39

Episode: 2065   score: 20.97   Avg score (100e): 20.74   actor gain: -0.34   critic loss: 0.41   steps: 2065


training loop:   4% |#                                | ETA:  35 days, 21:33:05

Episode: 2066   score: 20.97   Avg score (100e): 20.75   actor gain: -0.34   critic loss: 0.41   steps: 2066


training loop:   4% |#                                | ETA:  35 days, 21:21:37

Episode: 2067   score: 20.98   Avg score (100e): 20.75   actor gain: -0.34   critic loss: 0.41   steps: 2067


training loop:   4% |#                                | ETA:  35 days, 21:09:27

Episode: 2068   score: 20.98   Avg score (100e): 20.75   actor gain: -0.34   critic loss: 0.41   steps: 2068


training loop:   4% |#                                | ETA:  35 days, 20:57:36

Episode: 2069   score: 20.98   Avg score (100e): 20.76   actor gain: -0.34   critic loss: 0.41   steps: 2069


training loop:   4% |#                                | ETA:  35 days, 20:47:10

Episode: 2070   score: 20.98   Avg score (100e): 20.76   actor gain: -0.34   critic loss: 0.41   steps: 2070


training loop:   4% |#                                | ETA:  35 days, 20:36:26

Episode: 2071   score: 20.99   Avg score (100e): 20.77   actor gain: -0.34   critic loss: 0.41   steps: 2071


training loop:   4% |#                                | ETA:  35 days, 20:26:01

Episode: 2072   score: 20.99   Avg score (100e): 20.77   actor gain: -0.34   critic loss: 0.41   steps: 2072


training loop:   4% |#                                | ETA:  35 days, 20:14:08

Episode: 2073   score: 20.99   Avg score (100e): 20.78   actor gain: -0.33   critic loss: 0.41   steps: 2073


training loop:   4% |#                                | ETA:  35 days, 20:01:59

Episode: 2074   score: 20.99   Avg score (100e): 20.78   actor gain: -0.33   critic loss: 0.41   steps: 2074


training loop:   4% |#                                | ETA:  35 days, 19:49:58

Episode: 2075   score: 20.99   Avg score (100e): 20.78   actor gain: -0.34   critic loss: 0.41   steps: 2075
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  35 days, 19:37:57

Episode: 2076   score: 20.99   Avg score (100e): 20.79   actor gain: -0.34   critic loss: 0.41   steps: 2076


training loop:   4% |#                                | ETA:  35 days, 19:26:47

Episode: 2077   score: 20.99   Avg score (100e): 20.79   actor gain: -0.34   critic loss: 0.41   steps: 2077


training loop:   4% |#                                | ETA:  35 days, 19:14:49

Episode: 2078   score: 21.00   Avg score (100e): 20.80   actor gain: -0.34   critic loss: 0.41   steps: 2078


training loop:   4% |#                                | ETA:  35 days, 19:05:38

Episode: 2079   score: 21.00   Avg score (100e): 20.80   actor gain: -0.34   critic loss: 0.41   steps: 2079
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  35 days, 18:55:02

Episode: 2080   score: 21.01   Avg score (100e): 20.80   actor gain: -0.34   critic loss: 0.41   steps: 2080


training loop:   4% |#                                | ETA:  35 days, 18:43:58

Episode: 2081   score: 21.01   Avg score (100e): 20.81   actor gain: -0.34   critic loss: 0.40   steps: 2081


training loop:   4% |#                                | ETA:  35 days, 18:31:46

Episode: 2082   score: 21.01   Avg score (100e): 20.81   actor gain: -0.34   critic loss: 0.40   steps: 2082


training loop:   4% |#                                | ETA:  35 days, 18:21:31

Episode: 2083   score: 21.02   Avg score (100e): 20.82   actor gain: -0.33   critic loss: 0.40   steps: 2083


training loop:   4% |#                                | ETA:  35 days, 18:09:23

Episode: 2084   score: 21.02   Avg score (100e): 20.82   actor gain: -0.33   critic loss: 0.40   steps: 2084


training loop:   4% |#                                | ETA:  35 days, 17:57:47

Episode: 2085   score: 21.02   Avg score (100e): 20.83   actor gain: -0.33   critic loss: 0.40   steps: 2085


training loop:   4% |#                                | ETA:  35 days, 17:46:54

Episode: 2086   score: 21.03   Avg score (100e): 20.83   actor gain: -0.33   critic loss: 0.40   steps: 2086


training loop:   4% |#                                | ETA:  35 days, 17:37:14

Episode: 2087   score: 21.04   Avg score (100e): 20.83   actor gain: -0.33   critic loss: 0.40   steps: 2087
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  35 days, 17:27:20

Episode: 2088   score: 21.04   Avg score (100e): 20.84   actor gain: -0.33   critic loss: 0.40   steps: 2088


training loop:   4% |#                                | ETA:  35 days, 17:16:00

Episode: 2089   score: 21.05   Avg score (100e): 20.84   actor gain: -0.33   critic loss: 0.40   steps: 2089


training loop:   4% |#                                | ETA:  35 days, 17:04:08

Episode: 2090   score: 21.05   Avg score (100e): 20.85   actor gain: -0.33   critic loss: 0.40   steps: 2090


training loop:   4% |#                                | ETA:  35 days, 16:52:29

Episode: 2091   score: 21.05   Avg score (100e): 20.85   actor gain: -0.33   critic loss: 0.40   steps: 2091


training loop:   4% |#                                | ETA:  35 days, 16:42:01

Episode: 2092   score: 21.06   Avg score (100e): 20.86   actor gain: -0.33   critic loss: 0.40   steps: 2092


training loop:   4% |#                                | ETA:  35 days, 16:30:09

Episode: 2093   score: 21.07   Avg score (100e): 20.86   actor gain: -0.33   critic loss: 0.40   steps: 2093


training loop:   4% |#                                | ETA:  35 days, 16:20:06

Episode: 2094   score: 21.07   Avg score (100e): 20.87   actor gain: -0.33   critic loss: 0.40   steps: 2094


training loop:   4% |#                                | ETA:  35 days, 16:09:28

Episode: 2095   score: 21.07   Avg score (100e): 20.87   actor gain: -0.34   critic loss: 0.40   steps: 2095


training loop:   4% |#                                | ETA:  35 days, 15:59:00

Episode: 2096   score: 21.08   Avg score (100e): 20.87   actor gain: -0.34   critic loss: 0.40   steps: 2096


training loop:   4% |#                                | ETA:  35 days, 15:47:01

Episode: 2097   score: 21.08   Avg score (100e): 20.88   actor gain: -0.34   critic loss: 0.40   steps: 2097


training loop:   4% |#                                | ETA:  35 days, 15:35:16

Episode: 2098   score: 21.09   Avg score (100e): 20.88   actor gain: -0.34   critic loss: 0.40   steps: 2098


training loop:   4% |#                                | ETA:  35 days, 15:24:13

Episode: 2099   score: 21.10   Avg score (100e): 20.89   actor gain: -0.34   critic loss: 0.40   steps: 2099


training loop:   4% |#                                | ETA:  35 days, 15:12:41

Episode: 2100   score: 21.10   Avg score (100e): 20.89   actor gain: -0.34   critic loss: 0.40   steps: 2100


training loop:   4% |#                                | ETA:  35 days, 15:00:28

Episode: 2101   score: 21.11   Avg score (100e): 20.90   actor gain: -0.34   critic loss: 0.40   steps: 2101


training loop:   4% |#                                | ETA:  35 days, 14:48:31

Episode: 2102   score: 21.11   Avg score (100e): 20.90   actor gain: -0.33   critic loss: 0.40   steps: 2102


training loop:   4% |#                                | ETA:  35 days, 14:37:50

Episode: 2103   score: 21.12   Avg score (100e): 20.90   actor gain: -0.33   critic loss: 0.41   steps: 2103


training loop:   4% |#                                | ETA:  35 days, 14:26:10

Episode: 2104   score: 21.13   Avg score (100e): 20.91   actor gain: -0.33   critic loss: 0.41   steps: 2104


training loop:   4% |#                                | ETA:  35 days, 14:15:44

Episode: 2105   score: 21.13   Avg score (100e): 20.91   actor gain: -0.33   critic loss: 0.41   steps: 2105


training loop:   4% |#                                | ETA:  35 days, 14:03:43

Episode: 2106   score: 21.14   Avg score (100e): 20.92   actor gain: -0.33   critic loss: 0.41   steps: 2106


training loop:   4% |#                                | ETA:  35 days, 13:51:59

Episode: 2107   score: 21.14   Avg score (100e): 20.92   actor gain: -0.33   critic loss: 0.41   steps: 2107


training loop:   4% |#                                | ETA:  35 days, 13:40:02

Episode: 2108   score: 21.15   Avg score (100e): 20.93   actor gain: -0.33   critic loss: 0.41   steps: 2108


training loop:   4% |#                                | ETA:  35 days, 13:28:01

Episode: 2109   score: 21.15   Avg score (100e): 20.93   actor gain: -0.33   critic loss: 0.41   steps: 2109


training loop:   4% |#                                | ETA:  35 days, 13:16:43

Episode: 2110   score: 21.16   Avg score (100e): 20.93   actor gain: -0.33   critic loss: 0.41   steps: 2110


training loop:   4% |#                                | ETA:  35 days, 13:07:08

Episode: 2111   score: 21.16   Avg score (100e): 20.94   actor gain: -0.33   critic loss: 0.41   steps: 2111


training loop:   4% |#                                | ETA:  35 days, 12:58:08

Episode: 2112   score: 21.17   Avg score (100e): 20.94   actor gain: -0.33   critic loss: 0.41   steps: 2112


training loop:   4% |#                                | ETA:  35 days, 12:47:01

Episode: 2113   score: 21.18   Avg score (100e): 20.95   actor gain: -0.33   critic loss: 0.41   steps: 2113


training loop:   4% |#                                | ETA:  35 days, 12:35:36

Episode: 2114   score: 21.18   Avg score (100e): 20.95   actor gain: -0.33   critic loss: 0.41   steps: 2114


training loop:   4% |#                                | ETA:  35 days, 12:23:54

Episode: 2115   score: 21.18   Avg score (100e): 20.96   actor gain: -0.33   critic loss: 0.41   steps: 2115


training loop:   4% |#                                | ETA:  35 days, 12:13:00

Episode: 2116   score: 21.18   Avg score (100e): 20.96   actor gain: -0.33   critic loss: 0.41   steps: 2116


training loop:   4% |#                                | ETA:  35 days, 12:01:29

Episode: 2117   score: 21.19   Avg score (100e): 20.97   actor gain: -0.33   critic loss: 0.41   steps: 2117


training loop:   4% |#                                | ETA:  35 days, 11:50:17

Episode: 2118   score: 21.20   Avg score (100e): 20.97   actor gain: -0.33   critic loss: 0.41   steps: 2118


training loop:   4% |#                                | ETA:  35 days, 11:39:45

Episode: 2119   score: 21.20   Avg score (100e): 20.97   actor gain: -0.33   critic loss: 0.41   steps: 2119


training loop:   4% |#                                | ETA:  35 days, 11:28:17

Episode: 2120   score: 21.20   Avg score (100e): 20.98   actor gain: -0.33   critic loss: 0.41   steps: 2120


training loop:   4% |#                                | ETA:  35 days, 11:16:12

Episode: 2121   score: 21.21   Avg score (100e): 20.98   actor gain: -0.33   critic loss: 0.41   steps: 2121


training loop:   4% |#                                | ETA:  35 days, 11:06:27

Episode: 2122   score: 21.22   Avg score (100e): 20.99   actor gain: -0.33   critic loss: 0.41   steps: 2122


training loop:   4% |#                                | ETA:  35 days, 10:54:52

Episode: 2123   score: 21.22   Avg score (100e): 20.99   actor gain: -0.33   critic loss: 0.41   steps: 2123


training loop:   4% |#                                | ETA:  35 days, 10:43:16

Episode: 2124   score: 21.22   Avg score (100e): 21.00   actor gain: -0.33   critic loss: 0.41   steps: 2124


training loop:   4% |#                                | ETA:  35 days, 10:31:38

Episode: 2125   score: 21.22   Avg score (100e): 21.00   actor gain: -0.33   critic loss: 0.41   steps: 2125


training loop:   4% |#                                | ETA:  35 days, 10:19:39

Episode: 2126   score: 21.22   Avg score (100e): 21.01   actor gain: -0.33   critic loss: 0.41   steps: 2126


training loop:   4% |#                                | ETA:  35 days, 10:08:48

Episode: 2127   score: 21.22   Avg score (100e): 21.01   actor gain: -0.33   critic loss: 0.41   steps: 2127


training loop:   4% |#                                 | ETA:  35 days, 9:59:15

Episode: 2128   score: 21.23   Avg score (100e): 21.01   actor gain: -0.33   critic loss: 0.41   steps: 2128


training loop:   4% |#                                 | ETA:  35 days, 9:47:34

Episode: 2129   score: 21.23   Avg score (100e): 21.02   actor gain: -0.33   critic loss: 0.41   steps: 2129


training loop:   4% |#                                 | ETA:  35 days, 9:36:15

Episode: 2130   score: 21.24   Avg score (100e): 21.02   actor gain: -0.33   critic loss: 0.41   steps: 2130


training loop:   4% |#                                 | ETA:  35 days, 9:24:27

Episode: 2131   score: 21.25   Avg score (100e): 21.03   actor gain: -0.33   critic loss: 0.41   steps: 2131


training loop:   4% |#                                 | ETA:  35 days, 9:12:45

Episode: 2132   score: 21.25   Avg score (100e): 21.03   actor gain: -0.33   critic loss: 0.41   steps: 2132


training loop:   4% |#                                 | ETA:  35 days, 9:01:17

Episode: 2133   score: 21.25   Avg score (100e): 21.04   actor gain: -0.33   critic loss: 0.41   steps: 2133


training loop:   4% |#                                 | ETA:  35 days, 8:50:54

Episode: 2134   score: 21.26   Avg score (100e): 21.04   actor gain: -0.33   critic loss: 0.41   steps: 2134
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  35 days, 8:39:03

Episode: 2135   score: 21.27   Avg score (100e): 21.05   actor gain: -0.33   critic loss: 0.41   steps: 2135


training loop:   4% |#                                 | ETA:  35 days, 8:29:24

Episode: 2136   score: 21.27   Avg score (100e): 21.05   actor gain: -0.33   critic loss: 0.41   steps: 2136
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  35 days, 8:18:07

Episode: 2137   score: 21.28   Avg score (100e): 21.05   actor gain: -0.33   critic loss: 0.40   steps: 2137
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  35 days, 8:05:42

Episode: 2138   score: 21.28   Avg score (100e): 21.06   actor gain: -0.33   critic loss: 0.40   steps: 2138


training loop:   4% |#                                 | ETA:  35 days, 7:55:40

Episode: 2139   score: 21.28   Avg score (100e): 21.06   actor gain: -0.33   critic loss: 0.40   steps: 2139


training loop:   4% |#                                 | ETA:  35 days, 7:44:20

Episode: 2140   score: 21.29   Avg score (100e): 21.07   actor gain: -0.33   critic loss: 0.40   steps: 2140


training loop:   4% |#                                 | ETA:  35 days, 7:32:37

Episode: 2141   score: 21.29   Avg score (100e): 21.07   actor gain: -0.33   critic loss: 0.40   steps: 2141


training loop:   4% |#                                 | ETA:  35 days, 7:21:04

Episode: 2142   score: 21.29   Avg score (100e): 21.08   actor gain: -0.33   critic loss: 0.40   steps: 2142


training loop:   4% |#                                 | ETA:  35 days, 7:09:37

Episode: 2143   score: 21.30   Avg score (100e): 21.08   actor gain: -0.33   critic loss: 0.40   steps: 2143


training loop:   4% |#                                 | ETA:  35 days, 7:03:51

Episode: 2144   score: 21.31   Avg score (100e): 21.09   actor gain: -0.33   critic loss: 0.41   steps: 2144
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  35 days, 7:00:11

Episode: 2145   score: 21.31   Avg score (100e): 21.09   actor gain: -0.33   critic loss: 0.40   steps: 2145
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  35 days, 6:49:54

Episode: 2146   score: 21.31   Avg score (100e): 21.09   actor gain: -0.34   critic loss: 0.40   steps: 2146


training loop:   4% |#                                 | ETA:  35 days, 6:39:03

Episode: 2147   score: 21.31   Avg score (100e): 21.10   actor gain: -0.34   critic loss: 0.40   steps: 2147


training loop:   4% |#                                 | ETA:  35 days, 6:27:59

Episode: 2148   score: 21.32   Avg score (100e): 21.10   actor gain: -0.33   critic loss: 0.40   steps: 2148


training loop:   4% |#                                 | ETA:  35 days, 6:17:51

Episode: 2149   score: 21.31   Avg score (100e): 21.11   actor gain: -0.33   critic loss: 0.40   steps: 2149


training loop:   4% |#                                 | ETA:  35 days, 6:07:00

Episode: 2150   score: 21.32   Avg score (100e): 21.11   actor gain: -0.33   critic loss: 0.41   steps: 2150
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  35 days, 5:57:17

Episode: 2151   score: 21.33   Avg score (100e): 21.11   actor gain: -0.33   critic loss: 0.41   steps: 2151


training loop:   4% |#                                 | ETA:  35 days, 5:47:55

Episode: 2152   score: 21.33   Avg score (100e): 21.12   actor gain: -0.33   critic loss: 0.41   steps: 2152


training loop:   4% |#                                 | ETA:  35 days, 5:37:03

Episode: 2153   score: 21.33   Avg score (100e): 21.12   actor gain: -0.33   critic loss: 0.41   steps: 2153


training loop:   4% |#                                 | ETA:  35 days, 5:27:33

Episode: 2154   score: 21.34   Avg score (100e): 21.13   actor gain: -0.33   critic loss: 0.41   steps: 2154
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  35 days, 5:16:35

Episode: 2155   score: 21.35   Avg score (100e): 21.13   actor gain: -0.33   critic loss: 0.41   steps: 2155


training loop:   4% |#                                 | ETA:  35 days, 5:05:50

Episode: 2156   score: 21.35   Avg score (100e): 21.14   actor gain: -0.33   critic loss: 0.41   steps: 2156


training loop:   4% |#                                 | ETA:  35 days, 4:55:53

Episode: 2157   score: 21.36   Avg score (100e): 21.14   actor gain: -0.33   critic loss: 0.41   steps: 2157


training loop:   4% |#                                 | ETA:  35 days, 4:47:47

Episode: 2158   score: 21.37   Avg score (100e): 21.14   actor gain: -0.33   critic loss: 0.41   steps: 2158


training loop:   4% |#                                 | ETA:  35 days, 4:38:41

Episode: 2159   score: 21.38   Avg score (100e): 21.15   actor gain: -0.34   critic loss: 0.41   steps: 2159


training loop:   4% |#                                 | ETA:  35 days, 4:32:21

Episode: 2160   score: 21.38   Avg score (100e): 21.15   actor gain: -0.33   critic loss: 0.41   steps: 2160
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  35 days, 4:22:07

Episode: 2161   score: 21.38   Avg score (100e): 21.16   actor gain: -0.36   critic loss: 0.41   steps: 2161


training loop:   4% |#                                 | ETA:  35 days, 4:11:00

Episode: 2162   score: 21.39   Avg score (100e): 21.16   actor gain: -0.36   critic loss: 0.41   steps: 2162


training loop:   4% |#                                 | ETA:  35 days, 4:00:06

Episode: 2163   score: 21.40   Avg score (100e): 21.17   actor gain: -0.36   critic loss: 0.41   steps: 2163


training loop:   4% |#                                 | ETA:  35 days, 3:49:38

Episode: 2164   score: 21.40   Avg score (100e): 21.17   actor gain: -0.36   critic loss: 0.41   steps: 2164


training loop:   4% |#                                 | ETA:  35 days, 3:39:34

Episode: 2165   score: 21.41   Avg score (100e): 21.18   actor gain: -0.36   critic loss: 0.41   steps: 2165


training loop:   4% |#                                 | ETA:  35 days, 3:30:07

Episode: 2166   score: 21.42   Avg score (100e): 21.18   actor gain: -0.36   critic loss: 0.41   steps: 2166


training loop:   4% |#                                 | ETA:  35 days, 3:20:19

Episode: 2167   score: 21.42   Avg score (100e): 21.18   actor gain: -0.36   critic loss: 0.41   steps: 2167


training loop:   4% |#                                 | ETA:  35 days, 3:10:10

Episode: 2168   score: 21.42   Avg score (100e): 21.19   actor gain: -0.36   critic loss: 0.41   steps: 2168


training loop:   4% |#                                 | ETA:  35 days, 2:59:14

Episode: 2169   score: 21.42   Avg score (100e): 21.19   actor gain: -0.35   critic loss: 0.41   steps: 2169
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  35 days, 2:49:45

Episode: 2170   score: 21.43   Avg score (100e): 21.20   actor gain: -0.35   critic loss: 0.41   steps: 2170


training loop:   4% |#                                 | ETA:  35 days, 2:40:18

Episode: 2171   score: 21.43   Avg score (100e): 21.20   actor gain: -0.35   critic loss: 0.41   steps: 2171


training loop:   4% |#                                 | ETA:  35 days, 2:29:30

Episode: 2172   score: 21.43   Avg score (100e): 21.21   actor gain: -0.35   critic loss: 0.41   steps: 2172


training loop:   4% |#                                 | ETA:  35 days, 2:18:40

Episode: 2173   score: 21.44   Avg score (100e): 21.21   actor gain: -0.35   critic loss: 0.41   steps: 2173


training loop:   4% |#                                 | ETA:  35 days, 2:09:06

Episode: 2174   score: 21.44   Avg score (100e): 21.22   actor gain: -0.35   critic loss: 0.41   steps: 2174


training loop:   4% |#                                 | ETA:  35 days, 1:59:39

Episode: 2175   score: 21.44   Avg score (100e): 21.22   actor gain: -0.35   critic loss: 0.41   steps: 2175
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  35 days, 1:51:13

Episode: 2176   score: 21.45   Avg score (100e): 21.22   actor gain: -0.35   critic loss: 0.40   steps: 2176


training loop:   4% |#                                 | ETA:  35 days, 1:43:29

Episode: 2177   score: 21.45   Avg score (100e): 21.23   actor gain: -0.35   critic loss: 0.40   steps: 2177
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  35 days, 1:33:13

Episode: 2178   score: 21.45   Avg score (100e): 21.23   actor gain: -0.35   critic loss: 0.40   steps: 2178
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  35 days, 1:23:28

Episode: 2179   score: 21.46   Avg score (100e): 21.24   actor gain: -0.35   critic loss: 0.40   steps: 2179


training loop:   4% |#                                 | ETA:  35 days, 1:13:49

Episode: 2180   score: 21.46   Avg score (100e): 21.24   actor gain: -0.35   critic loss: 0.41   steps: 2180


training loop:   4% |#                                 | ETA:  35 days, 1:06:57

Episode: 2181   score: 21.47   Avg score (100e): 21.25   actor gain: -0.35   critic loss: 0.41   steps: 2181


training loop:   4% |#                                 | ETA:  35 days, 0:57:38

Episode: 2182   score: 21.48   Avg score (100e): 21.25   actor gain: -0.35   critic loss: 0.41   steps: 2182


training loop:   4% |#                                 | ETA:  35 days, 0:47:47

Episode: 2183   score: 21.48   Avg score (100e): 21.26   actor gain: -0.35   critic loss: 0.41   steps: 2183


training loop:   4% |#                                 | ETA:  35 days, 0:38:21

Episode: 2184   score: 21.48   Avg score (100e): 21.26   actor gain: -0.35   critic loss: 0.41   steps: 2184
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  35 days, 0:28:22

Episode: 2185   score: 21.49   Avg score (100e): 21.27   actor gain: -0.35   critic loss: 0.41   steps: 2185


training loop:   4% |#                                 | ETA:  35 days, 0:19:23

Episode: 2186   score: 21.49   Avg score (100e): 21.27   actor gain: -0.33   critic loss: 0.41   steps: 2186


training loop:   4% |#                                 | ETA:  35 days, 0:09:06

Episode: 2187   score: 21.50   Avg score (100e): 21.28   actor gain: -0.33   critic loss: 0.41   steps: 2187


training loop:   4% |#                                | ETA:  34 days, 23:59:01

Episode: 2188   score: 21.50   Avg score (100e): 21.28   actor gain: -0.33   critic loss: 0.40   steps: 2188


training loop:   4% |#                                | ETA:  34 days, 23:48:54

Episode: 2189   score: 21.50   Avg score (100e): 21.28   actor gain: -0.33   critic loss: 0.40   steps: 2189


training loop:   4% |#                                | ETA:  34 days, 23:38:21

Episode: 2190   score: 21.50   Avg score (100e): 21.29   actor gain: -0.33   critic loss: 0.40   steps: 2190


training loop:   4% |#                                | ETA:  34 days, 23:28:33

Episode: 2191   score: 21.50   Avg score (100e): 21.29   actor gain: -0.33   critic loss: 0.40   steps: 2191


training loop:   4% |#                                | ETA:  34 days, 23:18:20

Episode: 2192   score: 21.51   Avg score (100e): 21.30   actor gain: -0.33   critic loss: 0.40   steps: 2192


training loop:   4% |#                                | ETA:  34 days, 23:08:14

Episode: 2193   score: 21.51   Avg score (100e): 21.30   actor gain: -0.33   critic loss: 0.40   steps: 2193


training loop:   4% |#                                | ETA:  34 days, 22:57:08

Episode: 2194   score: 21.52   Avg score (100e): 21.31   actor gain: -0.34   critic loss: 0.40   steps: 2194


training loop:   4% |#                                | ETA:  34 days, 22:46:11

Episode: 2195   score: 21.53   Avg score (100e): 21.31   actor gain: -0.34   critic loss: 0.40   steps: 2195


training loop:   4% |#                                | ETA:  34 days, 22:36:25

Episode: 2196   score: 21.53   Avg score (100e): 21.32   actor gain: -0.33   critic loss: 0.40   steps: 2196


training loop:   4% |#                                | ETA:  34 days, 22:28:02

Episode: 2197   score: 21.53   Avg score (100e): 21.32   actor gain: -0.33   critic loss: 0.40   steps: 2197


training loop:   4% |#                                | ETA:  34 days, 22:17:10

Episode: 2198   score: 21.53   Avg score (100e): 21.32   actor gain: -0.33   critic loss: 0.40   steps: 2198
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  34 days, 22:05:54

Episode: 2199   score: 21.54   Avg score (100e): 21.33   actor gain: -0.33   critic loss: 0.40   steps: 2199


training loop:   4% |#                                | ETA:  34 days, 21:55:23

Episode: 2200   score: 21.54   Avg score (100e): 21.33   actor gain: -0.33   critic loss: 0.40   steps: 2200


training loop:   4% |#                                | ETA:  34 days, 21:44:40

Episode: 2201   score: 21.55   Avg score (100e): 21.34   actor gain: -0.33   critic loss: 0.40   steps: 2201


training loop:   4% |#                                | ETA:  34 days, 21:34:29

Episode: 2202   score: 21.56   Avg score (100e): 21.34   actor gain: -0.33   critic loss: 0.40   steps: 2202
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  34 days, 21:24:11

Episode: 2203   score: 21.56   Avg score (100e): 21.35   actor gain: -0.33   critic loss: 0.40   steps: 2203


training loop:   4% |#                                | ETA:  34 days, 21:14:08

Episode: 2204   score: 21.56   Avg score (100e): 21.35   actor gain: -0.33   critic loss: 0.40   steps: 2204
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  34 days, 21:03:39

Episode: 2205   score: 21.57   Avg score (100e): 21.36   actor gain: -0.33   critic loss: 0.40   steps: 2205


training loop:   4% |#                                | ETA:  34 days, 20:53:04

Episode: 2206   score: 21.57   Avg score (100e): 21.36   actor gain: -0.34   critic loss: 0.40   steps: 2206


training loop:   4% |#                                | ETA:  34 days, 20:41:57

Episode: 2207   score: 21.57   Avg score (100e): 21.36   actor gain: -0.33   critic loss: 0.40   steps: 2207


training loop:   4% |#                                | ETA:  34 days, 20:31:45

Episode: 2208   score: 21.57   Avg score (100e): 21.37   actor gain: -0.34   critic loss: 0.40   steps: 2208


training loop:   4% |#                                | ETA:  34 days, 20:25:03

Episode: 2209   score: 21.57   Avg score (100e): 21.37   actor gain: -0.34   critic loss: 0.40   steps: 2209


training loop:   4% |#                                | ETA:  34 days, 20:16:15

Episode: 2210   score: 21.58   Avg score (100e): 21.38   actor gain: -0.34   critic loss: 0.40   steps: 2210


training loop:   4% |#                                | ETA:  34 days, 20:06:46

Episode: 2211   score: 21.58   Avg score (100e): 21.38   actor gain: -0.34   critic loss: 0.40   steps: 2211


training loop:   4% |#                                | ETA:  34 days, 19:57:39

Episode: 2212   score: 21.58   Avg score (100e): 21.39   actor gain: -0.34   critic loss: 0.40   steps: 2212


training loop:   4% |#                                | ETA:  34 days, 19:47:46

Episode: 2213   score: 21.58   Avg score (100e): 21.39   actor gain: -0.34   critic loss: 0.40   steps: 2213
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  34 days, 19:37:19

Episode: 2214   score: 21.59   Avg score (100e): 21.39   actor gain: -0.34   critic loss: 0.40   steps: 2214


training loop:   4% |#                                | ETA:  34 days, 19:27:15

Episode: 2215   score: 21.59   Avg score (100e): 21.40   actor gain: -0.34   critic loss: 0.40   steps: 2215


training loop:   4% |#                                | ETA:  34 days, 19:17:05

Episode: 2216   score: 21.60   Avg score (100e): 21.40   actor gain: -0.34   critic loss: 0.40   steps: 2216
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  34 days, 19:07:10

Episode: 2217   score: 21.60   Avg score (100e): 21.41   actor gain: -0.34   critic loss: 0.40   steps: 2217


training loop:   4% |#                                | ETA:  34 days, 18:56:46

Episode: 2218   score: 21.60   Avg score (100e): 21.41   actor gain: -0.34   critic loss: 0.40   steps: 2218


training loop:   4% |#                                | ETA:  34 days, 18:48:30

Episode: 2219   score: 21.61   Avg score (100e): 21.41   actor gain: -0.34   critic loss: 0.40   steps: 2219
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  34 days, 18:38:56

Episode: 2220   score: 21.61   Avg score (100e): 21.42   actor gain: -0.34   critic loss: 0.40   steps: 2220


training loop:   4% |#                                | ETA:  34 days, 18:27:58

Episode: 2221   score: 21.62   Avg score (100e): 21.42   actor gain: -0.34   critic loss: 0.40   steps: 2221
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  34 days, 18:17:02

Episode: 2222   score: 21.61   Avg score (100e): 21.43   actor gain: -0.34   critic loss: 0.40   steps: 2222


training loop:   4% |#                                | ETA:  34 days, 18:06:51

Episode: 2223   score: 21.61   Avg score (100e): 21.43   actor gain: -0.34   critic loss: 0.40   steps: 2223


training loop:   4% |#                                | ETA:  34 days, 17:56:55

Episode: 2224   score: 21.61   Avg score (100e): 21.43   actor gain: -0.34   critic loss: 0.40   steps: 2224


training loop:   4% |#                                | ETA:  34 days, 17:46:49

Episode: 2225   score: 21.61   Avg score (100e): 21.44   actor gain: -0.34   critic loss: 0.40   steps: 2225


training loop:   4% |#                                | ETA:  34 days, 17:36:25

Episode: 2226   score: 21.61   Avg score (100e): 21.44   actor gain: -0.34   critic loss: 0.40   steps: 2226


training loop:   4% |#                                | ETA:  34 days, 17:25:52

Episode: 2227   score: 21.62   Avg score (100e): 21.45   actor gain: -0.34   critic loss: 0.40   steps: 2227


training loop:   4% |#                                | ETA:  34 days, 17:15:58

Episode: 2228   score: 21.63   Avg score (100e): 21.45   actor gain: -0.34   critic loss: 0.40   steps: 2228


training loop:   4% |#                                | ETA:  34 days, 17:05:32

Episode: 2229   score: 21.63   Avg score (100e): 21.45   actor gain: -0.34   critic loss: 0.40   steps: 2229


training loop:   4% |#                                | ETA:  34 days, 16:56:27

Episode: 2230   score: 21.63   Avg score (100e): 21.46   actor gain: -0.34   critic loss: 0.40   steps: 2230


training loop:   4% |#                                | ETA:  34 days, 16:46:04

Episode: 2231   score: 21.64   Avg score (100e): 21.46   actor gain: -0.34   critic loss: 0.40   steps: 2231


training loop:   4% |#                                | ETA:  34 days, 16:35:18

Episode: 2232   score: 21.64   Avg score (100e): 21.47   actor gain: -0.34   critic loss: 0.40   steps: 2232


training loop:   4% |#                                | ETA:  34 days, 16:25:32

Episode: 2233   score: 21.64   Avg score (100e): 21.47   actor gain: -0.34   critic loss: 0.40   steps: 2233


training loop:   4% |#                                | ETA:  34 days, 16:14:51

Episode: 2234   score: 21.65   Avg score (100e): 21.47   actor gain: -0.34   critic loss: 0.40   steps: 2234


training loop:   4% |#                                | ETA:  34 days, 16:05:35

Episode: 2235   score: 21.65   Avg score (100e): 21.48   actor gain: -0.34   critic loss: 0.40   steps: 2235


training loop:   4% |#                                | ETA:  34 days, 15:57:08

Episode: 2236   score: 21.65   Avg score (100e): 21.48   actor gain: -0.34   critic loss: 0.40   steps: 2236


training loop:   4% |#                                | ETA:  34 days, 15:46:50

Episode: 2237   score: 21.65   Avg score (100e): 21.48   actor gain: -0.34   critic loss: 0.40   steps: 2237


training loop:   4% |#                                | ETA:  34 days, 15:36:19

Episode: 2238   score: 21.66   Avg score (100e): 21.49   actor gain: -0.33   critic loss: 0.40   steps: 2238


training loop:   4% |#                                | ETA:  34 days, 15:25:46

Episode: 2239   score: 21.66   Avg score (100e): 21.49   actor gain: -0.34   critic loss: 0.40   steps: 2239


training loop:   4% |#                                | ETA:  34 days, 15:15:30

Episode: 2240   score: 21.66   Avg score (100e): 21.50   actor gain: -0.33   critic loss: 0.40   steps: 2240


training loop:   4% |#                                | ETA:  34 days, 15:06:01

Episode: 2241   score: 21.66   Avg score (100e): 21.50   actor gain: -0.33   critic loss: 0.40   steps: 2241


training loop:   4% |#                                | ETA:  34 days, 14:57:58

Episode: 2242   score: 21.66   Avg score (100e): 21.50   actor gain: -0.33   critic loss: 0.40   steps: 2242


training loop:   4% |#                                | ETA:  34 days, 14:47:47

Episode: 2243   score: 21.66   Avg score (100e): 21.51   actor gain: -0.33   critic loss: 0.40   steps: 2243


training loop:   4% |#                                | ETA:  34 days, 14:39:15

Episode: 2244   score: 21.67   Avg score (100e): 21.51   actor gain: -0.33   critic loss: 0.40   steps: 2244


training loop:   4% |#                                | ETA:  34 days, 14:30:54

Episode: 2245   score: 21.67   Avg score (100e): 21.51   actor gain: -0.33   critic loss: 0.40   steps: 2245


training loop:   4% |#                                | ETA:  34 days, 14:22:22

Episode: 2246   score: 21.67   Avg score (100e): 21.52   actor gain: -0.33   critic loss: 0.40   steps: 2246


training loop:   4% |#                                | ETA:  34 days, 14:12:40

Episode: 2247   score: 21.68   Avg score (100e): 21.52   actor gain: -0.33   critic loss: 0.40   steps: 2247


training loop:   4% |#                                | ETA:  34 days, 14:03:32

Episode: 2248   score: 21.68   Avg score (100e): 21.52   actor gain: -0.33   critic loss: 0.40   steps: 2248
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  34 days, 13:55:17

Episode: 2249   score: 21.69   Avg score (100e): 21.53   actor gain: -0.33   critic loss: 0.40   steps: 2249


training loop:   4% |#                                | ETA:  34 days, 13:46:57

Episode: 2250   score: 21.69   Avg score (100e): 21.53   actor gain: -0.33   critic loss: 0.40   steps: 2250


training loop:   4% |#                                | ETA:  34 days, 13:39:35

Episode: 2251   score: 21.70   Avg score (100e): 21.54   actor gain: -0.33   critic loss: 0.40   steps: 2251


training loop:   4% |#                                | ETA:  34 days, 13:32:51

Episode: 2252   score: 21.70   Avg score (100e): 21.54   actor gain: -0.33   critic loss: 0.40   steps: 2252


training loop:   4% |#                                | ETA:  34 days, 13:24:30

Episode: 2253   score: 21.71   Avg score (100e): 21.54   actor gain: -0.33   critic loss: 0.40   steps: 2253


training loop:   4% |#                                | ETA:  34 days, 13:15:53

Episode: 2254   score: 21.70   Avg score (100e): 21.55   actor gain: -0.33   critic loss: 0.40   steps: 2254


training loop:   4% |#                                | ETA:  34 days, 13:05:42

Episode: 2255   score: 21.70   Avg score (100e): 21.55   actor gain: -0.33   critic loss: 0.40   steps: 2255


training loop:   4% |#                                | ETA:  34 days, 12:56:52

Episode: 2256   score: 21.71   Avg score (100e): 21.55   actor gain: -0.33   critic loss: 0.40   steps: 2256


training loop:   4% |#                                | ETA:  34 days, 12:49:05

Episode: 2257   score: 21.71   Avg score (100e): 21.56   actor gain: -0.33   critic loss: 0.40   steps: 2257


training loop:   4% |#                                | ETA:  34 days, 12:41:21

Episode: 2258   score: 21.71   Avg score (100e): 21.56   actor gain: -0.33   critic loss: 0.40   steps: 2258


training loop:   4% |#                                | ETA:  34 days, 12:32:10

Episode: 2259   score: 21.71   Avg score (100e): 21.56   actor gain: -0.33   critic loss: 0.40   steps: 2259


training loop:   4% |#                                | ETA:  34 days, 12:22:56

Episode: 2260   score: 21.72   Avg score (100e): 21.57   actor gain: -0.33   critic loss: 0.40   steps: 2260


training loop:   4% |#                                | ETA:  34 days, 12:13:23

Episode: 2261   score: 21.73   Avg score (100e): 21.57   actor gain: -0.33   critic loss: 0.40   steps: 2261


training loop:   4% |#                                | ETA:  34 days, 12:05:31

Episode: 2262   score: 21.72   Avg score (100e): 21.57   actor gain: -0.33   critic loss: 0.40   steps: 2262
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  34 days, 11:56:31

Episode: 2263   score: 21.73   Avg score (100e): 21.58   actor gain: -0.33   critic loss: 0.40   steps: 2263


training loop:   4% |#                                | ETA:  34 days, 11:47:47

Episode: 2264   score: 21.73   Avg score (100e): 21.58   actor gain: -0.33   critic loss: 0.40   steps: 2264


training loop:   4% |#                                | ETA:  34 days, 11:39:35

Episode: 2265   score: 21.74   Avg score (100e): 21.58   actor gain: -0.33   critic loss: 0.40   steps: 2265
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  34 days, 11:31:19

Episode: 2266   score: 21.74   Avg score (100e): 21.59   actor gain: -0.33   critic loss: 0.40   steps: 2266
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  34 days, 11:23:31

Episode: 2267   score: 21.75   Avg score (100e): 21.59   actor gain: -0.33   critic loss: 0.40   steps: 2267


training loop:   4% |#                                | ETA:  34 days, 11:13:50

Episode: 2268   score: 21.75   Avg score (100e): 21.59   actor gain: -0.33   critic loss: 0.40   steps: 2268


training loop:   4% |#                                | ETA:  34 days, 11:04:43

Episode: 2269   score: 21.74   Avg score (100e): 21.60   actor gain: -0.33   critic loss: 0.40   steps: 2269


training loop:   4% |#                                | ETA:  34 days, 10:54:46

Episode: 2270   score: 21.74   Avg score (100e): 21.60   actor gain: -0.33   critic loss: 0.40   steps: 2270
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  34 days, 10:45:17

Episode: 2271   score: 21.75   Avg score (100e): 21.60   actor gain: -0.33   critic loss: 0.40   steps: 2271


training loop:   4% |#                                | ETA:  34 days, 10:36:26

Episode: 2272   score: 21.75   Avg score (100e): 21.61   actor gain: -0.33   critic loss: 0.40   steps: 2272


training loop:   4% |#                                | ETA:  34 days, 10:28:06

Episode: 2273   score: 21.75   Avg score (100e): 21.61   actor gain: -0.33   critic loss: 0.40   steps: 2273


training loop:   4% |#                                | ETA:  34 days, 10:20:41

Episode: 2274   score: 21.76   Avg score (100e): 21.61   actor gain: -0.33   critic loss: 0.40   steps: 2274


training loop:   4% |#                                | ETA:  34 days, 10:11:04

Episode: 2275   score: 21.76   Avg score (100e): 21.62   actor gain: -0.33   critic loss: 0.40   steps: 2275
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  34 days, 10:01:03

Episode: 2276   score: 21.76   Avg score (100e): 21.62   actor gain: -0.33   critic loss: 0.40   steps: 2276


training loop:   4% |#                                 | ETA:  34 days, 9:51:43

Episode: 2277   score: 21.77   Avg score (100e): 21.62   actor gain: -0.33   critic loss: 0.40   steps: 2277


training loop:   4% |#                                 | ETA:  34 days, 9:42:19

Episode: 2278   score: 21.78   Avg score (100e): 21.63   actor gain: -0.33   critic loss: 0.40   steps: 2278


training loop:   4% |#                                 | ETA:  34 days, 9:32:33

Episode: 2279   score: 21.78   Avg score (100e): 21.63   actor gain: -0.33   critic loss: 0.40   steps: 2279


training loop:   4% |#                                 | ETA:  34 days, 9:22:48

Episode: 2280   score: 21.78   Avg score (100e): 21.63   actor gain: -0.33   critic loss: 0.40   steps: 2280


training loop:   4% |#                                 | ETA:  34 days, 9:14:15

Episode: 2281   score: 21.78   Avg score (100e): 21.64   actor gain: -0.33   critic loss: 0.40   steps: 2281


training loop:   4% |#                                 | ETA:  34 days, 9:05:49

Episode: 2282   score: 21.79   Avg score (100e): 21.64   actor gain: -0.33   critic loss: 0.40   steps: 2282
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  34 days, 8:57:20

Episode: 2283   score: 21.79   Avg score (100e): 21.64   actor gain: -0.33   critic loss: 0.40   steps: 2283


training loop:   4% |#                                 | ETA:  34 days, 8:47:12

Episode: 2284   score: 21.79   Avg score (100e): 21.64   actor gain: -0.33   critic loss: 0.40   steps: 2284
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  34 days, 8:37:04

Episode: 2285   score: 21.79   Avg score (100e): 21.65   actor gain: -0.34   critic loss: 0.40   steps: 2285


training loop:   4% |#                                 | ETA:  34 days, 8:27:12

Episode: 2286   score: 21.80   Avg score (100e): 21.65   actor gain: -0.34   critic loss: 0.40   steps: 2286
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  34 days, 8:16:35

Episode: 2287   score: 21.80   Avg score (100e): 21.65   actor gain: -0.34   critic loss: 0.40   steps: 2287


training loop:   4% |#                                 | ETA:  34 days, 8:07:37

Episode: 2288   score: 21.80   Avg score (100e): 21.66   actor gain: -0.34   critic loss: 0.40   steps: 2288
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  34 days, 7:59:55

Episode: 2289   score: 21.80   Avg score (100e): 21.66   actor gain: -0.34   critic loss: 0.40   steps: 2289


training loop:   4% |#                                 | ETA:  34 days, 7:51:58

Episode: 2290   score: 21.81   Avg score (100e): 21.66   actor gain: -0.34   critic loss: 0.40   steps: 2290


training loop:   4% |#                                 | ETA:  34 days, 7:43:25

Episode: 2291   score: 21.81   Avg score (100e): 21.67   actor gain: -0.34   critic loss: 0.40   steps: 2291
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  34 days, 7:34:05

Episode: 2292   score: 21.81   Avg score (100e): 21.67   actor gain: -0.34   critic loss: 0.40   steps: 2292
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  34 days, 7:23:45

Episode: 2293   score: 21.82   Avg score (100e): 21.67   actor gain: -0.34   critic loss: 0.40   steps: 2293


training loop:   4% |#                                 | ETA:  34 days, 7:14:39

Episode: 2294   score: 21.82   Avg score (100e): 21.67   actor gain: -0.34   critic loss: 0.40   steps: 2294
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  34 days, 7:05:35

Episode: 2295   score: 21.82   Avg score (100e): 21.68   actor gain: -0.34   critic loss: 0.40   steps: 2295


training loop:   4% |#                                 | ETA:  34 days, 6:57:46

Episode: 2296   score: 21.82   Avg score (100e): 21.68   actor gain: -0.34   critic loss: 0.40   steps: 2296


training loop:   4% |#                                 | ETA:  34 days, 6:49:36

Episode: 2297   score: 21.81   Avg score (100e): 21.68   actor gain: -0.34   critic loss: 0.40   steps: 2297


training loop:   4% |#                                 | ETA:  34 days, 6:40:52

Episode: 2298   score: 21.81   Avg score (100e): 21.69   actor gain: -0.34   critic loss: 0.40   steps: 2298
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  34 days, 6:32:34

Episode: 2299   score: 21.81   Avg score (100e): 21.69   actor gain: -0.34   critic loss: 0.40   steps: 2299


training loop:   4% |#                                 | ETA:  34 days, 6:23:16

Episode: 2300   score: 21.82   Avg score (100e): 21.69   actor gain: -0.34   critic loss: 0.40   steps: 2300
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  34 days, 6:13:42

Episode: 2301   score: 21.82   Avg score (100e): 21.69   actor gain: -0.34   critic loss: 0.40   steps: 2301


training loop:   4% |#                                 | ETA:  34 days, 6:04:35

Episode: 2302   score: 21.82   Avg score (100e): 21.70   actor gain: -0.34   critic loss: 0.40   steps: 2302
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  34 days, 5:55:24

Episode: 2303   score: 21.82   Avg score (100e): 21.70   actor gain: -0.34   critic loss: 0.40   steps: 2303


training loop:   4% |#                                 | ETA:  34 days, 5:47:17

Episode: 2304   score: 21.82   Avg score (100e): 21.70   actor gain: -0.34   critic loss: 0.40   steps: 2304


training loop:   4% |#                                 | ETA:  34 days, 5:37:32

Episode: 2305   score: 21.82   Avg score (100e): 21.70   actor gain: -0.34   critic loss: 0.40   steps: 2305


training loop:   4% |#                                 | ETA:  34 days, 5:30:10

Episode: 2306   score: 21.83   Avg score (100e): 21.71   actor gain: -0.34   critic loss: 0.40   steps: 2306


training loop:   4% |#                                 | ETA:  34 days, 5:21:22

Episode: 2307   score: 21.83   Avg score (100e): 21.71   actor gain: -0.34   critic loss: 0.40   steps: 2307


training loop:   4% |#                                 | ETA:  34 days, 5:12:04

Episode: 2308   score: 21.84   Avg score (100e): 21.71   actor gain: -0.34   critic loss: 0.40   steps: 2308


training loop:   4% |#                                 | ETA:  34 days, 5:02:48

Episode: 2309   score: 21.85   Avg score (100e): 21.72   actor gain: -0.34   critic loss: 0.40   steps: 2309


training loop:   4% |#                                 | ETA:  34 days, 4:54:18

Episode: 2310   score: 21.85   Avg score (100e): 21.72   actor gain: -0.34   critic loss: 0.39   steps: 2310


training loop:   4% |#                                 | ETA:  34 days, 4:45:13

Episode: 2311   score: 21.85   Avg score (100e): 21.72   actor gain: -0.34   critic loss: 0.39   steps: 2311


training loop:   4% |#                                 | ETA:  34 days, 4:36:23

Episode: 2312   score: 21.86   Avg score (100e): 21.72   actor gain: -0.33   critic loss: 0.39   steps: 2312


training loop:   4% |#                                 | ETA:  34 days, 4:26:59

Episode: 2313   score: 21.86   Avg score (100e): 21.73   actor gain: -0.34   critic loss: 0.39   steps: 2313
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  34 days, 4:16:58

Episode: 2314   score: 21.87   Avg score (100e): 21.73   actor gain: -0.33   critic loss: 0.40   steps: 2314


training loop:   4% |#                                 | ETA:  34 days, 4:08:57

Episode: 2315   score: 21.86   Avg score (100e): 21.73   actor gain: -0.33   critic loss: 0.40   steps: 2315


training loop:   4% |#                                 | ETA:  34 days, 3:59:19

Episode: 2316   score: 21.87   Avg score (100e): 21.73   actor gain: -0.33   critic loss: 0.40   steps: 2316


training loop:   4% |#                                 | ETA:  34 days, 3:49:29

Episode: 2317   score: 21.87   Avg score (100e): 21.74   actor gain: -0.33   critic loss: 0.40   steps: 2317


training loop:   4% |#                                 | ETA:  34 days, 3:40:08

Episode: 2318   score: 21.87   Avg score (100e): 21.74   actor gain: -0.33   critic loss: 0.39   steps: 2318


training loop:   4% |#                                 | ETA:  34 days, 3:32:06

Episode: 2319   score: 21.87   Avg score (100e): 21.74   actor gain: -0.33   critic loss: 0.39   steps: 2319


training loop:   4% |#                                 | ETA:  34 days, 3:24:11

Episode: 2320   score: 21.87   Avg score (100e): 21.75   actor gain: -0.33   critic loss: 0.39   steps: 2320
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  34 days, 3:14:08

Episode: 2321   score: 21.88   Avg score (100e): 21.75   actor gain: -0.33   critic loss: 0.39   steps: 2321
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  34 days, 3:04:07

Episode: 2322   score: 21.88   Avg score (100e): 21.75   actor gain: -0.33   critic loss: 0.39   steps: 2322


training loop:   4% |#                                 | ETA:  34 days, 2:54:47

Episode: 2323   score: 21.88   Avg score (100e): 21.75   actor gain: -0.33   critic loss: 0.39   steps: 2323


training loop:   4% |#                                 | ETA:  34 days, 2:45:02

Episode: 2324   score: 21.89   Avg score (100e): 21.76   actor gain: -0.33   critic loss: 0.39   steps: 2324


training loop:   4% |#                                 | ETA:  34 days, 2:35:05

Episode: 2325   score: 21.89   Avg score (100e): 21.76   actor gain: -0.33   critic loss: 0.39   steps: 2325
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  34 days, 2:26:01

Episode: 2326   score: 21.89   Avg score (100e): 21.76   actor gain: -0.33   critic loss: 0.39   steps: 2326


training loop:   4% |#                                 | ETA:  34 days, 2:16:52

Episode: 2327   score: 21.89   Avg score (100e): 21.76   actor gain: -0.33   critic loss: 0.39   steps: 2327


training loop:   4% |#                                 | ETA:  34 days, 2:07:23

Episode: 2328   score: 21.89   Avg score (100e): 21.77   actor gain: -0.33   critic loss: 0.39   steps: 2328


training loop:   4% |#                                 | ETA:  34 days, 1:57:20

Episode: 2329   score: 21.89   Avg score (100e): 21.77   actor gain: -0.33   critic loss: 0.39   steps: 2329
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  34 days, 1:47:03

Episode: 2330   score: 21.90   Avg score (100e): 21.77   actor gain: -0.33   critic loss: 0.39   steps: 2330


training loop:   4% |#                                 | ETA:  34 days, 1:37:30

Episode: 2331   score: 21.90   Avg score (100e): 21.77   actor gain: -0.34   critic loss: 0.39   steps: 2331


training loop:   4% |#                                 | ETA:  34 days, 1:28:58

Episode: 2332   score: 21.90   Avg score (100e): 21.78   actor gain: -0.34   critic loss: 0.39   steps: 2332


training loop:   4% |#                                 | ETA:  34 days, 1:19:20

Episode: 2333   score: 21.90   Avg score (100e): 21.78   actor gain: -0.34   critic loss: 0.39   steps: 2333
np.all(done) is true! miracle!


training loop:   4% |#                                 | ETA:  34 days, 1:09:08

Episode: 2334   score: 21.90   Avg score (100e): 21.78   actor gain: -0.34   critic loss: 0.39   steps: 2334


training loop:   4% |#                                 | ETA:  34 days, 1:00:02

Episode: 2335   score: 21.90   Avg score (100e): 21.78   actor gain: -0.34   critic loss: 0.40   steps: 2335


training loop:   4% |#                                 | ETA:  34 days, 0:50:37

Episode: 2336   score: 21.91   Avg score (100e): 21.79   actor gain: -0.34   critic loss: 0.40   steps: 2336


training loop:   4% |#                                 | ETA:  34 days, 0:41:15

Episode: 2337   score: 21.90   Avg score (100e): 21.79   actor gain: -0.34   critic loss: 0.39   steps: 2337


training loop:   4% |#                                 | ETA:  34 days, 0:32:16

Episode: 2338   score: 21.91   Avg score (100e): 21.79   actor gain: -0.34   critic loss: 0.40   steps: 2338


training loop:   4% |#                                 | ETA:  34 days, 0:25:15

Episode: 2339   score: 21.91   Avg score (100e): 21.80   actor gain: -0.34   critic loss: 0.40   steps: 2339


training loop:   4% |#                                 | ETA:  34 days, 0:15:48

Episode: 2340   score: 21.92   Avg score (100e): 21.80   actor gain: -0.34   critic loss: 0.40   steps: 2340


training loop:   4% |#                                 | ETA:  34 days, 0:06:36

Episode: 2341   score: 21.92   Avg score (100e): 21.80   actor gain: -0.34   critic loss: 0.40   steps: 2341


training loop:   4% |#                                | ETA:  33 days, 23:56:54

Episode: 2342   score: 21.93   Avg score (100e): 21.80   actor gain: -0.34   critic loss: 0.40   steps: 2342


training loop:   4% |#                                | ETA:  33 days, 23:48:37

Episode: 2343   score: 21.93   Avg score (100e): 21.81   actor gain: -0.34   critic loss: 0.40   steps: 2343


training loop:   4% |#                                | ETA:  33 days, 23:41:50

Episode: 2344   score: 21.94   Avg score (100e): 21.81   actor gain: -0.34   critic loss: 0.40   steps: 2344


training loop:   4% |#                                | ETA:  33 days, 23:33:42

Episode: 2345   score: 21.94   Avg score (100e): 21.81   actor gain: -0.34   critic loss: 0.40   steps: 2345


training loop:   4% |#                                | ETA:  33 days, 23:24:26

Episode: 2346   score: 21.94   Avg score (100e): 21.81   actor gain: -0.34   critic loss: 0.40   steps: 2346
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 23:14:53

Episode: 2347   score: 21.94   Avg score (100e): 21.82   actor gain: -0.34   critic loss: 0.40   steps: 2347


training loop:   4% |#                                | ETA:  33 days, 23:06:34

Episode: 2348   score: 21.95   Avg score (100e): 21.82   actor gain: -0.34   critic loss: 0.40   steps: 2348


training loop:   4% |#                                | ETA:  33 days, 22:57:28

Episode: 2349   score: 21.95   Avg score (100e): 21.82   actor gain: -0.34   critic loss: 0.40   steps: 2349


training loop:   4% |#                                | ETA:  33 days, 22:48:09

Episode: 2350   score: 21.94   Avg score (100e): 21.82   actor gain: -0.34   critic loss: 0.40   steps: 2350


training loop:   4% |#                                | ETA:  33 days, 22:39:14

Episode: 2351   score: 21.95   Avg score (100e): 21.83   actor gain: -0.34   critic loss: 0.40   steps: 2351


training loop:   4% |#                                | ETA:  33 days, 22:29:57

Episode: 2352   score: 21.96   Avg score (100e): 21.83   actor gain: -0.34   critic loss: 0.40   steps: 2352
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:20:10

Episode: 2353   score: 21.96   Avg score (100e): 21.83   actor gain: -0.34   critic loss: 0.40   steps: 2353


training loop:   4% |#                                | ETA:  33 days, 22:11:21

Episode: 2354   score: 21.96   Avg score (100e): 21.83   actor gain: -0.33   critic loss: 0.40   steps: 2354


training loop:   4% |#                                | ETA:  33 days, 22:02:19

Episode: 2355   score: 21.97   Avg score (100e): 21.84   actor gain: -0.33   critic loss: 0.40   steps: 2355


training loop:   4% |#                                | ETA:  33 days, 21:53:04

Episode: 2356   score: 21.98   Avg score (100e): 21.84   actor gain: -0.33   critic loss: 0.40   steps: 2356


training loop:   4% |#                                | ETA:  33 days, 21:43:33

Episode: 2357   score: 21.98   Avg score (100e): 21.84   actor gain: -0.33   critic loss: 0.40   steps: 2357
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 21:33:36

Episode: 2358   score: 21.98   Avg score (100e): 21.85   actor gain: -0.33   critic loss: 0.40   steps: 2358


training loop:   4% |#                                | ETA:  33 days, 21:24:41

Episode: 2359   score: 21.98   Avg score (100e): 21.85   actor gain: -0.33   critic loss: 0.40   steps: 2359


training loop:   4% |#                                | ETA:  33 days, 21:16:38

Episode: 2360   score: 21.98   Avg score (100e): 21.85   actor gain: -0.33   critic loss: 0.40   steps: 2360
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 21:06:30

Episode: 2361   score: 21.99   Avg score (100e): 21.85   actor gain: -0.33   critic loss: 0.40   steps: 2361
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 21:01:12

Episode: 2362   score: 21.99   Avg score (100e): 21.86   actor gain: -0.33   critic loss: 0.40   steps: 2362


training loop:   4% |#                                | ETA:  33 days, 21:04:22

Episode: 2363   score: 22.00   Avg score (100e): 21.86   actor gain: -0.33   critic loss: 0.40   steps: 2363
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 21:06:02

Episode: 2364   score: 22.00   Avg score (100e): 21.86   actor gain: -0.33   critic loss: 0.40   steps: 2364


training loop:   4% |#                                | ETA:  33 days, 21:07:30

Episode: 2365   score: 22.00   Avg score (100e): 21.86   actor gain: -0.33   critic loss: 0.40   steps: 2365


training loop:   4% |#                                | ETA:  33 days, 21:07:25

Episode: 2366   score: 22.01   Avg score (100e): 21.87   actor gain: -0.33   critic loss: 0.40   steps: 2366
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 21:07:38

Episode: 2367   score: 22.02   Avg score (100e): 21.87   actor gain: -0.33   critic loss: 0.40   steps: 2367
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 21:05:54

Episode: 2368   score: 22.02   Avg score (100e): 21.87   actor gain: -0.33   critic loss: 0.40   steps: 2368


training loop:   4% |#                                | ETA:  33 days, 21:05:42

Episode: 2369   score: 22.02   Avg score (100e): 21.88   actor gain: -0.33   critic loss: 0.40   steps: 2369


training loop:   4% |#                                | ETA:  33 days, 21:05:50

Episode: 2370   score: 22.02   Avg score (100e): 21.88   actor gain: -0.33   critic loss: 0.41   steps: 2370
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 21:08:34

Episode: 2371   score: 22.02   Avg score (100e): 21.88   actor gain: -0.33   critic loss: 0.41   steps: 2371


training loop:   4% |#                                | ETA:  33 days, 21:08:10

Episode: 2372   score: 22.02   Avg score (100e): 21.88   actor gain: -0.33   critic loss: 0.41   steps: 2372
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 21:09:45

Episode: 2373   score: 22.02   Avg score (100e): 21.89   actor gain: -0.33   critic loss: 0.41   steps: 2373


training loop:   4% |#                                | ETA:  33 days, 21:10:49

Episode: 2374   score: 22.02   Avg score (100e): 21.89   actor gain: -0.33   critic loss: 0.41   steps: 2374


training loop:   4% |#                                | ETA:  33 days, 21:18:27

Episode: 2375   score: 22.03   Avg score (100e): 21.89   actor gain: -0.33   critic loss: 0.40   steps: 2375


training loop:   4% |#                                | ETA:  33 days, 21:24:48

Episode: 2376   score: 22.03   Avg score (100e): 21.89   actor gain: -0.33   critic loss: 0.40   steps: 2376


training loop:   4% |#                                | ETA:  33 days, 21:33:56

Episode: 2377   score: 22.03   Avg score (100e): 21.90   actor gain: -0.33   critic loss: 0.40   steps: 2377


training loop:   4% |#                                | ETA:  33 days, 21:39:36

Episode: 2378   score: 22.04   Avg score (100e): 21.90   actor gain: -0.33   critic loss: 0.40   steps: 2378
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 21:40:26

Episode: 2379   score: 22.04   Avg score (100e): 21.90   actor gain: -0.33   critic loss: 0.40   steps: 2379


training loop:   4% |#                                | ETA:  33 days, 21:42:29

Episode: 2380   score: 22.05   Avg score (100e): 21.90   actor gain: -0.33   critic loss: 0.40   steps: 2380


training loop:   4% |#                                | ETA:  33 days, 21:44:53

Episode: 2381   score: 22.04   Avg score (100e): 21.91   actor gain: -0.33   critic loss: 0.40   steps: 2381


training loop:   4% |#                                | ETA:  33 days, 21:47:02

Episode: 2382   score: 22.03   Avg score (100e): 21.91   actor gain: -0.33   critic loss: 0.40   steps: 2382


training loop:   4% |#                                | ETA:  33 days, 21:47:22

Episode: 2383   score: 22.04   Avg score (100e): 21.91   actor gain: -0.33   critic loss: 0.40   steps: 2383


training loop:   4% |#                                | ETA:  33 days, 21:48:08

Episode: 2384   score: 22.05   Avg score (100e): 21.91   actor gain: -0.33   critic loss: 0.40   steps: 2384


training loop:   4% |#                                | ETA:  33 days, 21:49:31

Episode: 2385   score: 22.05   Avg score (100e): 21.92   actor gain: -0.33   critic loss: 0.40   steps: 2385
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 21:49:47

Episode: 2386   score: 22.05   Avg score (100e): 21.92   actor gain: -0.33   critic loss: 0.40   steps: 2386


training loop:   4% |#                                | ETA:  33 days, 21:50:49

Episode: 2387   score: 22.05   Avg score (100e): 21.92   actor gain: -0.33   critic loss: 0.40   steps: 2387


training loop:   4% |#                                | ETA:  33 days, 21:53:39

Episode: 2388   score: 22.06   Avg score (100e): 21.92   actor gain: -0.33   critic loss: 0.40   steps: 2388


training loop:   4% |#                                | ETA:  33 days, 21:55:41

Episode: 2389   score: 22.06   Avg score (100e): 21.93   actor gain: -0.33   critic loss: 0.40   steps: 2389
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 21:55:54

Episode: 2390   score: 22.07   Avg score (100e): 21.93   actor gain: -0.33   critic loss: 0.40   steps: 2390


training loop:   4% |#                                | ETA:  33 days, 22:00:10

Episode: 2391   score: 22.07   Avg score (100e): 21.93   actor gain: -0.33   critic loss: 0.40   steps: 2391


training loop:   4% |#                                | ETA:  33 days, 22:01:17

Episode: 2392   score: 22.08   Avg score (100e): 21.94   actor gain: -0.33   critic loss: 0.40   steps: 2392


training loop:   4% |#                                | ETA:  33 days, 22:01:58

Episode: 2393   score: 22.08   Avg score (100e): 21.94   actor gain: -0.33   critic loss: 0.40   steps: 2393


training loop:   4% |#                                | ETA:  33 days, 22:03:24

Episode: 2394   score: 22.08   Avg score (100e): 21.94   actor gain: -0.33   critic loss: 0.40   steps: 2394


training loop:   4% |#                                | ETA:  33 days, 22:05:09

Episode: 2395   score: 22.08   Avg score (100e): 21.94   actor gain: -0.33   critic loss: 0.40   steps: 2395
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:07:21

Episode: 2396   score: 22.09   Avg score (100e): 21.95   actor gain: -0.33   critic loss: 0.40   steps: 2396


training loop:   4% |#                                | ETA:  33 days, 22:10:24

Episode: 2397   score: 22.09   Avg score (100e): 21.95   actor gain: -0.33   critic loss: 0.40   steps: 2397


training loop:   4% |#                                | ETA:  33 days, 22:09:34

Episode: 2398   score: 22.10   Avg score (100e): 21.95   actor gain: -0.33   critic loss: 0.40   steps: 2398


training loop:   4% |#                                | ETA:  33 days, 22:09:33

Episode: 2399   score: 22.10   Avg score (100e): 21.95   actor gain: -0.33   critic loss: 0.40   steps: 2399


training loop:   4% |#                                | ETA:  33 days, 22:09:46

Episode: 2400   score: 22.10   Avg score (100e): 21.96   actor gain: -0.33   critic loss: 0.40   steps: 2400


training loop:   4% |#                                | ETA:  33 days, 22:09:22

Episode: 2401   score: 22.11   Avg score (100e): 21.96   actor gain: -0.33   critic loss: 0.40   steps: 2401


training loop:   4% |#                                | ETA:  33 days, 22:08:08

Episode: 2402   score: 22.12   Avg score (100e): 21.96   actor gain: -0.33   critic loss: 0.40   steps: 2402


training loop:   4% |#                                | ETA:  33 days, 22:11:39

Episode: 2403   score: 22.13   Avg score (100e): 21.97   actor gain: -0.33   critic loss: 0.40   steps: 2403


training loop:   4% |#                                | ETA:  33 days, 22:13:12

Episode: 2404   score: 22.12   Avg score (100e): 21.97   actor gain: -0.33   critic loss: 0.40   steps: 2404


training loop:   4% |#                                | ETA:  33 days, 22:13:26

Episode: 2405   score: 22.12   Avg score (100e): 21.97   actor gain: -0.33   critic loss: 0.40   steps: 2405


training loop:   4% |#                                | ETA:  33 days, 22:13:19

Episode: 2406   score: 22.12   Avg score (100e): 21.98   actor gain: -0.33   critic loss: 0.40   steps: 2406


training loop:   4% |#                                | ETA:  33 days, 22:14:12

Episode: 2407   score: 22.12   Avg score (100e): 21.98   actor gain: -0.33   critic loss: 0.40   steps: 2407


training loop:   4% |#                                | ETA:  33 days, 22:13:22

Episode: 2408   score: 22.12   Avg score (100e): 21.98   actor gain: -0.33   critic loss: 0.40   steps: 2408


training loop:   4% |#                                | ETA:  33 days, 22:13:57

Episode: 2409   score: 22.13   Avg score (100e): 21.98   actor gain: -0.33   critic loss: 0.40   steps: 2409


training loop:   4% |#                                | ETA:  33 days, 22:13:42

Episode: 2410   score: 22.13   Avg score (100e): 21.99   actor gain: -0.33   critic loss: 0.40   steps: 2410


training loop:   4% |#                                | ETA:  33 days, 22:13:58

Episode: 2411   score: 22.14   Avg score (100e): 21.99   actor gain: -0.33   critic loss: 0.40   steps: 2411
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:12:37

Episode: 2412   score: 22.15   Avg score (100e): 21.99   actor gain: -0.33   critic loss: 0.40   steps: 2412


training loop:   4% |#                                | ETA:  33 days, 22:15:33

Episode: 2413   score: 22.16   Avg score (100e): 22.00   actor gain: -0.33   critic loss: 0.40   steps: 2413
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:14:47

Episode: 2414   score: 22.16   Avg score (100e): 22.00   actor gain: -0.33   critic loss: 0.40   steps: 2414


training loop:   4% |#                                | ETA:  33 days, 22:14:11

Episode: 2415   score: 22.16   Avg score (100e): 22.00   actor gain: -0.33   critic loss: 0.40   steps: 2415


training loop:   4% |#                                | ETA:  33 days, 22:13:34

Episode: 2416   score: 22.16   Avg score (100e): 22.00   actor gain: -0.33   critic loss: 0.40   steps: 2416


training loop:   4% |#                                | ETA:  33 days, 22:12:37

Episode: 2417   score: 22.16   Avg score (100e): 22.01   actor gain: -0.33   critic loss: 0.40   steps: 2417


training loop:   4% |#                                | ETA:  33 days, 22:11:53

Episode: 2418   score: 22.17   Avg score (100e): 22.01   actor gain: -0.33   critic loss: 0.40   steps: 2418


training loop:   4% |#                                | ETA:  33 days, 22:11:34

Episode: 2419   score: 22.17   Avg score (100e): 22.01   actor gain: -0.33   critic loss: 0.40   steps: 2419
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:10:26

Episode: 2420   score: 22.18   Avg score (100e): 22.02   actor gain: -0.33   critic loss: 0.40   steps: 2420
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:09:49

Episode: 2421   score: 22.18   Avg score (100e): 22.02   actor gain: -0.33   critic loss: 0.40   steps: 2421


training loop:   4% |#                                | ETA:  33 days, 22:10:39

Episode: 2422   score: 22.18   Avg score (100e): 22.02   actor gain: -0.33   critic loss: 0.40   steps: 2422


training loop:   4% |#                                | ETA:  33 days, 22:09:49

Episode: 2423   score: 22.19   Avg score (100e): 22.03   actor gain: -0.33   critic loss: 0.40   steps: 2423


training loop:   4% |#                                | ETA:  33 days, 22:08:39

Episode: 2424   score: 22.19   Avg score (100e): 22.03   actor gain: -0.33   critic loss: 0.40   steps: 2424


training loop:   4% |#                                | ETA:  33 days, 22:07:53

Episode: 2425   score: 22.19   Avg score (100e): 22.03   actor gain: -0.33   critic loss: 0.40   steps: 2425


training loop:   4% |#                                | ETA:  33 days, 22:07:26

Episode: 2426   score: 22.19   Avg score (100e): 22.03   actor gain: -0.33   critic loss: 0.40   steps: 2426


training loop:   4% |#                                | ETA:  33 days, 22:06:46

Episode: 2427   score: 22.19   Avg score (100e): 22.04   actor gain: -0.33   critic loss: 0.40   steps: 2427


training loop:   4% |#                                | ETA:  33 days, 22:05:39

Episode: 2428   score: 22.20   Avg score (100e): 22.04   actor gain: -0.33   critic loss: 0.40   steps: 2428


training loop:   4% |#                                | ETA:  33 days, 22:05:47

Episode: 2429   score: 22.20   Avg score (100e): 22.04   actor gain: -0.33   critic loss: 0.40   steps: 2429


training loop:   4% |#                                | ETA:  33 days, 22:04:34

Episode: 2430   score: 22.20   Avg score (100e): 22.05   actor gain: -0.33   critic loss: 0.40   steps: 2430
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:02:53

Episode: 2431   score: 22.21   Avg score (100e): 22.05   actor gain: -0.33   critic loss: 0.40   steps: 2431


training loop:   4% |#                                | ETA:  33 days, 22:02:08

Episode: 2432   score: 22.21   Avg score (100e): 22.05   actor gain: -0.33   critic loss: 0.40   steps: 2432


training loop:   4% |#                                | ETA:  33 days, 22:03:11

Episode: 2433   score: 22.22   Avg score (100e): 22.06   actor gain: -0.33   critic loss: 0.40   steps: 2433
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:01:38

Episode: 2434   score: 22.22   Avg score (100e): 22.06   actor gain: -0.33   critic loss: 0.40   steps: 2434


training loop:   4% |#                                | ETA:  33 days, 22:01:15

Episode: 2435   score: 22.22   Avg score (100e): 22.06   actor gain: -0.33   critic loss: 0.41   steps: 2435


training loop:   4% |#                                | ETA:  33 days, 22:05:08

Episode: 2436   score: 22.22   Avg score (100e): 22.07   actor gain: -0.33   critic loss: 0.41   steps: 2436


training loop:   4% |#                                | ETA:  33 days, 22:05:14

Episode: 2437   score: 22.23   Avg score (100e): 22.07   actor gain: -0.33   critic loss: 0.41   steps: 2437
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:04:29

Episode: 2438   score: 22.23   Avg score (100e): 22.07   actor gain: -0.33   critic loss: 0.41   steps: 2438


training loop:   4% |#                                | ETA:  33 days, 22:05:27

Episode: 2439   score: 22.23   Avg score (100e): 22.08   actor gain: -0.33   critic loss: 0.41   steps: 2439


training loop:   4% |#                                | ETA:  33 days, 22:05:14

Episode: 2440   score: 22.23   Avg score (100e): 22.08   actor gain: -0.33   critic loss: 0.41   steps: 2440


training loop:   4% |#                                | ETA:  33 days, 22:12:44

Episode: 2441   score: 22.24   Avg score (100e): 22.08   actor gain: -0.33   critic loss: 0.41   steps: 2441


training loop:   4% |#                                | ETA:  33 days, 22:15:04

Episode: 2442   score: 22.24   Avg score (100e): 22.08   actor gain: -0.33   critic loss: 0.41   steps: 2442
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:15:41

Episode: 2443   score: 22.24   Avg score (100e): 22.09   actor gain: -0.33   critic loss: 0.41   steps: 2443


training loop:   4% |#                                | ETA:  33 days, 22:17:29

Episode: 2444   score: 22.25   Avg score (100e): 22.09   actor gain: -0.33   critic loss: 0.41   steps: 2444
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:17:48

Episode: 2445   score: 22.26   Avg score (100e): 22.09   actor gain: -0.33   critic loss: 0.41   steps: 2445
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:16:42

Episode: 2446   score: 22.25   Avg score (100e): 22.10   actor gain: -0.33   critic loss: 0.41   steps: 2446


training loop:   4% |#                                | ETA:  33 days, 22:17:14

Episode: 2447   score: 22.26   Avg score (100e): 22.10   actor gain: -0.33   critic loss: 0.41   steps: 2447


training loop:   4% |#                                | ETA:  33 days, 22:18:52

Episode: 2448   score: 22.27   Avg score (100e): 22.10   actor gain: -0.33   critic loss: 0.41   steps: 2448


training loop:   4% |#                                | ETA:  33 days, 22:18:47

Episode: 2449   score: 22.26   Avg score (100e): 22.11   actor gain: -0.33   critic loss: 0.41   steps: 2449


training loop:   4% |#                                | ETA:  33 days, 22:18:32

Episode: 2450   score: 22.26   Avg score (100e): 22.11   actor gain: -0.33   critic loss: 0.41   steps: 2450


training loop:   4% |#                                | ETA:  33 days, 22:18:35

Episode: 2451   score: 22.27   Avg score (100e): 22.11   actor gain: -0.33   critic loss: 0.41   steps: 2451
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:17:06

Episode: 2452   score: 22.27   Avg score (100e): 22.12   actor gain: -0.33   critic loss: 0.41   steps: 2452


training loop:   4% |#                                | ETA:  33 days, 22:16:32

Episode: 2453   score: 22.28   Avg score (100e): 22.12   actor gain: -0.33   critic loss: 0.41   steps: 2453


training loop:   4% |#                                | ETA:  33 days, 22:16:16

Episode: 2454   score: 22.28   Avg score (100e): 22.12   actor gain: -0.33   critic loss: 0.41   steps: 2454


training loop:   4% |#                                | ETA:  33 days, 22:16:41

Episode: 2455   score: 22.28   Avg score (100e): 22.13   actor gain: -0.33   critic loss: 0.41   steps: 2455
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:15:22

Episode: 2456   score: 22.29   Avg score (100e): 22.13   actor gain: -0.33   critic loss: 0.41   steps: 2456


training loop:   4% |#                                | ETA:  33 days, 22:14:45

Episode: 2457   score: 22.29   Avg score (100e): 22.13   actor gain: -0.33   critic loss: 0.40   steps: 2457
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:14:18

Episode: 2458   score: 22.30   Avg score (100e): 22.13   actor gain: -0.33   critic loss: 0.40   steps: 2458
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:13:30

Episode: 2459   score: 22.30   Avg score (100e): 22.14   actor gain: -0.33   critic loss: 0.40   steps: 2459


training loop:   4% |#                                | ETA:  33 days, 22:14:17

Episode: 2460   score: 22.31   Avg score (100e): 22.14   actor gain: -0.33   critic loss: 0.40   steps: 2460


training loop:   4% |#                                | ETA:  33 days, 22:14:35

Episode: 2461   score: 22.31   Avg score (100e): 22.14   actor gain: -0.33   critic loss: 0.40   steps: 2461
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:14:04

Episode: 2462   score: 22.31   Avg score (100e): 22.15   actor gain: -0.33   critic loss: 0.40   steps: 2462
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:13:13

Episode: 2463   score: 22.32   Avg score (100e): 22.15   actor gain: -0.33   critic loss: 0.40   steps: 2463


training loop:   4% |#                                | ETA:  33 days, 22:13:58

Episode: 2464   score: 22.32   Avg score (100e): 22.15   actor gain: -0.33   critic loss: 0.40   steps: 2464


training loop:   4% |#                                | ETA:  33 days, 22:13:52

Episode: 2465   score: 22.32   Avg score (100e): 22.16   actor gain: -0.33   critic loss: 0.40   steps: 2465
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:13:04

Episode: 2466   score: 22.33   Avg score (100e): 22.16   actor gain: -0.33   critic loss: 0.40   steps: 2466


training loop:   4% |#                                | ETA:  33 days, 22:12:36

Episode: 2467   score: 22.33   Avg score (100e): 22.16   actor gain: -0.33   critic loss: 0.40   steps: 2467
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:16:24

Episode: 2468   score: 22.34   Avg score (100e): 22.17   actor gain: -0.33   critic loss: 0.40   steps: 2468


training loop:   4% |#                                | ETA:  33 days, 22:16:51

Episode: 2469   score: 22.34   Avg score (100e): 22.17   actor gain: -0.33   critic loss: 0.40   steps: 2469
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:16:44

Episode: 2470   score: 22.34   Avg score (100e): 22.17   actor gain: -0.33   critic loss: 0.40   steps: 2470


training loop:   4% |#                                | ETA:  33 days, 22:20:07

Episode: 2471   score: 22.34   Avg score (100e): 22.18   actor gain: -0.33   critic loss: 0.40   steps: 2471


training loop:   4% |#                                | ETA:  33 days, 22:22:35

Episode: 2472   score: 22.35   Avg score (100e): 22.18   actor gain: -0.33   critic loss: 0.40   steps: 2472


training loop:   4% |#                                | ETA:  33 days, 22:22:44

Episode: 2473   score: 22.35   Avg score (100e): 22.18   actor gain: -0.33   critic loss: 0.40   steps: 2473


training loop:   4% |#                                | ETA:  33 days, 22:24:02

Episode: 2474   score: 22.36   Avg score (100e): 22.19   actor gain: -0.33   critic loss: 0.40   steps: 2474


training loop:   4% |#                                | ETA:  33 days, 22:23:44

Episode: 2475   score: 22.36   Avg score (100e): 22.19   actor gain: -0.33   critic loss: 0.40   steps: 2475


training loop:   4% |#                                | ETA:  33 days, 22:23:47

Episode: 2476   score: 22.37   Avg score (100e): 22.19   actor gain: -0.33   critic loss: 0.40   steps: 2476


training loop:   4% |#                                | ETA:  33 days, 22:24:41

Episode: 2477   score: 22.37   Avg score (100e): 22.20   actor gain: -0.33   critic loss: 0.40   steps: 2477
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:23:51

Episode: 2478   score: 22.37   Avg score (100e): 22.20   actor gain: -0.33   critic loss: 0.40   steps: 2478


training loop:   4% |#                                | ETA:  33 days, 22:23:51

Episode: 2479   score: 22.37   Avg score (100e): 22.20   actor gain: -0.33   critic loss: 0.40   steps: 2479


training loop:   4% |#                                | ETA:  33 days, 22:23:16

Episode: 2480   score: 22.38   Avg score (100e): 22.21   actor gain: -0.33   critic loss: 0.40   steps: 2480


training loop:   4% |#                                | ETA:  33 days, 22:22:12

Episode: 2481   score: 22.38   Avg score (100e): 22.21   actor gain: -0.33   critic loss: 0.40   steps: 2481


training loop:   4% |#                                | ETA:  33 days, 22:22:46

Episode: 2482   score: 22.39   Avg score (100e): 22.21   actor gain: -0.33   critic loss: 0.40   steps: 2482


training loop:   4% |#                                | ETA:  33 days, 22:24:00

Episode: 2483   score: 22.39   Avg score (100e): 22.22   actor gain: -0.33   critic loss: 0.40   steps: 2483


training loop:   4% |#                                | ETA:  33 days, 22:23:34

Episode: 2484   score: 22.39   Avg score (100e): 22.22   actor gain: -0.33   critic loss: 0.40   steps: 2484


training loop:   4% |#                                | ETA:  33 days, 22:24:03

Episode: 2485   score: 22.40   Avg score (100e): 22.22   actor gain: -0.33   critic loss: 0.40   steps: 2485


training loop:   4% |#                                | ETA:  33 days, 22:23:56

Episode: 2486   score: 22.41   Avg score (100e): 22.23   actor gain: -0.33   critic loss: 0.40   steps: 2486
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:24:07

Episode: 2487   score: 22.41   Avg score (100e): 22.23   actor gain: -0.33   critic loss: 0.40   steps: 2487


training loop:   4% |#                                | ETA:  33 days, 22:24:59

Episode: 2488   score: 22.41   Avg score (100e): 22.23   actor gain: -0.33   critic loss: 0.40   steps: 2488


training loop:   4% |#                                | ETA:  33 days, 22:25:59

Episode: 2489   score: 22.41   Avg score (100e): 22.24   actor gain: -0.33   critic loss: 0.40   steps: 2489
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:28:01

Episode: 2490   score: 22.41   Avg score (100e): 22.24   actor gain: -0.33   critic loss: 0.40   steps: 2490
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:28:41

Episode: 2491   score: 22.42   Avg score (100e): 22.24   actor gain: -0.33   critic loss: 0.40   steps: 2491


training loop:   4% |#                                | ETA:  33 days, 22:29:10

Episode: 2492   score: 22.42   Avg score (100e): 22.25   actor gain: -0.33   critic loss: 0.40   steps: 2492


training loop:   4% |#                                | ETA:  33 days, 22:29:47

Episode: 2493   score: 22.43   Avg score (100e): 22.25   actor gain: -0.33   critic loss: 0.41   steps: 2493


training loop:   4% |#                                | ETA:  33 days, 22:29:36

Episode: 2494   score: 22.43   Avg score (100e): 22.26   actor gain: -0.33   critic loss: 0.41   steps: 2494
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:28:58

Episode: 2495   score: 22.43   Avg score (100e): 22.26   actor gain: -0.33   critic loss: 0.41   steps: 2495


training loop:   4% |#                                | ETA:  33 days, 22:29:25

Episode: 2496   score: 22.43   Avg score (100e): 22.26   actor gain: -0.33   critic loss: 0.41   steps: 2496


training loop:   4% |#                                | ETA:  33 days, 22:28:30

Episode: 2497   score: 22.43   Avg score (100e): 22.27   actor gain: -0.33   critic loss: 0.41   steps: 2497
np.all(done) is true! miracle!


training loop:   4% |#                                | ETA:  33 days, 22:27:27

Episode: 2498   score: 22.44   Avg score (100e): 22.27   actor gain: -0.33   critic loss: 0.41   steps: 2498


training loop:   4% |#                                | ETA:  33 days, 22:28:33

Episode: 2499   score: 22.44   Avg score (100e): 22.27   actor gain: -0.33   critic loss: 0.40   steps: 2499


training loop:   4% |#                                | ETA:  33 days, 22:32:04

Episode: 2500   score: 22.44   Avg score (100e): 22.28   actor gain: -0.33   critic loss: 0.40   steps: 2500


training loop:   5% |#                                | ETA:  33 days, 22:31:31

Episode: 2501   score: 22.44   Avg score (100e): 22.28   actor gain: -0.33   critic loss: 0.40   steps: 2501


training loop:   5% |#                                | ETA:  33 days, 22:32:21

Episode: 2502   score: 22.44   Avg score (100e): 22.28   actor gain: -0.33   critic loss: 0.40   steps: 2502


training loop:   5% |#                                | ETA:  33 days, 22:33:07

Episode: 2503   score: 22.45   Avg score (100e): 22.29   actor gain: -0.33   critic loss: 0.40   steps: 2503
np.all(done) is true! miracle!


training loop:   5% |#                                | ETA:  33 days, 22:32:29

Episode: 2504   score: 22.45   Avg score (100e): 22.29   actor gain: -0.33   critic loss: 0.40   steps: 2504
np.all(done) is true! miracle!


training loop:   5% |#                                | ETA:  33 days, 22:32:37

Episode: 2505   score: 22.45   Avg score (100e): 22.29   actor gain: -0.33   critic loss: 0.40   steps: 2505
np.all(done) is true! miracle!


training loop:   5% |#                                | ETA:  33 days, 22:32:56

Episode: 2506   score: 22.46   Avg score (100e): 22.30   actor gain: -0.33   critic loss: 0.40   steps: 2506


training loop:   5% |#                                | ETA:  33 days, 22:33:17

Episode: 2507   score: 22.46   Avg score (100e): 22.30   actor gain: -0.33   critic loss: 0.40   steps: 2507


training loop:   5% |#                                | ETA:  33 days, 22:32:49

Episode: 2508   score: 22.47   Avg score (100e): 22.30   actor gain: -0.33   critic loss: 0.40   steps: 2508


training loop:   5% |#                                | ETA:  33 days, 22:34:35

Episode: 2509   score: 22.47   Avg score (100e): 22.31   actor gain: -0.33   critic loss: 0.40   steps: 2509
np.all(done) is true! miracle!


training loop:   5% |#                                | ETA:  33 days, 22:36:40

Episode: 2510   score: 22.48   Avg score (100e): 22.31   actor gain: -0.33   critic loss: 0.40   steps: 2510
np.all(done) is true! miracle!


training loop:   5% |#                                | ETA:  33 days, 22:36:54

Episode: 2511   score: 22.48   Avg score (100e): 22.31   actor gain: -0.33   critic loss: 0.40   steps: 2511


training loop:   5% |#                                | ETA:  33 days, 22:37:01

Episode: 2512   score: 22.48   Avg score (100e): 22.32   actor gain: -0.33   critic loss: 0.40   steps: 2512


training loop:   5% |#                                | ETA:  33 days, 22:37:06

Episode: 2513   score: 22.48   Avg score (100e): 22.32   actor gain: -0.33   critic loss: 0.40   steps: 2513


training loop:   5% |#                                | ETA:  33 days, 22:38:42

Episode: 2514   score: 22.48   Avg score (100e): 22.32   actor gain: -0.33   critic loss: 0.40   steps: 2514


training loop:   5% |#                                | ETA:  33 days, 22:39:48

Episode: 2515   score: 22.48   Avg score (100e): 22.33   actor gain: -0.33   critic loss: 0.40   steps: 2515


training loop:   5% |#                                | ETA:  33 days, 22:39:36

Episode: 2516   score: 22.49   Avg score (100e): 22.33   actor gain: -0.33   critic loss: 0.40   steps: 2516


training loop:   5% |#                                | ETA:  33 days, 22:45:00

Episode: 2517   score: 22.50   Avg score (100e): 22.33   actor gain: -0.33   critic loss: 0.40   steps: 2517


training loop:   5% |#                                | ETA:  33 days, 22:52:54

Episode: 2518   score: 22.50   Avg score (100e): 22.34   actor gain: -0.33   critic loss: 0.40   steps: 2518


training loop:   5% |#                                | ETA:  33 days, 22:54:34

Episode: 2519   score: 22.50   Avg score (100e): 22.34   actor gain: -0.33   critic loss: 0.40   steps: 2519


training loop:   5% |#                                | ETA:  33 days, 22:56:58

Episode: 2520   score: 22.51   Avg score (100e): 22.34   actor gain: -0.33   critic loss: 0.40   steps: 2520


training loop:   5% |#                                | ETA:  33 days, 23:00:23

Episode: 2521   score: 22.51   Avg score (100e): 22.35   actor gain: -0.33   critic loss: 0.40   steps: 2521
np.all(done) is true! miracle!


training loop:   5% |#                                | ETA:  33 days, 23:02:54

Episode: 2522   score: 22.51   Avg score (100e): 22.35   actor gain: -0.33   critic loss: 0.40   steps: 2522


training loop:   5% |#                                | ETA:  33 days, 23:05:33

Episode: 2523   score: 22.51   Avg score (100e): 22.35   actor gain: -0.33   critic loss: 0.40   steps: 2523
np.all(done) is true! miracle!


training loop:   5% |#                                | ETA:  33 days, 23:13:05

Episode: 2524   score: 22.52   Avg score (100e): 22.36   actor gain: -0.33   critic loss: 0.40   steps: 2524


training loop:   5% |#                                | ETA:  33 days, 23:20:05

Episode: 2525   score: 22.53   Avg score (100e): 22.36   actor gain: -0.33   critic loss: 0.40   steps: 2525


training loop:   5% |#                                | ETA:  33 days, 23:24:44

Episode: 2526   score: 22.53   Avg score (100e): 22.36   actor gain: -0.33   critic loss: 0.40   steps: 2526


training loop:   5% |#                                | ETA:  33 days, 23:28:44

Episode: 2527   score: 22.54   Avg score (100e): 22.37   actor gain: -0.33   critic loss: 0.40   steps: 2527


training loop:   5% |#                                | ETA:  33 days, 23:31:48

Episode: 2528   score: 22.54   Avg score (100e): 22.37   actor gain: -0.33   critic loss: 0.40   steps: 2528


training loop:   5% |#                                | ETA:  33 days, 23:35:38

Episode: 2529   score: 22.54   Avg score (100e): 22.37   actor gain: -0.33   critic loss: 0.40   steps: 2529
np.all(done) is true! miracle!


training loop:   5% |#                                | ETA:  33 days, 23:37:56

Episode: 2530   score: 22.54   Avg score (100e): 22.38   actor gain: -0.33   critic loss: 0.40   steps: 2530


training loop:   5% |#                                | ETA:  33 days, 23:40:28

Episode: 2531   score: 22.54   Avg score (100e): 22.38   actor gain: -0.33   critic loss: 0.40   steps: 2531


training loop:   5% |#                                | ETA:  33 days, 23:43:07

Episode: 2532   score: 22.55   Avg score (100e): 22.38   actor gain: -0.33   critic loss: 0.40   steps: 2532
np.all(done) is true! miracle!


training loop:   5% |#                                | ETA:  33 days, 23:48:00

Episode: 2533   score: 22.55   Avg score (100e): 22.39   actor gain: -0.33   critic loss: 0.40   steps: 2533


training loop:   5% |#                                | ETA:  33 days, 23:52:27

Episode: 2534   score: 22.55   Avg score (100e): 22.39   actor gain: -0.33   critic loss: 0.40   steps: 2534
np.all(done) is true! miracle!


training loop:   5% |#                                | ETA:  33 days, 23:51:30

Episode: 2535   score: 22.55   Avg score (100e): 22.39   actor gain: -0.33   critic loss: 0.40   steps: 2535


training loop:   5% |#                                | ETA:  33 days, 23:56:08

Episode: 2536   score: 22.56   Avg score (100e): 22.40   actor gain: -0.33   critic loss: 0.40   steps: 2536


training loop:   5% |#                                 | ETA:  34 days, 0:01:08

Episode: 2537   score: 22.56   Avg score (100e): 22.40   actor gain: -0.33   critic loss: 0.40   steps: 2537


training loop:   5% |#                                 | ETA:  34 days, 0:05:39

Episode: 2538   score: 22.56   Avg score (100e): 22.40   actor gain: -0.33   critic loss: 0.40   steps: 2538


training loop:   5% |#                                 | ETA:  34 days, 0:07:49

Episode: 2539   score: 22.56   Avg score (100e): 22.41   actor gain: -0.33   critic loss: 0.40   steps: 2539


training loop:   5% |#                                 | ETA:  34 days, 0:13:22

Episode: 2540   score: 22.56   Avg score (100e): 22.41   actor gain: -0.33   critic loss: 0.40   steps: 2540


training loop:   5% |#                                 | ETA:  34 days, 0:23:36

Episode: 2541   score: 22.57   Avg score (100e): 22.41   actor gain: -0.33   critic loss: 0.40   steps: 2541


training loop:   5% |#                                 | ETA:  34 days, 0:27:11

Episode: 2542   score: 22.57   Avg score (100e): 22.42   actor gain: -0.33   critic loss: 0.40   steps: 2542


training loop:   5% |#                                 | ETA:  34 days, 0:34:02

Episode: 2543   score: 22.58   Avg score (100e): 22.42   actor gain: -0.33   critic loss: 0.40   steps: 2543


training loop:   5% |#                                 | ETA:  34 days, 0:35:31

Episode: 2544   score: 22.58   Avg score (100e): 22.42   actor gain: -0.33   critic loss: 0.40   steps: 2544


training loop:   5% |#                                 | ETA:  34 days, 0:36:13

Episode: 2545   score: 22.59   Avg score (100e): 22.43   actor gain: -0.33   critic loss: 0.40   steps: 2545


training loop:   5% |#                                 | ETA:  34 days, 0:36:11

Episode: 2546   score: 22.58   Avg score (100e): 22.43   actor gain: -0.33   critic loss: 0.40   steps: 2546


training loop:   5% |#                                 | ETA:  34 days, 0:37:03

Episode: 2547   score: 22.58   Avg score (100e): 22.43   actor gain: -0.33   critic loss: 0.40   steps: 2547


training loop:   5% |#                                 | ETA:  34 days, 0:38:10

Episode: 2548   score: 22.59   Avg score (100e): 22.44   actor gain: -0.33   critic loss: 0.40   steps: 2548


training loop:   5% |#                                 | ETA:  34 days, 0:37:35

Episode: 2549   score: 22.59   Avg score (100e): 22.44   actor gain: -0.33   critic loss: 0.40   steps: 2549


training loop:   5% |#                                 | ETA:  34 days, 0:38:44

Episode: 2550   score: 22.59   Avg score (100e): 22.44   actor gain: -0.33   critic loss: 0.40   steps: 2550
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 0:37:55

Episode: 2551   score: 22.60   Avg score (100e): 22.44   actor gain: -0.33   critic loss: 0.40   steps: 2551


training loop:   5% |#                                 | ETA:  34 days, 0:38:08

Episode: 2552   score: 22.60   Avg score (100e): 22.45   actor gain: -0.33   critic loss: 0.40   steps: 2552


training loop:   5% |#                                 | ETA:  34 days, 0:38:19

Episode: 2553   score: 22.60   Avg score (100e): 22.45   actor gain: -0.33   critic loss: 0.40   steps: 2553


training loop:   5% |#                                 | ETA:  34 days, 0:40:48

Episode: 2554   score: 22.61   Avg score (100e): 22.45   actor gain: -0.33   critic loss: 0.40   steps: 2554


training loop:   5% |#                                 | ETA:  34 days, 0:42:46

Episode: 2555   score: 22.61   Avg score (100e): 22.46   actor gain: -0.33   critic loss: 0.40   steps: 2555


training loop:   5% |#                                 | ETA:  34 days, 0:43:31

Episode: 2556   score: 22.62   Avg score (100e): 22.46   actor gain: -0.33   critic loss: 0.40   steps: 2556


training loop:   5% |#                                 | ETA:  34 days, 0:44:33

Episode: 2557   score: 22.62   Avg score (100e): 22.46   actor gain: -0.33   critic loss: 0.40   steps: 2557


training loop:   5% |#                                 | ETA:  34 days, 0:44:49

Episode: 2558   score: 22.62   Avg score (100e): 22.47   actor gain: -0.33   critic loss: 0.40   steps: 2558


training loop:   5% |#                                 | ETA:  34 days, 0:45:23

Episode: 2559   score: 22.62   Avg score (100e): 22.47   actor gain: -0.33   critic loss: 0.40   steps: 2559


training loop:   5% |#                                 | ETA:  34 days, 0:46:35

Episode: 2560   score: 22.62   Avg score (100e): 22.47   actor gain: -0.33   critic loss: 0.40   steps: 2560


training loop:   5% |#                                 | ETA:  34 days, 0:47:31

Episode: 2561   score: 22.62   Avg score (100e): 22.48   actor gain: -0.33   critic loss: 0.40   steps: 2561


training loop:   5% |#                                 | ETA:  34 days, 0:56:13

Episode: 2562   score: 22.62   Avg score (100e): 22.48   actor gain: -0.33   critic loss: 0.40   steps: 2562


training loop:   5% |#                                 | ETA:  34 days, 1:03:10

Episode: 2563   score: 22.63   Avg score (100e): 22.48   actor gain: -0.33   critic loss: 0.40   steps: 2563


training loop:   5% |#                                 | ETA:  34 days, 1:03:13

Episode: 2564   score: 22.63   Avg score (100e): 22.49   actor gain: -0.33   critic loss: 0.40   steps: 2564


training loop:   5% |#                                 | ETA:  34 days, 1:08:04

Episode: 2565   score: 22.63   Avg score (100e): 22.49   actor gain: -0.33   critic loss: 0.40   steps: 2565


training loop:   5% |#                                 | ETA:  34 days, 1:12:36

Episode: 2566   score: 22.63   Avg score (100e): 22.49   actor gain: -0.33   critic loss: 0.40   steps: 2566


training loop:   5% |#                                 | ETA:  34 days, 1:13:23

Episode: 2567   score: 22.63   Avg score (100e): 22.50   actor gain: -0.33   critic loss: 0.40   steps: 2567
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:17:03

Episode: 2568   score: 22.63   Avg score (100e): 22.50   actor gain: -0.33   critic loss: 0.40   steps: 2568


training loop:   5% |#                                 | ETA:  34 days, 1:20:25

Episode: 2569   score: 22.63   Avg score (100e): 22.50   actor gain: -0.33   critic loss: 0.40   steps: 2569


training loop:   5% |#                                 | ETA:  34 days, 1:22:58

Episode: 2570   score: 22.64   Avg score (100e): 22.50   actor gain: -0.33   critic loss: 0.40   steps: 2570


training loop:   5% |#                                 | ETA:  34 days, 1:22:56

Episode: 2571   score: 22.64   Avg score (100e): 22.51   actor gain: -0.33   critic loss: 0.40   steps: 2571


training loop:   5% |#                                 | ETA:  34 days, 1:22:29

Episode: 2572   score: 22.64   Avg score (100e): 22.51   actor gain: -0.33   critic loss: 0.40   steps: 2572


training loop:   5% |#                                 | ETA:  34 days, 1:23:22

Episode: 2573   score: 22.64   Avg score (100e): 22.51   actor gain: -0.33   critic loss: 0.40   steps: 2573
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:24:06

Episode: 2574   score: 22.65   Avg score (100e): 22.52   actor gain: -0.33   critic loss: 0.40   steps: 2574
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:24:27

Episode: 2575   score: 22.65   Avg score (100e): 22.52   actor gain: -0.33   critic loss: 0.40   steps: 2575
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:25:39

Episode: 2576   score: 22.66   Avg score (100e): 22.52   actor gain: -0.33   critic loss: 0.40   steps: 2576
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:33:13

Episode: 2577   score: 22.66   Avg score (100e): 22.52   actor gain: -0.33   critic loss: 0.40   steps: 2577
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:34:33

Episode: 2578   score: 22.66   Avg score (100e): 22.53   actor gain: -0.33   critic loss: 0.40   steps: 2578


training loop:   5% |#                                 | ETA:  34 days, 1:35:23

Episode: 2579   score: 22.67   Avg score (100e): 22.53   actor gain: -0.33   critic loss: 0.40   steps: 2579
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:38:09

Episode: 2580   score: 22.67   Avg score (100e): 22.53   actor gain: -0.33   critic loss: 0.40   steps: 2580


training loop:   5% |#                                 | ETA:  34 days, 1:39:33

Episode: 2581   score: 22.67   Avg score (100e): 22.54   actor gain: -0.33   critic loss: 0.40   steps: 2581


training loop:   5% |#                                 | ETA:  34 days, 1:41:02

Episode: 2582   score: 22.67   Avg score (100e): 22.54   actor gain: -0.33   critic loss: 0.40   steps: 2582
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:39:47

Episode: 2583   score: 22.67   Avg score (100e): 22.54   actor gain: -0.33   critic loss: 0.40   steps: 2583
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:39:03

Episode: 2584   score: 22.67   Avg score (100e): 22.54   actor gain: -0.33   critic loss: 0.40   steps: 2584


training loop:   5% |#                                 | ETA:  34 days, 1:39:21

Episode: 2585   score: 22.67   Avg score (100e): 22.55   actor gain: -0.33   critic loss: 0.40   steps: 2585


training loop:   5% |#                                 | ETA:  34 days, 1:40:33

Episode: 2586   score: 22.67   Avg score (100e): 22.55   actor gain: -0.33   critic loss: 0.40   steps: 2586
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:39:29

Episode: 2587   score: 22.67   Avg score (100e): 22.55   actor gain: -0.33   critic loss: 0.40   steps: 2587


training loop:   5% |#                                 | ETA:  34 days, 1:40:01

Episode: 2588   score: 22.68   Avg score (100e): 22.56   actor gain: -0.33   critic loss: 0.40   steps: 2588
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:39:24

Episode: 2589   score: 22.68   Avg score (100e): 22.56   actor gain: -0.33   critic loss: 0.40   steps: 2589


training loop:   5% |#                                 | ETA:  34 days, 1:40:59

Episode: 2590   score: 22.69   Avg score (100e): 22.56   actor gain: -0.33   critic loss: 0.40   steps: 2590


training loop:   5% |#                                 | ETA:  34 days, 1:41:32

Episode: 2591   score: 22.70   Avg score (100e): 22.56   actor gain: -0.33   critic loss: 0.40   steps: 2591


training loop:   5% |#                                 | ETA:  34 days, 1:43:52

Episode: 2592   score: 22.70   Avg score (100e): 22.57   actor gain: -0.33   critic loss: 0.40   steps: 2592
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:45:34

Episode: 2593   score: 22.71   Avg score (100e): 22.57   actor gain: -0.33   critic loss: 0.40   steps: 2593


training loop:   5% |#                                 | ETA:  34 days, 1:46:08

Episode: 2594   score: 22.71   Avg score (100e): 22.57   actor gain: -0.33   critic loss: 0.40   steps: 2594


training loop:   5% |#                                 | ETA:  34 days, 1:45:41

Episode: 2595   score: 22.71   Avg score (100e): 22.57   actor gain: -0.33   critic loss: 0.40   steps: 2595


training loop:   5% |#                                 | ETA:  34 days, 1:46:48

Episode: 2596   score: 22.72   Avg score (100e): 22.58   actor gain: -0.33   critic loss: 0.40   steps: 2596
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:48:44

Episode: 2597   score: 22.72   Avg score (100e): 22.58   actor gain: -0.33   critic loss: 0.40   steps: 2597


training loop:   5% |#                                 | ETA:  34 days, 1:48:51

Episode: 2598   score: 22.72   Avg score (100e): 22.58   actor gain: -0.33   critic loss: 0.40   steps: 2598


training loop:   5% |#                                 | ETA:  34 days, 1:51:16

Episode: 2599   score: 22.72   Avg score (100e): 22.59   actor gain: -0.33   critic loss: 0.40   steps: 2599


training loop:   5% |#                                 | ETA:  34 days, 1:51:35

Episode: 2600   score: 22.73   Avg score (100e): 22.59   actor gain: -0.33   critic loss: 0.40   steps: 2600


training loop:   5% |#                                 | ETA:  34 days, 1:51:01

Episode: 2601   score: 22.73   Avg score (100e): 22.59   actor gain: -0.33   critic loss: 0.41   steps: 2601


training loop:   5% |#                                 | ETA:  34 days, 1:49:42

Episode: 2602   score: 22.74   Avg score (100e): 22.60   actor gain: -0.33   critic loss: 0.40   steps: 2602


training loop:   5% |#                                 | ETA:  34 days, 1:49:38

Episode: 2603   score: 22.74   Avg score (100e): 22.60   actor gain: -0.33   critic loss: 0.40   steps: 2603


training loop:   5% |#                                 | ETA:  34 days, 1:48:59

Episode: 2604   score: 22.74   Avg score (100e): 22.60   actor gain: -0.33   critic loss: 0.40   steps: 2604


training loop:   5% |#                                 | ETA:  34 days, 1:48:42

Episode: 2605   score: 22.74   Avg score (100e): 22.60   actor gain: -0.33   critic loss: 0.40   steps: 2605


training loop:   5% |#                                 | ETA:  34 days, 1:47:30

Episode: 2606   score: 22.74   Avg score (100e): 22.61   actor gain: -0.33   critic loss: 0.40   steps: 2606


training loop:   5% |#                                 | ETA:  34 days, 1:46:06

Episode: 2607   score: 22.74   Avg score (100e): 22.61   actor gain: -0.33   critic loss: 0.40   steps: 2607


training loop:   5% |#                                 | ETA:  34 days, 1:45:23

Episode: 2608   score: 22.75   Avg score (100e): 22.61   actor gain: -0.33   critic loss: 0.40   steps: 2608


training loop:   5% |#                                 | ETA:  34 days, 1:44:42

Episode: 2609   score: 22.75   Avg score (100e): 22.61   actor gain: -0.33   critic loss: 0.40   steps: 2609


training loop:   5% |#                                 | ETA:  34 days, 1:44:05

Episode: 2610   score: 22.75   Avg score (100e): 22.62   actor gain: -0.33   critic loss: 0.40   steps: 2610


training loop:   5% |#                                 | ETA:  34 days, 1:42:16

Episode: 2611   score: 22.75   Avg score (100e): 22.62   actor gain: -0.33   critic loss: 0.40   steps: 2611


training loop:   5% |#                                 | ETA:  34 days, 1:46:58

Episode: 2612   score: 22.75   Avg score (100e): 22.62   actor gain: -0.33   critic loss: 0.40   steps: 2612
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:51:28

Episode: 2613   score: 22.76   Avg score (100e): 22.63   actor gain: -0.33   critic loss: 0.40   steps: 2613


training loop:   5% |#                                 | ETA:  34 days, 1:52:23

Episode: 2614   score: 22.76   Avg score (100e): 22.63   actor gain: -0.33   critic loss: 0.40   steps: 2614


training loop:   5% |#                                 | ETA:  34 days, 1:52:46

Episode: 2615   score: 22.76   Avg score (100e): 22.63   actor gain: -0.32   critic loss: 0.40   steps: 2615


training loop:   5% |#                                 | ETA:  34 days, 1:53:31

Episode: 2616   score: 22.77   Avg score (100e): 22.63   actor gain: -0.32   critic loss: 0.40   steps: 2616


training loop:   5% |#                                 | ETA:  34 days, 1:53:09

Episode: 2617   score: 22.77   Avg score (100e): 22.64   actor gain: -0.32   critic loss: 0.40   steps: 2617


training loop:   5% |#                                 | ETA:  34 days, 1:54:02

Episode: 2618   score: 22.77   Avg score (100e): 22.64   actor gain: -0.33   critic loss: 0.40   steps: 2618


training loop:   5% |#                                 | ETA:  34 days, 1:52:15

Episode: 2619   score: 22.77   Avg score (100e): 22.64   actor gain: -0.32   critic loss: 0.40   steps: 2619


training loop:   5% |#                                 | ETA:  34 days, 1:52:26

Episode: 2620   score: 22.77   Avg score (100e): 22.65   actor gain: -0.32   critic loss: 0.40   steps: 2620


training loop:   5% |#                                 | ETA:  34 days, 1:53:03

Episode: 2621   score: 22.76   Avg score (100e): 22.65   actor gain: -0.32   critic loss: 0.40   steps: 2621


training loop:   5% |#                                 | ETA:  34 days, 1:53:19

Episode: 2622   score: 22.77   Avg score (100e): 22.65   actor gain: -0.32   critic loss: 0.40   steps: 2622


training loop:   5% |#                                 | ETA:  34 days, 1:52:20

Episode: 2623   score: 22.78   Avg score (100e): 22.65   actor gain: -0.32   critic loss: 0.40   steps: 2623


training loop:   5% |#                                 | ETA:  34 days, 1:52:28

Episode: 2624   score: 22.78   Avg score (100e): 22.66   actor gain: -0.32   critic loss: 0.40   steps: 2624
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:53:40

Episode: 2625   score: 22.78   Avg score (100e): 22.66   actor gain: -0.32   critic loss: 0.40   steps: 2625


training loop:   5% |#                                 | ETA:  34 days, 1:52:53

Episode: 2626   score: 22.78   Avg score (100e): 22.66   actor gain: -0.32   critic loss: 0.40   steps: 2626
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:54:55

Episode: 2627   score: 22.78   Avg score (100e): 22.66   actor gain: -0.33   critic loss: 0.40   steps: 2627


training loop:   5% |#                                 | ETA:  34 days, 1:54:21

Episode: 2628   score: 22.79   Avg score (100e): 22.67   actor gain: -0.33   critic loss: 0.40   steps: 2628
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:53:59

Episode: 2629   score: 22.79   Avg score (100e): 22.67   actor gain: -0.33   critic loss: 0.40   steps: 2629


training loop:   5% |#                                 | ETA:  34 days, 1:56:20

Episode: 2630   score: 22.80   Avg score (100e): 22.67   actor gain: -0.33   critic loss: 0.40   steps: 2630
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:55:29

Episode: 2631   score: 22.80   Avg score (100e): 22.67   actor gain: -0.33   critic loss: 0.40   steps: 2631


training loop:   5% |#                                 | ETA:  34 days, 1:56:30

Episode: 2632   score: 22.80   Avg score (100e): 22.68   actor gain: -0.33   critic loss: 0.40   steps: 2632


training loop:   5% |#                                 | ETA:  34 days, 1:55:16

Episode: 2633   score: 22.80   Avg score (100e): 22.68   actor gain: -0.33   critic loss: 0.40   steps: 2633


training loop:   5% |#                                 | ETA:  34 days, 1:55:02

Episode: 2634   score: 22.80   Avg score (100e): 22.68   actor gain: -0.33   critic loss: 0.40   steps: 2634


training loop:   5% |#                                 | ETA:  34 days, 1:57:36

Episode: 2635   score: 22.80   Avg score (100e): 22.68   actor gain: -0.33   critic loss: 0.40   steps: 2635


training loop:   5% |#                                 | ETA:  34 days, 1:58:09

Episode: 2636   score: 22.80   Avg score (100e): 22.69   actor gain: -0.33   critic loss: 0.40   steps: 2636
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:57:46

Episode: 2637   score: 22.81   Avg score (100e): 22.69   actor gain: -0.33   critic loss: 0.40   steps: 2637
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 1:56:36

Episode: 2638   score: 22.82   Avg score (100e): 22.69   actor gain: -0.33   critic loss: 0.40   steps: 2638


training loop:   5% |#                                 | ETA:  34 days, 1:57:20

Episode: 2639   score: 22.82   Avg score (100e): 22.69   actor gain: -0.33   critic loss: 0.40   steps: 2639


training loop:   5% |#                                 | ETA:  34 days, 1:59:12

Episode: 2640   score: 22.82   Avg score (100e): 22.70   actor gain: -0.33   critic loss: 0.40   steps: 2640


training loop:   5% |#                                 | ETA:  34 days, 2:02:35

Episode: 2641   score: 22.82   Avg score (100e): 22.70   actor gain: -0.33   critic loss: 0.40   steps: 2641


training loop:   5% |#                                 | ETA:  34 days, 2:01:52

Episode: 2642   score: 22.82   Avg score (100e): 22.70   actor gain: -0.33   critic loss: 0.40   steps: 2642


training loop:   5% |#                                 | ETA:  34 days, 2:02:45

Episode: 2643   score: 22.83   Avg score (100e): 22.70   actor gain: -0.33   critic loss: 0.40   steps: 2643


training loop:   5% |#                                 | ETA:  34 days, 2:02:49

Episode: 2644   score: 22.82   Avg score (100e): 22.71   actor gain: -0.33   critic loss: 0.40   steps: 2644


training loop:   5% |#                                 | ETA:  34 days, 2:02:36

Episode: 2645   score: 22.82   Avg score (100e): 22.71   actor gain: -0.33   critic loss: 0.40   steps: 2645


training loop:   5% |#                                 | ETA:  34 days, 2:02:54

Episode: 2646   score: 22.83   Avg score (100e): 22.71   actor gain: -0.33   critic loss: 0.40   steps: 2646


training loop:   5% |#                                 | ETA:  34 days, 2:02:44

Episode: 2647   score: 22.83   Avg score (100e): 22.71   actor gain: -0.33   critic loss: 0.40   steps: 2647
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:00:41

Episode: 2648   score: 22.83   Avg score (100e): 22.72   actor gain: -0.33   critic loss: 0.40   steps: 2648
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:00:49

Episode: 2649   score: 22.83   Avg score (100e): 22.72   actor gain: -0.33   critic loss: 0.39   steps: 2649
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:01:41

Episode: 2650   score: 22.84   Avg score (100e): 22.72   actor gain: -0.33   critic loss: 0.39   steps: 2650
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:02:54

Episode: 2651   score: 22.84   Avg score (100e): 22.72   actor gain: -0.33   critic loss: 0.39   steps: 2651


training loop:   5% |#                                 | ETA:  34 days, 2:04:13

Episode: 2652   score: 22.84   Avg score (100e): 22.73   actor gain: -0.33   critic loss: 0.39   steps: 2652
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:05:30

Episode: 2653   score: 22.85   Avg score (100e): 22.73   actor gain: -0.33   critic loss: 0.39   steps: 2653


training loop:   5% |#                                 | ETA:  34 days, 2:08:09

Episode: 2654   score: 22.84   Avg score (100e): 22.73   actor gain: -0.33   critic loss: 0.39   steps: 2654


training loop:   5% |#                                 | ETA:  34 days, 2:11:40

Episode: 2655   score: 22.85   Avg score (100e): 22.73   actor gain: -0.33   critic loss: 0.39   steps: 2655
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:15:13

Episode: 2656   score: 22.85   Avg score (100e): 22.73   actor gain: -0.33   critic loss: 0.39   steps: 2656


training loop:   5% |#                                 | ETA:  34 days, 2:17:46

Episode: 2657   score: 22.85   Avg score (100e): 22.74   actor gain: -0.33   critic loss: 0.39   steps: 2657


training loop:   5% |#                                 | ETA:  34 days, 2:20:46

Episode: 2658   score: 22.85   Avg score (100e): 22.74   actor gain: -0.33   critic loss: 0.39   steps: 2658


training loop:   5% |#                                 | ETA:  34 days, 2:21:31

Episode: 2659   score: 22.85   Avg score (100e): 22.74   actor gain: -0.33   critic loss: 0.39   steps: 2659


training loop:   5% |#                                 | ETA:  34 days, 2:21:35

Episode: 2660   score: 22.85   Avg score (100e): 22.74   actor gain: -0.33   critic loss: 0.39   steps: 2660


training loop:   5% |#                                 | ETA:  34 days, 2:23:47

Episode: 2661   score: 22.86   Avg score (100e): 22.75   actor gain: -0.33   critic loss: 0.39   steps: 2661


training loop:   5% |#                                 | ETA:  34 days, 2:27:51

Episode: 2662   score: 22.86   Avg score (100e): 22.75   actor gain: -0.33   critic loss: 0.39   steps: 2662


training loop:   5% |#                                 | ETA:  34 days, 2:30:50

Episode: 2663   score: 22.86   Avg score (100e): 22.75   actor gain: -0.33   critic loss: 0.39   steps: 2663


training loop:   5% |#                                 | ETA:  34 days, 2:31:33

Episode: 2664   score: 22.87   Avg score (100e): 22.75   actor gain: -0.33   critic loss: 0.39   steps: 2664


training loop:   5% |#                                 | ETA:  34 days, 2:31:03

Episode: 2665   score: 22.87   Avg score (100e): 22.76   actor gain: -0.33   critic loss: 0.39   steps: 2665


training loop:   5% |#                                 | ETA:  34 days, 2:31:13

Episode: 2666   score: 22.87   Avg score (100e): 22.76   actor gain: -0.33   critic loss: 0.39   steps: 2666
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:30:51

Episode: 2667   score: 22.87   Avg score (100e): 22.76   actor gain: -0.33   critic loss: 0.39   steps: 2667
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:27:24

Episode: 2668   score: 22.88   Avg score (100e): 22.76   actor gain: -0.33   critic loss: 0.39   steps: 2668


training loop:   5% |#                                 | ETA:  34 days, 2:26:23

Episode: 2669   score: 22.88   Avg score (100e): 22.77   actor gain: -0.33   critic loss: 0.39   steps: 2669


training loop:   5% |#                                 | ETA:  34 days, 2:25:04

Episode: 2670   score: 22.88   Avg score (100e): 22.77   actor gain: -0.33   critic loss: 0.39   steps: 2670


training loop:   5% |#                                 | ETA:  34 days, 2:23:24

Episode: 2671   score: 22.89   Avg score (100e): 22.77   actor gain: -0.33   critic loss: 0.39   steps: 2671
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:21:57

Episode: 2672   score: 22.89   Avg score (100e): 22.77   actor gain: -0.33   critic loss: 0.39   steps: 2672


training loop:   5% |#                                 | ETA:  34 days, 2:17:47

Episode: 2673   score: 22.89   Avg score (100e): 22.78   actor gain: -0.33   critic loss: 0.39   steps: 2673


training loop:   5% |#                                 | ETA:  34 days, 2:18:59

Episode: 2674   score: 22.90   Avg score (100e): 22.78   actor gain: -0.33   critic loss: 0.39   steps: 2674


training loop:   5% |#                                 | ETA:  34 days, 2:14:58

Episode: 2675   score: 22.90   Avg score (100e): 22.78   actor gain: -0.33   critic loss: 0.39   steps: 2675


training loop:   5% |#                                 | ETA:  34 days, 2:09:50

Episode: 2676   score: 22.90   Avg score (100e): 22.78   actor gain: -0.33   critic loss: 0.39   steps: 2676


training loop:   5% |#                                 | ETA:  34 days, 2:05:52

Episode: 2677   score: 22.90   Avg score (100e): 22.79   actor gain: -0.33   critic loss: 0.39   steps: 2677


training loop:   5% |#                                 | ETA:  34 days, 2:06:27

Episode: 2678   score: 22.91   Avg score (100e): 22.79   actor gain: -0.33   critic loss: 0.39   steps: 2678


training loop:   5% |#                                 | ETA:  34 days, 2:06:49

Episode: 2679   score: 22.91   Avg score (100e): 22.79   actor gain: -0.33   critic loss: 0.39   steps: 2679


training loop:   5% |#                                 | ETA:  34 days, 2:01:49

Episode: 2680   score: 22.92   Avg score (100e): 22.79   actor gain: -0.33   critic loss: 0.39   steps: 2680


training loop:   5% |#                                 | ETA:  34 days, 2:00:06

Episode: 2681   score: 22.92   Avg score (100e): 22.80   actor gain: -0.33   critic loss: 0.39   steps: 2681
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:03:02

Episode: 2682   score: 22.93   Avg score (100e): 22.80   actor gain: -0.33   critic loss: 0.39   steps: 2682


training loop:   5% |#                                 | ETA:  34 days, 2:10:28

Episode: 2683   score: 22.93   Avg score (100e): 22.80   actor gain: -0.33   critic loss: 0.39   steps: 2683
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:15:17

Episode: 2684   score: 22.93   Avg score (100e): 22.80   actor gain: -0.32   critic loss: 0.39   steps: 2684


training loop:   5% |#                                 | ETA:  34 days, 2:17:45

Episode: 2685   score: 22.93   Avg score (100e): 22.81   actor gain: -0.32   critic loss: 0.39   steps: 2685


training loop:   5% |#                                 | ETA:  34 days, 2:20:45

Episode: 2686   score: 22.94   Avg score (100e): 22.81   actor gain: -0.32   critic loss: 0.40   steps: 2686


training loop:   5% |#                                 | ETA:  34 days, 2:21:23

Episode: 2687   score: 22.94   Avg score (100e): 22.81   actor gain: -0.32   critic loss: 0.40   steps: 2687
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:21:07

Episode: 2688   score: 22.94   Avg score (100e): 22.81   actor gain: -0.32   critic loss: 0.40   steps: 2688


training loop:   5% |#                                 | ETA:  34 days, 2:21:58

Episode: 2689   score: 22.94   Avg score (100e): 22.82   actor gain: -0.32   critic loss: 0.40   steps: 2689


training loop:   5% |#                                 | ETA:  34 days, 2:24:20

Episode: 2690   score: 22.95   Avg score (100e): 22.82   actor gain: -0.32   critic loss: 0.40   steps: 2690
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:24:21

Episode: 2691   score: 22.95   Avg score (100e): 22.82   actor gain: -0.32   critic loss: 0.40   steps: 2691


training loop:   5% |#                                 | ETA:  34 days, 2:24:44

Episode: 2692   score: 22.95   Avg score (100e): 22.82   actor gain: -0.32   critic loss: 0.40   steps: 2692


training loop:   5% |#                                 | ETA:  34 days, 2:23:58

Episode: 2693   score: 22.95   Avg score (100e): 22.83   actor gain: -0.32   critic loss: 0.40   steps: 2693


training loop:   5% |#                                 | ETA:  34 days, 2:24:58

Episode: 2694   score: 22.96   Avg score (100e): 22.83   actor gain: -0.32   critic loss: 0.40   steps: 2694


training loop:   5% |#                                 | ETA:  34 days, 2:28:28

Episode: 2695   score: 22.96   Avg score (100e): 22.83   actor gain: -0.32   critic loss: 0.40   steps: 2695


training loop:   5% |#                                 | ETA:  34 days, 2:28:09

Episode: 2696   score: 22.96   Avg score (100e): 22.83   actor gain: -0.32   critic loss: 0.40   steps: 2696


training loop:   5% |#                                 | ETA:  34 days, 2:30:01

Episode: 2697   score: 22.97   Avg score (100e): 22.84   actor gain: -0.32   critic loss: 0.40   steps: 2697
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:30:07

Episode: 2698   score: 22.97   Avg score (100e): 22.84   actor gain: -0.32   critic loss: 0.40   steps: 2698


training loop:   5% |#                                 | ETA:  34 days, 2:30:54

Episode: 2699   score: 22.98   Avg score (100e): 22.84   actor gain: -0.32   critic loss: 0.40   steps: 2699


training loop:   5% |#                                 | ETA:  34 days, 2:31:29

Episode: 2700   score: 22.98   Avg score (100e): 22.84   actor gain: -0.32   critic loss: 0.40   steps: 2700


training loop:   5% |#                                 | ETA:  34 days, 2:31:19

Episode: 2701   score: 22.98   Avg score (100e): 22.85   actor gain: -0.32   critic loss: 0.40   steps: 2701
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:32:20

Episode: 2702   score: 22.98   Avg score (100e): 22.85   actor gain: -0.32   critic loss: 0.40   steps: 2702
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:32:06

Episode: 2703   score: 22.98   Avg score (100e): 22.85   actor gain: -0.32   critic loss: 0.40   steps: 2703


training loop:   5% |#                                 | ETA:  34 days, 2:32:35

Episode: 2704   score: 22.98   Avg score (100e): 22.85   actor gain: -0.32   critic loss: 0.40   steps: 2704


training loop:   5% |#                                 | ETA:  34 days, 2:33:18

Episode: 2705   score: 22.98   Avg score (100e): 22.86   actor gain: -0.32   critic loss: 0.40   steps: 2705


training loop:   5% |#                                 | ETA:  34 days, 2:33:57

Episode: 2706   score: 22.98   Avg score (100e): 22.86   actor gain: -0.33   critic loss: 0.40   steps: 2706


training loop:   5% |#                                 | ETA:  34 days, 2:35:50

Episode: 2707   score: 22.98   Avg score (100e): 22.86   actor gain: -0.33   critic loss: 0.40   steps: 2707


training loop:   5% |#                                 | ETA:  34 days, 2:35:51

Episode: 2708   score: 22.98   Avg score (100e): 22.86   actor gain: -0.33   critic loss: 0.40   steps: 2708


training loop:   5% |#                                 | ETA:  34 days, 2:34:55

Episode: 2709   score: 22.99   Avg score (100e): 22.87   actor gain: -0.33   critic loss: 0.40   steps: 2709


training loop:   5% |#                                 | ETA:  34 days, 2:35:01

Episode: 2710   score: 22.99   Avg score (100e): 22.87   actor gain: -0.33   critic loss: 0.40   steps: 2710
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:33:56

Episode: 2711   score: 22.99   Avg score (100e): 22.87   actor gain: -0.33   critic loss: 0.40   steps: 2711
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:33:32

Episode: 2712   score: 22.99   Avg score (100e): 22.87   actor gain: -0.33   critic loss: 0.40   steps: 2712
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:33:38

Episode: 2713   score: 22.99   Avg score (100e): 22.87   actor gain: -0.33   critic loss: 0.40   steps: 2713


training loop:   5% |#                                 | ETA:  34 days, 2:36:41

Episode: 2714   score: 23.00   Avg score (100e): 22.88   actor gain: -0.33   critic loss: 0.40   steps: 2714


training loop:   5% |#                                 | ETA:  34 days, 2:38:13

Episode: 2715   score: 22.99   Avg score (100e): 22.88   actor gain: -0.33   critic loss: 0.40   steps: 2715
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:41:29

Episode: 2716   score: 23.00   Avg score (100e): 22.88   actor gain: -0.33   critic loss: 0.40   steps: 2716


training loop:   5% |#                                 | ETA:  34 days, 2:43:38

Episode: 2717   score: 23.00   Avg score (100e): 22.88   actor gain: -0.33   critic loss: 0.40   steps: 2717
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:43:32

Episode: 2718   score: 23.01   Avg score (100e): 22.89   actor gain: -0.33   critic loss: 0.40   steps: 2718
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:43:20

Episode: 2719   score: 23.01   Avg score (100e): 22.89   actor gain: -0.33   critic loss: 0.40   steps: 2719
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:44:01

Episode: 2720   score: 23.01   Avg score (100e): 22.89   actor gain: -0.33   critic loss: 0.40   steps: 2720


training loop:   5% |#                                 | ETA:  34 days, 2:43:21

Episode: 2721   score: 23.01   Avg score (100e): 22.89   actor gain: -0.33   critic loss: 0.40   steps: 2721


training loop:   5% |#                                 | ETA:  34 days, 2:42:29

Episode: 2722   score: 23.01   Avg score (100e): 22.90   actor gain: -0.33   critic loss: 0.40   steps: 2722


training loop:   5% |#                                 | ETA:  34 days, 2:41:40

Episode: 2723   score: 23.02   Avg score (100e): 22.90   actor gain: -0.33   critic loss: 0.40   steps: 2723
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:41:45

Episode: 2724   score: 23.02   Avg score (100e): 22.90   actor gain: -0.33   critic loss: 0.40   steps: 2724


training loop:   5% |#                                 | ETA:  34 days, 2:43:08

Episode: 2725   score: 23.02   Avg score (100e): 22.90   actor gain: -0.33   critic loss: 0.40   steps: 2725


training loop:   5% |#                                 | ETA:  34 days, 2:46:39

Episode: 2726   score: 23.02   Avg score (100e): 22.91   actor gain: -0.33   critic loss: 0.40   steps: 2726
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:52:25

Episode: 2727   score: 23.03   Avg score (100e): 22.91   actor gain: -0.33   critic loss: 0.40   steps: 2727


training loop:   5% |#                                 | ETA:  34 days, 2:52:57

Episode: 2728   score: 23.03   Avg score (100e): 22.91   actor gain: -0.33   critic loss: 0.40   steps: 2728


training loop:   5% |#                                 | ETA:  34 days, 2:54:54

Episode: 2729   score: 23.03   Avg score (100e): 22.91   actor gain: -0.33   critic loss: 0.40   steps: 2729


training loop:   5% |#                                 | ETA:  34 days, 2:53:24

Episode: 2730   score: 23.04   Avg score (100e): 22.92   actor gain: -0.33   critic loss: 0.40   steps: 2730


training loop:   5% |#                                 | ETA:  34 days, 2:53:52

Episode: 2731   score: 23.04   Avg score (100e): 22.92   actor gain: -0.33   critic loss: 0.40   steps: 2731


training loop:   5% |#                                 | ETA:  34 days, 2:53:57

Episode: 2732   score: 23.04   Avg score (100e): 22.92   actor gain: -0.33   critic loss: 0.40   steps: 2732


training loop:   5% |#                                 | ETA:  34 days, 2:55:40

Episode: 2733   score: 23.05   Avg score (100e): 22.92   actor gain: -0.33   critic loss: 0.40   steps: 2733


training loop:   5% |#                                 | ETA:  34 days, 2:56:53

Episode: 2734   score: 23.05   Avg score (100e): 22.92   actor gain: -0.33   critic loss: 0.40   steps: 2734


training loop:   5% |#                                 | ETA:  34 days, 2:57:02

Episode: 2735   score: 23.05   Avg score (100e): 22.93   actor gain: -0.33   critic loss: 0.40   steps: 2735


training loop:   5% |#                                 | ETA:  34 days, 2:57:34

Episode: 2736   score: 23.05   Avg score (100e): 22.93   actor gain: -0.33   critic loss: 0.40   steps: 2736


training loop:   5% |#                                 | ETA:  34 days, 2:56:53

Episode: 2737   score: 23.05   Avg score (100e): 22.93   actor gain: -0.33   critic loss: 0.40   steps: 2737
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 2:56:16

Episode: 2738   score: 23.05   Avg score (100e): 22.93   actor gain: -0.33   critic loss: 0.40   steps: 2738


training loop:   5% |#                                 | ETA:  34 days, 2:59:22

Episode: 2739   score: 23.05   Avg score (100e): 22.94   actor gain: -0.33   critic loss: 0.40   steps: 2739
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:00:22

Episode: 2740   score: 23.05   Avg score (100e): 22.94   actor gain: -0.33   critic loss: 0.40   steps: 2740


training loop:   5% |#                                 | ETA:  34 days, 3:01:40

Episode: 2741   score: 23.05   Avg score (100e): 22.94   actor gain: -0.33   critic loss: 0.40   steps: 2741


training loop:   5% |#                                 | ETA:  34 days, 3:02:41

Episode: 2742   score: 23.05   Avg score (100e): 22.94   actor gain: -0.33   critic loss: 0.40   steps: 2742
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:02:29

Episode: 2743   score: 23.05   Avg score (100e): 22.95   actor gain: -0.33   critic loss: 0.40   steps: 2743


training loop:   5% |#                                 | ETA:  34 days, 3:02:23

Episode: 2744   score: 23.05   Avg score (100e): 22.95   actor gain: -0.33   critic loss: 0.40   steps: 2744


training loop:   5% |#                                 | ETA:  34 days, 3:02:19

Episode: 2745   score: 23.06   Avg score (100e): 22.95   actor gain: -0.33   critic loss: 0.40   steps: 2745


training loop:   5% |#                                 | ETA:  34 days, 3:02:28

Episode: 2746   score: 23.05   Avg score (100e): 22.95   actor gain: -0.33   critic loss: 0.40   steps: 2746


training loop:   5% |#                                 | ETA:  34 days, 3:02:09

Episode: 2747   score: 23.06   Avg score (100e): 22.96   actor gain: -0.33   critic loss: 0.40   steps: 2747


training loop:   5% |#                                 | ETA:  34 days, 3:02:10

Episode: 2748   score: 23.06   Avg score (100e): 22.96   actor gain: -0.33   critic loss: 0.40   steps: 2748


training loop:   5% |#                                 | ETA:  34 days, 3:02:12

Episode: 2749   score: 23.06   Avg score (100e): 22.96   actor gain: -0.33   critic loss: 0.40   steps: 2749


training loop:   5% |#                                 | ETA:  34 days, 3:03:14

Episode: 2750   score: 23.06   Avg score (100e): 22.96   actor gain: -0.33   critic loss: 0.40   steps: 2750
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:02:19

Episode: 2751   score: 23.07   Avg score (100e): 22.96   actor gain: -0.33   critic loss: 0.40   steps: 2751


training loop:   5% |#                                 | ETA:  34 days, 3:02:19

Episode: 2752   score: 23.07   Avg score (100e): 22.97   actor gain: -0.33   critic loss: 0.40   steps: 2752


training loop:   5% |#                                 | ETA:  34 days, 3:01:50

Episode: 2753   score: 23.07   Avg score (100e): 22.97   actor gain: -0.33   critic loss: 0.40   steps: 2753
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:03:26

Episode: 2754   score: 23.08   Avg score (100e): 22.97   actor gain: -0.33   critic loss: 0.40   steps: 2754


training loop:   5% |#                                 | ETA:  34 days, 3:05:27

Episode: 2755   score: 23.08   Avg score (100e): 22.97   actor gain: -0.33   critic loss: 0.40   steps: 2755


training loop:   5% |#                                 | ETA:  34 days, 3:05:25

Episode: 2756   score: 23.08   Avg score (100e): 22.98   actor gain: -0.33   critic loss: 0.40   steps: 2756
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:06:19

Episode: 2757   score: 23.08   Avg score (100e): 22.98   actor gain: -0.33   critic loss: 0.40   steps: 2757


training loop:   5% |#                                 | ETA:  34 days, 3:06:20

Episode: 2758   score: 23.09   Avg score (100e): 22.98   actor gain: -0.33   critic loss: 0.40   steps: 2758


training loop:   5% |#                                 | ETA:  34 days, 3:08:23

Episode: 2759   score: 23.08   Avg score (100e): 22.98   actor gain: -0.33   critic loss: 0.40   steps: 2759


training loop:   5% |#                                 | ETA:  34 days, 3:09:59

Episode: 2760   score: 23.09   Avg score (100e): 22.99   actor gain: -0.33   critic loss: 0.40   steps: 2760


training loop:   5% |#                                 | ETA:  34 days, 3:10:08

Episode: 2761   score: 23.09   Avg score (100e): 22.99   actor gain: -0.33   critic loss: 0.40   steps: 2761


training loop:   5% |#                                 | ETA:  34 days, 3:12:07

Episode: 2762   score: 23.08   Avg score (100e): 22.99   actor gain: -0.33   critic loss: 0.40   steps: 2762
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:13:38

Episode: 2763   score: 23.08   Avg score (100e): 22.99   actor gain: -0.33   critic loss: 0.40   steps: 2763
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:13:33

Episode: 2764   score: 23.08   Avg score (100e): 22.99   actor gain: -0.33   critic loss: 0.40   steps: 2764


training loop:   5% |#                                 | ETA:  34 days, 3:15:30

Episode: 2765   score: 23.09   Avg score (100e): 23.00   actor gain: -0.33   critic loss: 0.40   steps: 2765


training loop:   5% |#                                 | ETA:  34 days, 3:15:53

Episode: 2766   score: 23.09   Avg score (100e): 23.00   actor gain: -0.33   critic loss: 0.40   steps: 2766


training loop:   5% |#                                 | ETA:  34 days, 3:15:50

Episode: 2767   score: 23.09   Avg score (100e): 23.00   actor gain: -0.33   critic loss: 0.40   steps: 2767


training loop:   5% |#                                 | ETA:  34 days, 3:14:58

Episode: 2768   score: 23.09   Avg score (100e): 23.00   actor gain: -0.33   critic loss: 0.40   steps: 2768


training loop:   5% |#                                 | ETA:  34 days, 3:15:45

Episode: 2769   score: 23.09   Avg score (100e): 23.00   actor gain: -0.33   critic loss: 0.40   steps: 2769


training loop:   5% |#                                 | ETA:  34 days, 3:16:11

Episode: 2770   score: 23.10   Avg score (100e): 23.01   actor gain: -0.33   critic loss: 0.40   steps: 2770


training loop:   5% |#                                 | ETA:  34 days, 3:15:44

Episode: 2771   score: 23.11   Avg score (100e): 23.01   actor gain: -0.33   critic loss: 0.40   steps: 2771


training loop:   5% |#                                 | ETA:  34 days, 3:18:12

Episode: 2772   score: 23.12   Avg score (100e): 23.01   actor gain: -0.33   critic loss: 0.40   steps: 2772


training loop:   5% |#                                 | ETA:  34 days, 3:19:46

Episode: 2773   score: 23.12   Avg score (100e): 23.01   actor gain: -0.33   critic loss: 0.40   steps: 2773
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:19:44

Episode: 2774   score: 23.12   Avg score (100e): 23.02   actor gain: -0.33   critic loss: 0.40   steps: 2774


training loop:   5% |#                                 | ETA:  34 days, 3:19:20

Episode: 2775   score: 23.12   Avg score (100e): 23.02   actor gain: -0.33   critic loss: 0.40   steps: 2775


training loop:   5% |#                                 | ETA:  34 days, 3:17:59

Episode: 2776   score: 23.12   Avg score (100e): 23.02   actor gain: -0.33   critic loss: 0.40   steps: 2776


training loop:   5% |#                                 | ETA:  34 days, 3:18:41

Episode: 2777   score: 23.12   Avg score (100e): 23.02   actor gain: -0.33   critic loss: 0.40   steps: 2777


training loop:   5% |#                                 | ETA:  34 days, 3:18:31

Episode: 2778   score: 23.12   Avg score (100e): 23.02   actor gain: -0.33   critic loss: 0.40   steps: 2778


training loop:   5% |#                                 | ETA:  34 days, 3:18:04

Episode: 2779   score: 23.13   Avg score (100e): 23.03   actor gain: -0.33   critic loss: 0.40   steps: 2779


training loop:   5% |#                                 | ETA:  34 days, 3:19:03

Episode: 2780   score: 23.13   Avg score (100e): 23.03   actor gain: -0.33   critic loss: 0.40   steps: 2780


training loop:   5% |#                                 | ETA:  34 days, 3:20:37

Episode: 2781   score: 23.13   Avg score (100e): 23.03   actor gain: -0.33   critic loss: 0.40   steps: 2781


training loop:   5% |#                                 | ETA:  34 days, 3:20:33

Episode: 2782   score: 23.13   Avg score (100e): 23.03   actor gain: -0.33   critic loss: 0.40   steps: 2782


training loop:   5% |#                                 | ETA:  34 days, 3:21:09

Episode: 2783   score: 23.13   Avg score (100e): 23.04   actor gain: -0.33   critic loss: 0.40   steps: 2783
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:20:19

Episode: 2784   score: 23.14   Avg score (100e): 23.04   actor gain: -0.33   critic loss: 0.40   steps: 2784


training loop:   5% |#                                 | ETA:  34 days, 3:21:02

Episode: 2785   score: 23.14   Avg score (100e): 23.04   actor gain: -0.32   critic loss: 0.40   steps: 2785


training loop:   5% |#                                 | ETA:  34 days, 3:20:56

Episode: 2786   score: 23.15   Avg score (100e): 23.04   actor gain: -0.32   critic loss: 0.40   steps: 2786


training loop:   5% |#                                 | ETA:  34 days, 3:20:38

Episode: 2787   score: 23.15   Avg score (100e): 23.04   actor gain: -0.32   critic loss: 0.40   steps: 2787
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:20:00

Episode: 2788   score: 23.15   Avg score (100e): 23.05   actor gain: -0.32   critic loss: 0.40   steps: 2788


training loop:   5% |#                                 | ETA:  34 days, 3:20:52

Episode: 2789   score: 23.16   Avg score (100e): 23.05   actor gain: -0.32   critic loss: 0.40   steps: 2789


training loop:   5% |#                                 | ETA:  34 days, 3:22:23

Episode: 2790   score: 23.16   Avg score (100e): 23.05   actor gain: -0.32   critic loss: 0.40   steps: 2790
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:23:00

Episode: 2791   score: 23.16   Avg score (100e): 23.05   actor gain: -0.32   critic loss: 0.40   steps: 2791
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:25:12

Episode: 2792   score: 23.16   Avg score (100e): 23.05   actor gain: -0.32   critic loss: 0.40   steps: 2792
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:24:45

Episode: 2793   score: 23.17   Avg score (100e): 23.06   actor gain: -0.32   critic loss: 0.40   steps: 2793


training loop:   5% |#                                 | ETA:  34 days, 3:25:04

Episode: 2794   score: 23.17   Avg score (100e): 23.06   actor gain: -0.32   critic loss: 0.40   steps: 2794


training loop:   5% |#                                 | ETA:  34 days, 3:24:48

Episode: 2795   score: 23.16   Avg score (100e): 23.06   actor gain: -0.32   critic loss: 0.40   steps: 2795


training loop:   5% |#                                 | ETA:  34 days, 3:24:19

Episode: 2796   score: 23.17   Avg score (100e): 23.06   actor gain: -0.32   critic loss: 0.40   steps: 2796


training loop:   5% |#                                 | ETA:  34 days, 3:25:11

Episode: 2797   score: 23.17   Avg score (100e): 23.06   actor gain: -0.32   critic loss: 0.40   steps: 2797
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:26:13

Episode: 2798   score: 23.18   Avg score (100e): 23.07   actor gain: -0.32   critic loss: 0.40   steps: 2798


training loop:   5% |#                                 | ETA:  34 days, 3:26:53

Episode: 2799   score: 23.18   Avg score (100e): 23.07   actor gain: -0.32   critic loss: 0.40   steps: 2799


training loop:   5% |#                                 | ETA:  34 days, 3:28:12

Episode: 2800   score: 23.18   Avg score (100e): 23.07   actor gain: -0.32   critic loss: 0.40   steps: 2800


training loop:   5% |#                                 | ETA:  34 days, 3:28:26

Episode: 2801   score: 23.19   Avg score (100e): 23.07   actor gain: -0.32   critic loss: 0.40   steps: 2801
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:28:04

Episode: 2802   score: 23.19   Avg score (100e): 23.07   actor gain: -0.32   critic loss: 0.40   steps: 2802


training loop:   5% |#                                 | ETA:  34 days, 3:28:55

Episode: 2803   score: 23.18   Avg score (100e): 23.08   actor gain: -0.33   critic loss: 0.40   steps: 2803
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:29:16

Episode: 2804   score: 23.19   Avg score (100e): 23.08   actor gain: -0.33   critic loss: 0.40   steps: 2804
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:31:17

Episode: 2805   score: 23.19   Avg score (100e): 23.08   actor gain: -0.33   critic loss: 0.40   steps: 2805
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:31:08

Episode: 2806   score: 23.19   Avg score (100e): 23.08   actor gain: -0.33   critic loss: 0.40   steps: 2806


training loop:   5% |#                                 | ETA:  34 days, 3:31:32

Episode: 2807   score: 23.20   Avg score (100e): 23.09   actor gain: -0.33   critic loss: 0.40   steps: 2807


training loop:   5% |#                                 | ETA:  34 days, 3:33:08

Episode: 2808   score: 23.20   Avg score (100e): 23.09   actor gain: -0.33   critic loss: 0.40   steps: 2808
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:33:00

Episode: 2809   score: 23.20   Avg score (100e): 23.09   actor gain: -0.33   critic loss: 0.40   steps: 2809


training loop:   5% |#                                 | ETA:  34 days, 3:33:15

Episode: 2810   score: 23.20   Avg score (100e): 23.09   actor gain: -0.33   critic loss: 0.40   steps: 2810


training loop:   5% |#                                 | ETA:  34 days, 3:34:01

Episode: 2811   score: 23.20   Avg score (100e): 23.09   actor gain: -0.33   critic loss: 0.40   steps: 2811


training loop:   5% |#                                 | ETA:  34 days, 3:37:46

Episode: 2812   score: 23.20   Avg score (100e): 23.10   actor gain: -0.33   critic loss: 0.40   steps: 2812


training loop:   5% |#                                 | ETA:  34 days, 3:38:59

Episode: 2813   score: 23.20   Avg score (100e): 23.10   actor gain: -0.33   critic loss: 0.40   steps: 2813


training loop:   5% |#                                 | ETA:  34 days, 3:39:16

Episode: 2814   score: 23.21   Avg score (100e): 23.10   actor gain: -0.33   critic loss: 0.40   steps: 2814
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:41:08

Episode: 2815   score: 23.21   Avg score (100e): 23.10   actor gain: -0.33   critic loss: 0.40   steps: 2815


training loop:   5% |#                                 | ETA:  34 days, 3:43:55

Episode: 2816   score: 23.21   Avg score (100e): 23.10   actor gain: -0.33   critic loss: 0.40   steps: 2816
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:44:47

Episode: 2817   score: 23.21   Avg score (100e): 23.11   actor gain: -0.33   critic loss: 0.40   steps: 2817


training loop:   5% |#                                 | ETA:  34 days, 3:45:17

Episode: 2818   score: 23.21   Avg score (100e): 23.11   actor gain: -0.33   critic loss: 0.40   steps: 2818


training loop:   5% |#                                 | ETA:  34 days, 3:46:58

Episode: 2819   score: 23.22   Avg score (100e): 23.11   actor gain: -0.32   critic loss: 0.40   steps: 2819
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:47:58

Episode: 2820   score: 23.22   Avg score (100e): 23.11   actor gain: -0.32   critic loss: 0.40   steps: 2820
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:47:58

Episode: 2821   score: 23.22   Avg score (100e): 23.12   actor gain: -0.32   critic loss: 0.40   steps: 2821


training loop:   5% |#                                 | ETA:  34 days, 3:50:07

Episode: 2822   score: 23.23   Avg score (100e): 23.12   actor gain: -0.32   critic loss: 0.40   steps: 2822


training loop:   5% |#                                 | ETA:  34 days, 3:50:15

Episode: 2823   score: 23.23   Avg score (100e): 23.12   actor gain: -0.33   critic loss: 0.40   steps: 2823


training loop:   5% |#                                 | ETA:  34 days, 3:53:58

Episode: 2824   score: 23.23   Avg score (100e): 23.12   actor gain: -0.32   critic loss: 0.40   steps: 2824


training loop:   5% |#                                 | ETA:  34 days, 3:54:33

Episode: 2825   score: 23.23   Avg score (100e): 23.12   actor gain: -0.32   critic loss: 0.40   steps: 2825
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:55:21

Episode: 2826   score: 23.24   Avg score (100e): 23.13   actor gain: -0.32   critic loss: 0.40   steps: 2826


training loop:   5% |#                                 | ETA:  34 days, 3:56:46

Episode: 2827   score: 23.24   Avg score (100e): 23.13   actor gain: -0.32   critic loss: 0.40   steps: 2827


training loop:   5% |#                                 | ETA:  34 days, 3:57:50

Episode: 2828   score: 23.24   Avg score (100e): 23.13   actor gain: -0.33   critic loss: 0.40   steps: 2828
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 3:57:40

Episode: 2829   score: 23.25   Avg score (100e): 23.13   actor gain: -0.32   critic loss: 0.40   steps: 2829


training loop:   5% |#                                 | ETA:  34 days, 4:00:19

Episode: 2830   score: 23.24   Avg score (100e): 23.13   actor gain: -0.32   critic loss: 0.40   steps: 2830


training loop:   5% |#                                 | ETA:  34 days, 4:01:32

Episode: 2831   score: 23.25   Avg score (100e): 23.14   actor gain: -0.32   critic loss: 0.40   steps: 2831


training loop:   5% |#                                 | ETA:  34 days, 4:01:33

Episode: 2832   score: 23.25   Avg score (100e): 23.14   actor gain: -0.32   critic loss: 0.40   steps: 2832


training loop:   5% |#                                 | ETA:  34 days, 4:02:44

Episode: 2833   score: 23.25   Avg score (100e): 23.14   actor gain: -0.32   critic loss: 0.40   steps: 2833


training loop:   5% |#                                 | ETA:  34 days, 4:03:42

Episode: 2834   score: 23.25   Avg score (100e): 23.14   actor gain: -0.32   critic loss: 0.40   steps: 2834


training loop:   5% |#                                 | ETA:  34 days, 4:05:17

Episode: 2835   score: 23.25   Avg score (100e): 23.14   actor gain: -0.32   critic loss: 0.40   steps: 2835


training loop:   5% |#                                 | ETA:  34 days, 4:04:22

Episode: 2836   score: 23.26   Avg score (100e): 23.15   actor gain: -0.32   critic loss: 0.40   steps: 2836


training loop:   5% |#                                 | ETA:  34 days, 4:06:20

Episode: 2837   score: 23.26   Avg score (100e): 23.15   actor gain: -0.32   critic loss: 0.40   steps: 2837
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:07:16

Episode: 2838   score: 23.26   Avg score (100e): 23.15   actor gain: -0.32   critic loss: 0.40   steps: 2838


training loop:   5% |#                                 | ETA:  34 days, 4:08:45

Episode: 2839   score: 23.26   Avg score (100e): 23.15   actor gain: -0.32   critic loss: 0.40   steps: 2839


training loop:   5% |#                                 | ETA:  34 days, 4:09:05

Episode: 2840   score: 23.27   Avg score (100e): 23.16   actor gain: -0.32   critic loss: 0.40   steps: 2840


training loop:   5% |#                                 | ETA:  34 days, 4:11:56

Episode: 2841   score: 23.27   Avg score (100e): 23.16   actor gain: -0.32   critic loss: 0.40   steps: 2841


training loop:   5% |#                                 | ETA:  34 days, 4:12:21

Episode: 2842   score: 23.28   Avg score (100e): 23.16   actor gain: -0.32   critic loss: 0.40   steps: 2842


training loop:   5% |#                                 | ETA:  34 days, 4:12:33

Episode: 2843   score: 23.28   Avg score (100e): 23.16   actor gain: -0.32   critic loss: 0.40   steps: 2843


training loop:   5% |#                                 | ETA:  34 days, 4:12:44

Episode: 2844   score: 23.28   Avg score (100e): 23.16   actor gain: -0.33   critic loss: 0.40   steps: 2844


training loop:   5% |#                                 | ETA:  34 days, 4:14:51

Episode: 2845   score: 23.29   Avg score (100e): 23.17   actor gain: -0.33   critic loss: 0.40   steps: 2845
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:15:12

Episode: 2846   score: 23.29   Avg score (100e): 23.17   actor gain: -0.33   critic loss: 0.40   steps: 2846


training loop:   5% |#                                 | ETA:  34 days, 4:16:12

Episode: 2847   score: 23.29   Avg score (100e): 23.17   actor gain: -0.33   critic loss: 0.40   steps: 2847


training loop:   5% |#                                 | ETA:  34 days, 4:17:41

Episode: 2848   score: 23.30   Avg score (100e): 23.17   actor gain: -0.33   critic loss: 0.40   steps: 2848


training loop:   5% |#                                 | ETA:  34 days, 4:18:12

Episode: 2849   score: 23.30   Avg score (100e): 23.18   actor gain: -0.33   critic loss: 0.40   steps: 2849


training loop:   5% |#                                 | ETA:  34 days, 4:17:52

Episode: 2850   score: 23.30   Avg score (100e): 23.18   actor gain: -0.33   critic loss: 0.40   steps: 2850


training loop:   5% |#                                 | ETA:  34 days, 4:17:51

Episode: 2851   score: 23.31   Avg score (100e): 23.18   actor gain: -0.33   critic loss: 0.40   steps: 2851
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:19:13

Episode: 2852   score: 23.31   Avg score (100e): 23.18   actor gain: -0.33   critic loss: 0.40   steps: 2852
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:18:47

Episode: 2853   score: 23.32   Avg score (100e): 23.19   actor gain: -0.33   critic loss: 0.40   steps: 2853


training loop:   5% |#                                 | ETA:  34 days, 4:19:13

Episode: 2854   score: 23.32   Avg score (100e): 23.19   actor gain: -0.33   critic loss: 0.40   steps: 2854


training loop:   5% |#                                 | ETA:  34 days, 4:19:52

Episode: 2855   score: 23.32   Avg score (100e): 23.19   actor gain: -0.33   critic loss: 0.41   steps: 2855


training loop:   5% |#                                 | ETA:  34 days, 4:21:28

Episode: 2856   score: 23.32   Avg score (100e): 23.19   actor gain: -0.33   critic loss: 0.41   steps: 2856
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:24:18

Episode: 2857   score: 23.33   Avg score (100e): 23.20   actor gain: -0.33   critic loss: 0.41   steps: 2857


training loop:   5% |#                                 | ETA:  34 days, 4:24:38

Episode: 2858   score: 23.33   Avg score (100e): 23.20   actor gain: -0.33   critic loss: 0.41   steps: 2858


training loop:   5% |#                                 | ETA:  34 days, 4:24:41

Episode: 2859   score: 23.33   Avg score (100e): 23.20   actor gain: -0.33   critic loss: 0.41   steps: 2859


training loop:   5% |#                                 | ETA:  34 days, 4:26:09

Episode: 2860   score: 23.34   Avg score (100e): 23.20   actor gain: -0.33   critic loss: 0.41   steps: 2860


training loop:   5% |#                                 | ETA:  34 days, 4:26:38

Episode: 2861   score: 23.34   Avg score (100e): 23.21   actor gain: -0.33   critic loss: 0.41   steps: 2861


training loop:   5% |#                                 | ETA:  34 days, 4:28:08

Episode: 2862   score: 23.35   Avg score (100e): 23.21   actor gain: -0.33   critic loss: 0.41   steps: 2862
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:29:50

Episode: 2863   score: 23.35   Avg score (100e): 23.21   actor gain: -0.32   critic loss: 0.41   steps: 2863


training loop:   5% |#                                 | ETA:  34 days, 4:31:23

Episode: 2864   score: 23.35   Avg score (100e): 23.21   actor gain: -0.32   critic loss: 0.41   steps: 2864


training loop:   5% |#                                 | ETA:  34 days, 4:31:40

Episode: 2865   score: 23.36   Avg score (100e): 23.22   actor gain: -0.32   critic loss: 0.41   steps: 2865


training loop:   5% |#                                 | ETA:  34 days, 4:30:57

Episode: 2866   score: 23.36   Avg score (100e): 23.22   actor gain: -0.32   critic loss: 0.41   steps: 2866


training loop:   5% |#                                 | ETA:  34 days, 4:32:08

Episode: 2867   score: 23.37   Avg score (100e): 23.22   actor gain: -0.32   critic loss: 0.41   steps: 2867
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:32:36

Episode: 2868   score: 23.37   Avg score (100e): 23.22   actor gain: -0.32   critic loss: 0.41   steps: 2868


training loop:   5% |#                                 | ETA:  34 days, 4:33:56

Episode: 2869   score: 23.37   Avg score (100e): 23.23   actor gain: -0.32   critic loss: 0.41   steps: 2869


training loop:   5% |#                                 | ETA:  34 days, 4:33:49

Episode: 2870   score: 23.38   Avg score (100e): 23.23   actor gain: -0.32   critic loss: 0.40   steps: 2870


training loop:   5% |#                                 | ETA:  34 days, 4:35:32

Episode: 2871   score: 23.38   Avg score (100e): 23.23   actor gain: -0.32   critic loss: 0.40   steps: 2871


training loop:   5% |#                                 | ETA:  34 days, 4:36:26

Episode: 2872   score: 23.38   Avg score (100e): 23.24   actor gain: -0.32   critic loss: 0.40   steps: 2872


training loop:   5% |#                                 | ETA:  34 days, 4:36:31

Episode: 2873   score: 23.38   Avg score (100e): 23.24   actor gain: -0.32   critic loss: 0.40   steps: 2873


training loop:   5% |#                                 | ETA:  34 days, 4:35:47

Episode: 2874   score: 23.39   Avg score (100e): 23.24   actor gain: -0.32   critic loss: 0.40   steps: 2874


training loop:   5% |#                                 | ETA:  34 days, 4:38:16

Episode: 2875   score: 23.39   Avg score (100e): 23.24   actor gain: -0.32   critic loss: 0.40   steps: 2875


training loop:   5% |#                                 | ETA:  34 days, 4:38:48

Episode: 2876   score: 23.39   Avg score (100e): 23.25   actor gain: -0.32   critic loss: 0.40   steps: 2876


training loop:   5% |#                                 | ETA:  34 days, 4:40:02

Episode: 2877   score: 23.39   Avg score (100e): 23.25   actor gain: -0.32   critic loss: 0.40   steps: 2877


training loop:   5% |#                                 | ETA:  34 days, 4:41:10

Episode: 2878   score: 23.40   Avg score (100e): 23.25   actor gain: -0.32   critic loss: 0.40   steps: 2878


training loop:   5% |#                                 | ETA:  34 days, 4:42:23

Episode: 2879   score: 23.40   Avg score (100e): 23.25   actor gain: -0.32   critic loss: 0.40   steps: 2879
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:42:10

Episode: 2880   score: 23.41   Avg score (100e): 23.26   actor gain: -0.32   critic loss: 0.40   steps: 2880


training loop:   5% |#                                 | ETA:  34 days, 4:42:25

Episode: 2881   score: 23.40   Avg score (100e): 23.26   actor gain: -0.32   critic loss: 0.40   steps: 2881


training loop:   5% |#                                 | ETA:  34 days, 4:43:17

Episode: 2882   score: 23.41   Avg score (100e): 23.26   actor gain: -0.32   critic loss: 0.40   steps: 2882


training loop:   5% |#                                 | ETA:  34 days, 4:45:56

Episode: 2883   score: 23.41   Avg score (100e): 23.27   actor gain: -0.32   critic loss: 0.40   steps: 2883


training loop:   5% |#                                 | ETA:  34 days, 4:45:39

Episode: 2884   score: 23.41   Avg score (100e): 23.27   actor gain: -0.32   critic loss: 0.40   steps: 2884


training loop:   5% |#                                 | ETA:  34 days, 4:46:00

Episode: 2885   score: 23.41   Avg score (100e): 23.27   actor gain: -0.32   critic loss: 0.40   steps: 2885


training loop:   5% |#                                 | ETA:  34 days, 4:47:28

Episode: 2886   score: 23.41   Avg score (100e): 23.27   actor gain: -0.32   critic loss: 0.40   steps: 2886
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:47:22

Episode: 2887   score: 23.41   Avg score (100e): 23.28   actor gain: -0.32   critic loss: 0.40   steps: 2887
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:47:40

Episode: 2888   score: 23.42   Avg score (100e): 23.28   actor gain: -0.32   critic loss: 0.40   steps: 2888


training loop:   5% |#                                 | ETA:  34 days, 4:51:18

Episode: 2889   score: 23.42   Avg score (100e): 23.28   actor gain: -0.32   critic loss: 0.40   steps: 2889
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:52:03

Episode: 2890   score: 23.42   Avg score (100e): 23.28   actor gain: -0.32   critic loss: 0.40   steps: 2890


training loop:   5% |#                                 | ETA:  34 days, 4:52:35

Episode: 2891   score: 23.43   Avg score (100e): 23.29   actor gain: -0.32   critic loss: 0.40   steps: 2891


training loop:   5% |#                                 | ETA:  34 days, 4:52:36

Episode: 2892   score: 23.42   Avg score (100e): 23.29   actor gain: -0.32   critic loss: 0.40   steps: 2892


training loop:   5% |#                                 | ETA:  34 days, 4:54:12

Episode: 2893   score: 23.42   Avg score (100e): 23.29   actor gain: -0.32   critic loss: 0.40   steps: 2893


training loop:   5% |#                                 | ETA:  34 days, 4:54:33

Episode: 2894   score: 23.43   Avg score (100e): 23.29   actor gain: -0.32   critic loss: 0.40   steps: 2894


training loop:   5% |#                                 | ETA:  34 days, 4:55:38

Episode: 2895   score: 23.43   Avg score (100e): 23.30   actor gain: -0.32   critic loss: 0.40   steps: 2895
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:55:23

Episode: 2896   score: 23.43   Avg score (100e): 23.30   actor gain: -0.32   critic loss: 0.41   steps: 2896
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:56:01

Episode: 2897   score: 23.44   Avg score (100e): 23.30   actor gain: -0.32   critic loss: 0.41   steps: 2897


training loop:   5% |#                                 | ETA:  34 days, 4:55:43

Episode: 2898   score: 23.43   Avg score (100e): 23.30   actor gain: -0.32   critic loss: 0.41   steps: 2898


training loop:   5% |#                                 | ETA:  34 days, 4:55:13

Episode: 2899   score: 23.44   Avg score (100e): 23.31   actor gain: -0.32   critic loss: 0.40   steps: 2899


training loop:   5% |#                                 | ETA:  34 days, 4:54:26

Episode: 2900   score: 23.44   Avg score (100e): 23.31   actor gain: -0.32   critic loss: 0.40   steps: 2900


training loop:   5% |#                                 | ETA:  34 days, 4:54:43

Episode: 2901   score: 23.44   Avg score (100e): 23.31   actor gain: -0.32   critic loss: 0.40   steps: 2901
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:53:58

Episode: 2902   score: 23.44   Avg score (100e): 23.32   actor gain: -0.32   critic loss: 0.40   steps: 2902
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:52:47

Episode: 2903   score: 23.45   Avg score (100e): 23.32   actor gain: -0.32   critic loss: 0.40   steps: 2903
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:51:56

Episode: 2904   score: 23.45   Avg score (100e): 23.32   actor gain: -0.32   critic loss: 0.40   steps: 2904


training loop:   5% |#                                 | ETA:  34 days, 4:52:15

Episode: 2905   score: 23.46   Avg score (100e): 23.32   actor gain: -0.32   critic loss: 0.40   steps: 2905


training loop:   5% |#                                 | ETA:  34 days, 4:51:48

Episode: 2906   score: 23.47   Avg score (100e): 23.33   actor gain: -0.32   critic loss: 0.40   steps: 2906


training loop:   5% |#                                 | ETA:  34 days, 4:51:17

Episode: 2907   score: 23.47   Avg score (100e): 23.33   actor gain: -0.32   critic loss: 0.40   steps: 2907


training loop:   5% |#                                 | ETA:  34 days, 4:50:39

Episode: 2908   score: 23.47   Avg score (100e): 23.33   actor gain: -0.32   critic loss: 0.40   steps: 2908
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:50:20

Episode: 2909   score: 23.48   Avg score (100e): 23.33   actor gain: -0.32   critic loss: 0.40   steps: 2909


training loop:   5% |#                                 | ETA:  34 days, 4:49:41

Episode: 2910   score: 23.48   Avg score (100e): 23.34   actor gain: -0.32   critic loss: 0.40   steps: 2910


training loop:   5% |#                                 | ETA:  34 days, 4:48:47

Episode: 2911   score: 23.49   Avg score (100e): 23.34   actor gain: -0.32   critic loss: 0.40   steps: 2911
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:48:03

Episode: 2912   score: 23.49   Avg score (100e): 23.34   actor gain: -0.32   critic loss: 0.41   steps: 2912


training loop:   5% |#                                 | ETA:  34 days, 4:48:27

Episode: 2913   score: 23.49   Avg score (100e): 23.35   actor gain: -0.32   critic loss: 0.40   steps: 2913


training loop:   5% |#                                 | ETA:  34 days, 4:47:56

Episode: 2914   score: 23.49   Avg score (100e): 23.35   actor gain: -0.32   critic loss: 0.40   steps: 2914


training loop:   5% |#                                 | ETA:  34 days, 4:47:10

Episode: 2915   score: 23.50   Avg score (100e): 23.35   actor gain: -0.32   critic loss: 0.40   steps: 2915


training loop:   5% |#                                 | ETA:  34 days, 4:46:06

Episode: 2916   score: 23.50   Avg score (100e): 23.35   actor gain: -0.32   critic loss: 0.40   steps: 2916


training loop:   5% |#                                 | ETA:  34 days, 4:46:17

Episode: 2917   score: 23.51   Avg score (100e): 23.36   actor gain: -0.32   critic loss: 0.41   steps: 2917


training loop:   5% |#                                 | ETA:  34 days, 4:46:07

Episode: 2918   score: 23.51   Avg score (100e): 23.36   actor gain: -0.32   critic loss: 0.41   steps: 2918


training loop:   5% |#                                 | ETA:  34 days, 4:45:27

Episode: 2919   score: 23.51   Avg score (100e): 23.36   actor gain: -0.32   critic loss: 0.40   steps: 2919


training loop:   5% |#                                 | ETA:  34 days, 4:45:03

Episode: 2920   score: 23.51   Avg score (100e): 23.37   actor gain: -0.32   critic loss: 0.41   steps: 2920


training loop:   5% |#                                 | ETA:  34 days, 4:44:54

Episode: 2921   score: 23.51   Avg score (100e): 23.37   actor gain: -0.32   critic loss: 0.40   steps: 2921


training loop:   5% |#                                 | ETA:  34 days, 4:47:06

Episode: 2922   score: 23.51   Avg score (100e): 23.37   actor gain: -0.32   critic loss: 0.40   steps: 2922
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:46:19

Episode: 2923   score: 23.51   Avg score (100e): 23.37   actor gain: -0.32   critic loss: 0.40   steps: 2923


training loop:   5% |#                                 | ETA:  34 days, 4:45:43

Episode: 2924   score: 23.52   Avg score (100e): 23.38   actor gain: -0.32   critic loss: 0.40   steps: 2924
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:45:14

Episode: 2925   score: 23.52   Avg score (100e): 23.38   actor gain: -0.32   critic loss: 0.40   steps: 2925


training loop:   5% |#                                 | ETA:  34 days, 4:45:19

Episode: 2926   score: 23.52   Avg score (100e): 23.38   actor gain: -0.32   critic loss: 0.40   steps: 2926


training loop:   5% |#                                 | ETA:  34 days, 4:44:50

Episode: 2927   score: 23.52   Avg score (100e): 23.39   actor gain: -0.32   critic loss: 0.40   steps: 2927
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:44:31

Episode: 2928   score: 23.53   Avg score (100e): 23.39   actor gain: -0.32   critic loss: 0.40   steps: 2928


training loop:   5% |#                                 | ETA:  34 days, 4:45:37

Episode: 2929   score: 23.53   Avg score (100e): 23.39   actor gain: -0.32   critic loss: 0.40   steps: 2929


training loop:   5% |#                                 | ETA:  34 days, 4:45:33

Episode: 2930   score: 23.53   Avg score (100e): 23.39   actor gain: -0.32   critic loss: 0.40   steps: 2930


training loop:   5% |#                                 | ETA:  34 days, 4:45:20

Episode: 2931   score: 23.54   Avg score (100e): 23.40   actor gain: -0.32   critic loss: 0.40   steps: 2931


training loop:   5% |#                                 | ETA:  34 days, 4:44:01

Episode: 2932   score: 23.54   Avg score (100e): 23.40   actor gain: -0.32   critic loss: 0.40   steps: 2932


training loop:   5% |#                                 | ETA:  34 days, 4:45:30

Episode: 2933   score: 23.55   Avg score (100e): 23.40   actor gain: -0.32   critic loss: 0.40   steps: 2933
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:44:57

Episode: 2934   score: 23.55   Avg score (100e): 23.41   actor gain: -0.32   critic loss: 0.40   steps: 2934


training loop:   5% |#                                 | ETA:  34 days, 4:44:26

Episode: 2935   score: 23.56   Avg score (100e): 23.41   actor gain: -0.32   critic loss: 0.40   steps: 2935
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:43:46

Episode: 2936   score: 23.56   Avg score (100e): 23.41   actor gain: -0.32   critic loss: 0.40   steps: 2936
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:43:59

Episode: 2937   score: 23.56   Avg score (100e): 23.42   actor gain: -0.32   critic loss: 0.39   steps: 2937
np.all(done) is true! miracle!


training loop:   5% |#                                 | ETA:  34 days, 4:42:58

Episode: 2938   score: 23.56   Avg score (100e): 23.42   actor gain: -0.32   critic loss: 0.39   steps: 2938


training loop:   5% |#                                 | ETA:  34 days, 4:42:28

Episode: 2939   score: 23.57   Avg score (100e): 23.42   actor gain: -0.32   critic loss: 0.40   steps: 2939


training loop:   5% |#                                 | ETA:  34 days, 4:42:05

Episode: 2940   score: 23.58   Avg score (100e): 23.42   actor gain: -0.32   critic loss: 0.39   steps: 2940


training loop:   5% |#                                 | ETA:  34 days, 4:42:48

Episode: 2941   score: 23.58   Avg score (100e): 23.43   actor gain: -0.32   critic loss: 0.40   steps: 2941


training loop:   5% |#                                 | ETA:  34 days, 4:42:17

Episode: 2942   score: 23.59   Avg score (100e): 23.43   actor gain: -0.32   critic loss: 0.40   steps: 2942
np.all(done) is true! miracle!


training loop:   5% |##                                | ETA:  34 days, 4:41:55

Episode: 2943   score: 23.59   Avg score (100e): 23.43   actor gain: -0.32   critic loss: 0.39   steps: 2943


training loop:   5% |##                                | ETA:  34 days, 4:41:56

Episode: 2944   score: 23.59   Avg score (100e): 23.44   actor gain: -0.32   critic loss: 0.40   steps: 2944


training loop:   5% |##                                | ETA:  34 days, 4:43:37

Episode: 2945   score: 23.59   Avg score (100e): 23.44   actor gain: -0.32   critic loss: 0.40   steps: 2945


training loop:   5% |##                                | ETA:  34 days, 4:43:14

Episode: 2946   score: 23.59   Avg score (100e): 23.44   actor gain: -0.32   critic loss: 0.40   steps: 2946


training loop:   5% |##                                | ETA:  34 days, 4:42:38

Episode: 2947   score: 23.60   Avg score (100e): 23.45   actor gain: -0.32   critic loss: 0.40   steps: 2947


training loop:   5% |##                                | ETA:  34 days, 4:41:45

Episode: 2948   score: 23.60   Avg score (100e): 23.45   actor gain: -0.32   critic loss: 0.40   steps: 2948


training loop:   5% |##                                | ETA:  34 days, 4:43:19

Episode: 2949   score: 23.60   Avg score (100e): 23.45   actor gain: -0.32   critic loss: 0.40   steps: 2949


training loop:   5% |##                                | ETA:  34 days, 4:42:46

Episode: 2950   score: 23.61   Avg score (100e): 23.45   actor gain: -0.32   critic loss: 0.40   steps: 2950
np.all(done) is true! miracle!


training loop:   5% |##                                | ETA:  34 days, 4:41:32

Episode: 2951   score: 23.61   Avg score (100e): 23.46   actor gain: -0.32   critic loss: 0.40   steps: 2951


training loop:   5% |##                                | ETA:  34 days, 4:40:43

Episode: 2952   score: 23.62   Avg score (100e): 23.46   actor gain: -0.32   critic loss: 0.40   steps: 2952


training loop:   5% |##                                | ETA:  34 days, 4:40:59

Episode: 2953   score: 23.62   Avg score (100e): 23.46   actor gain: -0.32   critic loss: 0.40   steps: 2953


training loop:   5% |##                                | ETA:  34 days, 4:43:05

Episode: 2954   score: 23.62   Avg score (100e): 23.47   actor gain: -0.32   critic loss: 0.40   steps: 2954
np.all(done) is true! miracle!


training loop:   5% |##                                | ETA:  34 days, 4:42:21

Episode: 2955   score: 23.62   Avg score (100e): 23.47   actor gain: -0.32   critic loss: 0.40   steps: 2955


training loop:   5% |##                                | ETA:  34 days, 4:43:55

Episode: 2956   score: 23.62   Avg score (100e): 23.47   actor gain: -0.32   critic loss: 0.40   steps: 2956


training loop:   5% |##                                | ETA:  34 days, 4:44:49

Episode: 2957   score: 23.62   Avg score (100e): 23.48   actor gain: -0.32   critic loss: 0.40   steps: 2957


training loop:   5% |##                                | ETA:  34 days, 4:45:03

Episode: 2958   score: 23.62   Avg score (100e): 23.48   actor gain: -0.32   critic loss: 0.40   steps: 2958
np.all(done) is true! miracle!


training loop:   5% |##                                | ETA:  34 days, 4:43:49

Episode: 2959   score: 23.62   Avg score (100e): 23.48   actor gain: -0.32   critic loss: 0.40   steps: 2959
np.all(done) is true! miracle!


training loop:   5% |##                                | ETA:  34 days, 4:44:28

Episode: 2960   score: 23.63   Avg score (100e): 23.48   actor gain: -0.32   critic loss: 0.40   steps: 2960


training loop:   5% |##                                | ETA:  34 days, 4:44:22

Episode: 2961   score: 23.63   Avg score (100e): 23.49   actor gain: -0.32   critic loss: 0.40   steps: 2961


training loop:   5% |##                                | ETA:  34 days, 4:44:16

Episode: 2962   score: 23.63   Avg score (100e): 23.49   actor gain: -0.32   critic loss: 0.40   steps: 2962


training loop:   5% |##                                | ETA:  34 days, 4:43:01

Episode: 2963   score: 23.63   Avg score (100e): 23.49   actor gain: -0.32   critic loss: 0.40   steps: 2963


training loop:   5% |##                                | ETA:  34 days, 4:42:29

Episode: 2964   score: 23.64   Avg score (100e): 23.50   actor gain: -0.32   critic loss: 0.40   steps: 2964
np.all(done) is true! miracle!


training loop:   5% |##                                | ETA:  34 days, 4:41:10

Episode: 2965   score: 23.64   Avg score (100e): 23.50   actor gain: -0.32   critic loss: 0.40   steps: 2965
np.all(done) is true! miracle!


training loop:   5% |##                                | ETA:  34 days, 4:39:37

Episode: 2966   score: 23.64   Avg score (100e): 23.50   actor gain: -0.32   critic loss: 0.40   steps: 2966


training loop:   5% |##                                | ETA:  34 days, 4:43:50

Episode: 2967   score: 23.64   Avg score (100e): 23.50   actor gain: -0.32   critic loss: 0.39   steps: 2967
np.all(done) is true! miracle!


training loop:   5% |##                                | ETA:  34 days, 4:44:41

Episode: 2968   score: 23.64   Avg score (100e): 23.51   actor gain: -0.32   critic loss: 0.39   steps: 2968
np.all(done) is true! miracle!


training loop:   5% |##                                | ETA:  34 days, 4:44:37

Episode: 2969   score: 23.64   Avg score (100e): 23.51   actor gain: -0.32   critic loss: 0.39   steps: 2969


training loop:   5% |##                                | ETA:  34 days, 4:44:04

Episode: 2970   score: 23.65   Avg score (100e): 23.51   actor gain: -0.32   critic loss: 0.39   steps: 2970


training loop:   5% |##                                | ETA:  34 days, 4:45:20

Episode: 2971   score: 23.64   Avg score (100e): 23.51   actor gain: -0.32   critic loss: 0.39   steps: 2971


training loop:   5% |##                                | ETA:  34 days, 4:46:02

Episode: 2972   score: 23.64   Avg score (100e): 23.52   actor gain: -0.32   critic loss: 0.39   steps: 2972


training loop:   5% |##                                | ETA:  34 days, 4:45:35

Episode: 2973   score: 23.64   Avg score (100e): 23.52   actor gain: -0.32   critic loss: 0.39   steps: 2973
np.all(done) is true! miracle!


training loop:   5% |##                                | ETA:  34 days, 4:44:24

Episode: 2974   score: 23.64   Avg score (100e): 23.52   actor gain: -0.32   critic loss: 0.39   steps: 2974


training loop:   5% |##                                | ETA:  34 days, 4:44:21

Episode: 2975   score: 23.64   Avg score (100e): 23.53   actor gain: -0.32   critic loss: 0.39   steps: 2975


training loop:   5% |##                                | ETA:  34 days, 4:46:12

Episode: 2976   score: 23.65   Avg score (100e): 23.53   actor gain: -0.32   critic loss: 0.39   steps: 2976
np.all(done) is true! miracle!


training loop:   5% |##                                | ETA:  34 days, 4:45:50

Episode: 2977   score: 23.65   Avg score (100e): 23.53   actor gain: -0.32   critic loss: 0.39   steps: 2977


training loop:   5% |##                                | ETA:  34 days, 4:45:58

Episode: 2978   score: 23.66   Avg score (100e): 23.53   actor gain: -0.32   critic loss: 0.39   steps: 2978


training loop:   5% |##                                | ETA:  34 days, 4:46:18

Episode: 2979   score: 23.66   Avg score (100e): 23.54   actor gain: -0.33   critic loss: 0.39   steps: 2979


training loop:   5% |##                                | ETA:  34 days, 4:46:12

Episode: 2980   score: 23.66   Avg score (100e): 23.54   actor gain: -0.33   critic loss: 0.39   steps: 2980


training loop:   5% |##                                | ETA:  34 days, 4:46:24

Episode: 2981   score: 23.66   Avg score (100e): 23.54   actor gain: -0.33   critic loss: 0.39   steps: 2981


training loop:   5% |##                                | ETA:  34 days, 4:45:40

Episode: 2982   score: 23.66   Avg score (100e): 23.54   actor gain: -0.33   critic loss: 0.39   steps: 2982


training loop:   5% |##                                | ETA:  34 days, 4:45:59

Episode: 2983   score: 23.67   Avg score (100e): 23.55   actor gain: -0.33   critic loss: 0.39   steps: 2983


training loop:   5% |##                                | ETA:  34 days, 4:45:55

Episode: 2984   score: 23.67   Avg score (100e): 23.55   actor gain: -0.33   critic loss: 0.40   steps: 2984


training loop:   5% |##                                | ETA:  34 days, 4:46:20

Episode: 2985   score: 23.67   Avg score (100e): 23.55   actor gain: -0.32   critic loss: 0.40   steps: 2985
np.all(done) is true! miracle!


training loop:   5% |##                                | ETA:  34 days, 4:46:57

Episode: 2986   score: 23.67   Avg score (100e): 23.55   actor gain: -0.32   critic loss: 0.40   steps: 2986


training loop:   5% |##                                | ETA:  34 days, 4:51:06

Episode: 2987   score: 23.68   Avg score (100e): 23.56   actor gain: -0.32   critic loss: 0.40   steps: 2987


training loop:   5% |##                                | ETA:  34 days, 4:51:24

Episode: 2988   score: 23.68   Avg score (100e): 23.56   actor gain: -0.32   critic loss: 0.40   steps: 2988


training loop:   5% |##                                | ETA:  34 days, 4:52:15

Episode: 2989   score: 23.68   Avg score (100e): 23.56   actor gain: -0.32   critic loss: 0.40   steps: 2989
np.all(done) is true! miracle!


training loop:   5% |##                                | ETA:  34 days, 4:50:46

Episode: 2990   score: 23.67   Avg score (100e): 23.56   actor gain: -0.33   critic loss: 0.40   steps: 2990


training loop:   5% |##                                | ETA:  34 days, 4:52:22

Episode: 2991   score: 23.67   Avg score (100e): 23.57   actor gain: -0.33   critic loss: 0.40   steps: 2991
np.all(done) is true! miracle!


training loop:   5% |##                                | ETA:  34 days, 4:52:42

Episode: 2992   score: 23.68   Avg score (100e): 23.57   actor gain: -0.33   critic loss: 0.40   steps: 2992


training loop:   5% |##                                | ETA:  34 days, 4:52:54

Episode: 2993   score: 23.68   Avg score (100e): 23.57   actor gain: -0.33   critic loss: 0.40   steps: 2993


training loop:   5% |##                                | ETA:  34 days, 4:52:53

Episode: 2994   score: 23.68   Avg score (100e): 23.57   actor gain: -0.33   critic loss: 0.40   steps: 2994


training loop:   5% |##                                | ETA:  34 days, 4:53:42

Episode: 2995   score: 23.69   Avg score (100e): 23.58   actor gain: -0.33   critic loss: 0.40   steps: 2995


training loop:   5% |##                                | ETA:  34 days, 4:53:22

Episode: 2996   score: 23.69   Avg score (100e): 23.58   actor gain: -0.33   critic loss: 0.40   steps: 2996


training loop:   5% |##                                | ETA:  34 days, 4:53:28

Episode: 2997   score: 23.69   Avg score (100e): 23.58   actor gain: -0.33   critic loss: 0.40   steps: 2997
np.all(done) is true! miracle!


training loop:   5% |##                                | ETA:  34 days, 4:54:41

Episode: 2998   score: 23.70   Avg score (100e): 23.58   actor gain: -0.33   critic loss: 0.40   steps: 2998


training loop:   5% |##                                | ETA:  34 days, 4:54:55

Episode: 2999   score: 23.70   Avg score (100e): 23.59   actor gain: -0.33   critic loss: 0.40   steps: 2999


training loop:   5% |##                                | ETA:  34 days, 4:55:22

Episode: 3000   score: 23.71   Avg score (100e): 23.59   actor gain: -0.33   critic loss: 0.40   steps: 3000
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:54:11

Episode: 3001   score: 23.71   Avg score (100e): 23.59   actor gain: -0.33   critic loss: 0.40   steps: 3001
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:54:22

Episode: 3002   score: 23.71   Avg score (100e): 23.60   actor gain: -0.33   critic loss: 0.40   steps: 3002


training loop:   6% |##                                | ETA:  34 days, 4:54:58

Episode: 3003   score: 23.71   Avg score (100e): 23.60   actor gain: -0.33   critic loss: 0.40   steps: 3003
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:54:09

Episode: 3004   score: 23.71   Avg score (100e): 23.60   actor gain: -0.33   critic loss: 0.40   steps: 3004
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:53:21

Episode: 3005   score: 23.71   Avg score (100e): 23.60   actor gain: -0.33   critic loss: 0.40   steps: 3005
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:53:27

Episode: 3006   score: 23.71   Avg score (100e): 23.61   actor gain: -0.33   critic loss: 0.40   steps: 3006


training loop:   6% |##                                | ETA:  34 days, 4:53:39

Episode: 3007   score: 23.71   Avg score (100e): 23.61   actor gain: -0.33   critic loss: 0.40   steps: 3007
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:52:37

Episode: 3008   score: 23.72   Avg score (100e): 23.61   actor gain: -0.33   critic loss: 0.40   steps: 3008


training loop:   6% |##                                | ETA:  34 days, 4:52:26

Episode: 3009   score: 23.72   Avg score (100e): 23.61   actor gain: -0.33   critic loss: 0.40   steps: 3009


training loop:   6% |##                                | ETA:  34 days, 4:52:53

Episode: 3010   score: 23.72   Avg score (100e): 23.61   actor gain: -0.33   critic loss: 0.40   steps: 3010
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:51:19

Episode: 3011   score: 23.73   Avg score (100e): 23.62   actor gain: -0.33   critic loss: 0.40   steps: 3011
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:50:47

Episode: 3012   score: 23.73   Avg score (100e): 23.62   actor gain: -0.33   critic loss: 0.40   steps: 3012


training loop:   6% |##                                | ETA:  34 days, 4:50:21

Episode: 3013   score: 23.73   Avg score (100e): 23.62   actor gain: -0.33   critic loss: 0.40   steps: 3013
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:49:55

Episode: 3014   score: 23.73   Avg score (100e): 23.62   actor gain: -0.33   critic loss: 0.40   steps: 3014
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:48:37

Episode: 3015   score: 23.74   Avg score (100e): 23.63   actor gain: -0.33   critic loss: 0.40   steps: 3015
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:47:10

Episode: 3016   score: 23.74   Avg score (100e): 23.63   actor gain: -0.33   critic loss: 0.40   steps: 3016


training loop:   6% |##                                | ETA:  34 days, 4:46:49

Episode: 3017   score: 23.74   Avg score (100e): 23.63   actor gain: -0.33   critic loss: 0.40   steps: 3017
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:46:01

Episode: 3018   score: 23.74   Avg score (100e): 23.63   actor gain: -0.33   critic loss: 0.40   steps: 3018


training loop:   6% |##                                | ETA:  34 days, 4:49:10

Episode: 3019   score: 23.74   Avg score (100e): 23.64   actor gain: -0.33   critic loss: 0.40   steps: 3019
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:48:55

Episode: 3020   score: 23.74   Avg score (100e): 23.64   actor gain: -0.33   critic loss: 0.40   steps: 3020


training loop:   6% |##                                | ETA:  34 days, 4:48:50

Episode: 3021   score: 23.74   Avg score (100e): 23.64   actor gain: -0.33   critic loss: 0.40   steps: 3021
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:48:34

Episode: 3022   score: 23.74   Avg score (100e): 23.64   actor gain: -0.33   critic loss: 0.40   steps: 3022


training loop:   6% |##                                | ETA:  34 days, 4:48:39

Episode: 3023   score: 23.74   Avg score (100e): 23.65   actor gain: -0.33   critic loss: 0.40   steps: 3023


training loop:   6% |##                                | ETA:  34 days, 4:48:34

Episode: 3024   score: 23.74   Avg score (100e): 23.65   actor gain: -0.33   critic loss: 0.40   steps: 3024
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:48:51

Episode: 3025   score: 23.75   Avg score (100e): 23.65   actor gain: -0.33   critic loss: 0.40   steps: 3025


training loop:   6% |##                                | ETA:  34 days, 4:49:37

Episode: 3026   score: 23.75   Avg score (100e): 23.65   actor gain: -0.33   critic loss: 0.40   steps: 3026


training loop:   6% |##                                | ETA:  34 days, 4:49:58

Episode: 3027   score: 23.75   Avg score (100e): 23.65   actor gain: -0.33   critic loss: 0.40   steps: 3027


training loop:   6% |##                                | ETA:  34 days, 4:49:41

Episode: 3028   score: 23.76   Avg score (100e): 23.66   actor gain: -0.33   critic loss: 0.40   steps: 3028
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:48:37

Episode: 3029   score: 23.77   Avg score (100e): 23.66   actor gain: -0.33   critic loss: 0.40   steps: 3029


training loop:   6% |##                                | ETA:  34 days, 4:49:26

Episode: 3030   score: 23.77   Avg score (100e): 23.66   actor gain: -0.33   critic loss: 0.40   steps: 3030
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:48:45

Episode: 3031   score: 23.78   Avg score (100e): 23.66   actor gain: -0.33   critic loss: 0.40   steps: 3031


training loop:   6% |##                                | ETA:  34 days, 4:48:25

Episode: 3032   score: 23.78   Avg score (100e): 23.67   actor gain: -0.33   critic loss: 0.40   steps: 3032
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:47:20

Episode: 3033   score: 23.78   Avg score (100e): 23.67   actor gain: -0.33   critic loss: 0.40   steps: 3033
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:48:10

Episode: 3034   score: 23.78   Avg score (100e): 23.67   actor gain: -0.33   critic loss: 0.40   steps: 3034
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:47:38

Episode: 3035   score: 23.79   Avg score (100e): 23.67   actor gain: -0.33   critic loss: 0.40   steps: 3035
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:47:05

Episode: 3036   score: 23.79   Avg score (100e): 23.68   actor gain: -0.33   critic loss: 0.40   steps: 3036


training loop:   6% |##                                | ETA:  34 days, 4:46:17

Episode: 3037   score: 23.80   Avg score (100e): 23.68   actor gain: -0.33   critic loss: 0.40   steps: 3037


training loop:   6% |##                                | ETA:  34 days, 4:46:45

Episode: 3038   score: 23.80   Avg score (100e): 23.68   actor gain: -0.33   critic loss: 0.40   steps: 3038


training loop:   6% |##                                | ETA:  34 days, 4:46:24

Episode: 3039   score: 23.80   Avg score (100e): 23.68   actor gain: -0.33   critic loss: 0.40   steps: 3039


training loop:   6% |##                                | ETA:  34 days, 4:47:11

Episode: 3040   score: 23.81   Avg score (100e): 23.69   actor gain: -0.33   critic loss: 0.40   steps: 3040


training loop:   6% |##                                | ETA:  34 days, 4:47:23

Episode: 3041   score: 23.81   Avg score (100e): 23.69   actor gain: -0.33   critic loss: 0.40   steps: 3041


training loop:   6% |##                                | ETA:  34 days, 4:47:36

Episode: 3042   score: 23.81   Avg score (100e): 23.69   actor gain: -0.33   critic loss: 0.40   steps: 3042
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:48:29

Episode: 3043   score: 23.82   Avg score (100e): 23.69   actor gain: -0.32   critic loss: 0.40   steps: 3043


training loop:   6% |##                                | ETA:  34 days, 4:47:56

Episode: 3044   score: 23.82   Avg score (100e): 23.69   actor gain: -0.32   critic loss: 0.40   steps: 3044


training loop:   6% |##                                | ETA:  34 days, 4:48:23

Episode: 3045   score: 23.82   Avg score (100e): 23.70   actor gain: -0.32   critic loss: 0.40   steps: 3045
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:48:31

Episode: 3046   score: 23.83   Avg score (100e): 23.70   actor gain: -0.32   critic loss: 0.40   steps: 3046
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:48:19

Episode: 3047   score: 23.83   Avg score (100e): 23.70   actor gain: -0.32   critic loss: 0.40   steps: 3047
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:47:23

Episode: 3048   score: 23.83   Avg score (100e): 23.70   actor gain: -0.32   critic loss: 0.40   steps: 3048


training loop:   6% |##                                | ETA:  34 days, 4:47:53

Episode: 3049   score: 23.83   Avg score (100e): 23.71   actor gain: -0.32   critic loss: 0.40   steps: 3049


training loop:   6% |##                                | ETA:  34 days, 4:47:49

Episode: 3050   score: 23.83   Avg score (100e): 23.71   actor gain: -0.32   critic loss: 0.40   steps: 3050


training loop:   6% |##                                | ETA:  34 days, 4:47:56

Episode: 3051   score: 23.83   Avg score (100e): 23.71   actor gain: -0.32   critic loss: 0.40   steps: 3051
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:51:33

Episode: 3052   score: 23.84   Avg score (100e): 23.71   actor gain: -0.32   critic loss: 0.40   steps: 3052


training loop:   6% |##                                | ETA:  34 days, 4:52:45

Episode: 3053   score: 23.84   Avg score (100e): 23.71   actor gain: -0.32   critic loss: 0.40   steps: 3053
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:52:53

Episode: 3054   score: 23.84   Avg score (100e): 23.72   actor gain: -0.32   critic loss: 0.40   steps: 3054


training loop:   6% |##                                | ETA:  34 days, 4:52:07

Episode: 3055   score: 23.84   Avg score (100e): 23.72   actor gain: -0.32   critic loss: 0.40   steps: 3055
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:59:31

Episode: 3056   score: 23.83   Avg score (100e): 23.72   actor gain: -0.32   critic loss: 0.40   steps: 3056


training loop:   6% |##                                | ETA:  34 days, 5:04:21

Episode: 3057   score: 23.83   Avg score (100e): 23.72   actor gain: -0.32   critic loss: 0.40   steps: 3057


training loop:   6% |##                                | ETA:  34 days, 5:03:56

Episode: 3058   score: 23.84   Avg score (100e): 23.73   actor gain: -0.32   critic loss: 0.40   steps: 3058


training loop:   6% |##                                | ETA:  34 days, 5:03:38

Episode: 3059   score: 23.84   Avg score (100e): 23.73   actor gain: -0.32   critic loss: 0.40   steps: 3059


training loop:   6% |##                                | ETA:  34 days, 5:03:39

Episode: 3060   score: 23.84   Avg score (100e): 23.73   actor gain: -0.32   critic loss: 0.40   steps: 3060
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 5:02:56

Episode: 3061   score: 23.84   Avg score (100e): 23.73   actor gain: -0.32   critic loss: 0.40   steps: 3061


training loop:   6% |##                                | ETA:  34 days, 5:02:09

Episode: 3062   score: 23.85   Avg score (100e): 23.73   actor gain: -0.32   critic loss: 0.40   steps: 3062
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 5:00:08

Episode: 3063   score: 23.85   Avg score (100e): 23.74   actor gain: -0.32   critic loss: 0.40   steps: 3063


training loop:   6% |##                                | ETA:  34 days, 4:59:10

Episode: 3064   score: 23.85   Avg score (100e): 23.74   actor gain: -0.32   critic loss: 0.40   steps: 3064
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:59:25

Episode: 3065   score: 23.85   Avg score (100e): 23.74   actor gain: -0.32   critic loss: 0.40   steps: 3065


training loop:   6% |##                                | ETA:  34 days, 4:58:19

Episode: 3066   score: 23.85   Avg score (100e): 23.74   actor gain: -0.32   critic loss: 0.40   steps: 3066
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:56:48

Episode: 3067   score: 23.86   Avg score (100e): 23.74   actor gain: -0.32   critic loss: 0.40   steps: 3067
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:55:38

Episode: 3068   score: 23.86   Avg score (100e): 23.75   actor gain: -0.33   critic loss: 0.40   steps: 3068


training loop:   6% |##                                | ETA:  34 days, 4:55:17

Episode: 3069   score: 23.86   Avg score (100e): 23.75   actor gain: -0.33   critic loss: 0.40   steps: 3069


training loop:   6% |##                                | ETA:  34 days, 4:54:31

Episode: 3070   score: 23.86   Avg score (100e): 23.75   actor gain: -0.33   critic loss: 0.40   steps: 3070


training loop:   6% |##                                | ETA:  34 days, 4:53:02

Episode: 3071   score: 23.86   Avg score (100e): 23.75   actor gain: -0.33   critic loss: 0.40   steps: 3071


training loop:   6% |##                                | ETA:  34 days, 4:52:14

Episode: 3072   score: 23.86   Avg score (100e): 23.76   actor gain: -0.33   critic loss: 0.40   steps: 3072


training loop:   6% |##                                | ETA:  34 days, 4:51:48

Episode: 3073   score: 23.86   Avg score (100e): 23.76   actor gain: -0.33   critic loss: 0.40   steps: 3073
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:51:23

Episode: 3074   score: 23.86   Avg score (100e): 23.76   actor gain: -0.33   critic loss: 0.40   steps: 3074


training loop:   6% |##                                | ETA:  34 days, 4:50:14

Episode: 3075   score: 23.86   Avg score (100e): 23.76   actor gain: -0.33   critic loss: 0.39   steps: 3075
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:49:14

Episode: 3076   score: 23.87   Avg score (100e): 23.76   actor gain: -0.33   critic loss: 0.39   steps: 3076


training loop:   6% |##                                | ETA:  34 days, 4:48:55

Episode: 3077   score: 23.86   Avg score (100e): 23.77   actor gain: -0.33   critic loss: 0.39   steps: 3077


training loop:   6% |##                                | ETA:  34 days, 4:48:35

Episode: 3078   score: 23.87   Avg score (100e): 23.77   actor gain: -0.33   critic loss: 0.39   steps: 3078


training loop:   6% |##                                | ETA:  34 days, 4:48:05

Episode: 3079   score: 23.87   Avg score (100e): 23.77   actor gain: -0.33   critic loss: 0.39   steps: 3079
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:47:09

Episode: 3080   score: 23.87   Avg score (100e): 23.77   actor gain: -0.33   critic loss: 0.39   steps: 3080


training loop:   6% |##                                | ETA:  34 days, 4:46:23

Episode: 3081   score: 23.87   Avg score (100e): 23.77   actor gain: -0.33   critic loss: 0.39   steps: 3081


training loop:   6% |##                                | ETA:  34 days, 4:45:53

Episode: 3082   score: 23.88   Avg score (100e): 23.78   actor gain: -0.33   critic loss: 0.39   steps: 3082
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:44:34

Episode: 3083   score: 23.88   Avg score (100e): 23.78   actor gain: -0.33   critic loss: 0.39   steps: 3083


training loop:   6% |##                                | ETA:  34 days, 4:46:52

Episode: 3084   score: 23.87   Avg score (100e): 23.78   actor gain: -0.33   critic loss: 0.39   steps: 3084


training loop:   6% |##                                | ETA:  34 days, 4:47:02

Episode: 3085   score: 23.87   Avg score (100e): 23.78   actor gain: -0.33   critic loss: 0.39   steps: 3085


training loop:   6% |##                                | ETA:  34 days, 4:47:11

Episode: 3086   score: 23.88   Avg score (100e): 23.78   actor gain: -0.33   critic loss: 0.39   steps: 3086


training loop:   6% |##                                | ETA:  34 days, 4:46:40

Episode: 3087   score: 23.88   Avg score (100e): 23.79   actor gain: -0.33   critic loss: 0.39   steps: 3087
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:46:10

Episode: 3088   score: 23.89   Avg score (100e): 23.79   actor gain: -0.33   critic loss: 0.39   steps: 3088
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:44:55

Episode: 3089   score: 23.89   Avg score (100e): 23.79   actor gain: -0.33   critic loss: 0.39   steps: 3089
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:44:33

Episode: 3090   score: 23.89   Avg score (100e): 23.79   actor gain: -0.33   critic loss: 0.39   steps: 3090


training loop:   6% |##                                | ETA:  34 days, 4:44:18

Episode: 3091   score: 23.89   Avg score (100e): 23.80   actor gain: -0.33   critic loss: 0.39   steps: 3091
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:43:21

Episode: 3092   score: 23.89   Avg score (100e): 23.80   actor gain: -0.33   critic loss: 0.39   steps: 3092


training loop:   6% |##                                | ETA:  34 days, 4:43:31

Episode: 3093   score: 23.89   Avg score (100e): 23.80   actor gain: -0.33   critic loss: 0.39   steps: 3093
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:43:33

Episode: 3094   score: 23.89   Avg score (100e): 23.80   actor gain: -0.33   critic loss: 0.39   steps: 3094


training loop:   6% |##                                | ETA:  34 days, 4:42:58

Episode: 3095   score: 23.89   Avg score (100e): 23.80   actor gain: -0.33   critic loss: 0.39   steps: 3095


training loop:   6% |##                                | ETA:  34 days, 4:42:46

Episode: 3096   score: 23.89   Avg score (100e): 23.81   actor gain: -0.33   critic loss: 0.39   steps: 3096
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:41:31

Episode: 3097   score: 23.90   Avg score (100e): 23.81   actor gain: -0.33   critic loss: 0.39   steps: 3097


training loop:   6% |##                                | ETA:  34 days, 4:41:05

Episode: 3098   score: 23.90   Avg score (100e): 23.81   actor gain: -0.33   critic loss: 0.39   steps: 3098


training loop:   6% |##                                | ETA:  34 days, 4:39:57

Episode: 3099   score: 23.90   Avg score (100e): 23.81   actor gain: -0.33   critic loss: 0.39   steps: 3099


training loop:   6% |##                                | ETA:  34 days, 4:39:04

Episode: 3100   score: 23.90   Avg score (100e): 23.81   actor gain: -0.33   critic loss: 0.39   steps: 3100


training loop:   6% |##                                | ETA:  34 days, 4:38:26

Episode: 3101   score: 23.90   Avg score (100e): 23.82   actor gain: -0.33   critic loss: 0.39   steps: 3101


training loop:   6% |##                                | ETA:  34 days, 4:38:10

Episode: 3102   score: 23.90   Avg score (100e): 23.82   actor gain: -0.33   critic loss: 0.40   steps: 3102


training loop:   6% |##                                | ETA:  34 days, 4:37:14

Episode: 3103   score: 23.90   Avg score (100e): 23.82   actor gain: -0.33   critic loss: 0.40   steps: 3103


training loop:   6% |##                                | ETA:  34 days, 4:36:15

Episode: 3104   score: 23.90   Avg score (100e): 23.82   actor gain: -0.33   critic loss: 0.40   steps: 3104
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:34:43

Episode: 3105   score: 23.91   Avg score (100e): 23.82   actor gain: -0.33   critic loss: 0.40   steps: 3105
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:34:11

Episode: 3106   score: 23.91   Avg score (100e): 23.83   actor gain: -0.32   critic loss: 0.40   steps: 3106


training loop:   6% |##                                | ETA:  34 days, 4:34:48

Episode: 3107   score: 23.91   Avg score (100e): 23.83   actor gain: -0.32   critic loss: 0.40   steps: 3107


training loop:   6% |##                                | ETA:  34 days, 4:34:38

Episode: 3108   score: 23.91   Avg score (100e): 23.83   actor gain: -0.32   critic loss: 0.40   steps: 3108
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:33:45

Episode: 3109   score: 23.91   Avg score (100e): 23.83   actor gain: -0.32   critic loss: 0.40   steps: 3109


training loop:   6% |##                                | ETA:  34 days, 4:33:04

Episode: 3110   score: 23.91   Avg score (100e): 23.83   actor gain: -0.32   critic loss: 0.40   steps: 3110


training loop:   6% |##                                | ETA:  34 days, 4:33:00

Episode: 3111   score: 23.91   Avg score (100e): 23.84   actor gain: -0.32   critic loss: 0.40   steps: 3111


training loop:   6% |##                                | ETA:  34 days, 4:32:07

Episode: 3112   score: 23.92   Avg score (100e): 23.84   actor gain: -0.32   critic loss: 0.40   steps: 3112
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:30:34

Episode: 3113   score: 23.92   Avg score (100e): 23.84   actor gain: -0.32   critic loss: 0.40   steps: 3113


training loop:   6% |##                                | ETA:  34 days, 4:29:42

Episode: 3114   score: 23.92   Avg score (100e): 23.84   actor gain: -0.32   critic loss: 0.40   steps: 3114
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:29:09

Episode: 3115   score: 23.92   Avg score (100e): 23.84   actor gain: -0.32   critic loss: 0.40   steps: 3115


training loop:   6% |##                                | ETA:  34 days, 4:28:06

Episode: 3116   score: 23.92   Avg score (100e): 23.84   actor gain: -0.32   critic loss: 0.40   steps: 3116


training loop:   6% |##                                | ETA:  34 days, 4:29:30

Episode: 3117   score: 23.92   Avg score (100e): 23.85   actor gain: -0.32   critic loss: 0.40   steps: 3117


training loop:   6% |##                                | ETA:  34 days, 4:29:33

Episode: 3118   score: 23.93   Avg score (100e): 23.85   actor gain: -0.32   critic loss: 0.40   steps: 3118


training loop:   6% |##                                | ETA:  34 days, 4:29:54

Episode: 3119   score: 23.93   Avg score (100e): 23.85   actor gain: -0.32   critic loss: 0.40   steps: 3119
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:29:03

Episode: 3120   score: 23.93   Avg score (100e): 23.85   actor gain: -0.32   critic loss: 0.40   steps: 3120


training loop:   6% |##                                | ETA:  34 days, 4:28:44

Episode: 3121   score: 23.93   Avg score (100e): 23.85   actor gain: -0.32   critic loss: 0.40   steps: 3121


training loop:   6% |##                                | ETA:  34 days, 4:28:14

Episode: 3122   score: 23.93   Avg score (100e): 23.86   actor gain: -0.32   critic loss: 0.40   steps: 3122


training loop:   6% |##                                | ETA:  34 days, 4:27:36

Episode: 3123   score: 23.94   Avg score (100e): 23.86   actor gain: -0.32   critic loss: 0.40   steps: 3123


training loop:   6% |##                                | ETA:  34 days, 4:26:56

Episode: 3124   score: 23.94   Avg score (100e): 23.86   actor gain: -0.32   critic loss: 0.40   steps: 3124


training loop:   6% |##                                | ETA:  34 days, 4:26:59

Episode: 3125   score: 23.94   Avg score (100e): 23.86   actor gain: -0.32   critic loss: 0.40   steps: 3125


training loop:   6% |##                                | ETA:  34 days, 4:26:37

Episode: 3126   score: 23.94   Avg score (100e): 23.86   actor gain: -0.32   critic loss: 0.40   steps: 3126


training loop:   6% |##                                | ETA:  34 days, 4:26:13

Episode: 3127   score: 23.95   Avg score (100e): 23.87   actor gain: -0.32   critic loss: 0.40   steps: 3127


training loop:   6% |##                                | ETA:  34 days, 4:25:32

Episode: 3128   score: 23.95   Avg score (100e): 23.87   actor gain: -0.32   critic loss: 0.40   steps: 3128
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:23:45

Episode: 3129   score: 23.95   Avg score (100e): 23.87   actor gain: -0.32   critic loss: 0.40   steps: 3129
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:22:59

Episode: 3130   score: 23.95   Avg score (100e): 23.87   actor gain: -0.32   critic loss: 0.40   steps: 3130


training loop:   6% |##                                | ETA:  34 days, 4:22:05

Episode: 3131   score: 23.95   Avg score (100e): 23.87   actor gain: -0.32   critic loss: 0.40   steps: 3131


training loop:   6% |##                                | ETA:  34 days, 4:21:42

Episode: 3132   score: 23.95   Avg score (100e): 23.87   actor gain: -0.32   critic loss: 0.40   steps: 3132
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:20:28

Episode: 3133   score: 23.96   Avg score (100e): 23.88   actor gain: -0.32   critic loss: 0.40   steps: 3133


training loop:   6% |##                                | ETA:  34 days, 4:20:15

Episode: 3134   score: 23.96   Avg score (100e): 23.88   actor gain: -0.32   critic loss: 0.40   steps: 3134


training loop:   6% |##                                | ETA:  34 days, 4:19:38

Episode: 3135   score: 23.96   Avg score (100e): 23.88   actor gain: -0.32   critic loss: 0.40   steps: 3135


training loop:   6% |##                                | ETA:  34 days, 4:19:04

Episode: 3136   score: 23.96   Avg score (100e): 23.88   actor gain: -0.32   critic loss: 0.40   steps: 3136
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:18:04

Episode: 3137   score: 23.97   Avg score (100e): 23.88   actor gain: -0.32   critic loss: 0.40   steps: 3137


training loop:   6% |##                                | ETA:  34 days, 4:16:51

Episode: 3138   score: 23.97   Avg score (100e): 23.88   actor gain: -0.32   critic loss: 0.40   steps: 3138


training loop:   6% |##                                | ETA:  34 days, 4:16:11

Episode: 3139   score: 23.97   Avg score (100e): 23.89   actor gain: -0.32   critic loss: 0.40   steps: 3139
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:15:45

Episode: 3140   score: 23.97   Avg score (100e): 23.89   actor gain: -0.32   critic loss: 0.40   steps: 3140


training loop:   6% |##                                | ETA:  34 days, 4:14:46

Episode: 3141   score: 23.97   Avg score (100e): 23.89   actor gain: -0.32   critic loss: 0.40   steps: 3141


training loop:   6% |##                                | ETA:  34 days, 4:13:54

Episode: 3142   score: 23.98   Avg score (100e): 23.89   actor gain: -0.32   critic loss: 0.39   steps: 3142


training loop:   6% |##                                | ETA:  34 days, 4:13:46

Episode: 3143   score: 23.98   Avg score (100e): 23.89   actor gain: -0.32   critic loss: 0.39   steps: 3143


training loop:   6% |##                                | ETA:  34 days, 4:13:44

Episode: 3144   score: 23.99   Avg score (100e): 23.89   actor gain: -0.32   critic loss: 0.39   steps: 3144


training loop:   6% |##                                | ETA:  34 days, 4:13:06

Episode: 3145   score: 23.99   Avg score (100e): 23.90   actor gain: -0.32   critic loss: 0.39   steps: 3145


training loop:   6% |##                                | ETA:  34 days, 4:12:21

Episode: 3146   score: 23.99   Avg score (100e): 23.90   actor gain: -0.32   critic loss: 0.39   steps: 3146


training loop:   6% |##                                | ETA:  34 days, 4:12:01

Episode: 3147   score: 23.99   Avg score (100e): 23.90   actor gain: -0.32   critic loss: 0.40   steps: 3147


training loop:   6% |##                                | ETA:  34 days, 4:12:08

Episode: 3148   score: 23.99   Avg score (100e): 23.90   actor gain: -0.32   critic loss: 0.40   steps: 3148


training loop:   6% |##                                | ETA:  34 days, 4:14:24

Episode: 3149   score: 24.00   Avg score (100e): 23.90   actor gain: -0.32   critic loss: 0.40   steps: 3149


training loop:   6% |##                                | ETA:  34 days, 4:15:36

Episode: 3150   score: 24.01   Avg score (100e): 23.90   actor gain: -0.32   critic loss: 0.40   steps: 3150
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:15:09

Episode: 3151   score: 24.02   Avg score (100e): 23.91   actor gain: -0.32   critic loss: 0.40   steps: 3151


training loop:   6% |##                                | ETA:  34 days, 4:15:23

Episode: 3152   score: 24.02   Avg score (100e): 23.91   actor gain: -0.32   critic loss: 0.40   steps: 3152


training loop:   6% |##                                | ETA:  34 days, 4:15:08

Episode: 3153   score: 24.02   Avg score (100e): 23.91   actor gain: -0.32   critic loss: 0.40   steps: 3153
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:13:56

Episode: 3154   score: 24.03   Avg score (100e): 23.91   actor gain: -0.32   critic loss: 0.40   steps: 3154
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:13:00

Episode: 3155   score: 24.03   Avg score (100e): 23.91   actor gain: -0.32   critic loss: 0.40   steps: 3155


training loop:   6% |##                                | ETA:  34 days, 4:13:00

Episode: 3156   score: 24.03   Avg score (100e): 23.92   actor gain: -0.32   critic loss: 0.40   steps: 3156


training loop:   6% |##                                | ETA:  34 days, 4:12:27

Episode: 3157   score: 24.04   Avg score (100e): 23.92   actor gain: -0.32   critic loss: 0.40   steps: 3157


training loop:   6% |##                                | ETA:  34 days, 4:12:03

Episode: 3158   score: 24.05   Avg score (100e): 23.92   actor gain: -0.32   critic loss: 0.40   steps: 3158
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:10:38

Episode: 3159   score: 24.06   Avg score (100e): 23.92   actor gain: -0.32   critic loss: 0.40   steps: 3159


training loop:   6% |##                                | ETA:  34 days, 4:10:27

Episode: 3160   score: 24.08   Avg score (100e): 23.93   actor gain: -0.32   critic loss: 0.40   steps: 3160


training loop:   6% |##                                | ETA:  34 days, 4:09:47

Episode: 3161   score: 24.09   Avg score (100e): 23.93   actor gain: -0.32   critic loss: 0.40   steps: 3161


training loop:   6% |##                                | ETA:  34 days, 4:09:31

Episode: 3162   score: 24.10   Avg score (100e): 23.93   actor gain: -0.32   critic loss: 0.40   steps: 3162
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:08:00

Episode: 3163   score: 24.10   Avg score (100e): 23.93   actor gain: -0.32   critic loss: 0.40   steps: 3163


training loop:   6% |##                                | ETA:  34 days, 4:06:01

Episode: 3164   score: 24.11   Avg score (100e): 23.94   actor gain: -0.32   critic loss: 0.40   steps: 3164
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 4:01:36

Episode: 3165   score: 24.11   Avg score (100e): 23.94   actor gain: -0.32   critic loss: 0.40   steps: 3165


training loop:   6% |##                                | ETA:  34 days, 3:57:08

Episode: 3166   score: 24.12   Avg score (100e): 23.94   actor gain: -0.32   critic loss: 0.40   steps: 3166
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 3:52:39

Episode: 3167   score: 24.12   Avg score (100e): 23.94   actor gain: -0.32   critic loss: 0.40   steps: 3167
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 3:47:52

Episode: 3168   score: 24.13   Avg score (100e): 23.95   actor gain: -0.32   critic loss: 0.40   steps: 3168


training loop:   6% |##                                | ETA:  34 days, 3:42:57

Episode: 3169   score: 24.14   Avg score (100e): 23.95   actor gain: -0.32   critic loss: 0.40   steps: 3169
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 3:37:35

Episode: 3170   score: 24.15   Avg score (100e): 23.95   actor gain: -0.32   critic loss: 0.40   steps: 3170


training loop:   6% |##                                | ETA:  34 days, 3:32:17

Episode: 3171   score: 24.16   Avg score (100e): 23.95   actor gain: -0.32   critic loss: 0.40   steps: 3171


training loop:   6% |##                                | ETA:  34 days, 3:26:48

Episode: 3172   score: 24.17   Avg score (100e): 23.96   actor gain: -0.32   critic loss: 0.40   steps: 3172


training loop:   6% |##                                | ETA:  34 days, 3:20:44

Episode: 3173   score: 24.17   Avg score (100e): 23.96   actor gain: -0.32   critic loss: 0.40   steps: 3173


training loop:   6% |##                                | ETA:  34 days, 3:15:50

Episode: 3174   score: 24.18   Avg score (100e): 23.96   actor gain: -0.32   critic loss: 0.40   steps: 3174


training loop:   6% |##                                | ETA:  34 days, 3:10:13

Episode: 3175   score: 24.19   Avg score (100e): 23.97   actor gain: -0.32   critic loss: 0.40   steps: 3175


training loop:   6% |##                                | ETA:  34 days, 3:04:05

Episode: 3176   score: 24.20   Avg score (100e): 23.97   actor gain: -0.32   critic loss: 0.40   steps: 3176


training loop:   6% |##                                | ETA:  34 days, 2:56:57

Episode: 3177   score: 24.20   Avg score (100e): 23.97   actor gain: -0.32   critic loss: 0.40   steps: 3177
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 2:49:20

Episode: 3178   score: 24.21   Avg score (100e): 23.98   actor gain: -0.32   critic loss: 0.40   steps: 3178
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 2:41:43

Episode: 3179   score: 24.21   Avg score (100e): 23.98   actor gain: -0.32   critic loss: 0.40   steps: 3179
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 2:34:06

Episode: 3180   score: 24.22   Avg score (100e): 23.98   actor gain: -0.32   critic loss: 0.40   steps: 3180


training loop:   6% |##                                | ETA:  34 days, 2:28:46

Episode: 3181   score: 24.23   Avg score (100e): 23.99   actor gain: -0.32   critic loss: 0.40   steps: 3181


training loop:   6% |##                                | ETA:  34 days, 2:22:02

Episode: 3182   score: 24.23   Avg score (100e): 23.99   actor gain: -0.32   critic loss: 0.40   steps: 3182


training loop:   6% |##                                | ETA:  34 days, 2:16:16

Episode: 3183   score: 24.23   Avg score (100e): 23.99   actor gain: -0.32   critic loss: 0.40   steps: 3183
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 2:09:51

Episode: 3184   score: 24.25   Avg score (100e): 24.00   actor gain: -0.32   critic loss: 0.40   steps: 3184


training loop:   6% |##                                | ETA:  34 days, 2:02:55

Episode: 3185   score: 24.26   Avg score (100e): 24.00   actor gain: -0.32   critic loss: 0.40   steps: 3185


training loop:   6% |##                                | ETA:  34 days, 1:55:31

Episode: 3186   score: 24.27   Avg score (100e): 24.01   actor gain: -0.32   critic loss: 0.40   steps: 3186
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 1:48:12

Episode: 3187   score: 24.27   Avg score (100e): 24.01   actor gain: -0.32   critic loss: 0.40   steps: 3187


training loop:   6% |##                                | ETA:  34 days, 1:40:55

Episode: 3188   score: 24.28   Avg score (100e): 24.01   actor gain: -0.32   critic loss: 0.40   steps: 3188


training loop:   6% |##                                | ETA:  34 days, 1:33:47

Episode: 3189   score: 24.29   Avg score (100e): 24.02   actor gain: -0.32   critic loss: 0.40   steps: 3189


training loop:   6% |##                                | ETA:  34 days, 1:27:03

Episode: 3190   score: 24.29   Avg score (100e): 24.02   actor gain: -0.32   critic loss: 0.40   steps: 3190


training loop:   6% |##                                | ETA:  34 days, 1:19:56

Episode: 3191   score: 24.30   Avg score (100e): 24.03   actor gain: -0.32   critic loss: 0.40   steps: 3191
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 1:12:19

Episode: 3192   score: 24.31   Avg score (100e): 24.03   actor gain: -0.32   critic loss: 0.40   steps: 3192


training loop:   6% |##                                | ETA:  34 days, 1:04:59

Episode: 3193   score: 24.32   Avg score (100e): 24.04   actor gain: -0.32   critic loss: 0.40   steps: 3193


training loop:   6% |##                                | ETA:  34 days, 0:57:35

Episode: 3194   score: 24.32   Avg score (100e): 24.04   actor gain: -0.32   critic loss: 0.40   steps: 3194
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 0:49:56

Episode: 3195   score: 24.33   Avg score (100e): 24.04   actor gain: -0.32   critic loss: 0.40   steps: 3195


training loop:   6% |##                                | ETA:  34 days, 0:42:44

Episode: 3196   score: 24.34   Avg score (100e): 24.05   actor gain: -0.32   critic loss: 0.40   steps: 3196
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 0:36:27

Episode: 3197   score: 24.34   Avg score (100e): 24.05   actor gain: -0.32   critic loss: 0.40   steps: 3197


training loop:   6% |##                                | ETA:  34 days, 0:29:15

Episode: 3198   score: 24.35   Avg score (100e): 24.06   actor gain: -0.32   critic loss: 0.40   steps: 3198
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  34 days, 0:21:38

Episode: 3199   score: 24.36   Avg score (100e): 24.06   actor gain: -0.32   critic loss: 0.40   steps: 3199


training loop:   6% |##                                | ETA:  34 days, 0:14:09

Episode: 3200   score: 24.37   Avg score (100e): 24.07   actor gain: -0.32   critic loss: 0.40   steps: 3200


training loop:   6% |##                                | ETA:  34 days, 0:06:46

Episode: 3201   score: 24.38   Avg score (100e): 24.07   actor gain: -0.32   critic loss: 0.40   steps: 3201
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 23:59:19

Episode: 3202   score: 24.39   Avg score (100e): 24.08   actor gain: -0.32   critic loss: 0.40   steps: 3202


training loop:   6% |##                               | ETA:  33 days, 23:51:51

Episode: 3203   score: 24.39   Avg score (100e): 24.08   actor gain: -0.32   critic loss: 0.40   steps: 3203
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 23:46:02

Episode: 3204   score: 24.40   Avg score (100e): 24.09   actor gain: -0.32   critic loss: 0.40   steps: 3204
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 23:40:18

Episode: 3205   score: 24.41   Avg score (100e): 24.09   actor gain: -0.32   critic loss: 0.40   steps: 3205


training loop:   6% |##                               | ETA:  33 days, 23:32:54

Episode: 3206   score: 24.41   Avg score (100e): 24.10   actor gain: -0.32   critic loss: 0.40   steps: 3206


training loop:   6% |##                               | ETA:  33 days, 23:25:59

Episode: 3207   score: 24.42   Avg score (100e): 24.10   actor gain: -0.32   critic loss: 0.40   steps: 3207


training loop:   6% |##                               | ETA:  33 days, 23:19:40

Episode: 3208   score: 24.43   Avg score (100e): 24.11   actor gain: -0.32   critic loss: 0.40   steps: 3208


training loop:   6% |##                               | ETA:  33 days, 23:12:45

Episode: 3209   score: 24.44   Avg score (100e): 24.11   actor gain: -0.32   critic loss: 0.40   steps: 3209


training loop:   6% |##                               | ETA:  33 days, 23:05:38

Episode: 3210   score: 24.45   Avg score (100e): 24.12   actor gain: -0.32   critic loss: 0.40   steps: 3210
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 22:58:33

Episode: 3211   score: 24.46   Avg score (100e): 24.12   actor gain: -0.32   critic loss: 0.40   steps: 3211


training loop:   6% |##                               | ETA:  33 days, 22:51:53

Episode: 3212   score: 24.47   Avg score (100e): 24.13   actor gain: -0.32   critic loss: 0.40   steps: 3212
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 22:44:58

Episode: 3213   score: 24.48   Avg score (100e): 24.13   actor gain: -0.32   critic loss: 0.40   steps: 3213
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 22:39:57

Episode: 3214   score: 24.49   Avg score (100e): 24.14   actor gain: -0.32   critic loss: 0.40   steps: 3214


training loop:   6% |##                               | ETA:  33 days, 22:33:00

Episode: 3215   score: 24.49   Avg score (100e): 24.15   actor gain: -0.32   critic loss: 0.40   steps: 3215
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 22:25:53

Episode: 3216   score: 24.50   Avg score (100e): 24.15   actor gain: -0.32   critic loss: 0.40   steps: 3216


training loop:   6% |##                               | ETA:  33 days, 22:18:57

Episode: 3217   score: 24.51   Avg score (100e): 24.16   actor gain: -0.32   critic loss: 0.40   steps: 3217


training loop:   6% |##                               | ETA:  33 days, 22:12:29

Episode: 3218   score: 24.52   Avg score (100e): 24.16   actor gain: -0.32   critic loss: 0.40   steps: 3218


training loop:   6% |##                               | ETA:  33 days, 22:06:45

Episode: 3219   score: 24.52   Avg score (100e): 24.17   actor gain: -0.32   critic loss: 0.40   steps: 3219
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 22:00:35

Episode: 3220   score: 24.53   Avg score (100e): 24.17   actor gain: -0.32   critic loss: 0.40   steps: 3220
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 21:54:10

Episode: 3221   score: 24.53   Avg score (100e): 24.18   actor gain: -0.32   critic loss: 0.40   steps: 3221


training loop:   6% |##                               | ETA:  33 days, 21:47:44

Episode: 3222   score: 24.53   Avg score (100e): 24.19   actor gain: -0.32   critic loss: 0.40   steps: 3222
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 21:40:50

Episode: 3223   score: 24.54   Avg score (100e): 24.19   actor gain: -0.32   critic loss: 0.40   steps: 3223


training loop:   6% |##                               | ETA:  33 days, 21:33:53

Episode: 3224   score: 24.55   Avg score (100e): 24.20   actor gain: -0.32   critic loss: 0.40   steps: 3224


training loop:   6% |##                               | ETA:  33 days, 21:28:24

Episode: 3225   score: 24.56   Avg score (100e): 24.20   actor gain: -0.32   critic loss: 0.40   steps: 3225


training loop:   6% |##                               | ETA:  33 days, 21:22:03

Episode: 3226   score: 24.57   Avg score (100e): 24.21   actor gain: -0.32   critic loss: 0.40   steps: 3226
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 21:15:33

Episode: 3227   score: 24.58   Avg score (100e): 24.22   actor gain: -0.32   critic loss: 0.40   steps: 3227
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 21:09:49

Episode: 3228   score: 24.59   Avg score (100e): 24.22   actor gain: -0.32   critic loss: 0.40   steps: 3228
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 21:04:40

Episode: 3229   score: 24.60   Avg score (100e): 24.23   actor gain: -0.32   critic loss: 0.40   steps: 3229


training loop:   6% |##                               | ETA:  33 days, 20:59:22

Episode: 3230   score: 24.60   Avg score (100e): 24.24   actor gain: -0.32   critic loss: 0.40   steps: 3230


training loop:   6% |##                               | ETA:  33 days, 20:53:15

Episode: 3231   score: 24.61   Avg score (100e): 24.24   actor gain: -0.32   critic loss: 0.40   steps: 3231
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 20:46:29

Episode: 3232   score: 24.62   Avg score (100e): 24.25   actor gain: -0.32   critic loss: 0.40   steps: 3232
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 20:39:54

Episode: 3233   score: 24.62   Avg score (100e): 24.26   actor gain: -0.32   critic loss: 0.40   steps: 3233
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 20:33:11

Episode: 3234   score: 24.62   Avg score (100e): 24.26   actor gain: -0.32   critic loss: 0.40   steps: 3234


training loop:   6% |##                               | ETA:  33 days, 20:26:23

Episode: 3235   score: 24.63   Avg score (100e): 24.27   actor gain: -0.32   critic loss: 0.40   steps: 3235
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 20:19:16

Episode: 3236   score: 24.64   Avg score (100e): 24.28   actor gain: -0.32   critic loss: 0.39   steps: 3236


training loop:   6% |##                               | ETA:  33 days, 20:12:36

Episode: 3237   score: 24.65   Avg score (100e): 24.28   actor gain: -0.32   critic loss: 0.39   steps: 3237


training loop:   6% |##                               | ETA:  33 days, 20:06:42

Episode: 3238   score: 24.65   Avg score (100e): 24.29   actor gain: -0.32   critic loss: 0.39   steps: 3238


training loop:   6% |##                               | ETA:  33 days, 20:00:14

Episode: 3239   score: 24.66   Avg score (100e): 24.30   actor gain: -0.32   critic loss: 0.39   steps: 3239


training loop:   6% |##                               | ETA:  33 days, 19:53:23

Episode: 3240   score: 24.67   Avg score (100e): 24.30   actor gain: -0.32   critic loss: 0.39   steps: 3240


training loop:   6% |##                               | ETA:  33 days, 19:46:27

Episode: 3241   score: 24.68   Avg score (100e): 24.31   actor gain: -0.32   critic loss: 0.39   steps: 3241
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 19:39:34

Episode: 3242   score: 24.69   Avg score (100e): 24.32   actor gain: -0.32   critic loss: 0.39   steps: 3242


training loop:   6% |##                               | ETA:  33 days, 19:32:31

Episode: 3243   score: 24.70   Avg score (100e): 24.33   actor gain: -0.32   critic loss: 0.39   steps: 3243
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 19:25:19

Episode: 3244   score: 24.71   Avg score (100e): 24.33   actor gain: -0.32   critic loss: 0.39   steps: 3244
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 19:19:00

Episode: 3245   score: 24.71   Avg score (100e): 24.34   actor gain: -0.32   critic loss: 0.39   steps: 3245
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 19:14:37

Episode: 3246   score: 24.72   Avg score (100e): 24.35   actor gain: -0.32   critic loss: 0.39   steps: 3246


training loop:   6% |##                               | ETA:  33 days, 19:08:28

Episode: 3247   score: 24.73   Avg score (100e): 24.35   actor gain: -0.32   critic loss: 0.39   steps: 3247


training loop:   6% |##                               | ETA:  33 days, 19:02:23

Episode: 3248   score: 24.73   Avg score (100e): 24.36   actor gain: -0.32   critic loss: 0.39   steps: 3248


training loop:   6% |##                               | ETA:  33 days, 18:57:11

Episode: 3249   score: 24.74   Avg score (100e): 24.37   actor gain: -0.32   critic loss: 0.39   steps: 3249


training loop:   6% |##                               | ETA:  33 days, 18:51:19

Episode: 3250   score: 24.75   Avg score (100e): 24.38   actor gain: -0.32   critic loss: 0.40   steps: 3250


training loop:   6% |##                               | ETA:  33 days, 18:45:35

Episode: 3251   score: 24.76   Avg score (100e): 24.38   actor gain: -0.32   critic loss: 0.40   steps: 3251


training loop:   6% |##                               | ETA:  33 days, 18:38:59

Episode: 3252   score: 24.77   Avg score (100e): 24.39   actor gain: -0.32   critic loss: 0.40   steps: 3252


training loop:   6% |##                               | ETA:  33 days, 18:32:44

Episode: 3253   score: 24.77   Avg score (100e): 24.40   actor gain: -0.32   critic loss: 0.40   steps: 3253


training loop:   6% |##                               | ETA:  33 days, 18:26:22

Episode: 3254   score: 24.78   Avg score (100e): 24.41   actor gain: -0.32   critic loss: 0.40   steps: 3254


training loop:   6% |##                               | ETA:  33 days, 18:19:45

Episode: 3255   score: 24.78   Avg score (100e): 24.41   actor gain: -0.32   critic loss: 0.40   steps: 3255
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 18:12:57

Episode: 3256   score: 24.79   Avg score (100e): 24.42   actor gain: -0.32   critic loss: 0.40   steps: 3256


training loop:   6% |##                               | ETA:  33 days, 18:07:14

Episode: 3257   score: 24.79   Avg score (100e): 24.43   actor gain: -0.32   critic loss: 0.40   steps: 3257


training loop:   6% |##                               | ETA:  33 days, 18:01:58

Episode: 3258   score: 24.80   Avg score (100e): 24.44   actor gain: -0.32   critic loss: 0.40   steps: 3258


training loop:   6% |##                               | ETA:  33 days, 17:55:49

Episode: 3259   score: 24.82   Avg score (100e): 24.44   actor gain: -0.32   critic loss: 0.40   steps: 3259


training loop:   6% |##                               | ETA:  33 days, 17:48:48

Episode: 3260   score: 24.83   Avg score (100e): 24.45   actor gain: -0.32   critic loss: 0.40   steps: 3260


training loop:   6% |##                               | ETA:  33 days, 17:42:01

Episode: 3261   score: 24.83   Avg score (100e): 24.46   actor gain: -0.32   critic loss: 0.40   steps: 3261


training loop:   6% |##                               | ETA:  33 days, 17:35:01

Episode: 3262   score: 24.84   Avg score (100e): 24.47   actor gain: -0.32   critic loss: 0.40   steps: 3262
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 17:27:40

Episode: 3263   score: 24.85   Avg score (100e): 24.47   actor gain: -0.32   critic loss: 0.40   steps: 3263


training loop:   6% |##                               | ETA:  33 days, 17:21:05

Episode: 3264   score: 24.85   Avg score (100e): 24.48   actor gain: -0.32   critic loss: 0.40   steps: 3264


training loop:   6% |##                               | ETA:  33 days, 17:16:21

Episode: 3265   score: 24.86   Avg score (100e): 24.49   actor gain: -0.32   critic loss: 0.40   steps: 3265


training loop:   6% |##                               | ETA:  33 days, 17:09:47

Episode: 3266   score: 24.87   Avg score (100e): 24.50   actor gain: -0.32   critic loss: 0.40   steps: 3266


training loop:   6% |##                               | ETA:  33 days, 17:03:42

Episode: 3267   score: 24.88   Avg score (100e): 24.50   actor gain: -0.32   critic loss: 0.40   steps: 3267


training loop:   6% |##                               | ETA:  33 days, 16:57:40

Episode: 3268   score: 24.89   Avg score (100e): 24.51   actor gain: -0.32   critic loss: 0.40   steps: 3268
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 16:50:43

Episode: 3269   score: 24.90   Avg score (100e): 24.52   actor gain: -0.32   critic loss: 0.40   steps: 3269
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 16:44:14

Episode: 3270   score: 24.91   Avg score (100e): 24.53   actor gain: -0.32   critic loss: 0.40   steps: 3270


training loop:   6% |##                               | ETA:  33 days, 16:39:58

Episode: 3271   score: 24.92   Avg score (100e): 24.53   actor gain: -0.32   critic loss: 0.40   steps: 3271


training loop:   6% |##                               | ETA:  33 days, 16:34:25

Episode: 3272   score: 24.92   Avg score (100e): 24.54   actor gain: -0.32   critic loss: 0.40   steps: 3272


training loop:   6% |##                               | ETA:  33 days, 16:29:01

Episode: 3273   score: 24.93   Avg score (100e): 24.55   actor gain: -0.32   critic loss: 0.40   steps: 3273
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 16:22:55

Episode: 3274   score: 24.94   Avg score (100e): 24.56   actor gain: -0.32   critic loss: 0.40   steps: 3274


training loop:   6% |##                               | ETA:  33 days, 16:16:32

Episode: 3275   score: 24.94   Avg score (100e): 24.56   actor gain: -0.32   critic loss: 0.40   steps: 3275


training loop:   6% |##                               | ETA:  33 days, 16:10:08

Episode: 3276   score: 24.95   Avg score (100e): 24.57   actor gain: -0.32   critic loss: 0.40   steps: 3276


training loop:   6% |##                               | ETA:  33 days, 16:03:26

Episode: 3277   score: 24.96   Avg score (100e): 24.58   actor gain: -0.32   critic loss: 0.40   steps: 3277
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 15:59:14

Episode: 3278   score: 24.97   Avg score (100e): 24.59   actor gain: -0.32   critic loss: 0.40   steps: 3278


training loop:   6% |##                               | ETA:  33 days, 15:53:14

Episode: 3279   score: 24.97   Avg score (100e): 24.60   actor gain: -0.32   critic loss: 0.40   steps: 3279


training loop:   6% |##                               | ETA:  33 days, 15:47:20

Episode: 3280   score: 24.98   Avg score (100e): 24.60   actor gain: -0.32   critic loss: 0.40   steps: 3280


training loop:   6% |##                               | ETA:  33 days, 15:41:40

Episode: 3281   score: 24.99   Avg score (100e): 24.61   actor gain: -0.32   critic loss: 0.40   steps: 3281


training loop:   6% |##                               | ETA:  33 days, 15:36:49

Episode: 3282   score: 25.00   Avg score (100e): 24.62   actor gain: -0.32   critic loss: 0.40   steps: 3282


training loop:   6% |##                               | ETA:  33 days, 15:30:26

Episode: 3283   score: 25.01   Avg score (100e): 24.63   actor gain: -0.32   critic loss: 0.40   steps: 3283


training loop:   6% |##                               | ETA:  33 days, 15:24:55

Episode: 3284   score: 25.02   Avg score (100e): 24.63   actor gain: -0.32   critic loss: 0.40   steps: 3284


training loop:   6% |##                               | ETA:  33 days, 15:18:34

Episode: 3285   score: 25.02   Avg score (100e): 24.64   actor gain: -0.32   critic loss: 0.40   steps: 3285


training loop:   6% |##                               | ETA:  33 days, 15:12:15

Episode: 3286   score: 25.03   Avg score (100e): 24.65   actor gain: -0.32   critic loss: 0.40   steps: 3286


training loop:   6% |##                               | ETA:  33 days, 15:05:32

Episode: 3287   score: 25.04   Avg score (100e): 24.66   actor gain: -0.32   critic loss: 0.40   steps: 3287


training loop:   6% |##                               | ETA:  33 days, 14:58:36

Episode: 3288   score: 25.05   Avg score (100e): 24.66   actor gain: -0.32   critic loss: 0.40   steps: 3288
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 14:51:30

Episode: 3289   score: 25.06   Avg score (100e): 24.67   actor gain: -0.32   critic loss: 0.40   steps: 3289


training loop:   6% |##                               | ETA:  33 days, 14:45:03

Episode: 3290   score: 25.06   Avg score (100e): 24.68   actor gain: -0.32   critic loss: 0.40   steps: 3290


training loop:   6% |##                               | ETA:  33 days, 14:39:45

Episode: 3291   score: 25.07   Avg score (100e): 24.69   actor gain: -0.32   critic loss: 0.40   steps: 3291
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 14:32:58

Episode: 3292   score: 25.08   Avg score (100e): 24.69   actor gain: -0.32   critic loss: 0.40   steps: 3292


training loop:   6% |##                               | ETA:  33 days, 14:27:00

Episode: 3293   score: 25.09   Avg score (100e): 24.70   actor gain: -0.32   critic loss: 0.40   steps: 3293


training loop:   6% |##                               | ETA:  33 days, 14:20:37

Episode: 3294   score: 25.09   Avg score (100e): 24.71   actor gain: -0.32   critic loss: 0.40   steps: 3294
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 14:13:26

Episode: 3295   score: 25.10   Avg score (100e): 24.72   actor gain: -0.32   critic loss: 0.40   steps: 3295


training loop:   6% |##                               | ETA:  33 days, 14:06:33

Episode: 3296   score: 25.12   Avg score (100e): 24.73   actor gain: -0.32   critic loss: 0.40   steps: 3296
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 14:00:08

Episode: 3297   score: 25.12   Avg score (100e): 24.73   actor gain: -0.32   critic loss: 0.40   steps: 3297


training loop:   6% |##                               | ETA:  33 days, 13:54:48

Episode: 3298   score: 25.13   Avg score (100e): 24.74   actor gain: -0.32   critic loss: 0.40   steps: 3298


training loop:   6% |##                               | ETA:  33 days, 13:48:51

Episode: 3299   score: 25.14   Avg score (100e): 24.75   actor gain: -0.32   critic loss: 0.40   steps: 3299
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 13:41:49

Episode: 3300   score: 25.15   Avg score (100e): 24.76   actor gain: -0.32   critic loss: 0.40   steps: 3300
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 13:35:01

Episode: 3301   score: 25.15   Avg score (100e): 24.76   actor gain: -0.32   critic loss: 0.40   steps: 3301
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 13:28:03

Episode: 3302   score: 25.16   Avg score (100e): 24.77   actor gain: -0.32   critic loss: 0.40   steps: 3302


training loop:   6% |##                               | ETA:  33 days, 13:21:13

Episode: 3303   score: 25.17   Avg score (100e): 24.78   actor gain: -0.32   critic loss: 0.40   steps: 3303
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 13:13:49

Episode: 3304   score: 25.18   Avg score (100e): 24.79   actor gain: -0.32   critic loss: 0.40   steps: 3304


training loop:   6% |##                               | ETA:  33 days, 13:08:59

Episode: 3305   score: 25.19   Avg score (100e): 24.80   actor gain: -0.32   critic loss: 0.40   steps: 3305


training loop:   6% |##                               | ETA:  33 days, 13:02:57

Episode: 3306   score: 25.19   Avg score (100e): 24.80   actor gain: -0.32   critic loss: 0.40   steps: 3306


training loop:   6% |##                               | ETA:  33 days, 12:57:00

Episode: 3307   score: 25.20   Avg score (100e): 24.81   actor gain: -0.32   critic loss: 0.40   steps: 3307
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 12:50:26

Episode: 3308   score: 25.21   Avg score (100e): 24.82   actor gain: -0.32   critic loss: 0.40   steps: 3308


training loop:   6% |##                               | ETA:  33 days, 12:43:58

Episode: 3309   score: 25.22   Avg score (100e): 24.83   actor gain: -0.32   critic loss: 0.40   steps: 3309


training loop:   6% |##                               | ETA:  33 days, 12:37:20

Episode: 3310   score: 25.22   Avg score (100e): 24.83   actor gain: -0.32   critic loss: 0.40   steps: 3310


training loop:   6% |##                               | ETA:  33 days, 12:32:31

Episode: 3311   score: 25.23   Avg score (100e): 24.84   actor gain: -0.32   critic loss: 0.40   steps: 3311
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 12:26:27

Episode: 3312   score: 25.24   Avg score (100e): 24.85   actor gain: -0.32   critic loss: 0.40   steps: 3312


training loop:   6% |##                               | ETA:  33 days, 12:20:27

Episode: 3313   score: 25.25   Avg score (100e): 24.86   actor gain: -0.32   critic loss: 0.40   steps: 3313


training loop:   6% |##                               | ETA:  33 days, 12:15:15

Episode: 3314   score: 25.25   Avg score (100e): 24.87   actor gain: -0.32   critic loss: 0.40   steps: 3314


training loop:   6% |##                               | ETA:  33 days, 12:09:37

Episode: 3315   score: 25.26   Avg score (100e): 24.87   actor gain: -0.32   critic loss: 0.40   steps: 3315


training loop:   6% |##                               | ETA:  33 days, 12:04:34

Episode: 3316   score: 25.27   Avg score (100e): 24.88   actor gain: -0.32   critic loss: 0.40   steps: 3316
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 11:59:50

Episode: 3317   score: 25.28   Avg score (100e): 24.89   actor gain: -0.33   critic loss: 0.40   steps: 3317


training loop:   6% |##                               | ETA:  33 days, 11:55:43

Episode: 3318   score: 25.28   Avg score (100e): 24.90   actor gain: -0.33   critic loss: 0.40   steps: 3318


training loop:   6% |##                               | ETA:  33 days, 11:50:31

Episode: 3319   score: 25.29   Avg score (100e): 24.90   actor gain: -0.33   critic loss: 0.40   steps: 3319
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 11:44:29

Episode: 3320   score: 25.30   Avg score (100e): 24.91   actor gain: -0.33   critic loss: 0.40   steps: 3320


training loop:   6% |##                               | ETA:  33 days, 11:38:03

Episode: 3321   score: 25.31   Avg score (100e): 24.92   actor gain: -0.33   critic loss: 0.40   steps: 3321
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 11:31:29

Episode: 3322   score: 25.32   Avg score (100e): 24.93   actor gain: -0.33   critic loss: 0.40   steps: 3322


training loop:   6% |##                               | ETA:  33 days, 11:25:56

Episode: 3323   score: 25.33   Avg score (100e): 24.93   actor gain: -0.33   critic loss: 0.40   steps: 3323
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 11:20:57

Episode: 3324   score: 25.33   Avg score (100e): 24.94   actor gain: -0.33   critic loss: 0.40   steps: 3324
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 11:14:00

Episode: 3325   score: 25.34   Avg score (100e): 24.95   actor gain: -0.33   critic loss: 0.40   steps: 3325
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 11:07:23

Episode: 3326   score: 25.35   Avg score (100e): 24.96   actor gain: -0.33   critic loss: 0.40   steps: 3326


training loop:   6% |##                               | ETA:  33 days, 11:01:29

Episode: 3327   score: 25.35   Avg score (100e): 24.97   actor gain: -0.33   critic loss: 0.40   steps: 3327


training loop:   6% |##                               | ETA:  33 days, 10:55:07

Episode: 3328   score: 25.36   Avg score (100e): 24.97   actor gain: -0.33   critic loss: 0.40   steps: 3328


training loop:   6% |##                               | ETA:  33 days, 10:48:11

Episode: 3329   score: 25.37   Avg score (100e): 24.98   actor gain: -0.33   critic loss: 0.40   steps: 3329


training loop:   6% |##                               | ETA:  33 days, 10:41:56

Episode: 3330   score: 25.38   Avg score (100e): 24.99   actor gain: -0.33   critic loss: 0.40   steps: 3330
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 10:37:13

Episode: 3331   score: 25.39   Avg score (100e): 25.00   actor gain: -0.32   critic loss: 0.40   steps: 3331


training loop:   6% |##                               | ETA:  33 days, 10:31:11

Episode: 3332   score: 25.40   Avg score (100e): 25.00   actor gain: -0.32   critic loss: 0.40   steps: 3332


training loop:   6% |##                               | ETA:  33 days, 10:24:25

Episode: 3333   score: 25.40   Avg score (100e): 25.01   actor gain: -0.32   critic loss: 0.40   steps: 3333
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 10:17:16

Episode: 3334   score: 25.41   Avg score (100e): 25.02   actor gain: -0.33   critic loss: 0.40   steps: 3334


training loop:   6% |##                               | ETA:  33 days, 10:10:48

Episode: 3335   score: 25.42   Avg score (100e): 25.03   actor gain: -0.32   critic loss: 0.40   steps: 3335
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  33 days, 10:04:03

Episode: 3336   score: 25.42   Avg score (100e): 25.04   actor gain: -0.32   critic loss: 0.40   steps: 3336


training loop:   6% |##                                | ETA:  33 days, 9:57:20

Episode: 3337   score: 25.43   Avg score (100e): 25.04   actor gain: -0.32   critic loss: 0.40   steps: 3337
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 9:53:06

Episode: 3338   score: 25.43   Avg score (100e): 25.05   actor gain: -0.32   critic loss: 0.40   steps: 3338


training loop:   6% |##                                | ETA:  33 days, 9:46:45

Episode: 3339   score: 25.43   Avg score (100e): 25.06   actor gain: -0.32   critic loss: 0.40   steps: 3339
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 9:40:03

Episode: 3340   score: 25.44   Avg score (100e): 25.07   actor gain: -0.32   critic loss: 0.40   steps: 3340


training loop:   6% |##                                | ETA:  33 days, 9:35:32

Episode: 3341   score: 25.45   Avg score (100e): 25.07   actor gain: -0.32   critic loss: 0.40   steps: 3341


training loop:   6% |##                                | ETA:  33 days, 9:29:10

Episode: 3342   score: 25.46   Avg score (100e): 25.08   actor gain: -0.32   critic loss: 0.40   steps: 3342


training loop:   6% |##                                | ETA:  33 days, 9:23:36

Episode: 3343   score: 25.47   Avg score (100e): 25.09   actor gain: -0.32   critic loss: 0.40   steps: 3343
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 9:17:30

Episode: 3344   score: 25.48   Avg score (100e): 25.10   actor gain: -0.32   critic loss: 0.40   steps: 3344


training loop:   6% |##                                | ETA:  33 days, 9:12:26

Episode: 3345   score: 25.48   Avg score (100e): 25.11   actor gain: -0.32   critic loss: 0.40   steps: 3345
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 9:06:32

Episode: 3346   score: 25.49   Avg score (100e): 25.11   actor gain: -0.32   critic loss: 0.40   steps: 3346


training loop:   6% |##                                | ETA:  33 days, 9:00:35

Episode: 3347   score: 25.50   Avg score (100e): 25.12   actor gain: -0.32   critic loss: 0.40   steps: 3347


training loop:   6% |##                                | ETA:  33 days, 8:55:02

Episode: 3348   score: 25.51   Avg score (100e): 25.13   actor gain: -0.32   critic loss: 0.40   steps: 3348
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 8:48:28

Episode: 3349   score: 25.52   Avg score (100e): 25.14   actor gain: -0.32   critic loss: 0.40   steps: 3349


training loop:   6% |##                                | ETA:  33 days, 8:42:05

Episode: 3350   score: 25.53   Avg score (100e): 25.14   actor gain: -0.32   critic loss: 0.40   steps: 3350


training loop:   6% |##                                | ETA:  33 days, 8:36:00

Episode: 3351   score: 25.53   Avg score (100e): 25.15   actor gain: -0.32   critic loss: 0.40   steps: 3351


training loop:   6% |##                                | ETA:  33 days, 8:29:01

Episode: 3352   score: 25.54   Avg score (100e): 25.16   actor gain: -0.32   critic loss: 0.40   steps: 3352


training loop:   6% |##                                | ETA:  33 days, 8:22:07

Episode: 3353   score: 25.55   Avg score (100e): 25.17   actor gain: -0.32   critic loss: 0.40   steps: 3353


training loop:   6% |##                                | ETA:  33 days, 8:15:05

Episode: 3354   score: 25.56   Avg score (100e): 25.18   actor gain: -0.32   critic loss: 0.40   steps: 3354


training loop:   6% |##                                | ETA:  33 days, 8:08:05

Episode: 3355   score: 25.57   Avg score (100e): 25.18   actor gain: -0.32   critic loss: 0.40   steps: 3355
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 8:01:00

Episode: 3356   score: 25.57   Avg score (100e): 25.19   actor gain: -0.32   critic loss: 0.40   steps: 3356


training loop:   6% |##                                | ETA:  33 days, 7:54:27

Episode: 3357   score: 25.57   Avg score (100e): 25.20   actor gain: -0.32   critic loss: 0.40   steps: 3357


training loop:   6% |##                                | ETA:  33 days, 7:48:14

Episode: 3358   score: 25.58   Avg score (100e): 25.21   actor gain: -0.32   critic loss: 0.40   steps: 3358


training loop:   6% |##                                | ETA:  33 days, 7:41:20

Episode: 3359   score: 25.59   Avg score (100e): 25.21   actor gain: -0.32   critic loss: 0.40   steps: 3359


training loop:   6% |##                                | ETA:  33 days, 7:34:45

Episode: 3360   score: 25.60   Avg score (100e): 25.22   actor gain: -0.32   critic loss: 0.40   steps: 3360


training loop:   6% |##                                | ETA:  33 days, 7:28:16

Episode: 3361   score: 25.61   Avg score (100e): 25.23   actor gain: -0.32   critic loss: 0.40   steps: 3361


training loop:   6% |##                                | ETA:  33 days, 7:21:28

Episode: 3362   score: 25.62   Avg score (100e): 25.24   actor gain: -0.32   critic loss: 0.40   steps: 3362


training loop:   6% |##                                | ETA:  33 days, 7:14:41

Episode: 3363   score: 25.62   Avg score (100e): 25.25   actor gain: -0.32   critic loss: 0.40   steps: 3363


training loop:   6% |##                                | ETA:  33 days, 7:08:25

Episode: 3364   score: 25.63   Avg score (100e): 25.25   actor gain: -0.32   critic loss: 0.40   steps: 3364
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 7:04:35

Episode: 3365   score: 25.64   Avg score (100e): 25.26   actor gain: -0.32   critic loss: 0.40   steps: 3365


training loop:   6% |##                                | ETA:  33 days, 6:59:00

Episode: 3366   score: 25.64   Avg score (100e): 25.27   actor gain: -0.32   critic loss: 0.40   steps: 3366


training loop:   6% |##                                | ETA:  33 days, 6:52:16

Episode: 3367   score: 25.65   Avg score (100e): 25.28   actor gain: -0.32   critic loss: 0.40   steps: 3367


training loop:   6% |##                                | ETA:  33 days, 6:45:40

Episode: 3368   score: 25.66   Avg score (100e): 25.28   actor gain: -0.32   critic loss: 0.40   steps: 3368
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 6:38:55

Episode: 3369   score: 25.67   Avg score (100e): 25.29   actor gain: -0.32   critic loss: 0.40   steps: 3369
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 6:32:08

Episode: 3370   score: 25.67   Avg score (100e): 25.30   actor gain: -0.32   critic loss: 0.40   steps: 3370


training loop:   6% |##                                | ETA:  33 days, 6:25:23

Episode: 3371   score: 25.68   Avg score (100e): 25.31   actor gain: -0.32   critic loss: 0.40   steps: 3371


training loop:   6% |##                                | ETA:  33 days, 6:19:33

Episode: 3372   score: 25.68   Avg score (100e): 25.31   actor gain: -0.32   critic loss: 0.40   steps: 3372


training loop:   6% |##                                | ETA:  33 days, 6:12:44

Episode: 3373   score: 25.68   Avg score (100e): 25.32   actor gain: -0.32   critic loss: 0.40   steps: 3373
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 6:05:43

Episode: 3374   score: 25.69   Avg score (100e): 25.33   actor gain: -0.32   critic loss: 0.40   steps: 3374


training loop:   6% |##                                | ETA:  33 days, 5:59:11

Episode: 3375   score: 25.70   Avg score (100e): 25.34   actor gain: -0.32   critic loss: 0.40   steps: 3375


training loop:   6% |##                                | ETA:  33 days, 5:54:10

Episode: 3376   score: 25.71   Avg score (100e): 25.34   actor gain: -0.32   critic loss: 0.40   steps: 3376


training loop:   6% |##                                | ETA:  33 days, 5:47:36

Episode: 3377   score: 25.72   Avg score (100e): 25.35   actor gain: -0.32   critic loss: 0.40   steps: 3377
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 5:40:42

Episode: 3378   score: 25.72   Avg score (100e): 25.36   actor gain: -0.32   critic loss: 0.40   steps: 3378


training loop:   6% |##                                | ETA:  33 days, 5:34:44

Episode: 3379   score: 25.72   Avg score (100e): 25.37   actor gain: -0.32   critic loss: 0.40   steps: 3379
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 5:28:08

Episode: 3380   score: 25.73   Avg score (100e): 25.37   actor gain: -0.32   critic loss: 0.40   steps: 3380


training loop:   6% |##                                | ETA:  33 days, 5:21:35

Episode: 3381   score: 25.73   Avg score (100e): 25.38   actor gain: -0.32   critic loss: 0.40   steps: 3381
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 5:14:49

Episode: 3382   score: 25.74   Avg score (100e): 25.39   actor gain: -0.32   critic loss: 0.40   steps: 3382


training loop:   6% |##                                | ETA:  33 days, 5:08:01

Episode: 3383   score: 25.75   Avg score (100e): 25.40   actor gain: -0.32   critic loss: 0.40   steps: 3383


training loop:   6% |##                                | ETA:  33 days, 5:01:25

Episode: 3384   score: 25.76   Avg score (100e): 25.40   actor gain: -0.32   critic loss: 0.40   steps: 3384


training loop:   6% |##                                | ETA:  33 days, 4:54:41

Episode: 3385   score: 25.77   Avg score (100e): 25.41   actor gain: -0.32   critic loss: 0.40   steps: 3385
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 4:48:12

Episode: 3386   score: 25.78   Avg score (100e): 25.42   actor gain: -0.32   critic loss: 0.40   steps: 3386


training loop:   6% |##                                | ETA:  33 days, 4:41:39

Episode: 3387   score: 25.78   Avg score (100e): 25.43   actor gain: -0.32   critic loss: 0.39   steps: 3387
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 4:34:55

Episode: 3388   score: 25.79   Avg score (100e): 25.43   actor gain: -0.32   critic loss: 0.39   steps: 3388


training loop:   6% |##                                | ETA:  33 days, 4:28:24

Episode: 3389   score: 25.80   Avg score (100e): 25.44   actor gain: -0.32   critic loss: 0.39   steps: 3389


training loop:   6% |##                                | ETA:  33 days, 4:21:42

Episode: 3390   score: 25.80   Avg score (100e): 25.45   actor gain: -0.32   critic loss: 0.39   steps: 3390
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 4:14:36

Episode: 3391   score: 25.81   Avg score (100e): 25.46   actor gain: -0.32   critic loss: 0.39   steps: 3391
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 4:07:50

Episode: 3392   score: 25.82   Avg score (100e): 25.46   actor gain: -0.32   critic loss: 0.39   steps: 3392


training loop:   6% |##                                | ETA:  33 days, 4:01:23

Episode: 3393   score: 25.83   Avg score (100e): 25.47   actor gain: -0.32   critic loss: 0.40   steps: 3393


training loop:   6% |##                                | ETA:  33 days, 3:55:54

Episode: 3394   score: 25.84   Avg score (100e): 25.48   actor gain: -0.32   critic loss: 0.40   steps: 3394


training loop:   6% |##                                | ETA:  33 days, 3:50:12

Episode: 3395   score: 25.85   Avg score (100e): 25.49   actor gain: -0.32   critic loss: 0.40   steps: 3395


training loop:   6% |##                                | ETA:  33 days, 3:43:49

Episode: 3396   score: 25.85   Avg score (100e): 25.49   actor gain: -0.32   critic loss: 0.40   steps: 3396


training loop:   6% |##                                | ETA:  33 days, 3:37:04

Episode: 3397   score: 25.86   Avg score (100e): 25.50   actor gain: -0.32   critic loss: 0.40   steps: 3397


training loop:   6% |##                                | ETA:  33 days, 3:31:01

Episode: 3398   score: 25.87   Avg score (100e): 25.51   actor gain: -0.32   critic loss: 0.40   steps: 3398


training loop:   6% |##                                | ETA:  33 days, 3:24:18

Episode: 3399   score: 25.88   Avg score (100e): 25.52   actor gain: -0.32   critic loss: 0.40   steps: 3399
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 3:18:18

Episode: 3400   score: 25.89   Avg score (100e): 25.52   actor gain: -0.32   critic loss: 0.40   steps: 3400


training loop:   6% |##                                | ETA:  33 days, 3:12:35

Episode: 3401   score: 25.89   Avg score (100e): 25.53   actor gain: -0.32   critic loss: 0.40   steps: 3401


training loop:   6% |##                                | ETA:  33 days, 3:05:41

Episode: 3402   score: 25.90   Avg score (100e): 25.54   actor gain: -0.32   critic loss: 0.40   steps: 3402


training loop:   6% |##                                | ETA:  33 days, 2:59:06

Episode: 3403   score: 25.90   Avg score (100e): 25.55   actor gain: -0.32   critic loss: 0.40   steps: 3403
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 2:53:25

Episode: 3404   score: 25.91   Avg score (100e): 25.55   actor gain: -0.32   critic loss: 0.40   steps: 3404


training loop:   6% |##                                | ETA:  33 days, 2:47:37

Episode: 3405   score: 25.91   Avg score (100e): 25.56   actor gain: -0.32   critic loss: 0.40   steps: 3405


training loop:   6% |##                                | ETA:  33 days, 2:41:15

Episode: 3406   score: 25.92   Avg score (100e): 25.57   actor gain: -0.32   critic loss: 0.40   steps: 3406


training loop:   6% |##                                | ETA:  33 days, 2:34:21

Episode: 3407   score: 25.93   Avg score (100e): 25.57   actor gain: -0.32   critic loss: 0.40   steps: 3407


training loop:   6% |##                                | ETA:  33 days, 2:31:42

Episode: 3408   score: 25.94   Avg score (100e): 25.58   actor gain: -0.32   critic loss: 0.40   steps: 3408
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 2:27:07

Episode: 3409   score: 25.94   Avg score (100e): 25.59   actor gain: -0.32   critic loss: 0.40   steps: 3409


training loop:   6% |##                                | ETA:  33 days, 2:22:19

Episode: 3410   score: 25.95   Avg score (100e): 25.60   actor gain: -0.32   critic loss: 0.40   steps: 3410


training loop:   6% |##                                | ETA:  33 days, 2:16:38

Episode: 3411   score: 25.95   Avg score (100e): 25.60   actor gain: -0.32   critic loss: 0.40   steps: 3411
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 2:10:20

Episode: 3412   score: 25.96   Avg score (100e): 25.61   actor gain: -0.32   critic loss: 0.40   steps: 3412


training loop:   6% |##                                | ETA:  33 days, 2:04:07

Episode: 3413   score: 25.96   Avg score (100e): 25.62   actor gain: -0.32   critic loss: 0.40   steps: 3413


training loop:   6% |##                                | ETA:  33 days, 1:59:22

Episode: 3414   score: 25.97   Avg score (100e): 25.62   actor gain: -0.32   critic loss: 0.40   steps: 3414


training loop:   6% |##                                | ETA:  33 days, 1:53:50

Episode: 3415   score: 25.98   Avg score (100e): 25.63   actor gain: -0.33   critic loss: 0.40   steps: 3415


training loop:   6% |##                                | ETA:  33 days, 1:47:32

Episode: 3416   score: 25.98   Avg score (100e): 25.64   actor gain: -0.33   critic loss: 0.40   steps: 3416
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 1:41:02

Episode: 3417   score: 25.99   Avg score (100e): 25.65   actor gain: -0.33   critic loss: 0.40   steps: 3417
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 1:34:19

Episode: 3418   score: 26.00   Avg score (100e): 25.65   actor gain: -0.33   critic loss: 0.39   steps: 3418


training loop:   6% |##                                | ETA:  33 days, 1:28:07

Episode: 3419   score: 26.00   Avg score (100e): 25.66   actor gain: -0.32   critic loss: 0.39   steps: 3419


training loop:   6% |##                                | ETA:  33 days, 1:21:25

Episode: 3420   score: 26.01   Avg score (100e): 25.67   actor gain: -0.32   critic loss: 0.39   steps: 3420
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 1:16:21

Episode: 3421   score: 26.02   Avg score (100e): 25.67   actor gain: -0.33   critic loss: 0.39   steps: 3421
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 1:09:56

Episode: 3422   score: 26.03   Avg score (100e): 25.68   actor gain: -0.32   critic loss: 0.39   steps: 3422
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 1:03:28

Episode: 3423   score: 26.04   Avg score (100e): 25.69   actor gain: -0.32   critic loss: 0.39   steps: 3423


training loop:   6% |##                                | ETA:  33 days, 0:57:58

Episode: 3424   score: 26.04   Avg score (100e): 25.70   actor gain: -0.32   critic loss: 0.39   steps: 3424


training loop:   6% |##                                | ETA:  33 days, 0:51:33

Episode: 3425   score: 26.05   Avg score (100e): 25.70   actor gain: -0.32   critic loss: 0.39   steps: 3425


training loop:   6% |##                                | ETA:  33 days, 0:44:51

Episode: 3426   score: 26.06   Avg score (100e): 25.71   actor gain: -0.33   critic loss: 0.39   steps: 3426


training loop:   6% |##                                | ETA:  33 days, 0:38:15

Episode: 3427   score: 26.06   Avg score (100e): 25.72   actor gain: -0.33   critic loss: 0.39   steps: 3427
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 0:33:05

Episode: 3428   score: 26.07   Avg score (100e): 25.72   actor gain: -0.32   critic loss: 0.39   steps: 3428


training loop:   6% |##                                | ETA:  33 days, 0:26:46

Episode: 3429   score: 26.07   Avg score (100e): 25.73   actor gain: -0.33   critic loss: 0.39   steps: 3429
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 0:21:29

Episode: 3430   score: 26.08   Avg score (100e): 25.74   actor gain: -0.33   critic loss: 0.39   steps: 3430
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 0:15:14

Episode: 3431   score: 26.09   Avg score (100e): 25.75   actor gain: -0.33   critic loss: 0.39   steps: 3431


training loop:   6% |##                                | ETA:  33 days, 0:08:38

Episode: 3432   score: 26.09   Avg score (100e): 25.75   actor gain: -0.32   critic loss: 0.39   steps: 3432
np.all(done) is true! miracle!


training loop:   6% |##                                | ETA:  33 days, 0:02:12

Episode: 3433   score: 26.10   Avg score (100e): 25.76   actor gain: -0.32   critic loss: 0.39   steps: 3433


training loop:   6% |##                               | ETA:  32 days, 23:56:18

Episode: 3434   score: 26.11   Avg score (100e): 25.77   actor gain: -0.32   critic loss: 0.39   steps: 3434
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 23:49:56

Episode: 3435   score: 26.11   Avg score (100e): 25.77   actor gain: -0.32   critic loss: 0.39   steps: 3435
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 23:42:55

Episode: 3436   score: 26.12   Avg score (100e): 25.78   actor gain: -0.32   critic loss: 0.39   steps: 3436


training loop:   6% |##                               | ETA:  32 days, 23:36:31

Episode: 3437   score: 26.13   Avg score (100e): 25.79   actor gain: -0.32   critic loss: 0.39   steps: 3437


training loop:   6% |##                               | ETA:  32 days, 23:30:32

Episode: 3438   score: 26.14   Avg score (100e): 25.79   actor gain: -0.32   critic loss: 0.39   steps: 3438


training loop:   6% |##                               | ETA:  32 days, 23:24:11

Episode: 3439   score: 26.15   Avg score (100e): 25.80   actor gain: -0.32   critic loss: 0.39   steps: 3439


training loop:   6% |##                               | ETA:  32 days, 23:18:00

Episode: 3440   score: 26.15   Avg score (100e): 25.81   actor gain: -0.32   critic loss: 0.39   steps: 3440
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 23:14:08

Episode: 3441   score: 26.17   Avg score (100e): 25.82   actor gain: -0.32   critic loss: 0.39   steps: 3441


training loop:   6% |##                               | ETA:  32 days, 23:08:44

Episode: 3442   score: 26.18   Avg score (100e): 25.82   actor gain: -0.32   critic loss: 0.39   steps: 3442
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 23:03:03

Episode: 3443   score: 26.18   Avg score (100e): 25.83   actor gain: -0.32   critic loss: 0.39   steps: 3443


training loop:   6% |##                               | ETA:  32 days, 22:56:41

Episode: 3444   score: 26.19   Avg score (100e): 25.84   actor gain: -0.32   critic loss: 0.40   steps: 3444
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 22:51:04

Episode: 3445   score: 26.19   Avg score (100e): 25.84   actor gain: -0.32   critic loss: 0.40   steps: 3445


training loop:   6% |##                               | ETA:  32 days, 22:45:37

Episode: 3446   score: 26.21   Avg score (100e): 25.85   actor gain: -0.32   critic loss: 0.40   steps: 3446


training loop:   6% |##                               | ETA:  32 days, 22:39:17

Episode: 3447   score: 26.21   Avg score (100e): 25.86   actor gain: -0.32   critic loss: 0.40   steps: 3447


training loop:   6% |##                               | ETA:  32 days, 22:33:20

Episode: 3448   score: 26.22   Avg score (100e): 25.87   actor gain: -0.32   critic loss: 0.40   steps: 3448


training loop:   6% |##                               | ETA:  32 days, 22:27:15

Episode: 3449   score: 26.22   Avg score (100e): 25.87   actor gain: -0.32   critic loss: 0.40   steps: 3449
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 22:20:40

Episode: 3450   score: 26.23   Avg score (100e): 25.88   actor gain: -0.32   critic loss: 0.40   steps: 3450


training loop:   6% |##                               | ETA:  32 days, 22:14:33

Episode: 3451   score: 26.23   Avg score (100e): 25.89   actor gain: -0.32   critic loss: 0.40   steps: 3451
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 22:07:40

Episode: 3452   score: 26.23   Avg score (100e): 25.89   actor gain: -0.32   critic loss: 0.40   steps: 3452


training loop:   6% |##                               | ETA:  32 days, 22:01:35

Episode: 3453   score: 26.24   Avg score (100e): 25.90   actor gain: -0.32   critic loss: 0.40   steps: 3453
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 21:55:18

Episode: 3454   score: 26.25   Avg score (100e): 25.91   actor gain: -0.32   critic loss: 0.40   steps: 3454


training loop:   6% |##                               | ETA:  32 days, 21:49:13

Episode: 3455   score: 26.26   Avg score (100e): 25.91   actor gain: -0.32   critic loss: 0.40   steps: 3455


training loop:   6% |##                               | ETA:  32 days, 21:43:18

Episode: 3456   score: 26.27   Avg score (100e): 25.92   actor gain: -0.32   critic loss: 0.40   steps: 3456


training loop:   6% |##                               | ETA:  32 days, 21:36:41

Episode: 3457   score: 26.27   Avg score (100e): 25.93   actor gain: -0.32   critic loss: 0.40   steps: 3457


training loop:   6% |##                               | ETA:  32 days, 21:29:57

Episode: 3458   score: 26.28   Avg score (100e): 25.94   actor gain: -0.32   critic loss: 0.40   steps: 3458
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 21:23:23

Episode: 3459   score: 26.29   Avg score (100e): 25.94   actor gain: -0.32   critic loss: 0.40   steps: 3459


training loop:   6% |##                               | ETA:  32 days, 21:16:38

Episode: 3460   score: 26.29   Avg score (100e): 25.95   actor gain: -0.32   critic loss: 0.40   steps: 3460


training loop:   6% |##                               | ETA:  32 days, 21:10:18

Episode: 3461   score: 26.30   Avg score (100e): 25.96   actor gain: -0.32   critic loss: 0.40   steps: 3461
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 21:03:56

Episode: 3462   score: 26.31   Avg score (100e): 25.96   actor gain: -0.32   critic loss: 0.40   steps: 3462


training loop:   6% |##                               | ETA:  32 days, 20:58:55

Episode: 3463   score: 26.32   Avg score (100e): 25.97   actor gain: -0.32   critic loss: 0.40   steps: 3463


training loop:   6% |##                               | ETA:  32 days, 20:53:03

Episode: 3464   score: 26.33   Avg score (100e): 25.98   actor gain: -0.32   critic loss: 0.40   steps: 3464


training loop:   6% |##                               | ETA:  32 days, 20:46:39

Episode: 3465   score: 26.33   Avg score (100e): 25.98   actor gain: -0.33   critic loss: 0.40   steps: 3465


training loop:   6% |##                               | ETA:  32 days, 20:40:27

Episode: 3466   score: 26.34   Avg score (100e): 25.99   actor gain: -0.33   critic loss: 0.40   steps: 3466
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 20:34:13

Episode: 3467   score: 26.34   Avg score (100e): 26.00   actor gain: -0.33   critic loss: 0.40   steps: 3467


training loop:   6% |##                               | ETA:  32 days, 20:27:43

Episode: 3468   score: 26.35   Avg score (100e): 26.00   actor gain: -0.33   critic loss: 0.40   steps: 3468
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 20:21:29

Episode: 3469   score: 26.36   Avg score (100e): 26.01   actor gain: -0.33   critic loss: 0.40   steps: 3469


training loop:   6% |##                               | ETA:  32 days, 20:17:00

Episode: 3470   score: 26.37   Avg score (100e): 26.02   actor gain: -0.33   critic loss: 0.40   steps: 3470


training loop:   6% |##                               | ETA:  32 days, 20:11:45

Episode: 3471   score: 26.37   Avg score (100e): 26.03   actor gain: -0.33   critic loss: 0.40   steps: 3471
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 20:05:18

Episode: 3472   score: 26.38   Avg score (100e): 26.03   actor gain: -0.33   critic loss: 0.40   steps: 3472


training loop:   6% |##                               | ETA:  32 days, 20:00:20

Episode: 3473   score: 26.38   Avg score (100e): 26.04   actor gain: -0.33   critic loss: 0.39   steps: 3473


training loop:   6% |##                               | ETA:  32 days, 19:54:07

Episode: 3474   score: 26.39   Avg score (100e): 26.05   actor gain: -0.32   critic loss: 0.39   steps: 3474
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 19:48:12

Episode: 3475   score: 26.40   Avg score (100e): 26.05   actor gain: -0.32   critic loss: 0.39   steps: 3475
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 19:43:08

Episode: 3476   score: 26.41   Avg score (100e): 26.06   actor gain: -0.32   critic loss: 0.39   steps: 3476
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 19:38:40

Episode: 3477   score: 26.42   Avg score (100e): 26.07   actor gain: -0.32   critic loss: 0.39   steps: 3477
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 19:33:31

Episode: 3478   score: 26.43   Avg score (100e): 26.07   actor gain: -0.32   critic loss: 0.39   steps: 3478
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 19:27:49

Episode: 3479   score: 26.43   Avg score (100e): 26.08   actor gain: -0.32   critic loss: 0.39   steps: 3479


training loop:   6% |##                               | ETA:  32 days, 19:21:53

Episode: 3480   score: 26.44   Avg score (100e): 26.09   actor gain: -0.32   critic loss: 0.39   steps: 3480


training loop:   6% |##                               | ETA:  32 days, 19:15:36

Episode: 3481   score: 26.45   Avg score (100e): 26.10   actor gain: -0.32   critic loss: 0.39   steps: 3481


training loop:   6% |##                               | ETA:  32 days, 19:10:00

Episode: 3482   score: 26.45   Avg score (100e): 26.10   actor gain: -0.32   critic loss: 0.40   steps: 3482
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 19:05:24

Episode: 3483   score: 26.46   Avg score (100e): 26.11   actor gain: -0.32   critic loss: 0.40   steps: 3483
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 19:00:28

Episode: 3484   score: 26.47   Avg score (100e): 26.12   actor gain: -0.32   critic loss: 0.40   steps: 3484


training loop:   6% |##                               | ETA:  32 days, 18:54:54

Episode: 3485   score: 26.48   Avg score (100e): 26.12   actor gain: -0.32   critic loss: 0.40   steps: 3485


training loop:   6% |##                               | ETA:  32 days, 18:48:50

Episode: 3486   score: 26.49   Avg score (100e): 26.13   actor gain: -0.32   critic loss: 0.40   steps: 3486
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 18:42:57

Episode: 3487   score: 26.50   Avg score (100e): 26.14   actor gain: -0.32   critic loss: 0.40   steps: 3487
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 18:36:55

Episode: 3488   score: 26.50   Avg score (100e): 26.15   actor gain: -0.32   critic loss: 0.40   steps: 3488


training loop:   6% |##                               | ETA:  32 days, 18:30:52

Episode: 3489   score: 26.51   Avg score (100e): 26.15   actor gain: -0.32   critic loss: 0.40   steps: 3489


training loop:   6% |##                               | ETA:  32 days, 18:25:40

Episode: 3490   score: 26.52   Avg score (100e): 26.16   actor gain: -0.32   critic loss: 0.40   steps: 3490
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 18:19:44

Episode: 3491   score: 26.53   Avg score (100e): 26.17   actor gain: -0.32   critic loss: 0.40   steps: 3491
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 18:13:14

Episode: 3492   score: 26.54   Avg score (100e): 26.17   actor gain: -0.32   critic loss: 0.40   steps: 3492


training loop:   6% |##                               | ETA:  32 days, 18:06:51

Episode: 3493   score: 26.55   Avg score (100e): 26.18   actor gain: -0.32   critic loss: 0.40   steps: 3493
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 18:00:08

Episode: 3494   score: 26.56   Avg score (100e): 26.19   actor gain: -0.32   critic loss: 0.40   steps: 3494


training loop:   6% |##                               | ETA:  32 days, 17:54:05

Episode: 3495   score: 26.56   Avg score (100e): 26.20   actor gain: -0.32   critic loss: 0.40   steps: 3495


training loop:   6% |##                               | ETA:  32 days, 17:47:40

Episode: 3496   score: 26.58   Avg score (100e): 26.20   actor gain: -0.32   critic loss: 0.40   steps: 3496


training loop:   6% |##                               | ETA:  32 days, 17:42:34

Episode: 3497   score: 26.58   Avg score (100e): 26.21   actor gain: -0.32   critic loss: 0.40   steps: 3497


training loop:   6% |##                               | ETA:  32 days, 17:36:03

Episode: 3498   score: 26.58   Avg score (100e): 26.22   actor gain: -0.32   critic loss: 0.40   steps: 3498


training loop:   6% |##                               | ETA:  32 days, 17:30:14

Episode: 3499   score: 26.59   Avg score (100e): 26.23   actor gain: -0.32   critic loss: 0.40   steps: 3499
np.all(done) is true! miracle!


training loop:   6% |##                               | ETA:  32 days, 17:24:31

Episode: 3500   score: 26.60   Avg score (100e): 26.23   actor gain: -0.32   critic loss: 0.40   steps: 3500
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:19:44

Episode: 3501   score: 26.60   Avg score (100e): 26.24   actor gain: -0.32   critic loss: 0.40   steps: 3501
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:18:08

Episode: 3502   score: 26.62   Avg score (100e): 26.25   actor gain: -0.32   critic loss: 0.40   steps: 3502


training loop:   7% |##                               | ETA:  32 days, 17:18:00

Episode: 3503   score: 26.62   Avg score (100e): 26.25   actor gain: -0.32   critic loss: 0.40   steps: 3503


training loop:   7% |##                               | ETA:  32 days, 17:18:28

Episode: 3504   score: 26.63   Avg score (100e): 26.26   actor gain: -0.33   critic loss: 0.40   steps: 3504
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:19:27

Episode: 3505   score: 26.64   Avg score (100e): 26.27   actor gain: -0.32   critic loss: 0.40   steps: 3505


training loop:   7% |##                               | ETA:  32 days, 17:21:50

Episode: 3506   score: 26.64   Avg score (100e): 26.28   actor gain: -0.33   critic loss: 0.40   steps: 3506


training loop:   7% |##                               | ETA:  32 days, 17:22:46

Episode: 3507   score: 26.66   Avg score (100e): 26.28   actor gain: -0.33   critic loss: 0.41   steps: 3507


training loop:   7% |##                               | ETA:  32 days, 17:22:25

Episode: 3508   score: 26.66   Avg score (100e): 26.29   actor gain: -0.33   critic loss: 0.40   steps: 3508
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:21:58

Episode: 3509   score: 26.67   Avg score (100e): 26.30   actor gain: -0.33   critic loss: 0.40   steps: 3509


training loop:   7% |##                               | ETA:  32 days, 17:22:28

Episode: 3510   score: 26.68   Avg score (100e): 26.30   actor gain: -0.33   critic loss: 0.41   steps: 3510


training loop:   7% |##                               | ETA:  32 days, 17:22:30

Episode: 3511   score: 26.68   Avg score (100e): 26.31   actor gain: -0.33   critic loss: 0.41   steps: 3511


training loop:   7% |##                               | ETA:  32 days, 17:22:38

Episode: 3512   score: 26.69   Avg score (100e): 26.32   actor gain: -0.33   critic loss: 0.41   steps: 3512


training loop:   7% |##                               | ETA:  32 days, 17:22:09

Episode: 3513   score: 26.69   Avg score (100e): 26.33   actor gain: -0.33   critic loss: 0.41   steps: 3513
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:22:36

Episode: 3514   score: 26.70   Avg score (100e): 26.33   actor gain: -0.33   critic loss: 0.40   steps: 3514


training loop:   7% |##                               | ETA:  32 days, 17:22:56

Episode: 3515   score: 26.70   Avg score (100e): 26.34   actor gain: -0.33   critic loss: 0.40   steps: 3515


training loop:   7% |##                               | ETA:  32 days, 17:22:14

Episode: 3516   score: 26.71   Avg score (100e): 26.35   actor gain: -0.33   critic loss: 0.40   steps: 3516
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:22:50

Episode: 3517   score: 26.72   Avg score (100e): 26.36   actor gain: -0.33   critic loss: 0.40   steps: 3517


training loop:   7% |##                               | ETA:  32 days, 17:24:47

Episode: 3518   score: 26.73   Avg score (100e): 26.36   actor gain: -0.33   critic loss: 0.40   steps: 3518


training loop:   7% |##                               | ETA:  32 days, 17:26:07

Episode: 3519   score: 26.74   Avg score (100e): 26.37   actor gain: -0.33   critic loss: 0.40   steps: 3519


training loop:   7% |##                               | ETA:  32 days, 17:25:54

Episode: 3520   score: 26.74   Avg score (100e): 26.38   actor gain: -0.33   critic loss: 0.40   steps: 3520


training loop:   7% |##                               | ETA:  32 days, 17:25:36

Episode: 3521   score: 26.75   Avg score (100e): 26.38   actor gain: -0.33   critic loss: 0.40   steps: 3521


training loop:   7% |##                               | ETA:  32 days, 17:24:55

Episode: 3522   score: 26.75   Avg score (100e): 26.39   actor gain: -0.33   critic loss: 0.40   steps: 3522


training loop:   7% |##                               | ETA:  32 days, 17:26:23

Episode: 3523   score: 26.75   Avg score (100e): 26.40   actor gain: -0.33   critic loss: 0.40   steps: 3523
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:24:58

Episode: 3524   score: 26.76   Avg score (100e): 26.41   actor gain: -0.33   critic loss: 0.40   steps: 3524


training loop:   7% |##                               | ETA:  32 days, 17:23:57

Episode: 3525   score: 26.76   Avg score (100e): 26.41   actor gain: -0.33   critic loss: 0.40   steps: 3525


training loop:   7% |##                               | ETA:  32 days, 17:23:30

Episode: 3526   score: 26.77   Avg score (100e): 26.42   actor gain: -0.33   critic loss: 0.40   steps: 3526
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:23:41

Episode: 3527   score: 26.78   Avg score (100e): 26.43   actor gain: -0.33   critic loss: 0.40   steps: 3527
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:23:02

Episode: 3528   score: 26.79   Avg score (100e): 26.43   actor gain: -0.33   critic loss: 0.40   steps: 3528
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:22:07

Episode: 3529   score: 26.81   Avg score (100e): 26.44   actor gain: -0.32   critic loss: 0.40   steps: 3529
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:20:34

Episode: 3530   score: 26.81   Avg score (100e): 26.45   actor gain: -0.32   critic loss: 0.40   steps: 3530


training loop:   7% |##                               | ETA:  32 days, 17:19:10

Episode: 3531   score: 26.82   Avg score (100e): 26.46   actor gain: -0.32   critic loss: 0.40   steps: 3531


training loop:   7% |##                               | ETA:  32 days, 17:18:58

Episode: 3532   score: 26.83   Avg score (100e): 26.46   actor gain: -0.32   critic loss: 0.40   steps: 3532


training loop:   7% |##                               | ETA:  32 days, 17:18:05

Episode: 3533   score: 26.83   Avg score (100e): 26.47   actor gain: -0.32   critic loss: 0.40   steps: 3533


training loop:   7% |##                               | ETA:  32 days, 17:17:04

Episode: 3534   score: 26.83   Avg score (100e): 26.48   actor gain: -0.32   critic loss: 0.40   steps: 3534


training loop:   7% |##                               | ETA:  32 days, 17:16:41

Episode: 3535   score: 26.84   Avg score (100e): 26.49   actor gain: -0.32   critic loss: 0.40   steps: 3535


training loop:   7% |##                               | ETA:  32 days, 17:21:10

Episode: 3536   score: 26.85   Avg score (100e): 26.49   actor gain: -0.32   critic loss: 0.40   steps: 3536


training loop:   7% |##                               | ETA:  32 days, 17:22:23

Episode: 3537   score: 26.86   Avg score (100e): 26.50   actor gain: -0.32   critic loss: 0.40   steps: 3537
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:23:50

Episode: 3538   score: 26.86   Avg score (100e): 26.51   actor gain: -0.32   critic loss: 0.40   steps: 3538


training loop:   7% |##                               | ETA:  32 days, 17:23:29

Episode: 3539   score: 26.87   Avg score (100e): 26.52   actor gain: -0.32   critic loss: 0.40   steps: 3539


training loop:   7% |##                               | ETA:  32 days, 17:27:47

Episode: 3540   score: 26.88   Avg score (100e): 26.52   actor gain: -0.33   critic loss: 0.40   steps: 3540


training loop:   7% |##                               | ETA:  32 days, 17:31:50

Episode: 3541   score: 26.89   Avg score (100e): 26.53   actor gain: -0.33   critic loss: 0.40   steps: 3541


training loop:   7% |##                               | ETA:  32 days, 17:36:39

Episode: 3542   score: 26.90   Avg score (100e): 26.54   actor gain: -0.33   critic loss: 0.40   steps: 3542


training loop:   7% |##                               | ETA:  32 days, 17:40:27

Episode: 3543   score: 26.90   Avg score (100e): 26.54   actor gain: -0.33   critic loss: 0.40   steps: 3543


training loop:   7% |##                               | ETA:  32 days, 17:44:11

Episode: 3544   score: 26.91   Avg score (100e): 26.55   actor gain: -0.32   critic loss: 0.40   steps: 3544


training loop:   7% |##                               | ETA:  32 days, 17:46:04

Episode: 3545   score: 26.92   Avg score (100e): 26.56   actor gain: -0.32   critic loss: 0.40   steps: 3545
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:47:33

Episode: 3546   score: 26.93   Avg score (100e): 26.57   actor gain: -0.32   critic loss: 0.40   steps: 3546
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:47:18

Episode: 3547   score: 26.93   Avg score (100e): 26.57   actor gain: -0.32   critic loss: 0.40   steps: 3547


training loop:   7% |##                               | ETA:  32 days, 17:49:26

Episode: 3548   score: 26.94   Avg score (100e): 26.58   actor gain: -0.32   critic loss: 0.40   steps: 3548


training loop:   7% |##                               | ETA:  32 days, 17:50:27

Episode: 3549   score: 26.94   Avg score (100e): 26.59   actor gain: -0.32   critic loss: 0.40   steps: 3549


training loop:   7% |##                               | ETA:  32 days, 17:51:22

Episode: 3550   score: 26.95   Avg score (100e): 26.59   actor gain: -0.32   critic loss: 0.40   steps: 3550
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:51:47

Episode: 3551   score: 26.96   Avg score (100e): 26.60   actor gain: -0.32   critic loss: 0.40   steps: 3551


training loop:   7% |##                               | ETA:  32 days, 17:53:16

Episode: 3552   score: 26.97   Avg score (100e): 26.61   actor gain: -0.32   critic loss: 0.40   steps: 3552


training loop:   7% |##                               | ETA:  32 days, 17:53:08

Episode: 3553   score: 26.97   Avg score (100e): 26.62   actor gain: -0.32   critic loss: 0.40   steps: 3553


training loop:   7% |##                               | ETA:  32 days, 17:55:01

Episode: 3554   score: 26.98   Avg score (100e): 26.62   actor gain: -0.32   critic loss: 0.40   steps: 3554


training loop:   7% |##                               | ETA:  32 days, 17:54:41

Episode: 3555   score: 26.98   Avg score (100e): 26.63   actor gain: -0.32   critic loss: 0.40   steps: 3555


training loop:   7% |##                               | ETA:  32 days, 17:54:22

Episode: 3556   score: 26.99   Avg score (100e): 26.64   actor gain: -0.32   critic loss: 0.40   steps: 3556
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:54:07

Episode: 3557   score: 26.99   Avg score (100e): 26.65   actor gain: -0.32   critic loss: 0.40   steps: 3557


training loop:   7% |##                               | ETA:  32 days, 17:54:21

Episode: 3558   score: 27.00   Avg score (100e): 26.65   actor gain: -0.32   critic loss: 0.40   steps: 3558
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:54:27

Episode: 3559   score: 27.01   Avg score (100e): 26.66   actor gain: -0.32   critic loss: 0.40   steps: 3559


training loop:   7% |##                               | ETA:  32 days, 17:54:35

Episode: 3560   score: 27.01   Avg score (100e): 26.67   actor gain: -0.32   critic loss: 0.40   steps: 3560


training loop:   7% |##                               | ETA:  32 days, 17:55:19

Episode: 3561   score: 27.02   Avg score (100e): 26.67   actor gain: -0.32   critic loss: 0.40   steps: 3561


training loop:   7% |##                               | ETA:  32 days, 17:57:54

Episode: 3562   score: 27.03   Avg score (100e): 26.68   actor gain: -0.32   critic loss: 0.40   steps: 3562
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 17:58:20

Episode: 3563   score: 27.03   Avg score (100e): 26.69   actor gain: -0.32   critic loss: 0.40   steps: 3563


training loop:   7% |##                               | ETA:  32 days, 17:59:05

Episode: 3564   score: 27.04   Avg score (100e): 26.70   actor gain: -0.32   critic loss: 0.40   steps: 3564


training loop:   7% |##                               | ETA:  32 days, 17:59:13

Episode: 3565   score: 27.04   Avg score (100e): 26.70   actor gain: -0.32   critic loss: 0.40   steps: 3565


training loop:   7% |##                               | ETA:  32 days, 17:59:44

Episode: 3566   score: 27.05   Avg score (100e): 26.71   actor gain: -0.32   critic loss: 0.40   steps: 3566


training loop:   7% |##                               | ETA:  32 days, 18:00:35

Episode: 3567   score: 27.05   Avg score (100e): 26.72   actor gain: -0.32   critic loss: 0.40   steps: 3567


training loop:   7% |##                               | ETA:  32 days, 18:00:38

Episode: 3568   score: 27.06   Avg score (100e): 26.72   actor gain: -0.32   critic loss: 0.40   steps: 3568


training loop:   7% |##                               | ETA:  32 days, 18:00:25

Episode: 3569   score: 27.07   Avg score (100e): 26.73   actor gain: -0.32   critic loss: 0.40   steps: 3569


training loop:   7% |##                               | ETA:  32 days, 18:03:55

Episode: 3570   score: 27.07   Avg score (100e): 26.74   actor gain: -0.32   critic loss: 0.40   steps: 3570
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:05:40

Episode: 3571   score: 27.08   Avg score (100e): 26.74   actor gain: -0.32   critic loss: 0.40   steps: 3571


training loop:   7% |##                               | ETA:  32 days, 18:06:04

Episode: 3572   score: 27.08   Avg score (100e): 26.75   actor gain: -0.32   critic loss: 0.40   steps: 3572


training loop:   7% |##                               | ETA:  32 days, 18:05:41

Episode: 3573   score: 27.08   Avg score (100e): 26.76   actor gain: -0.32   critic loss: 0.40   steps: 3573


training loop:   7% |##                               | ETA:  32 days, 18:06:35

Episode: 3574   score: 27.09   Avg score (100e): 26.77   actor gain: -0.32   critic loss: 0.40   steps: 3574


training loop:   7% |##                               | ETA:  32 days, 18:09:07

Episode: 3575   score: 27.10   Avg score (100e): 26.77   actor gain: -0.32   critic loss: 0.40   steps: 3575
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:18:20

Episode: 3576   score: 27.10   Avg score (100e): 26.78   actor gain: -0.32   critic loss: 0.40   steps: 3576


training loop:   7% |##                               | ETA:  32 days, 18:21:53

Episode: 3577   score: 27.11   Avg score (100e): 26.79   actor gain: -0.32   critic loss: 0.40   steps: 3577


training loop:   7% |##                               | ETA:  32 days, 18:26:14

Episode: 3578   score: 27.11   Avg score (100e): 26.79   actor gain: -0.32   critic loss: 0.40   steps: 3578
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:29:55

Episode: 3579   score: 27.12   Avg score (100e): 26.80   actor gain: -0.32   critic loss: 0.40   steps: 3579


training loop:   7% |##                               | ETA:  32 days, 18:33:10

Episode: 3580   score: 27.12   Avg score (100e): 26.81   actor gain: -0.32   critic loss: 0.40   steps: 3580
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:33:40

Episode: 3581   score: 27.14   Avg score (100e): 26.81   actor gain: -0.32   critic loss: 0.40   steps: 3581


training loop:   7% |##                               | ETA:  32 days, 18:34:32

Episode: 3582   score: 27.14   Avg score (100e): 26.82   actor gain: -0.32   critic loss: 0.40   steps: 3582


training loop:   7% |##                               | ETA:  32 days, 18:34:31

Episode: 3583   score: 27.15   Avg score (100e): 26.83   actor gain: -0.32   critic loss: 0.40   steps: 3583


training loop:   7% |##                               | ETA:  32 days, 18:35:52

Episode: 3584   score: 27.15   Avg score (100e): 26.83   actor gain: -0.32   critic loss: 0.40   steps: 3584


training loop:   7% |##                               | ETA:  32 days, 18:36:27

Episode: 3585   score: 27.15   Avg score (100e): 26.84   actor gain: -0.32   critic loss: 0.40   steps: 3585


training loop:   7% |##                               | ETA:  32 days, 18:36:23

Episode: 3586   score: 27.16   Avg score (100e): 26.85   actor gain: -0.32   critic loss: 0.40   steps: 3586


training loop:   7% |##                               | ETA:  32 days, 18:36:23

Episode: 3587   score: 27.17   Avg score (100e): 26.85   actor gain: -0.32   critic loss: 0.40   steps: 3587
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:36:53

Episode: 3588   score: 27.16   Avg score (100e): 26.86   actor gain: -0.32   critic loss: 0.40   steps: 3588
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:36:39

Episode: 3589   score: 27.17   Avg score (100e): 26.87   actor gain: -0.32   critic loss: 0.40   steps: 3589
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:37:58

Episode: 3590   score: 27.18   Avg score (100e): 26.87   actor gain: -0.32   critic loss: 0.40   steps: 3590


training loop:   7% |##                               | ETA:  32 days, 18:39:00

Episode: 3591   score: 27.19   Avg score (100e): 26.88   actor gain: -0.32   critic loss: 0.40   steps: 3591
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:39:14

Episode: 3592   score: 27.19   Avg score (100e): 26.89   actor gain: -0.32   critic loss: 0.40   steps: 3592
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:38:54

Episode: 3593   score: 27.19   Avg score (100e): 26.89   actor gain: -0.32   critic loss: 0.40   steps: 3593


training loop:   7% |##                               | ETA:  32 days, 18:38:56

Episode: 3594   score: 27.20   Avg score (100e): 26.90   actor gain: -0.32   critic loss: 0.40   steps: 3594


training loop:   7% |##                               | ETA:  32 days, 18:38:58

Episode: 3595   score: 27.20   Avg score (100e): 26.91   actor gain: -0.32   critic loss: 0.40   steps: 3595


training loop:   7% |##                               | ETA:  32 days, 18:39:28

Episode: 3596   score: 27.21   Avg score (100e): 26.91   actor gain: -0.33   critic loss: 0.40   steps: 3596


training loop:   7% |##                               | ETA:  32 days, 18:39:05

Episode: 3597   score: 27.22   Avg score (100e): 26.92   actor gain: -0.32   critic loss: 0.40   steps: 3597


training loop:   7% |##                               | ETA:  32 days, 18:39:01

Episode: 3598   score: 27.23   Avg score (100e): 26.93   actor gain: -0.32   critic loss: 0.39   steps: 3598
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:38:29

Episode: 3599   score: 27.24   Avg score (100e): 26.93   actor gain: -0.32   critic loss: 0.40   steps: 3599
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:39:19

Episode: 3600   score: 27.24   Avg score (100e): 26.94   actor gain: -0.32   critic loss: 0.39   steps: 3600


training loop:   7% |##                               | ETA:  32 days, 18:39:15

Episode: 3601   score: 27.25   Avg score (100e): 26.95   actor gain: -0.32   critic loss: 0.40   steps: 3601
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:38:50

Episode: 3602   score: 27.26   Avg score (100e): 26.95   actor gain: -0.33   critic loss: 0.40   steps: 3602
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:41:43

Episode: 3603   score: 27.26   Avg score (100e): 26.96   actor gain: -0.32   critic loss: 0.40   steps: 3603
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:43:01

Episode: 3604   score: 27.26   Avg score (100e): 26.96   actor gain: -0.32   critic loss: 0.40   steps: 3604
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:43:46

Episode: 3605   score: 27.27   Avg score (100e): 26.97   actor gain: -0.33   critic loss: 0.40   steps: 3605


training loop:   7% |##                               | ETA:  32 days, 18:44:16

Episode: 3606   score: 27.27   Avg score (100e): 26.98   actor gain: -0.32   critic loss: 0.40   steps: 3606
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:45:02

Episode: 3607   score: 27.28   Avg score (100e): 26.98   actor gain: -0.33   critic loss: 0.40   steps: 3607


training loop:   7% |##                               | ETA:  32 days, 18:47:08

Episode: 3608   score: 27.29   Avg score (100e): 26.99   actor gain: -0.32   critic loss: 0.40   steps: 3608


training loop:   7% |##                               | ETA:  32 days, 18:47:45

Episode: 3609   score: 27.30   Avg score (100e): 27.00   actor gain: -0.32   critic loss: 0.40   steps: 3609


training loop:   7% |##                               | ETA:  32 days, 18:48:00

Episode: 3610   score: 27.30   Avg score (100e): 27.00   actor gain: -0.32   critic loss: 0.40   steps: 3610


training loop:   7% |##                               | ETA:  32 days, 18:48:09

Episode: 3611   score: 27.30   Avg score (100e): 27.01   actor gain: -0.33   critic loss: 0.40   steps: 3611


training loop:   7% |##                               | ETA:  32 days, 18:48:41

Episode: 3612   score: 27.31   Avg score (100e): 27.01   actor gain: -0.33   critic loss: 0.40   steps: 3612
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:48:01

Episode: 3613   score: 27.32   Avg score (100e): 27.02   actor gain: -0.32   critic loss: 0.40   steps: 3613


training loop:   7% |##                               | ETA:  32 days, 18:48:03

Episode: 3614   score: 27.32   Avg score (100e): 27.03   actor gain: -0.32   critic loss: 0.40   steps: 3614


training loop:   7% |##                               | ETA:  32 days, 18:47:27

Episode: 3615   score: 27.32   Avg score (100e): 27.03   actor gain: -0.32   critic loss: 0.40   steps: 3615


training loop:   7% |##                               | ETA:  32 days, 18:48:04

Episode: 3616   score: 27.34   Avg score (100e): 27.04   actor gain: -0.32   critic loss: 0.40   steps: 3616


training loop:   7% |##                               | ETA:  32 days, 18:47:56

Episode: 3617   score: 27.34   Avg score (100e): 27.05   actor gain: -0.32   critic loss: 0.40   steps: 3617
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:47:24

Episode: 3618   score: 27.35   Avg score (100e): 27.05   actor gain: -0.32   critic loss: 0.40   steps: 3618
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:47:15

Episode: 3619   score: 27.36   Avg score (100e): 27.06   actor gain: -0.32   critic loss: 0.40   steps: 3619


training loop:   7% |##                               | ETA:  32 days, 18:48:11

Episode: 3620   score: 27.36   Avg score (100e): 27.06   actor gain: -0.32   critic loss: 0.40   steps: 3620
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:48:13

Episode: 3621   score: 27.37   Avg score (100e): 27.07   actor gain: -0.32   critic loss: 0.40   steps: 3621


training loop:   7% |##                               | ETA:  32 days, 18:48:21

Episode: 3622   score: 27.37   Avg score (100e): 27.08   actor gain: -0.32   critic loss: 0.40   steps: 3622
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:48:17

Episode: 3623   score: 27.38   Avg score (100e): 27.08   actor gain: -0.32   critic loss: 0.40   steps: 3623
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:48:35

Episode: 3624   score: 27.39   Avg score (100e): 27.09   actor gain: -0.32   critic loss: 0.40   steps: 3624


training loop:   7% |##                               | ETA:  32 days, 18:48:22

Episode: 3625   score: 27.39   Avg score (100e): 27.10   actor gain: -0.32   critic loss: 0.40   steps: 3625
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:47:34

Episode: 3626   score: 27.40   Avg score (100e): 27.10   actor gain: -0.32   critic loss: 0.40   steps: 3626


training loop:   7% |##                               | ETA:  32 days, 18:47:28

Episode: 3627   score: 27.40   Avg score (100e): 27.11   actor gain: -0.32   critic loss: 0.40   steps: 3627


training loop:   7% |##                               | ETA:  32 days, 18:47:27

Episode: 3628   score: 27.40   Avg score (100e): 27.11   actor gain: -0.32   critic loss: 0.40   steps: 3628


training loop:   7% |##                               | ETA:  32 days, 18:47:46

Episode: 3629   score: 27.41   Avg score (100e): 27.12   actor gain: -0.32   critic loss: 0.40   steps: 3629


training loop:   7% |##                               | ETA:  32 days, 18:47:14

Episode: 3630   score: 27.42   Avg score (100e): 27.13   actor gain: -0.32   critic loss: 0.40   steps: 3630


training loop:   7% |##                               | ETA:  32 days, 18:47:04

Episode: 3631   score: 27.43   Avg score (100e): 27.13   actor gain: -0.32   critic loss: 0.40   steps: 3631


training loop:   7% |##                               | ETA:  32 days, 18:47:42

Episode: 3632   score: 27.43   Avg score (100e): 27.14   actor gain: -0.32   critic loss: 0.40   steps: 3632


training loop:   7% |##                               | ETA:  32 days, 18:47:40

Episode: 3633   score: 27.44   Avg score (100e): 27.14   actor gain: -0.32   critic loss: 0.40   steps: 3633


training loop:   7% |##                               | ETA:  32 days, 18:47:48

Episode: 3634   score: 27.44   Avg score (100e): 27.15   actor gain: -0.32   critic loss: 0.40   steps: 3634


training loop:   7% |##                               | ETA:  32 days, 18:47:17

Episode: 3635   score: 27.45   Avg score (100e): 27.16   actor gain: -0.32   critic loss: 0.40   steps: 3635


training loop:   7% |##                               | ETA:  32 days, 18:49:37

Episode: 3636   score: 27.46   Avg score (100e): 27.16   actor gain: -0.32   critic loss: 0.40   steps: 3636
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:49:20

Episode: 3637   score: 27.46   Avg score (100e): 27.17   actor gain: -0.32   critic loss: 0.40   steps: 3637
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:49:20

Episode: 3638   score: 27.47   Avg score (100e): 27.17   actor gain: -0.32   critic loss: 0.40   steps: 3638


training loop:   7% |##                               | ETA:  32 days, 18:49:29

Episode: 3639   score: 27.47   Avg score (100e): 27.18   actor gain: -0.32   critic loss: 0.40   steps: 3639
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:50:07

Episode: 3640   score: 27.48   Avg score (100e): 27.19   actor gain: -0.32   critic loss: 0.40   steps: 3640


training loop:   7% |##                               | ETA:  32 days, 18:50:07

Episode: 3641   score: 27.49   Avg score (100e): 27.19   actor gain: -0.32   critic loss: 0.40   steps: 3641
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:49:20

Episode: 3642   score: 27.49   Avg score (100e): 27.20   actor gain: -0.32   critic loss: 0.40   steps: 3642


training loop:   7% |##                               | ETA:  32 days, 18:49:49

Episode: 3643   score: 27.50   Avg score (100e): 27.20   actor gain: -0.32   critic loss: 0.40   steps: 3643


training loop:   7% |##                               | ETA:  32 days, 18:49:29

Episode: 3644   score: 27.50   Avg score (100e): 27.21   actor gain: -0.32   critic loss: 0.40   steps: 3644


training loop:   7% |##                               | ETA:  32 days, 18:49:41

Episode: 3645   score: 27.50   Avg score (100e): 27.22   actor gain: -0.32   critic loss: 0.40   steps: 3645
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:48:45

Episode: 3646   score: 27.51   Avg score (100e): 27.22   actor gain: -0.32   critic loss: 0.40   steps: 3646


training loop:   7% |##                               | ETA:  32 days, 18:48:37

Episode: 3647   score: 27.52   Avg score (100e): 27.23   actor gain: -0.32   critic loss: 0.40   steps: 3647
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:47:45

Episode: 3648   score: 27.52   Avg score (100e): 27.23   actor gain: -0.32   critic loss: 0.40   steps: 3648
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:47:45

Episode: 3649   score: 27.53   Avg score (100e): 27.24   actor gain: -0.32   critic loss: 0.40   steps: 3649
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:47:06

Episode: 3650   score: 27.53   Avg score (100e): 27.25   actor gain: -0.32   critic loss: 0.40   steps: 3650
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:47:10

Episode: 3651   score: 27.53   Avg score (100e): 27.25   actor gain: -0.32   critic loss: 0.40   steps: 3651


training loop:   7% |##                               | ETA:  32 days, 18:46:16

Episode: 3652   score: 27.54   Avg score (100e): 27.26   actor gain: -0.33   critic loss: 0.39   steps: 3652


training loop:   7% |##                               | ETA:  32 days, 18:46:17

Episode: 3653   score: 27.54   Avg score (100e): 27.26   actor gain: -0.33   critic loss: 0.39   steps: 3653


training loop:   7% |##                               | ETA:  32 days, 18:45:57

Episode: 3654   score: 27.55   Avg score (100e): 27.27   actor gain: -0.33   critic loss: 0.39   steps: 3654


training loop:   7% |##                               | ETA:  32 days, 18:45:52

Episode: 3655   score: 27.56   Avg score (100e): 27.27   actor gain: -0.33   critic loss: 0.39   steps: 3655


training loop:   7% |##                               | ETA:  32 days, 18:45:42

Episode: 3656   score: 27.56   Avg score (100e): 27.28   actor gain: -0.33   critic loss: 0.39   steps: 3656
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:45:45

Episode: 3657   score: 27.57   Avg score (100e): 27.29   actor gain: -0.33   critic loss: 0.39   steps: 3657
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:45:13

Episode: 3658   score: 27.57   Avg score (100e): 27.29   actor gain: -0.33   critic loss: 0.39   steps: 3658


training loop:   7% |##                               | ETA:  32 days, 18:46:26

Episode: 3659   score: 27.58   Avg score (100e): 27.30   actor gain: -0.33   critic loss: 0.39   steps: 3659


training loop:   7% |##                               | ETA:  32 days, 18:47:24

Episode: 3660   score: 27.59   Avg score (100e): 27.30   actor gain: -0.33   critic loss: 0.39   steps: 3660
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:47:47

Episode: 3661   score: 27.60   Avg score (100e): 27.31   actor gain: -0.33   critic loss: 0.39   steps: 3661
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:47:37

Episode: 3662   score: 27.61   Avg score (100e): 27.31   actor gain: -0.33   critic loss: 0.39   steps: 3662


training loop:   7% |##                               | ETA:  32 days, 18:47:51

Episode: 3663   score: 27.62   Avg score (100e): 27.32   actor gain: -0.33   critic loss: 0.39   steps: 3663


training loop:   7% |##                               | ETA:  32 days, 18:47:45

Episode: 3664   score: 27.62   Avg score (100e): 27.33   actor gain: -0.33   critic loss: 0.39   steps: 3664
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:48:11

Episode: 3665   score: 27.63   Avg score (100e): 27.33   actor gain: -0.33   critic loss: 0.39   steps: 3665
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:48:15

Episode: 3666   score: 27.63   Avg score (100e): 27.34   actor gain: -0.33   critic loss: 0.40   steps: 3666
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:48:43

Episode: 3667   score: 27.64   Avg score (100e): 27.34   actor gain: -0.33   critic loss: 0.40   steps: 3667


training loop:   7% |##                               | ETA:  32 days, 18:51:14

Episode: 3668   score: 27.64   Avg score (100e): 27.35   actor gain: -0.33   critic loss: 0.40   steps: 3668
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:51:56

Episode: 3669   score: 27.65   Avg score (100e): 27.36   actor gain: -0.33   critic loss: 0.40   steps: 3669


training loop:   7% |##                               | ETA:  32 days, 18:51:54

Episode: 3670   score: 27.65   Avg score (100e): 27.36   actor gain: -0.33   critic loss: 0.40   steps: 3670


training loop:   7% |##                               | ETA:  32 days, 18:52:23

Episode: 3671   score: 27.66   Avg score (100e): 27.37   actor gain: -0.33   critic loss: 0.40   steps: 3671
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:51:55

Episode: 3672   score: 27.66   Avg score (100e): 27.37   actor gain: -0.33   critic loss: 0.40   steps: 3672


training loop:   7% |##                               | ETA:  32 days, 18:52:43

Episode: 3673   score: 27.67   Avg score (100e): 27.38   actor gain: -0.33   critic loss: 0.40   steps: 3673


training loop:   7% |##                               | ETA:  32 days, 18:53:21

Episode: 3674   score: 27.68   Avg score (100e): 27.38   actor gain: -0.33   critic loss: 0.40   steps: 3674


training loop:   7% |##                               | ETA:  32 days, 18:53:44

Episode: 3675   score: 27.68   Avg score (100e): 27.39   actor gain: -0.33   critic loss: 0.40   steps: 3675


training loop:   7% |##                               | ETA:  32 days, 18:53:43

Episode: 3676   score: 27.69   Avg score (100e): 27.40   actor gain: -0.33   critic loss: 0.40   steps: 3676


training loop:   7% |##                               | ETA:  32 days, 18:54:12

Episode: 3677   score: 27.70   Avg score (100e): 27.40   actor gain: -0.33   critic loss: 0.40   steps: 3677
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:53:44

Episode: 3678   score: 27.70   Avg score (100e): 27.41   actor gain: -0.33   critic loss: 0.40   steps: 3678


training loop:   7% |##                               | ETA:  32 days, 18:53:09

Episode: 3679   score: 27.70   Avg score (100e): 27.41   actor gain: -0.33   critic loss: 0.40   steps: 3679
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:52:28

Episode: 3680   score: 27.70   Avg score (100e): 27.42   actor gain: -0.33   critic loss: 0.40   steps: 3680


training loop:   7% |##                               | ETA:  32 days, 18:52:55

Episode: 3681   score: 27.71   Avg score (100e): 27.43   actor gain: -0.33   critic loss: 0.40   steps: 3681
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:52:33

Episode: 3682   score: 27.71   Avg score (100e): 27.43   actor gain: -0.33   critic loss: 0.40   steps: 3682


training loop:   7% |##                               | ETA:  32 days, 18:52:17

Episode: 3683   score: 27.72   Avg score (100e): 27.44   actor gain: -0.33   critic loss: 0.40   steps: 3683


training loop:   7% |##                               | ETA:  32 days, 18:51:57

Episode: 3684   score: 27.72   Avg score (100e): 27.44   actor gain: -0.33   critic loss: 0.40   steps: 3684


training loop:   7% |##                               | ETA:  32 days, 18:52:07

Episode: 3685   score: 27.73   Avg score (100e): 27.45   actor gain: -0.33   critic loss: 0.40   steps: 3685
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:51:46

Episode: 3686   score: 27.73   Avg score (100e): 27.45   actor gain: -0.33   critic loss: 0.40   steps: 3686


training loop:   7% |##                               | ETA:  32 days, 18:51:04

Episode: 3687   score: 27.74   Avg score (100e): 27.46   actor gain: -0.33   critic loss: 0.40   steps: 3687
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:50:23

Episode: 3688   score: 27.74   Avg score (100e): 27.47   actor gain: -0.33   critic loss: 0.40   steps: 3688


training loop:   7% |##                               | ETA:  32 days, 18:50:19

Episode: 3689   score: 27.75   Avg score (100e): 27.47   actor gain: -0.33   critic loss: 0.40   steps: 3689
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:50:34

Episode: 3690   score: 27.76   Avg score (100e): 27.48   actor gain: -0.33   critic loss: 0.40   steps: 3690


training loop:   7% |##                               | ETA:  32 days, 18:50:38

Episode: 3691   score: 27.77   Avg score (100e): 27.48   actor gain: -0.33   critic loss: 0.40   steps: 3691


training loop:   7% |##                               | ETA:  32 days, 18:50:09

Episode: 3692   score: 27.77   Avg score (100e): 27.49   actor gain: -0.33   critic loss: 0.40   steps: 3692


training loop:   7% |##                               | ETA:  32 days, 18:49:50

Episode: 3693   score: 27.78   Avg score (100e): 27.50   actor gain: -0.33   critic loss: 0.40   steps: 3693


training loop:   7% |##                               | ETA:  32 days, 18:50:20

Episode: 3694   score: 27.79   Avg score (100e): 27.50   actor gain: -0.33   critic loss: 0.40   steps: 3694


training loop:   7% |##                               | ETA:  32 days, 18:50:33

Episode: 3695   score: 27.79   Avg score (100e): 27.51   actor gain: -0.33   critic loss: 0.40   steps: 3695


training loop:   7% |##                               | ETA:  32 days, 18:50:05

Episode: 3696   score: 27.80   Avg score (100e): 27.51   actor gain: -0.33   critic loss: 0.40   steps: 3696


training loop:   7% |##                               | ETA:  32 days, 18:49:55

Episode: 3697   score: 27.80   Avg score (100e): 27.52   actor gain: -0.33   critic loss: 0.40   steps: 3697


training loop:   7% |##                               | ETA:  32 days, 18:50:48

Episode: 3698   score: 27.81   Avg score (100e): 27.52   actor gain: -0.32   critic loss: 0.40   steps: 3698
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:50:47

Episode: 3699   score: 27.82   Avg score (100e): 27.53   actor gain: -0.32   critic loss: 0.40   steps: 3699


training loop:   7% |##                               | ETA:  32 days, 18:50:39

Episode: 3700   score: 27.83   Avg score (100e): 27.54   actor gain: -0.32   critic loss: 0.40   steps: 3700


training loop:   7% |##                               | ETA:  32 days, 18:52:56

Episode: 3701   score: 27.83   Avg score (100e): 27.54   actor gain: -0.32   critic loss: 0.40   steps: 3701
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:53:22

Episode: 3702   score: 27.83   Avg score (100e): 27.55   actor gain: -0.32   critic loss: 0.40   steps: 3702


training loop:   7% |##                               | ETA:  32 days, 18:54:15

Episode: 3703   score: 27.84   Avg score (100e): 27.55   actor gain: -0.32   critic loss: 0.40   steps: 3703
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:53:47

Episode: 3704   score: 27.84   Avg score (100e): 27.56   actor gain: -0.32   critic loss: 0.40   steps: 3704
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:54:50

Episode: 3705   score: 27.85   Avg score (100e): 27.57   actor gain: -0.32   critic loss: 0.40   steps: 3705


training loop:   7% |##                               | ETA:  32 days, 18:55:39

Episode: 3706   score: 27.86   Avg score (100e): 27.57   actor gain: -0.32   critic loss: 0.40   steps: 3706


training loop:   7% |##                               | ETA:  32 days, 18:55:54

Episode: 3707   score: 27.86   Avg score (100e): 27.58   actor gain: -0.32   critic loss: 0.40   steps: 3707


training loop:   7% |##                               | ETA:  32 days, 18:56:06

Episode: 3708   score: 27.86   Avg score (100e): 27.58   actor gain: -0.32   critic loss: 0.40   steps: 3708
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:56:17

Episode: 3709   score: 27.87   Avg score (100e): 27.59   actor gain: -0.32   critic loss: 0.40   steps: 3709


training loop:   7% |##                               | ETA:  32 days, 18:56:20

Episode: 3710   score: 27.87   Avg score (100e): 27.59   actor gain: -0.32   critic loss: 0.41   steps: 3710


training loop:   7% |##                               | ETA:  32 days, 18:56:24

Episode: 3711   score: 27.88   Avg score (100e): 27.60   actor gain: -0.32   critic loss: 0.40   steps: 3711


training loop:   7% |##                               | ETA:  32 days, 18:56:26

Episode: 3712   score: 27.88   Avg score (100e): 27.61   actor gain: -0.32   critic loss: 0.40   steps: 3712


training loop:   7% |##                               | ETA:  32 days, 18:57:50

Episode: 3713   score: 27.89   Avg score (100e): 27.61   actor gain: -0.32   critic loss: 0.40   steps: 3713


training loop:   7% |##                               | ETA:  32 days, 18:57:58

Episode: 3714   score: 27.89   Avg score (100e): 27.62   actor gain: -0.32   critic loss: 0.40   steps: 3714
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:58:03

Episode: 3715   score: 27.90   Avg score (100e): 27.62   actor gain: -0.32   critic loss: 0.40   steps: 3715


training loop:   7% |##                               | ETA:  32 days, 18:57:52

Episode: 3716   score: 27.91   Avg score (100e): 27.63   actor gain: -0.32   critic loss: 0.40   steps: 3716


training loop:   7% |##                               | ETA:  32 days, 18:58:23

Episode: 3717   score: 27.91   Avg score (100e): 27.63   actor gain: -0.32   critic loss: 0.40   steps: 3717
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:58:24

Episode: 3718   score: 27.92   Avg score (100e): 27.64   actor gain: -0.32   critic loss: 0.40   steps: 3718


training loop:   7% |##                               | ETA:  32 days, 18:58:45

Episode: 3719   score: 27.92   Avg score (100e): 27.65   actor gain: -0.32   critic loss: 0.40   steps: 3719


training loop:   7% |##                               | ETA:  32 days, 18:58:30

Episode: 3720   score: 27.93   Avg score (100e): 27.65   actor gain: -0.32   critic loss: 0.40   steps: 3720
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:59:12

Episode: 3721   score: 27.94   Avg score (100e): 27.66   actor gain: -0.32   critic loss: 0.40   steps: 3721


training loop:   7% |##                               | ETA:  32 days, 18:59:46

Episode: 3722   score: 27.94   Avg score (100e): 27.66   actor gain: -0.32   critic loss: 0.40   steps: 3722
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 18:59:48

Episode: 3723   score: 27.94   Avg score (100e): 27.67   actor gain: -0.32   critic loss: 0.40   steps: 3723


training loop:   7% |##                               | ETA:  32 days, 18:59:51

Episode: 3724   score: 27.95   Avg score (100e): 27.67   actor gain: -0.32   critic loss: 0.40   steps: 3724
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:00:26

Episode: 3725   score: 27.95   Avg score (100e): 27.68   actor gain: -0.32   critic loss: 0.40   steps: 3725
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:00:54

Episode: 3726   score: 27.96   Avg score (100e): 27.68   actor gain: -0.32   critic loss: 0.40   steps: 3726


training loop:   7% |##                               | ETA:  32 days, 19:01:18

Episode: 3727   score: 27.96   Avg score (100e): 27.69   actor gain: -0.32   critic loss: 0.40   steps: 3727
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:00:52

Episode: 3728   score: 27.97   Avg score (100e): 27.70   actor gain: -0.32   critic loss: 0.40   steps: 3728


training loop:   7% |##                               | ETA:  32 days, 19:01:51

Episode: 3729   score: 27.97   Avg score (100e): 27.70   actor gain: -0.33   critic loss: 0.40   steps: 3729


training loop:   7% |##                               | ETA:  32 days, 19:02:25

Episode: 3730   score: 27.98   Avg score (100e): 27.71   actor gain: -0.32   critic loss: 0.40   steps: 3730


training loop:   7% |##                               | ETA:  32 days, 19:03:24

Episode: 3731   score: 27.98   Avg score (100e): 27.71   actor gain: -0.33   critic loss: 0.40   steps: 3731
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:04:04

Episode: 3732   score: 27.99   Avg score (100e): 27.72   actor gain: -0.33   critic loss: 0.40   steps: 3732


training loop:   7% |##                               | ETA:  32 days, 19:03:35

Episode: 3733   score: 27.99   Avg score (100e): 27.72   actor gain: -0.33   critic loss: 0.40   steps: 3733
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:06:40

Episode: 3734   score: 28.00   Avg score (100e): 27.73   actor gain: -0.33   critic loss: 0.40   steps: 3734


training loop:   7% |##                               | ETA:  32 days, 19:06:47

Episode: 3735   score: 28.00   Avg score (100e): 27.73   actor gain: -0.33   critic loss: 0.40   steps: 3735


training loop:   7% |##                               | ETA:  32 days, 19:07:16

Episode: 3736   score: 28.01   Avg score (100e): 27.74   actor gain: -0.33   critic loss: 0.40   steps: 3736


training loop:   7% |##                               | ETA:  32 days, 19:07:58

Episode: 3737   score: 28.01   Avg score (100e): 27.75   actor gain: -0.33   critic loss: 0.40   steps: 3737


training loop:   7% |##                               | ETA:  32 days, 19:08:13

Episode: 3738   score: 28.03   Avg score (100e): 27.75   actor gain: -0.33   critic loss: 0.40   steps: 3738


training loop:   7% |##                               | ETA:  32 days, 19:08:31

Episode: 3739   score: 28.03   Avg score (100e): 27.76   actor gain: -0.33   critic loss: 0.40   steps: 3739
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:08:47

Episode: 3740   score: 28.04   Avg score (100e): 27.76   actor gain: -0.33   critic loss: 0.40   steps: 3740


training loop:   7% |##                               | ETA:  32 days, 19:09:11

Episode: 3741   score: 28.04   Avg score (100e): 27.77   actor gain: -0.33   critic loss: 0.40   steps: 3741


training loop:   7% |##                               | ETA:  32 days, 19:09:32

Episode: 3742   score: 28.05   Avg score (100e): 27.77   actor gain: -0.33   critic loss: 0.40   steps: 3742
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:09:40

Episode: 3743   score: 28.05   Avg score (100e): 27.78   actor gain: -0.33   critic loss: 0.40   steps: 3743
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:09:26

Episode: 3744   score: 28.06   Avg score (100e): 27.79   actor gain: -0.33   critic loss: 0.40   steps: 3744
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:09:22

Episode: 3745   score: 28.07   Avg score (100e): 27.79   actor gain: -0.33   critic loss: 0.40   steps: 3745
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:09:00

Episode: 3746   score: 28.09   Avg score (100e): 27.80   actor gain: -0.33   critic loss: 0.40   steps: 3746


training loop:   7% |##                               | ETA:  32 days, 19:08:45

Episode: 3747   score: 28.09   Avg score (100e): 27.80   actor gain: -0.33   critic loss: 0.40   steps: 3747
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:09:05

Episode: 3748   score: 28.10   Avg score (100e): 27.81   actor gain: -0.33   critic loss: 0.40   steps: 3748
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:12:38

Episode: 3749   score: 28.09   Avg score (100e): 27.81   actor gain: -0.33   critic loss: 0.40   steps: 3749


training loop:   7% |##                               | ETA:  32 days, 19:12:52

Episode: 3750   score: 28.10   Avg score (100e): 27.82   actor gain: -0.33   critic loss: 0.40   steps: 3750


training loop:   7% |##                               | ETA:  32 days, 19:13:10

Episode: 3751   score: 28.11   Avg score (100e): 27.83   actor gain: -0.33   critic loss: 0.40   steps: 3751


training loop:   7% |##                               | ETA:  32 days, 19:12:58

Episode: 3752   score: 28.12   Avg score (100e): 27.83   actor gain: -0.33   critic loss: 0.40   steps: 3752
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:13:38

Episode: 3753   score: 28.12   Avg score (100e): 27.84   actor gain: -0.33   critic loss: 0.40   steps: 3753


training loop:   7% |##                               | ETA:  32 days, 19:14:00

Episode: 3754   score: 28.13   Avg score (100e): 27.84   actor gain: -0.33   critic loss: 0.40   steps: 3754
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:14:44

Episode: 3755   score: 28.14   Avg score (100e): 27.85   actor gain: -0.33   critic loss: 0.40   steps: 3755


training loop:   7% |##                               | ETA:  32 days, 19:15:12

Episode: 3756   score: 28.13   Avg score (100e): 27.85   actor gain: -0.33   critic loss: 0.40   steps: 3756


training loop:   7% |##                               | ETA:  32 days, 19:17:11

Episode: 3757   score: 28.14   Avg score (100e): 27.86   actor gain: -0.33   critic loss: 0.40   steps: 3757


training loop:   7% |##                               | ETA:  32 days, 19:18:02

Episode: 3758   score: 28.14   Avg score (100e): 27.87   actor gain: -0.33   critic loss: 0.40   steps: 3758


training loop:   7% |##                               | ETA:  32 days, 19:18:29

Episode: 3759   score: 28.15   Avg score (100e): 27.87   actor gain: -0.33   critic loss: 0.40   steps: 3759


training loop:   7% |##                               | ETA:  32 days, 19:18:22

Episode: 3760   score: 28.16   Avg score (100e): 27.88   actor gain: -0.33   critic loss: 0.40   steps: 3760


training loop:   7% |##                               | ETA:  32 days, 19:19:29

Episode: 3761   score: 28.17   Avg score (100e): 27.88   actor gain: -0.33   critic loss: 0.40   steps: 3761
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:19:38

Episode: 3762   score: 28.18   Avg score (100e): 27.89   actor gain: -0.33   critic loss: 0.40   steps: 3762


training loop:   7% |##                               | ETA:  32 days, 19:19:58

Episode: 3763   score: 28.18   Avg score (100e): 27.89   actor gain: -0.33   critic loss: 0.41   steps: 3763


training loop:   7% |##                               | ETA:  32 days, 19:20:55

Episode: 3764   score: 28.18   Avg score (100e): 27.90   actor gain: -0.33   critic loss: 0.41   steps: 3764


training loop:   7% |##                               | ETA:  32 days, 19:23:40

Episode: 3765   score: 28.19   Avg score (100e): 27.90   actor gain: -0.33   critic loss: 0.41   steps: 3765


training loop:   7% |##                               | ETA:  32 days, 19:28:47

Episode: 3766   score: 28.20   Avg score (100e): 27.91   actor gain: -0.32   critic loss: 0.41   steps: 3766


training loop:   7% |##                               | ETA:  32 days, 19:29:44

Episode: 3767   score: 28.20   Avg score (100e): 27.92   actor gain: -0.32   critic loss: 0.41   steps: 3767


training loop:   7% |##                               | ETA:  32 days, 19:31:08

Episode: 3768   score: 28.20   Avg score (100e): 27.92   actor gain: -0.32   critic loss: 0.41   steps: 3768


training loop:   7% |##                               | ETA:  32 days, 19:31:30

Episode: 3769   score: 28.21   Avg score (100e): 27.93   actor gain: -0.32   critic loss: 0.41   steps: 3769
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:32:45

Episode: 3770   score: 28.22   Avg score (100e): 27.93   actor gain: -0.32   critic loss: 0.41   steps: 3770
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:33:13

Episode: 3771   score: 28.22   Avg score (100e): 27.94   actor gain: -0.32   critic loss: 0.41   steps: 3771
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:34:32

Episode: 3772   score: 28.23   Avg score (100e): 27.94   actor gain: -0.32   critic loss: 0.41   steps: 3772


training loop:   7% |##                               | ETA:  32 days, 19:36:35

Episode: 3773   score: 28.23   Avg score (100e): 27.95   actor gain: -0.32   critic loss: 0.41   steps: 3773


training loop:   7% |##                               | ETA:  32 days, 19:37:53

Episode: 3774   score: 28.23   Avg score (100e): 27.96   actor gain: -0.32   critic loss: 0.41   steps: 3774


training loop:   7% |##                               | ETA:  32 days, 19:38:25

Episode: 3775   score: 28.24   Avg score (100e): 27.96   actor gain: -0.32   critic loss: 0.41   steps: 3775
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:38:53

Episode: 3776   score: 28.24   Avg score (100e): 27.97   actor gain: -0.32   critic loss: 0.41   steps: 3776
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:39:54

Episode: 3777   score: 28.25   Avg score (100e): 27.97   actor gain: -0.32   critic loss: 0.41   steps: 3777


training loop:   7% |##                               | ETA:  32 days, 19:40:14

Episode: 3778   score: 28.25   Avg score (100e): 27.98   actor gain: -0.32   critic loss: 0.41   steps: 3778
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:42:08

Episode: 3779   score: 28.26   Avg score (100e): 27.98   actor gain: -0.32   critic loss: 0.41   steps: 3779
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:44:31

Episode: 3780   score: 28.26   Avg score (100e): 27.99   actor gain: -0.32   critic loss: 0.41   steps: 3780


training loop:   7% |##                               | ETA:  32 days, 19:46:04

Episode: 3781   score: 28.26   Avg score (100e): 27.99   actor gain: -0.32   critic loss: 0.41   steps: 3781


training loop:   7% |##                               | ETA:  32 days, 19:46:53

Episode: 3782   score: 28.27   Avg score (100e): 28.00   actor gain: -0.32   critic loss: 0.41   steps: 3782


training loop:   7% |##                               | ETA:  32 days, 19:48:01

Episode: 3783   score: 28.28   Avg score (100e): 28.01   actor gain: -0.32   critic loss: 0.41   steps: 3783


training loop:   7% |##                               | ETA:  32 days, 19:48:52

Episode: 3784   score: 28.29   Avg score (100e): 28.01   actor gain: -0.32   critic loss: 0.41   steps: 3784


training loop:   7% |##                               | ETA:  32 days, 19:49:03

Episode: 3785   score: 28.30   Avg score (100e): 28.02   actor gain: -0.32   critic loss: 0.40   steps: 3785
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:49:00

Episode: 3786   score: 28.30   Avg score (100e): 28.02   actor gain: -0.32   critic loss: 0.40   steps: 3786


training loop:   7% |##                               | ETA:  32 days, 19:49:45

Episode: 3787   score: 28.31   Avg score (100e): 28.03   actor gain: -0.32   critic loss: 0.40   steps: 3787


training loop:   7% |##                               | ETA:  32 days, 19:50:56

Episode: 3788   score: 28.32   Avg score (100e): 28.03   actor gain: -0.32   critic loss: 0.40   steps: 3788
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:50:50

Episode: 3789   score: 28.32   Avg score (100e): 28.04   actor gain: -0.32   critic loss: 0.40   steps: 3789


training loop:   7% |##                               | ETA:  32 days, 19:52:29

Episode: 3790   score: 28.33   Avg score (100e): 28.05   actor gain: -0.32   critic loss: 0.40   steps: 3790


training loop:   7% |##                               | ETA:  32 days, 19:53:42

Episode: 3791   score: 28.34   Avg score (100e): 28.05   actor gain: -0.32   critic loss: 0.40   steps: 3791


training loop:   7% |##                               | ETA:  32 days, 19:54:12

Episode: 3792   score: 28.35   Avg score (100e): 28.06   actor gain: -0.32   critic loss: 0.40   steps: 3792


training loop:   7% |##                               | ETA:  32 days, 19:54:21

Episode: 3793   score: 28.35   Avg score (100e): 28.06   actor gain: -0.32   critic loss: 0.40   steps: 3793


training loop:   7% |##                               | ETA:  32 days, 19:54:37

Episode: 3794   score: 28.35   Avg score (100e): 28.07   actor gain: -0.32   critic loss: 0.40   steps: 3794


training loop:   7% |##                               | ETA:  32 days, 19:55:51

Episode: 3795   score: 28.36   Avg score (100e): 28.07   actor gain: -0.32   critic loss: 0.40   steps: 3795
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:56:08

Episode: 3796   score: 28.37   Avg score (100e): 28.08   actor gain: -0.32   critic loss: 0.40   steps: 3796
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:55:52

Episode: 3797   score: 28.37   Avg score (100e): 28.09   actor gain: -0.32   critic loss: 0.40   steps: 3797
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 19:59:30

Episode: 3798   score: 28.38   Avg score (100e): 28.09   actor gain: -0.32   critic loss: 0.40   steps: 3798


training loop:   7% |##                               | ETA:  32 days, 20:01:04

Episode: 3799   score: 28.38   Avg score (100e): 28.10   actor gain: -0.32   critic loss: 0.40   steps: 3799


training loop:   7% |##                               | ETA:  32 days, 20:01:41

Episode: 3800   score: 28.39   Avg score (100e): 28.10   actor gain: -0.32   critic loss: 0.40   steps: 3800


training loop:   7% |##                               | ETA:  32 days, 20:02:12

Episode: 3801   score: 28.40   Avg score (100e): 28.11   actor gain: -0.32   critic loss: 0.40   steps: 3801


training loop:   7% |##                               | ETA:  32 days, 20:02:14

Episode: 3802   score: 28.40   Avg score (100e): 28.11   actor gain: -0.32   critic loss: 0.40   steps: 3802
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:03:21

Episode: 3803   score: 28.41   Avg score (100e): 28.12   actor gain: -0.32   critic loss: 0.40   steps: 3803
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:03:10

Episode: 3804   score: 28.42   Avg score (100e): 28.13   actor gain: -0.32   critic loss: 0.40   steps: 3804


training loop:   7% |##                               | ETA:  32 days, 20:04:07

Episode: 3805   score: 28.42   Avg score (100e): 28.13   actor gain: -0.32   critic loss: 0.40   steps: 3805


training loop:   7% |##                               | ETA:  32 days, 20:05:14

Episode: 3806   score: 28.42   Avg score (100e): 28.14   actor gain: -0.32   critic loss: 0.40   steps: 3806
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:05:21

Episode: 3807   score: 28.42   Avg score (100e): 28.14   actor gain: -0.32   critic loss: 0.40   steps: 3807
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:05:07

Episode: 3808   score: 28.43   Avg score (100e): 28.15   actor gain: -0.32   critic loss: 0.40   steps: 3808
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:04:33

Episode: 3809   score: 28.43   Avg score (100e): 28.15   actor gain: -0.32   critic loss: 0.40   steps: 3809
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:04:57

Episode: 3810   score: 28.44   Avg score (100e): 28.16   actor gain: -0.32   critic loss: 0.40   steps: 3810
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:05:01

Episode: 3811   score: 28.45   Avg score (100e): 28.16   actor gain: -0.32   critic loss: 0.40   steps: 3811


training loop:   7% |##                               | ETA:  32 days, 20:04:49

Episode: 3812   score: 28.46   Avg score (100e): 28.17   actor gain: -0.32   critic loss: 0.40   steps: 3812


training loop:   7% |##                               | ETA:  32 days, 20:04:38

Episode: 3813   score: 28.46   Avg score (100e): 28.18   actor gain: -0.32   critic loss: 0.40   steps: 3813


training loop:   7% |##                               | ETA:  32 days, 20:06:25

Episode: 3814   score: 28.47   Avg score (100e): 28.18   actor gain: -0.32   critic loss: 0.40   steps: 3814


training loop:   7% |##                               | ETA:  32 days, 20:07:02

Episode: 3815   score: 28.48   Avg score (100e): 28.19   actor gain: -0.32   critic loss: 0.40   steps: 3815


training loop:   7% |##                               | ETA:  32 days, 20:08:02

Episode: 3816   score: 28.48   Avg score (100e): 28.19   actor gain: -0.32   critic loss: 0.40   steps: 3816


training loop:   7% |##                               | ETA:  32 days, 20:07:54

Episode: 3817   score: 28.48   Avg score (100e): 28.20   actor gain: -0.32   critic loss: 0.40   steps: 3817
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:08:09

Episode: 3818   score: 28.48   Avg score (100e): 28.20   actor gain: -0.32   critic loss: 0.40   steps: 3818


training loop:   7% |##                               | ETA:  32 days, 20:11:28

Episode: 3819   score: 28.48   Avg score (100e): 28.21   actor gain: -0.32   critic loss: 0.40   steps: 3819
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:12:42

Episode: 3820   score: 28.49   Avg score (100e): 28.22   actor gain: -0.32   critic loss: 0.40   steps: 3820


training loop:   7% |##                               | ETA:  32 days, 20:13:57

Episode: 3821   score: 28.50   Avg score (100e): 28.22   actor gain: -0.32   critic loss: 0.40   steps: 3821


training loop:   7% |##                               | ETA:  32 days, 20:15:54

Episode: 3822   score: 28.50   Avg score (100e): 28.23   actor gain: -0.32   critic loss: 0.40   steps: 3822


training loop:   7% |##                               | ETA:  32 days, 20:16:41

Episode: 3823   score: 28.51   Avg score (100e): 28.23   actor gain: -0.32   critic loss: 0.40   steps: 3823


training loop:   7% |##                               | ETA:  32 days, 20:17:19

Episode: 3824   score: 28.51   Avg score (100e): 28.24   actor gain: -0.32   critic loss: 0.40   steps: 3824
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:17:21

Episode: 3825   score: 28.51   Avg score (100e): 28.24   actor gain: -0.32   critic loss: 0.40   steps: 3825
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:18:15

Episode: 3826   score: 28.52   Avg score (100e): 28.25   actor gain: -0.33   critic loss: 0.40   steps: 3826


training loop:   7% |##                               | ETA:  32 days, 20:18:52

Episode: 3827   score: 28.53   Avg score (100e): 28.26   actor gain: -0.32   critic loss: 0.40   steps: 3827
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:24:52

Episode: 3828   score: 28.54   Avg score (100e): 28.26   actor gain: -0.32   critic loss: 0.40   steps: 3828
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:27:18

Episode: 3829   score: 28.54   Avg score (100e): 28.27   actor gain: -0.32   critic loss: 0.40   steps: 3829


training loop:   7% |##                               | ETA:  32 days, 20:28:02

Episode: 3830   score: 28.55   Avg score (100e): 28.27   actor gain: -0.32   critic loss: 0.40   steps: 3830


training loop:   7% |##                               | ETA:  32 days, 20:30:59

Episode: 3831   score: 28.55   Avg score (100e): 28.28   actor gain: -0.32   critic loss: 0.40   steps: 3831


training loop:   7% |##                               | ETA:  32 days, 20:32:39

Episode: 3832   score: 28.56   Avg score (100e): 28.28   actor gain: -0.32   critic loss: 0.40   steps: 3832


training loop:   7% |##                               | ETA:  32 days, 20:33:57

Episode: 3833   score: 28.56   Avg score (100e): 28.29   actor gain: -0.32   critic loss: 0.40   steps: 3833


training loop:   7% |##                               | ETA:  32 days, 20:33:16

Episode: 3834   score: 28.56   Avg score (100e): 28.30   actor gain: -0.32   critic loss: 0.40   steps: 3834
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:30:24

Episode: 3835   score: 28.57   Avg score (100e): 28.30   actor gain: -0.32   critic loss: 0.40   steps: 3835
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:27:56

Episode: 3836   score: 28.58   Avg score (100e): 28.31   actor gain: -0.32   critic loss: 0.40   steps: 3836
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:25:24

Episode: 3837   score: 28.58   Avg score (100e): 28.31   actor gain: -0.32   critic loss: 0.40   steps: 3837
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:22:28

Episode: 3838   score: 28.59   Avg score (100e): 28.32   actor gain: -0.32   critic loss: 0.40   steps: 3838
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:22:26

Episode: 3839   score: 28.59   Avg score (100e): 28.32   actor gain: -0.32   critic loss: 0.40   steps: 3839
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:22:00

Episode: 3840   score: 28.60   Avg score (100e): 28.33   actor gain: -0.32   critic loss: 0.40   steps: 3840


training loop:   7% |##                               | ETA:  32 days, 20:21:48

Episode: 3841   score: 28.60   Avg score (100e): 28.33   actor gain: -0.32   critic loss: 0.40   steps: 3841


training loop:   7% |##                               | ETA:  32 days, 20:22:21

Episode: 3842   score: 28.60   Avg score (100e): 28.34   actor gain: -0.32   critic loss: 0.40   steps: 3842


training loop:   7% |##                               | ETA:  32 days, 20:22:30

Episode: 3843   score: 28.61   Avg score (100e): 28.35   actor gain: -0.32   critic loss: 0.40   steps: 3843


training loop:   7% |##                               | ETA:  32 days, 20:19:22

Episode: 3844   score: 28.61   Avg score (100e): 28.35   actor gain: -0.32   critic loss: 0.40   steps: 3844
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:17:15

Episode: 3845   score: 28.62   Avg score (100e): 28.36   actor gain: -0.32   critic loss: 0.40   steps: 3845


training loop:   7% |##                               | ETA:  32 days, 20:15:47

Episode: 3846   score: 28.62   Avg score (100e): 28.36   actor gain: -0.32   critic loss: 0.40   steps: 3846


training loop:   7% |##                               | ETA:  32 days, 20:17:01

Episode: 3847   score: 28.63   Avg score (100e): 28.37   actor gain: -0.32   critic loss: 0.40   steps: 3847


training loop:   7% |##                               | ETA:  32 days, 20:17:00

Episode: 3848   score: 28.63   Avg score (100e): 28.37   actor gain: -0.32   critic loss: 0.40   steps: 3848
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:16:43

Episode: 3849   score: 28.64   Avg score (100e): 28.38   actor gain: -0.32   critic loss: 0.40   steps: 3849
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:16:44

Episode: 3850   score: 28.65   Avg score (100e): 28.38   actor gain: -0.32   critic loss: 0.40   steps: 3850


training loop:   7% |##                               | ETA:  32 days, 20:15:41

Episode: 3851   score: 28.65   Avg score (100e): 28.39   actor gain: -0.32   critic loss: 0.40   steps: 3851


training loop:   7% |##                               | ETA:  32 days, 20:14:09

Episode: 3852   score: 28.67   Avg score (100e): 28.39   actor gain: -0.32   critic loss: 0.40   steps: 3852


training loop:   7% |##                               | ETA:  32 days, 20:11:40

Episode: 3853   score: 28.67   Avg score (100e): 28.40   actor gain: -0.32   critic loss: 0.40   steps: 3853


training loop:   7% |##                               | ETA:  32 days, 20:09:59

Episode: 3854   score: 28.68   Avg score (100e): 28.41   actor gain: -0.32   critic loss: 0.40   steps: 3854


training loop:   7% |##                               | ETA:  32 days, 20:08:05

Episode: 3855   score: 28.69   Avg score (100e): 28.41   actor gain: -0.32   critic loss: 0.40   steps: 3855


training loop:   7% |##                               | ETA:  32 days, 20:08:08

Episode: 3856   score: 28.70   Avg score (100e): 28.42   actor gain: -0.32   critic loss: 0.40   steps: 3856
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:07:34

Episode: 3857   score: 28.70   Avg score (100e): 28.42   actor gain: -0.32   critic loss: 0.40   steps: 3857


training loop:   7% |##                               | ETA:  32 days, 20:10:00

Episode: 3858   score: 28.70   Avg score (100e): 28.43   actor gain: -0.32   critic loss: 0.40   steps: 3858
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:10:44

Episode: 3859   score: 28.71   Avg score (100e): 28.43   actor gain: -0.32   critic loss: 0.40   steps: 3859


training loop:   7% |##                               | ETA:  32 days, 20:11:40

Episode: 3860   score: 28.71   Avg score (100e): 28.44   actor gain: -0.32   critic loss: 0.40   steps: 3860


training loop:   7% |##                               | ETA:  32 days, 20:11:34

Episode: 3861   score: 28.72   Avg score (100e): 28.44   actor gain: -0.32   critic loss: 0.40   steps: 3861
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:11:47

Episode: 3862   score: 28.72   Avg score (100e): 28.45   actor gain: -0.33   critic loss: 0.40   steps: 3862
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:13:34

Episode: 3863   score: 28.72   Avg score (100e): 28.46   actor gain: -0.33   critic loss: 0.40   steps: 3863
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:13:15

Episode: 3864   score: 28.73   Avg score (100e): 28.46   actor gain: -0.33   critic loss: 0.40   steps: 3864


training loop:   7% |##                               | ETA:  32 days, 20:13:07

Episode: 3865   score: 28.73   Avg score (100e): 28.47   actor gain: -0.33   critic loss: 0.40   steps: 3865
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:12:52

Episode: 3866   score: 28.74   Avg score (100e): 28.47   actor gain: -0.33   critic loss: 0.40   steps: 3866
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:13:25

Episode: 3867   score: 28.74   Avg score (100e): 28.48   actor gain: -0.33   critic loss: 0.40   steps: 3867


training loop:   7% |##                               | ETA:  32 days, 20:13:34

Episode: 3868   score: 28.74   Avg score (100e): 28.48   actor gain: -0.33   critic loss: 0.40   steps: 3868


training loop:   7% |##                               | ETA:  32 days, 20:13:25

Episode: 3869   score: 28.75   Avg score (100e): 28.49   actor gain: -0.33   critic loss: 0.40   steps: 3869


training loop:   7% |##                               | ETA:  32 days, 20:12:59

Episode: 3870   score: 28.76   Avg score (100e): 28.49   actor gain: -0.33   critic loss: 0.40   steps: 3870
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:12:35

Episode: 3871   score: 28.77   Avg score (100e): 28.50   actor gain: -0.33   critic loss: 0.40   steps: 3871


training loop:   7% |##                               | ETA:  32 days, 20:12:31

Episode: 3872   score: 28.77   Avg score (100e): 28.50   actor gain: -0.33   critic loss: 0.40   steps: 3872
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:11:40

Episode: 3873   score: 28.78   Avg score (100e): 28.51   actor gain: -0.33   critic loss: 0.40   steps: 3873
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:11:06

Episode: 3874   score: 28.78   Avg score (100e): 28.52   actor gain: -0.33   critic loss: 0.40   steps: 3874


training loop:   7% |##                               | ETA:  32 days, 20:11:35

Episode: 3875   score: 28.79   Avg score (100e): 28.52   actor gain: -0.33   critic loss: 0.40   steps: 3875


training loop:   7% |##                               | ETA:  32 days, 20:11:18

Episode: 3876   score: 28.79   Avg score (100e): 28.53   actor gain: -0.33   critic loss: 0.40   steps: 3876


training loop:   7% |##                               | ETA:  32 days, 20:10:53

Episode: 3877   score: 28.80   Avg score (100e): 28.53   actor gain: -0.33   critic loss: 0.40   steps: 3877


training loop:   7% |##                               | ETA:  32 days, 20:10:08

Episode: 3878   score: 28.80   Avg score (100e): 28.54   actor gain: -0.33   critic loss: 0.40   steps: 3878
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:10:42

Episode: 3879   score: 28.81   Avg score (100e): 28.54   actor gain: -0.33   critic loss: 0.40   steps: 3879
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:10:20

Episode: 3880   score: 28.81   Avg score (100e): 28.55   actor gain: -0.33   critic loss: 0.40   steps: 3880


training loop:   7% |##                               | ETA:  32 days, 20:09:50

Episode: 3881   score: 28.83   Avg score (100e): 28.55   actor gain: -0.33   critic loss: 0.40   steps: 3881
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:08:40

Episode: 3882   score: 28.83   Avg score (100e): 28.56   actor gain: -0.33   critic loss: 0.40   steps: 3882


training loop:   7% |##                               | ETA:  32 days, 20:09:15

Episode: 3883   score: 28.84   Avg score (100e): 28.56   actor gain: -0.33   critic loss: 0.40   steps: 3883
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:08:44

Episode: 3884   score: 28.84   Avg score (100e): 28.57   actor gain: -0.33   critic loss: 0.40   steps: 3884


training loop:   7% |##                               | ETA:  32 days, 20:08:23

Episode: 3885   score: 28.85   Avg score (100e): 28.58   actor gain: -0.33   critic loss: 0.40   steps: 3885
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:07:48

Episode: 3886   score: 28.85   Avg score (100e): 28.58   actor gain: -0.33   critic loss: 0.40   steps: 3886
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:07:31

Episode: 3887   score: 28.86   Avg score (100e): 28.59   actor gain: -0.32   critic loss: 0.40   steps: 3887


training loop:   7% |##                               | ETA:  32 days, 20:07:47

Episode: 3888   score: 28.87   Avg score (100e): 28.59   actor gain: -0.32   critic loss: 0.40   steps: 3888
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:07:48

Episode: 3889   score: 28.87   Avg score (100e): 28.60   actor gain: -0.32   critic loss: 0.40   steps: 3889


training loop:   7% |##                               | ETA:  32 days, 20:07:21

Episode: 3890   score: 28.88   Avg score (100e): 28.60   actor gain: -0.32   critic loss: 0.40   steps: 3890
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:08:05

Episode: 3891   score: 28.89   Avg score (100e): 28.61   actor gain: -0.32   critic loss: 0.40   steps: 3891
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:08:10

Episode: 3892   score: 28.89   Avg score (100e): 28.61   actor gain: -0.32   critic loss: 0.40   steps: 3892
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:08:16

Episode: 3893   score: 28.89   Avg score (100e): 28.62   actor gain: -0.32   critic loss: 0.40   steps: 3893


training loop:   7% |##                               | ETA:  32 days, 20:08:19

Episode: 3894   score: 28.90   Avg score (100e): 28.63   actor gain: -0.32   critic loss: 0.40   steps: 3894
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:07:58

Episode: 3895   score: 28.91   Avg score (100e): 28.63   actor gain: -0.32   critic loss: 0.40   steps: 3895


training loop:   7% |##                               | ETA:  32 days, 20:09:55

Episode: 3896   score: 28.91   Avg score (100e): 28.64   actor gain: -0.32   critic loss: 0.40   steps: 3896
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:10:04

Episode: 3897   score: 28.91   Avg score (100e): 28.64   actor gain: -0.32   critic loss: 0.40   steps: 3897
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:09:56

Episode: 3898   score: 28.91   Avg score (100e): 28.65   actor gain: -0.32   critic loss: 0.40   steps: 3898
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:10:01

Episode: 3899   score: 28.92   Avg score (100e): 28.65   actor gain: -0.32   critic loss: 0.40   steps: 3899


training loop:   7% |##                               | ETA:  32 days, 20:10:20

Episode: 3900   score: 28.93   Avg score (100e): 28.66   actor gain: -0.32   critic loss: 0.40   steps: 3900
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:10:02

Episode: 3901   score: 28.93   Avg score (100e): 28.66   actor gain: -0.32   critic loss: 0.40   steps: 3901


training loop:   7% |##                               | ETA:  32 days, 20:09:58

Episode: 3902   score: 28.93   Avg score (100e): 28.67   actor gain: -0.32   critic loss: 0.40   steps: 3902


training loop:   7% |##                               | ETA:  32 days, 20:10:21

Episode: 3903   score: 28.94   Avg score (100e): 28.67   actor gain: -0.32   critic loss: 0.40   steps: 3903


training loop:   7% |##                               | ETA:  32 days, 20:10:56

Episode: 3904   score: 28.95   Avg score (100e): 28.68   actor gain: -0.32   critic loss: 0.40   steps: 3904


training loop:   7% |##                               | ETA:  32 days, 20:12:35

Episode: 3905   score: 28.95   Avg score (100e): 28.68   actor gain: -0.32   critic loss: 0.40   steps: 3905
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:11:37

Episode: 3906   score: 28.95   Avg score (100e): 28.69   actor gain: -0.32   critic loss: 0.40   steps: 3906


training loop:   7% |##                               | ETA:  32 days, 20:11:40

Episode: 3907   score: 28.95   Avg score (100e): 28.69   actor gain: -0.32   critic loss: 0.40   steps: 3907


training loop:   7% |##                               | ETA:  32 days, 20:11:34

Episode: 3908   score: 28.96   Avg score (100e): 28.70   actor gain: -0.32   critic loss: 0.40   steps: 3908


training loop:   7% |##                               | ETA:  32 days, 20:11:02

Episode: 3909   score: 28.97   Avg score (100e): 28.71   actor gain: -0.32   critic loss: 0.40   steps: 3909


training loop:   7% |##                               | ETA:  32 days, 20:10:45

Episode: 3910   score: 28.97   Avg score (100e): 28.71   actor gain: -0.32   critic loss: 0.40   steps: 3910


training loop:   7% |##                               | ETA:  32 days, 20:10:44

Episode: 3911   score: 28.97   Avg score (100e): 28.72   actor gain: -0.33   critic loss: 0.40   steps: 3911


training loop:   7% |##                               | ETA:  32 days, 20:11:00

Episode: 3912   score: 28.98   Avg score (100e): 28.72   actor gain: -0.33   critic loss: 0.40   steps: 3912


training loop:   7% |##                               | ETA:  32 days, 20:10:44

Episode: 3913   score: 28.98   Avg score (100e): 28.73   actor gain: -0.33   critic loss: 0.40   steps: 3913


training loop:   7% |##                               | ETA:  32 days, 20:11:25

Episode: 3914   score: 28.99   Avg score (100e): 28.73   actor gain: -0.33   critic loss: 0.40   steps: 3914


training loop:   7% |##                               | ETA:  32 days, 20:11:53

Episode: 3915   score: 28.98   Avg score (100e): 28.74   actor gain: -0.33   critic loss: 0.40   steps: 3915
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:11:38

Episode: 3916   score: 28.99   Avg score (100e): 28.74   actor gain: -0.33   critic loss: 0.40   steps: 3916


training loop:   7% |##                               | ETA:  32 days, 20:11:47

Episode: 3917   score: 28.99   Avg score (100e): 28.75   actor gain: -0.33   critic loss: 0.40   steps: 3917
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:10:52

Episode: 3918   score: 29.00   Avg score (100e): 28.75   actor gain: -0.33   critic loss: 0.40   steps: 3918
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:12:00

Episode: 3919   score: 29.00   Avg score (100e): 28.76   actor gain: -0.33   critic loss: 0.40   steps: 3919


training loop:   7% |##                               | ETA:  32 days, 20:12:45

Episode: 3920   score: 29.00   Avg score (100e): 28.76   actor gain: -0.33   critic loss: 0.40   steps: 3920


training loop:   7% |##                               | ETA:  32 days, 20:14:17

Episode: 3921   score: 29.00   Avg score (100e): 28.77   actor gain: -0.33   critic loss: 0.40   steps: 3921


training loop:   7% |##                               | ETA:  32 days, 20:16:05

Episode: 3922   score: 29.00   Avg score (100e): 28.77   actor gain: -0.33   critic loss: 0.40   steps: 3922


training loop:   7% |##                               | ETA:  32 days, 20:17:04

Episode: 3923   score: 29.00   Avg score (100e): 28.78   actor gain: -0.33   critic loss: 0.40   steps: 3923
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:16:59

Episode: 3924   score: 29.00   Avg score (100e): 28.78   actor gain: -0.33   critic loss: 0.40   steps: 3924


training loop:   7% |##                               | ETA:  32 days, 20:17:06

Episode: 3925   score: 29.01   Avg score (100e): 28.79   actor gain: -0.33   critic loss: 0.40   steps: 3925


training loop:   7% |##                               | ETA:  32 days, 20:17:08

Episode: 3926   score: 29.01   Avg score (100e): 28.79   actor gain: -0.33   critic loss: 0.40   steps: 3926


training loop:   7% |##                               | ETA:  32 days, 20:18:01

Episode: 3927   score: 29.02   Avg score (100e): 28.80   actor gain: -0.33   critic loss: 0.40   steps: 3927


training loop:   7% |##                               | ETA:  32 days, 20:20:37

Episode: 3928   score: 29.02   Avg score (100e): 28.80   actor gain: -0.33   critic loss: 0.40   steps: 3928
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:20:27

Episode: 3929   score: 29.03   Avg score (100e): 28.81   actor gain: -0.33   critic loss: 0.40   steps: 3929


training loop:   7% |##                               | ETA:  32 days, 20:21:00

Episode: 3930   score: 29.03   Avg score (100e): 28.81   actor gain: -0.33   critic loss: 0.40   steps: 3930


training loop:   7% |##                               | ETA:  32 days, 20:21:36

Episode: 3931   score: 29.03   Avg score (100e): 28.82   actor gain: -0.33   critic loss: 0.40   steps: 3931


training loop:   7% |##                               | ETA:  32 days, 20:21:42

Episode: 3932   score: 29.04   Avg score (100e): 28.82   actor gain: -0.33   critic loss: 0.40   steps: 3932


training loop:   7% |##                               | ETA:  32 days, 20:21:55

Episode: 3933   score: 29.04   Avg score (100e): 28.83   actor gain: -0.33   critic loss: 0.40   steps: 3933


training loop:   7% |##                               | ETA:  32 days, 20:22:12

Episode: 3934   score: 29.04   Avg score (100e): 28.83   actor gain: -0.33   critic loss: 0.40   steps: 3934


training loop:   7% |##                               | ETA:  32 days, 20:22:22

Episode: 3935   score: 29.04   Avg score (100e): 28.84   actor gain: -0.33   critic loss: 0.40   steps: 3935
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:21:55

Episode: 3936   score: 29.04   Avg score (100e): 28.84   actor gain: -0.33   critic loss: 0.40   steps: 3936
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:21:56

Episode: 3937   score: 29.04   Avg score (100e): 28.84   actor gain: -0.33   critic loss: 0.40   steps: 3937


training loop:   7% |##                               | ETA:  32 days, 20:22:40

Episode: 3938   score: 29.05   Avg score (100e): 28.85   actor gain: -0.33   critic loss: 0.40   steps: 3938


training loop:   7% |##                               | ETA:  32 days, 20:22:19

Episode: 3939   score: 29.05   Avg score (100e): 28.85   actor gain: -0.33   critic loss: 0.40   steps: 3939
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:21:43

Episode: 3940   score: 29.05   Avg score (100e): 28.86   actor gain: -0.33   critic loss: 0.40   steps: 3940


training loop:   7% |##                               | ETA:  32 days, 20:22:49

Episode: 3941   score: 29.05   Avg score (100e): 28.86   actor gain: -0.33   critic loss: 0.40   steps: 3941


training loop:   7% |##                               | ETA:  32 days, 20:23:10

Episode: 3942   score: 29.05   Avg score (100e): 28.87   actor gain: -0.33   critic loss: 0.40   steps: 3942
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:23:08

Episode: 3943   score: 29.05   Avg score (100e): 28.87   actor gain: -0.33   critic loss: 0.40   steps: 3943
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:22:57

Episode: 3944   score: 29.05   Avg score (100e): 28.88   actor gain: -0.33   critic loss: 0.40   steps: 3944
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:22:45

Episode: 3945   score: 29.05   Avg score (100e): 28.88   actor gain: -0.33   critic loss: 0.40   steps: 3945


training loop:   7% |##                               | ETA:  32 days, 20:22:50

Episode: 3946   score: 29.05   Avg score (100e): 28.88   actor gain: -0.33   critic loss: 0.40   steps: 3946


training loop:   7% |##                               | ETA:  32 days, 20:23:21

Episode: 3947   score: 29.06   Avg score (100e): 28.89   actor gain: -0.33   critic loss: 0.40   steps: 3947
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:23:13

Episode: 3948   score: 29.07   Avg score (100e): 28.89   actor gain: -0.33   critic loss: 0.40   steps: 3948


training loop:   7% |##                               | ETA:  32 days, 20:23:47

Episode: 3949   score: 29.07   Avg score (100e): 28.90   actor gain: -0.33   critic loss: 0.39   steps: 3949


training loop:   7% |##                               | ETA:  32 days, 20:24:18

Episode: 3950   score: 29.07   Avg score (100e): 28.90   actor gain: -0.32   critic loss: 0.39   steps: 3950


training loop:   7% |##                               | ETA:  32 days, 20:25:42

Episode: 3951   score: 29.06   Avg score (100e): 28.91   actor gain: -0.32   critic loss: 0.39   steps: 3951
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:30:55

Episode: 3952   score: 29.06   Avg score (100e): 28.91   actor gain: -0.32   critic loss: 0.39   steps: 3952


training loop:   7% |##                               | ETA:  32 days, 20:32:51

Episode: 3953   score: 29.07   Avg score (100e): 28.91   actor gain: -0.32   critic loss: 0.39   steps: 3953
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:40:41

Episode: 3954   score: 29.08   Avg score (100e): 28.92   actor gain: -0.32   critic loss: 0.39   steps: 3954


training loop:   7% |##                               | ETA:  32 days, 20:41:52

Episode: 3955   score: 29.08   Avg score (100e): 28.92   actor gain: -0.32   critic loss: 0.39   steps: 3955
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:45:26

Episode: 3956   score: 29.08   Avg score (100e): 28.93   actor gain: -0.32   critic loss: 0.39   steps: 3956


training loop:   7% |##                               | ETA:  32 days, 20:46:34

Episode: 3957   score: 29.09   Avg score (100e): 28.93   actor gain: -0.32   critic loss: 0.39   steps: 3957


training loop:   7% |##                               | ETA:  32 days, 20:47:48

Episode: 3958   score: 29.09   Avg score (100e): 28.93   actor gain: -0.32   critic loss: 0.39   steps: 3958


training loop:   7% |##                               | ETA:  32 days, 20:48:37

Episode: 3959   score: 29.09   Avg score (100e): 28.94   actor gain: -0.32   critic loss: 0.39   steps: 3959
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:48:21

Episode: 3960   score: 29.09   Avg score (100e): 28.94   actor gain: -0.32   critic loss: 0.39   steps: 3960


training loop:   7% |##                               | ETA:  32 days, 20:50:06

Episode: 3961   score: 29.10   Avg score (100e): 28.95   actor gain: -0.32   critic loss: 0.39   steps: 3961
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:49:56

Episode: 3962   score: 29.10   Avg score (100e): 28.95   actor gain: -0.32   critic loss: 0.39   steps: 3962
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:50:52

Episode: 3963   score: 29.11   Avg score (100e): 28.95   actor gain: -0.32   critic loss: 0.39   steps: 3963


training loop:   7% |##                               | ETA:  32 days, 20:51:07

Episode: 3964   score: 29.11   Avg score (100e): 28.96   actor gain: -0.32   critic loss: 0.40   steps: 3964


training loop:   7% |##                               | ETA:  32 days, 20:51:40

Episode: 3965   score: 29.11   Avg score (100e): 28.96   actor gain: -0.32   critic loss: 0.40   steps: 3965


training loop:   7% |##                               | ETA:  32 days, 20:54:01

Episode: 3966   score: 29.11   Avg score (100e): 28.96   actor gain: -0.32   critic loss: 0.39   steps: 3966


training loop:   7% |##                               | ETA:  32 days, 20:54:29

Episode: 3967   score: 29.11   Avg score (100e): 28.97   actor gain: -0.32   critic loss: 0.40   steps: 3967
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:54:29

Episode: 3968   score: 29.11   Avg score (100e): 28.97   actor gain: -0.32   critic loss: 0.40   steps: 3968
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:53:49

Episode: 3969   score: 29.12   Avg score (100e): 28.98   actor gain: -0.32   critic loss: 0.40   steps: 3969
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:53:53

Episode: 3970   score: 29.12   Avg score (100e): 28.98   actor gain: -0.32   critic loss: 0.40   steps: 3970


training loop:   7% |##                               | ETA:  32 days, 20:53:16

Episode: 3971   score: 29.12   Avg score (100e): 28.98   actor gain: -0.32   critic loss: 0.40   steps: 3971
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:52:40

Episode: 3972   score: 29.13   Avg score (100e): 28.99   actor gain: -0.32   critic loss: 0.40   steps: 3972
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:52:25

Episode: 3973   score: 29.12   Avg score (100e): 28.99   actor gain: -0.32   critic loss: 0.40   steps: 3973


training loop:   7% |##                               | ETA:  32 days, 20:52:10

Episode: 3974   score: 29.13   Avg score (100e): 28.99   actor gain: -0.32   critic loss: 0.40   steps: 3974


training loop:   7% |##                               | ETA:  32 days, 20:51:17

Episode: 3975   score: 29.13   Avg score (100e): 29.00   actor gain: -0.32   critic loss: 0.40   steps: 3975


training loop:   7% |##                               | ETA:  32 days, 20:50:25

Episode: 3976   score: 29.13   Avg score (100e): 29.00   actor gain: -0.32   critic loss: 0.40   steps: 3976
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:49:09

Episode: 3977   score: 29.13   Avg score (100e): 29.00   actor gain: -0.32   critic loss: 0.40   steps: 3977
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:49:11

Episode: 3978   score: 29.13   Avg score (100e): 29.01   actor gain: -0.32   critic loss: 0.40   steps: 3978


training loop:   7% |##                               | ETA:  32 days, 20:48:57

Episode: 3979   score: 29.13   Avg score (100e): 29.01   actor gain: -0.32   critic loss: 0.40   steps: 3979


training loop:   7% |##                               | ETA:  32 days, 20:48:07

Episode: 3980   score: 29.13   Avg score (100e): 29.01   actor gain: -0.32   critic loss: 0.40   steps: 3980
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:47:05

Episode: 3981   score: 29.14   Avg score (100e): 29.02   actor gain: -0.32   critic loss: 0.40   steps: 3981


training loop:   7% |##                               | ETA:  32 days, 20:46:48

Episode: 3982   score: 29.14   Avg score (100e): 29.02   actor gain: -0.33   critic loss: 0.40   steps: 3982
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:46:26

Episode: 3983   score: 29.14   Avg score (100e): 29.02   actor gain: -0.33   critic loss: 0.40   steps: 3983


training loop:   7% |##                               | ETA:  32 days, 20:45:37

Episode: 3984   score: 29.15   Avg score (100e): 29.03   actor gain: -0.33   critic loss: 0.40   steps: 3984


training loop:   7% |##                               | ETA:  32 days, 20:44:45

Episode: 3985   score: 29.15   Avg score (100e): 29.03   actor gain: -0.32   critic loss: 0.40   steps: 3985


training loop:   7% |##                               | ETA:  32 days, 20:44:32

Episode: 3986   score: 29.15   Avg score (100e): 29.03   actor gain: -0.32   critic loss: 0.40   steps: 3986
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:43:46

Episode: 3987   score: 29.16   Avg score (100e): 29.03   actor gain: -0.32   critic loss: 0.40   steps: 3987
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:44:43

Episode: 3988   score: 29.16   Avg score (100e): 29.04   actor gain: -0.32   critic loss: 0.40   steps: 3988
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:46:10

Episode: 3989   score: 29.17   Avg score (100e): 29.04   actor gain: -0.32   critic loss: 0.40   steps: 3989
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:45:50

Episode: 3990   score: 29.17   Avg score (100e): 29.04   actor gain: -0.32   critic loss: 0.40   steps: 3990


training loop:   7% |##                               | ETA:  32 days, 20:46:35

Episode: 3991   score: 29.18   Avg score (100e): 29.05   actor gain: -0.32   critic loss: 0.40   steps: 3991


training loop:   7% |##                               | ETA:  32 days, 20:46:49

Episode: 3992   score: 29.17   Avg score (100e): 29.05   actor gain: -0.32   critic loss: 0.40   steps: 3992


training loop:   7% |##                               | ETA:  32 days, 20:46:58

Episode: 3993   score: 29.17   Avg score (100e): 29.05   actor gain: -0.32   critic loss: 0.40   steps: 3993
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:48:15

Episode: 3994   score: 29.17   Avg score (100e): 29.05   actor gain: -0.32   critic loss: 0.40   steps: 3994
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:47:55

Episode: 3995   score: 29.18   Avg score (100e): 29.06   actor gain: -0.32   critic loss: 0.40   steps: 3995


training loop:   7% |##                               | ETA:  32 days, 20:47:48

Episode: 3996   score: 29.18   Avg score (100e): 29.06   actor gain: -0.32   critic loss: 0.40   steps: 3996


training loop:   7% |##                               | ETA:  32 days, 20:47:39

Episode: 3997   score: 29.19   Avg score (100e): 29.06   actor gain: -0.32   critic loss: 0.40   steps: 3997


training loop:   7% |##                               | ETA:  32 days, 20:48:50

Episode: 3998   score: 29.19   Avg score (100e): 29.07   actor gain: -0.32   critic loss: 0.40   steps: 3998


training loop:   7% |##                               | ETA:  32 days, 20:48:23

Episode: 3999   score: 29.20   Avg score (100e): 29.07   actor gain: -0.32   critic loss: 0.40   steps: 3999
np.all(done) is true! miracle!


training loop:   7% |##                               | ETA:  32 days, 20:48:51

Episode: 4000   score: 29.20   Avg score (100e): 29.07   actor gain: -0.32   critic loss: 0.40   steps: 4000


training loop:   8% |##                               | ETA:  32 days, 20:48:46

Episode: 4001   score: 29.20   Avg score (100e): 29.07   actor gain: -0.32   critic loss: 0.40   steps: 4001
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:47:50

Episode: 4002   score: 29.21   Avg score (100e): 29.08   actor gain: -0.32   critic loss: 0.40   steps: 4002


training loop:   8% |##                               | ETA:  32 days, 20:48:13

Episode: 4003   score: 29.21   Avg score (100e): 29.08   actor gain: -0.32   critic loss: 0.40   steps: 4003


training loop:   8% |##                               | ETA:  32 days, 20:47:43

Episode: 4004   score: 29.21   Avg score (100e): 29.08   actor gain: -0.32   critic loss: 0.40   steps: 4004
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:47:09

Episode: 4005   score: 29.21   Avg score (100e): 29.08   actor gain: -0.32   critic loss: 0.40   steps: 4005


training loop:   8% |##                               | ETA:  32 days, 20:46:25

Episode: 4006   score: 29.22   Avg score (100e): 29.09   actor gain: -0.32   critic loss: 0.40   steps: 4006
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:45:24

Episode: 4007   score: 29.22   Avg score (100e): 29.09   actor gain: -0.32   critic loss: 0.40   steps: 4007


training loop:   8% |##                               | ETA:  32 days, 20:45:01

Episode: 4008   score: 29.22   Avg score (100e): 29.09   actor gain: -0.32   critic loss: 0.40   steps: 4008
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:44:11

Episode: 4009   score: 29.23   Avg score (100e): 29.09   actor gain: -0.32   critic loss: 0.40   steps: 4009


training loop:   8% |##                               | ETA:  32 days, 20:44:05

Episode: 4010   score: 29.24   Avg score (100e): 29.10   actor gain: -0.32   critic loss: 0.40   steps: 4010


training loop:   8% |##                               | ETA:  32 days, 20:45:34

Episode: 4011   score: 29.24   Avg score (100e): 29.10   actor gain: -0.32   critic loss: 0.40   steps: 4011
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:46:11

Episode: 4012   score: 29.25   Avg score (100e): 29.10   actor gain: -0.32   critic loss: 0.40   steps: 4012


training loop:   8% |##                               | ETA:  32 days, 20:45:53

Episode: 4013   score: 29.25   Avg score (100e): 29.11   actor gain: -0.33   critic loss: 0.40   steps: 4013
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:44:24

Episode: 4014   score: 29.25   Avg score (100e): 29.11   actor gain: -0.32   critic loss: 0.40   steps: 4014


training loop:   8% |##                               | ETA:  32 days, 20:44:04

Episode: 4015   score: 29.26   Avg score (100e): 29.11   actor gain: -0.32   critic loss: 0.40   steps: 4015
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:42:48

Episode: 4016   score: 29.26   Avg score (100e): 29.11   actor gain: -0.33   critic loss: 0.40   steps: 4016
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:41:57

Episode: 4017   score: 29.26   Avg score (100e): 29.12   actor gain: -0.32   critic loss: 0.40   steps: 4017
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:41:22

Episode: 4018   score: 29.26   Avg score (100e): 29.12   actor gain: -0.33   critic loss: 0.40   steps: 4018


training loop:   8% |##                               | ETA:  32 days, 20:40:42

Episode: 4019   score: 29.27   Avg score (100e): 29.12   actor gain: -0.32   critic loss: 0.40   steps: 4019
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:39:41

Episode: 4020   score: 29.27   Avg score (100e): 29.12   actor gain: -0.32   critic loss: 0.40   steps: 4020
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:38:17

Episode: 4021   score: 29.27   Avg score (100e): 29.13   actor gain: -0.33   critic loss: 0.40   steps: 4021
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:37:13

Episode: 4022   score: 29.28   Avg score (100e): 29.13   actor gain: -0.32   critic loss: 0.40   steps: 4022


training loop:   8% |##                               | ETA:  32 days, 20:36:53

Episode: 4023   score: 29.29   Avg score (100e): 29.13   actor gain: -0.32   critic loss: 0.40   steps: 4023
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:35:30

Episode: 4024   score: 29.28   Avg score (100e): 29.14   actor gain: -0.32   critic loss: 0.40   steps: 4024
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:34:29

Episode: 4025   score: 29.28   Avg score (100e): 29.14   actor gain: -0.33   critic loss: 0.40   steps: 4025
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:35:00

Episode: 4026   score: 29.29   Avg score (100e): 29.14   actor gain: -0.32   critic loss: 0.40   steps: 4026


training loop:   8% |##                               | ETA:  32 days, 20:34:17

Episode: 4027   score: 29.29   Avg score (100e): 29.14   actor gain: -0.32   critic loss: 0.40   steps: 4027


training loop:   8% |##                               | ETA:  32 days, 20:34:27

Episode: 4028   score: 29.29   Avg score (100e): 29.15   actor gain: -0.32   critic loss: 0.40   steps: 4028
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:33:39

Episode: 4029   score: 29.30   Avg score (100e): 29.15   actor gain: -0.32   critic loss: 0.40   steps: 4029


training loop:   8% |##                               | ETA:  32 days, 20:33:08

Episode: 4030   score: 29.30   Avg score (100e): 29.15   actor gain: -0.32   critic loss: 0.40   steps: 4030


training loop:   8% |##                               | ETA:  32 days, 20:32:12

Episode: 4031   score: 29.31   Avg score (100e): 29.15   actor gain: -0.32   critic loss: 0.40   steps: 4031


training loop:   8% |##                               | ETA:  32 days, 20:31:18

Episode: 4032   score: 29.32   Avg score (100e): 29.16   actor gain: -0.32   critic loss: 0.40   steps: 4032


training loop:   8% |##                               | ETA:  32 days, 20:31:32

Episode: 4033   score: 29.33   Avg score (100e): 29.16   actor gain: -0.32   critic loss: 0.40   steps: 4033


training loop:   8% |##                               | ETA:  32 days, 20:34:21

Episode: 4034   score: 29.33   Avg score (100e): 29.16   actor gain: -0.32   critic loss: 0.40   steps: 4034


training loop:   8% |##                               | ETA:  32 days, 20:34:15

Episode: 4035   score: 29.34   Avg score (100e): 29.17   actor gain: -0.32   critic loss: 0.40   steps: 4035
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:33:44

Episode: 4036   score: 29.34   Avg score (100e): 29.17   actor gain: -0.32   critic loss: 0.40   steps: 4036


training loop:   8% |##                               | ETA:  32 days, 20:33:21

Episode: 4037   score: 29.34   Avg score (100e): 29.17   actor gain: -0.32   critic loss: 0.40   steps: 4037


training loop:   8% |##                               | ETA:  32 days, 20:33:17

Episode: 4038   score: 29.34   Avg score (100e): 29.18   actor gain: -0.32   critic loss: 0.40   steps: 4038


training loop:   8% |##                               | ETA:  32 days, 20:32:35

Episode: 4039   score: 29.34   Avg score (100e): 29.18   actor gain: -0.32   critic loss: 0.40   steps: 4039
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:31:38

Episode: 4040   score: 29.35   Avg score (100e): 29.18   actor gain: -0.32   critic loss: 0.40   steps: 4040


training loop:   8% |##                               | ETA:  32 days, 20:31:18

Episode: 4041   score: 29.36   Avg score (100e): 29.18   actor gain: -0.32   critic loss: 0.40   steps: 4041
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:30:24

Episode: 4042   score: 29.36   Avg score (100e): 29.19   actor gain: -0.32   critic loss: 0.40   steps: 4042
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:30:05

Episode: 4043   score: 29.36   Avg score (100e): 29.19   actor gain: -0.32   critic loss: 0.40   steps: 4043
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:29:29

Episode: 4044   score: 29.37   Avg score (100e): 29.19   actor gain: -0.32   critic loss: 0.40   steps: 4044


training loop:   8% |##                               | ETA:  32 days, 20:28:44

Episode: 4045   score: 29.37   Avg score (100e): 29.20   actor gain: -0.33   critic loss: 0.40   steps: 4045


training loop:   8% |##                               | ETA:  32 days, 20:28:10

Episode: 4046   score: 29.37   Avg score (100e): 29.20   actor gain: -0.32   critic loss: 0.40   steps: 4046
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:27:15

Episode: 4047   score: 29.37   Avg score (100e): 29.20   actor gain: -0.32   critic loss: 0.40   steps: 4047


training loop:   8% |##                               | ETA:  32 days, 20:28:30

Episode: 4048   score: 29.37   Avg score (100e): 29.21   actor gain: -0.32   critic loss: 0.40   steps: 4048
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:29:27

Episode: 4049   score: 29.38   Avg score (100e): 29.21   actor gain: -0.32   critic loss: 0.40   steps: 4049


training loop:   8% |##                               | ETA:  32 days, 20:29:00

Episode: 4050   score: 29.39   Avg score (100e): 29.21   actor gain: -0.32   critic loss: 0.40   steps: 4050
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:28:19

Episode: 4051   score: 29.39   Avg score (100e): 29.22   actor gain: -0.32   critic loss: 0.40   steps: 4051
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:27:39

Episode: 4052   score: 29.39   Avg score (100e): 29.22   actor gain: -0.32   critic loss: 0.40   steps: 4052


training loop:   8% |##                               | ETA:  32 days, 20:27:42

Episode: 4053   score: 29.40   Avg score (100e): 29.22   actor gain: -0.32   critic loss: 0.40   steps: 4053
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:27:47

Episode: 4054   score: 29.40   Avg score (100e): 29.23   actor gain: -0.32   critic loss: 0.40   steps: 4054


training loop:   8% |##                               | ETA:  32 days, 20:28:02

Episode: 4055   score: 29.40   Avg score (100e): 29.23   actor gain: -0.32   critic loss: 0.40   steps: 4055


training loop:   8% |##                               | ETA:  32 days, 20:28:06

Episode: 4056   score: 29.40   Avg score (100e): 29.23   actor gain: -0.32   critic loss: 0.40   steps: 4056


training loop:   8% |##                               | ETA:  32 days, 20:28:05

Episode: 4057   score: 29.41   Avg score (100e): 29.23   actor gain: -0.32   critic loss: 0.40   steps: 4057


training loop:   8% |##                               | ETA:  32 days, 20:28:05

Episode: 4058   score: 29.41   Avg score (100e): 29.24   actor gain: -0.32   critic loss: 0.40   steps: 4058
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:29:35

Episode: 4059   score: 29.42   Avg score (100e): 29.24   actor gain: -0.32   critic loss: 0.40   steps: 4059
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:29:42

Episode: 4060   score: 29.43   Avg score (100e): 29.24   actor gain: -0.32   critic loss: 0.40   steps: 4060


training loop:   8% |##                               | ETA:  32 days, 20:30:48

Episode: 4061   score: 29.43   Avg score (100e): 29.25   actor gain: -0.32   critic loss: 0.40   steps: 4061


training loop:   8% |##                               | ETA:  32 days, 20:30:41

Episode: 4062   score: 29.43   Avg score (100e): 29.25   actor gain: -0.32   critic loss: 0.40   steps: 4062


training loop:   8% |##                               | ETA:  32 days, 20:31:42

Episode: 4063   score: 29.43   Avg score (100e): 29.25   actor gain: -0.32   critic loss: 0.40   steps: 4063


training loop:   8% |##                               | ETA:  32 days, 20:31:52

Episode: 4064   score: 29.44   Avg score (100e): 29.26   actor gain: -0.32   critic loss: 0.40   steps: 4064


training loop:   8% |##                               | ETA:  32 days, 20:32:17

Episode: 4065   score: 29.44   Avg score (100e): 29.26   actor gain: -0.32   critic loss: 0.40   steps: 4065


training loop:   8% |##                               | ETA:  32 days, 20:32:06

Episode: 4066   score: 29.44   Avg score (100e): 29.26   actor gain: -0.32   critic loss: 0.40   steps: 4066
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:31:21

Episode: 4067   score: 29.44   Avg score (100e): 29.27   actor gain: -0.32   critic loss: 0.40   steps: 4067


training loop:   8% |##                               | ETA:  32 days, 20:31:18

Episode: 4068   score: 29.45   Avg score (100e): 29.27   actor gain: -0.32   critic loss: 0.40   steps: 4068
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:30:22

Episode: 4069   score: 29.45   Avg score (100e): 29.27   actor gain: -0.32   critic loss: 0.40   steps: 4069
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:29:37

Episode: 4070   score: 29.45   Avg score (100e): 29.28   actor gain: -0.32   critic loss: 0.40   steps: 4070


training loop:   8% |##                               | ETA:  32 days, 20:28:48

Episode: 4071   score: 29.45   Avg score (100e): 29.28   actor gain: -0.32   critic loss: 0.40   steps: 4071


training loop:   8% |##                               | ETA:  32 days, 20:28:06

Episode: 4072   score: 29.46   Avg score (100e): 29.28   actor gain: -0.32   critic loss: 0.40   steps: 4072


training loop:   8% |##                               | ETA:  32 days, 20:28:03

Episode: 4073   score: 29.46   Avg score (100e): 29.29   actor gain: -0.32   critic loss: 0.40   steps: 4073
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:27:09

Episode: 4074   score: 29.46   Avg score (100e): 29.29   actor gain: -0.32   critic loss: 0.40   steps: 4074


training loop:   8% |##                               | ETA:  32 days, 20:26:29

Episode: 4075   score: 29.46   Avg score (100e): 29.29   actor gain: -0.32   critic loss: 0.40   steps: 4075


training loop:   8% |##                               | ETA:  32 days, 20:26:19

Episode: 4076   score: 29.46   Avg score (100e): 29.30   actor gain: -0.32   critic loss: 0.40   steps: 4076
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:25:23

Episode: 4077   score: 29.47   Avg score (100e): 29.30   actor gain: -0.32   critic loss: 0.40   steps: 4077
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:25:33

Episode: 4078   score: 29.47   Avg score (100e): 29.30   actor gain: -0.32   critic loss: 0.40   steps: 4078


training loop:   8% |##                               | ETA:  32 days, 20:25:38

Episode: 4079   score: 29.46   Avg score (100e): 29.31   actor gain: -0.32   critic loss: 0.40   steps: 4079
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:25:00

Episode: 4080   score: 29.47   Avg score (100e): 29.31   actor gain: -0.32   critic loss: 0.40   steps: 4080
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:24:40

Episode: 4081   score: 29.48   Avg score (100e): 29.31   actor gain: -0.32   critic loss: 0.40   steps: 4081
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:23:47

Episode: 4082   score: 29.48   Avg score (100e): 29.32   actor gain: -0.32   critic loss: 0.40   steps: 4082


training loop:   8% |##                               | ETA:  32 days, 20:23:49

Episode: 4083   score: 29.48   Avg score (100e): 29.32   actor gain: -0.32   critic loss: 0.39   steps: 4083


training loop:   8% |##                               | ETA:  32 days, 20:24:21

Episode: 4084   score: 29.49   Avg score (100e): 29.32   actor gain: -0.32   critic loss: 0.39   steps: 4084


training loop:   8% |##                               | ETA:  32 days, 20:23:53

Episode: 4085   score: 29.50   Avg score (100e): 29.33   actor gain: -0.32   critic loss: 0.39   steps: 4085


training loop:   8% |##                               | ETA:  32 days, 20:23:02

Episode: 4086   score: 29.51   Avg score (100e): 29.33   actor gain: -0.32   critic loss: 0.39   steps: 4086


training loop:   8% |##                               | ETA:  32 days, 20:21:59

Episode: 4087   score: 29.51   Avg score (100e): 29.33   actor gain: -0.33   critic loss: 0.39   steps: 4087


training loop:   8% |##                               | ETA:  32 days, 20:22:29

Episode: 4088   score: 29.51   Avg score (100e): 29.34   actor gain: -0.33   critic loss: 0.40   steps: 4088
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:21:55

Episode: 4089   score: 29.51   Avg score (100e): 29.34   actor gain: -0.33   critic loss: 0.40   steps: 4089


training loop:   8% |##                               | ETA:  32 days, 20:21:34

Episode: 4090   score: 29.52   Avg score (100e): 29.35   actor gain: -0.33   critic loss: 0.40   steps: 4090


training loop:   8% |##                               | ETA:  32 days, 20:23:23

Episode: 4091   score: 29.52   Avg score (100e): 29.35   actor gain: -0.33   critic loss: 0.40   steps: 4091


training loop:   8% |##                               | ETA:  32 days, 20:23:07

Episode: 4092   score: 29.53   Avg score (100e): 29.35   actor gain: -0.33   critic loss: 0.40   steps: 4092
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:24:13

Episode: 4093   score: 29.53   Avg score (100e): 29.36   actor gain: -0.33   critic loss: 0.40   steps: 4093
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:23:50

Episode: 4094   score: 29.54   Avg score (100e): 29.36   actor gain: -0.33   critic loss: 0.40   steps: 4094
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:23:04

Episode: 4095   score: 29.54   Avg score (100e): 29.36   actor gain: -0.33   critic loss: 0.40   steps: 4095


training loop:   8% |##                               | ETA:  32 days, 20:22:48

Episode: 4096   score: 29.54   Avg score (100e): 29.37   actor gain: -0.33   critic loss: 0.40   steps: 4096
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:22:06

Episode: 4097   score: 29.53   Avg score (100e): 29.37   actor gain: -0.33   critic loss: 0.40   steps: 4097


training loop:   8% |##                               | ETA:  32 days, 20:22:14

Episode: 4098   score: 29.54   Avg score (100e): 29.37   actor gain: -0.33   critic loss: 0.40   steps: 4098
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:21:31

Episode: 4099   score: 29.54   Avg score (100e): 29.38   actor gain: -0.33   critic loss: 0.40   steps: 4099


training loop:   8% |##                               | ETA:  32 days, 20:21:16

Episode: 4100   score: 29.54   Avg score (100e): 29.38   actor gain: -0.33   critic loss: 0.40   steps: 4100


training loop:   8% |##                               | ETA:  32 days, 20:20:49

Episode: 4101   score: 29.54   Avg score (100e): 29.38   actor gain: -0.33   critic loss: 0.40   steps: 4101
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:19:57

Episode: 4102   score: 29.54   Avg score (100e): 29.39   actor gain: -0.33   critic loss: 0.40   steps: 4102


training loop:   8% |##                               | ETA:  32 days, 20:20:12

Episode: 4103   score: 29.55   Avg score (100e): 29.39   actor gain: -0.33   critic loss: 0.40   steps: 4103
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:19:34

Episode: 4104   score: 29.55   Avg score (100e): 29.39   actor gain: -0.33   critic loss: 0.40   steps: 4104


training loop:   8% |##                               | ETA:  32 days, 20:19:18

Episode: 4105   score: 29.55   Avg score (100e): 29.40   actor gain: -0.33   critic loss: 0.40   steps: 4105
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:18:28

Episode: 4106   score: 29.55   Avg score (100e): 29.40   actor gain: -0.33   critic loss: 0.40   steps: 4106


training loop:   8% |##                               | ETA:  32 days, 20:17:42

Episode: 4107   score: 29.55   Avg score (100e): 29.40   actor gain: -0.33   critic loss: 0.40   steps: 4107


training loop:   8% |##                               | ETA:  32 days, 20:17:51

Episode: 4108   score: 29.55   Avg score (100e): 29.41   actor gain: -0.33   critic loss: 0.40   steps: 4108


training loop:   8% |##                               | ETA:  32 days, 20:17:24

Episode: 4109   score: 29.56   Avg score (100e): 29.41   actor gain: -0.33   critic loss: 0.40   steps: 4109
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:16:31

Episode: 4110   score: 29.57   Avg score (100e): 29.41   actor gain: -0.33   critic loss: 0.40   steps: 4110


training loop:   8% |##                               | ETA:  32 days, 20:15:52

Episode: 4111   score: 29.57   Avg score (100e): 29.42   actor gain: -0.33   critic loss: 0.40   steps: 4111


training loop:   8% |##                               | ETA:  32 days, 20:15:23

Episode: 4112   score: 29.57   Avg score (100e): 29.42   actor gain: -0.33   critic loss: 0.40   steps: 4112
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:15:09

Episode: 4113   score: 29.58   Avg score (100e): 29.42   actor gain: -0.33   critic loss: 0.40   steps: 4113


training loop:   8% |##                               | ETA:  32 days, 20:14:26

Episode: 4114   score: 29.58   Avg score (100e): 29.43   actor gain: -0.33   critic loss: 0.40   steps: 4114
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:13:47

Episode: 4115   score: 29.58   Avg score (100e): 29.43   actor gain: -0.33   critic loss: 0.40   steps: 4115


training loop:   8% |##                               | ETA:  32 days, 20:13:20

Episode: 4116   score: 29.59   Avg score (100e): 29.43   actor gain: -0.33   critic loss: 0.40   steps: 4116


training loop:   8% |##                               | ETA:  32 days, 20:12:23

Episode: 4117   score: 29.59   Avg score (100e): 29.44   actor gain: -0.33   critic loss: 0.40   steps: 4117


training loop:   8% |##                               | ETA:  32 days, 20:12:49

Episode: 4118   score: 29.59   Avg score (100e): 29.44   actor gain: -0.33   critic loss: 0.40   steps: 4118


training loop:   8% |##                               | ETA:  32 days, 20:12:42

Episode: 4119   score: 29.59   Avg score (100e): 29.44   actor gain: -0.33   critic loss: 0.40   steps: 4119


training loop:   8% |##                               | ETA:  32 days, 20:12:31

Episode: 4120   score: 29.59   Avg score (100e): 29.45   actor gain: -0.33   critic loss: 0.40   steps: 4120


training loop:   8% |##                               | ETA:  32 days, 20:11:49

Episode: 4121   score: 29.59   Avg score (100e): 29.45   actor gain: -0.33   critic loss: 0.40   steps: 4121
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:11:13

Episode: 4122   score: 29.60   Avg score (100e): 29.45   actor gain: -0.33   critic loss: 0.40   steps: 4122


training loop:   8% |##                               | ETA:  32 days, 20:11:16

Episode: 4123   score: 29.60   Avg score (100e): 29.46   actor gain: -0.32   critic loss: 0.40   steps: 4123
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:13:05

Episode: 4124   score: 29.61   Avg score (100e): 29.46   actor gain: -0.32   critic loss: 0.40   steps: 4124


training loop:   8% |##                               | ETA:  32 days, 20:13:15

Episode: 4125   score: 29.61   Avg score (100e): 29.46   actor gain: -0.32   critic loss: 0.40   steps: 4125


training loop:   8% |##                               | ETA:  32 days, 20:13:14

Episode: 4126   score: 29.61   Avg score (100e): 29.47   actor gain: -0.32   critic loss: 0.40   steps: 4126


training loop:   8% |##                               | ETA:  32 days, 20:12:57

Episode: 4127   score: 29.61   Avg score (100e): 29.47   actor gain: -0.32   critic loss: 0.40   steps: 4127


training loop:   8% |##                               | ETA:  32 days, 20:13:21

Episode: 4128   score: 29.61   Avg score (100e): 29.47   actor gain: -0.32   critic loss: 0.40   steps: 4128
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:13:12

Episode: 4129   score: 29.62   Avg score (100e): 29.48   actor gain: -0.32   critic loss: 0.40   steps: 4129


training loop:   8% |##                               | ETA:  32 days, 20:12:59

Episode: 4130   score: 29.62   Avg score (100e): 29.48   actor gain: -0.32   critic loss: 0.40   steps: 4130
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:12:03

Episode: 4131   score: 29.62   Avg score (100e): 29.48   actor gain: -0.33   critic loss: 0.40   steps: 4131


training loop:   8% |##                               | ETA:  32 days, 20:11:28

Episode: 4132   score: 29.63   Avg score (100e): 29.48   actor gain: -0.32   critic loss: 0.40   steps: 4132
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:11:42

Episode: 4133   score: 29.64   Avg score (100e): 29.49   actor gain: -0.32   critic loss: 0.40   steps: 4133


training loop:   8% |##                               | ETA:  32 days, 20:11:18

Episode: 4134   score: 29.63   Avg score (100e): 29.49   actor gain: -0.32   critic loss: 0.40   steps: 4134


training loop:   8% |##                               | ETA:  32 days, 20:11:02

Episode: 4135   score: 29.64   Avg score (100e): 29.49   actor gain: -0.32   critic loss: 0.40   steps: 4135
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:10:46

Episode: 4136   score: 29.64   Avg score (100e): 29.50   actor gain: -0.32   critic loss: 0.40   steps: 4136


training loop:   8% |##                               | ETA:  32 days, 20:10:04

Episode: 4137   score: 29.64   Avg score (100e): 29.50   actor gain: -0.32   critic loss: 0.40   steps: 4137


training loop:   8% |##                               | ETA:  32 days, 20:10:21

Episode: 4138   score: 29.64   Avg score (100e): 29.50   actor gain: -0.32   critic loss: 0.40   steps: 4138


training loop:   8% |##                               | ETA:  32 days, 20:09:48

Episode: 4139   score: 29.65   Avg score (100e): 29.51   actor gain: -0.32   critic loss: 0.40   steps: 4139


training loop:   8% |##                               | ETA:  32 days, 20:09:33

Episode: 4140   score: 29.66   Avg score (100e): 29.51   actor gain: -0.32   critic loss: 0.40   steps: 4140


training loop:   8% |##                               | ETA:  32 days, 20:09:00

Episode: 4141   score: 29.66   Avg score (100e): 29.51   actor gain: -0.32   critic loss: 0.40   steps: 4141


training loop:   8% |##                               | ETA:  32 days, 20:08:18

Episode: 4142   score: 29.66   Avg score (100e): 29.52   actor gain: -0.32   critic loss: 0.40   steps: 4142


training loop:   8% |##                               | ETA:  32 days, 20:08:32

Episode: 4143   score: 29.66   Avg score (100e): 29.52   actor gain: -0.32   critic loss: 0.40   steps: 4143


training loop:   8% |##                               | ETA:  32 days, 20:08:27

Episode: 4144   score: 29.66   Avg score (100e): 29.52   actor gain: -0.32   critic loss: 0.40   steps: 4144


training loop:   8% |##                               | ETA:  32 days, 20:08:19

Episode: 4145   score: 29.67   Avg score (100e): 29.52   actor gain: -0.32   critic loss: 0.40   steps: 4145
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:07:23

Episode: 4146   score: 29.67   Avg score (100e): 29.53   actor gain: -0.32   critic loss: 0.40   steps: 4146


training loop:   8% |##                               | ETA:  32 days, 20:07:17

Episode: 4147   score: 29.67   Avg score (100e): 29.53   actor gain: -0.32   critic loss: 0.40   steps: 4147


training loop:   8% |##                               | ETA:  32 days, 20:07:32

Episode: 4148   score: 29.67   Avg score (100e): 29.53   actor gain: -0.32   critic loss: 0.40   steps: 4148


training loop:   8% |##                               | ETA:  32 days, 20:07:43

Episode: 4149   score: 29.68   Avg score (100e): 29.54   actor gain: -0.32   critic loss: 0.40   steps: 4149


training loop:   8% |##                               | ETA:  32 days, 20:09:21

Episode: 4150   score: 29.67   Avg score (100e): 29.54   actor gain: -0.33   critic loss: 0.40   steps: 4150
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:09:10

Episode: 4151   score: 29.68   Avg score (100e): 29.54   actor gain: -0.32   critic loss: 0.40   steps: 4151


training loop:   8% |##                               | ETA:  32 days, 20:09:12

Episode: 4152   score: 29.68   Avg score (100e): 29.54   actor gain: -0.32   critic loss: 0.40   steps: 4152
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:11:21

Episode: 4153   score: 29.68   Avg score (100e): 29.55   actor gain: -0.32   critic loss: 0.40   steps: 4153


training loop:   8% |##                               | ETA:  32 days, 20:11:14

Episode: 4154   score: 29.68   Avg score (100e): 29.55   actor gain: -0.32   critic loss: 0.40   steps: 4154


training loop:   8% |##                               | ETA:  32 days, 20:11:43

Episode: 4155   score: 29.68   Avg score (100e): 29.55   actor gain: -0.32   critic loss: 0.40   steps: 4155


training loop:   8% |##                               | ETA:  32 days, 20:15:33

Episode: 4156   score: 29.68   Avg score (100e): 29.56   actor gain: -0.32   critic loss: 0.40   steps: 4156


training loop:   8% |##                               | ETA:  32 days, 20:23:35

Episode: 4157   score: 29.68   Avg score (100e): 29.56   actor gain: -0.32   critic loss: 0.40   steps: 4157
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:30:36

Episode: 4158   score: 29.69   Avg score (100e): 29.56   actor gain: -0.32   critic loss: 0.40   steps: 4158


training loop:   8% |##                               | ETA:  32 days, 20:32:31

Episode: 4159   score: 29.69   Avg score (100e): 29.56   actor gain: -0.32   critic loss: 0.40   steps: 4159


training loop:   8% |##                               | ETA:  32 days, 20:33:05

Episode: 4160   score: 29.70   Avg score (100e): 29.57   actor gain: -0.32   critic loss: 0.40   steps: 4160
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:34:20

Episode: 4161   score: 29.71   Avg score (100e): 29.57   actor gain: -0.32   critic loss: 0.40   steps: 4161
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:34:14

Episode: 4162   score: 29.71   Avg score (100e): 29.57   actor gain: -0.32   critic loss: 0.40   steps: 4162
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:34:44

Episode: 4163   score: 29.71   Avg score (100e): 29.57   actor gain: -0.32   critic loss: 0.40   steps: 4163


training loop:   8% |##                               | ETA:  32 days, 20:34:44

Episode: 4164   score: 29.71   Avg score (100e): 29.58   actor gain: -0.32   critic loss: 0.40   steps: 4164
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:35:30

Episode: 4165   score: 29.72   Avg score (100e): 29.58   actor gain: -0.32   critic loss: 0.40   steps: 4165


training loop:   8% |##                               | ETA:  32 days, 20:35:41

Episode: 4166   score: 29.71   Avg score (100e): 29.58   actor gain: -0.32   critic loss: 0.40   steps: 4166
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:36:55

Episode: 4167   score: 29.72   Avg score (100e): 29.59   actor gain: -0.32   critic loss: 0.40   steps: 4167


training loop:   8% |##                               | ETA:  32 days, 20:40:57

Episode: 4168   score: 29.71   Avg score (100e): 29.59   actor gain: -0.32   critic loss: 0.40   steps: 4168


training loop:   8% |##                               | ETA:  32 days, 20:43:34

Episode: 4169   score: 29.71   Avg score (100e): 29.59   actor gain: -0.32   critic loss: 0.40   steps: 4169


training loop:   8% |##                               | ETA:  32 days, 20:47:02

Episode: 4170   score: 29.71   Avg score (100e): 29.59   actor gain: -0.32   critic loss: 0.40   steps: 4170
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:47:12

Episode: 4171   score: 29.71   Avg score (100e): 29.60   actor gain: -0.32   critic loss: 0.40   steps: 4171
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:48:12

Episode: 4172   score: 29.72   Avg score (100e): 29.60   actor gain: -0.32   critic loss: 0.40   steps: 4172


training loop:   8% |##                               | ETA:  32 days, 20:50:34

Episode: 4173   score: 29.72   Avg score (100e): 29.60   actor gain: -0.32   critic loss: 0.40   steps: 4173
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:50:51

Episode: 4174   score: 29.73   Avg score (100e): 29.60   actor gain: -0.32   critic loss: 0.40   steps: 4174


training loop:   8% |##                               | ETA:  32 days, 20:52:19

Episode: 4175   score: 29.73   Avg score (100e): 29.61   actor gain: -0.32   critic loss: 0.40   steps: 4175


training loop:   8% |##                               | ETA:  32 days, 20:52:07

Episode: 4176   score: 29.73   Avg score (100e): 29.61   actor gain: -0.32   critic loss: 0.40   steps: 4176
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:52:23

Episode: 4177   score: 29.73   Avg score (100e): 29.61   actor gain: -0.32   critic loss: 0.40   steps: 4177


training loop:   8% |##                               | ETA:  32 days, 20:52:51

Episode: 4178   score: 29.72   Avg score (100e): 29.61   actor gain: -0.32   critic loss: 0.40   steps: 4178
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:53:17

Episode: 4179   score: 29.73   Avg score (100e): 29.62   actor gain: -0.32   critic loss: 0.40   steps: 4179


training loop:   8% |##                               | ETA:  32 days, 20:53:46

Episode: 4180   score: 29.72   Avg score (100e): 29.62   actor gain: -0.32   critic loss: 0.40   steps: 4180


training loop:   8% |##                               | ETA:  32 days, 20:58:23

Episode: 4181   score: 29.72   Avg score (100e): 29.62   actor gain: -0.32   critic loss: 0.40   steps: 4181


training loop:   8% |##                               | ETA:  32 days, 20:59:48

Episode: 4182   score: 29.72   Avg score (100e): 29.62   actor gain: -0.32   critic loss: 0.40   steps: 4182


training loop:   8% |##                               | ETA:  32 days, 21:01:50

Episode: 4183   score: 29.72   Avg score (100e): 29.63   actor gain: -0.32   critic loss: 0.40   steps: 4183


training loop:   8% |##                               | ETA:  32 days, 21:04:24

Episode: 4184   score: 29.72   Avg score (100e): 29.63   actor gain: -0.32   critic loss: 0.40   steps: 4184


training loop:   8% |##                               | ETA:  32 days, 21:04:56

Episode: 4185   score: 29.72   Avg score (100e): 29.63   actor gain: -0.33   critic loss: 0.40   steps: 4185


training loop:   8% |##                               | ETA:  32 days, 21:05:51

Episode: 4186   score: 29.72   Avg score (100e): 29.63   actor gain: -0.33   critic loss: 0.40   steps: 4186


training loop:   8% |##                               | ETA:  32 days, 21:06:30

Episode: 4187   score: 29.72   Avg score (100e): 29.64   actor gain: -0.33   critic loss: 0.40   steps: 4187


training loop:   8% |##                               | ETA:  32 days, 21:09:26

Episode: 4188   score: 29.72   Avg score (100e): 29.64   actor gain: -0.33   critic loss: 0.40   steps: 4188
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 21:11:43

Episode: 4189   score: 29.73   Avg score (100e): 29.64   actor gain: -0.33   critic loss: 0.40   steps: 4189
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 21:12:59

Episode: 4190   score: 29.73   Avg score (100e): 29.64   actor gain: -0.33   critic loss: 0.40   steps: 4190


training loop:   8% |##                               | ETA:  32 days, 21:15:11

Episode: 4191   score: 29.74   Avg score (100e): 29.64   actor gain: -0.33   critic loss: 0.40   steps: 4191


training loop:   8% |##                               | ETA:  32 days, 21:19:26

Episode: 4192   score: 29.74   Avg score (100e): 29.65   actor gain: -0.33   critic loss: 0.40   steps: 4192


training loop:   8% |##                               | ETA:  32 days, 21:22:56

Episode: 4193   score: 29.74   Avg score (100e): 29.65   actor gain: -0.33   critic loss: 0.40   steps: 4193


training loop:   8% |##                               | ETA:  32 days, 21:24:41

Episode: 4194   score: 29.75   Avg score (100e): 29.65   actor gain: -0.33   critic loss: 0.40   steps: 4194
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 21:26:24

Episode: 4195   score: 29.75   Avg score (100e): 29.65   actor gain: -0.33   critic loss: 0.40   steps: 4195


training loop:   8% |##                               | ETA:  32 days, 21:28:12

Episode: 4196   score: 29.75   Avg score (100e): 29.65   actor gain: -0.33   critic loss: 0.40   steps: 4196


training loop:   8% |##                               | ETA:  32 days, 21:30:44

Episode: 4197   score: 29.76   Avg score (100e): 29.66   actor gain: -0.33   critic loss: 0.40   steps: 4197


training loop:   8% |##                               | ETA:  32 days, 21:30:33

Episode: 4198   score: 29.76   Avg score (100e): 29.66   actor gain: -0.33   critic loss: 0.40   steps: 4198


training loop:   8% |##                               | ETA:  32 days, 21:29:45

Episode: 4199   score: 29.76   Avg score (100e): 29.66   actor gain: -0.33   critic loss: 0.40   steps: 4199


training loop:   8% |##                               | ETA:  32 days, 21:27:14

Episode: 4200   score: 29.77   Avg score (100e): 29.66   actor gain: -0.33   critic loss: 0.40   steps: 4200


training loop:   8% |##                               | ETA:  32 days, 21:24:40

Episode: 4201   score: 29.77   Avg score (100e): 29.67   actor gain: -0.33   critic loss: 0.40   steps: 4201
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 21:22:04

Episode: 4202   score: 29.78   Avg score (100e): 29.67   actor gain: -0.33   critic loss: 0.40   steps: 4202


training loop:   8% |##                               | ETA:  32 days, 21:20:03

Episode: 4203   score: 29.78   Avg score (100e): 29.67   actor gain: -0.33   critic loss: 0.40   steps: 4203


training loop:   8% |##                               | ETA:  32 days, 21:18:08

Episode: 4204   score: 29.79   Avg score (100e): 29.67   actor gain: -0.33   critic loss: 0.40   steps: 4204
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 21:14:49

Episode: 4205   score: 29.80   Avg score (100e): 29.68   actor gain: -0.32   critic loss: 0.40   steps: 4205


training loop:   8% |##                               | ETA:  32 days, 21:14:09

Episode: 4206   score: 29.80   Avg score (100e): 29.68   actor gain: -0.32   critic loss: 0.40   steps: 4206
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 21:12:52

Episode: 4207   score: 29.81   Avg score (100e): 29.68   actor gain: -0.32   critic loss: 0.40   steps: 4207


training loop:   8% |##                               | ETA:  32 days, 21:13:00

Episode: 4208   score: 29.81   Avg score (100e): 29.68   actor gain: -0.32   critic loss: 0.40   steps: 4208


training loop:   8% |##                               | ETA:  32 days, 21:12:16

Episode: 4209   score: 29.81   Avg score (100e): 29.69   actor gain: -0.32   critic loss: 0.40   steps: 4209


training loop:   8% |##                               | ETA:  32 days, 21:11:15

Episode: 4210   score: 29.82   Avg score (100e): 29.69   actor gain: -0.32   critic loss: 0.40   steps: 4210


training loop:   8% |##                               | ETA:  32 days, 21:08:35

Episode: 4211   score: 29.83   Avg score (100e): 29.69   actor gain: -0.32   critic loss: 0.40   steps: 4211
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 21:07:14

Episode: 4212   score: 29.83   Avg score (100e): 29.69   actor gain: -0.32   critic loss: 0.40   steps: 4212


training loop:   8% |##                               | ETA:  32 days, 21:05:27

Episode: 4213   score: 29.83   Avg score (100e): 29.70   actor gain: -0.32   critic loss: 0.40   steps: 4213
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 21:04:27

Episode: 4214   score: 29.84   Avg score (100e): 29.70   actor gain: -0.32   critic loss: 0.40   steps: 4214
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 21:01:12

Episode: 4215   score: 29.84   Avg score (100e): 29.70   actor gain: -0.32   critic loss: 0.40   steps: 4215


training loop:   8% |##                               | ETA:  32 days, 20:58:35

Episode: 4216   score: 29.84   Avg score (100e): 29.70   actor gain: -0.32   critic loss: 0.40   steps: 4216


training loop:   8% |##                               | ETA:  32 days, 20:55:00

Episode: 4217   score: 29.84   Avg score (100e): 29.71   actor gain: -0.32   critic loss: 0.40   steps: 4217


training loop:   8% |##                               | ETA:  32 days, 20:55:17

Episode: 4218   score: 29.84   Avg score (100e): 29.71   actor gain: -0.32   critic loss: 0.40   steps: 4218
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:55:35

Episode: 4219   score: 29.84   Avg score (100e): 29.71   actor gain: -0.32   critic loss: 0.40   steps: 4219
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:55:30

Episode: 4220   score: 29.85   Avg score (100e): 29.71   actor gain: -0.32   critic loss: 0.40   steps: 4220


training loop:   8% |##                               | ETA:  32 days, 20:59:27

Episode: 4221   score: 29.85   Avg score (100e): 29.72   actor gain: -0.32   critic loss: 0.39   steps: 4221


training loop:   8% |##                               | ETA:  32 days, 20:59:51

Episode: 4222   score: 29.86   Avg score (100e): 29.72   actor gain: -0.32   critic loss: 0.40   steps: 4222
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 21:00:15

Episode: 4223   score: 29.87   Avg score (100e): 29.72   actor gain: -0.32   critic loss: 0.39   steps: 4223


training loop:   8% |##                               | ETA:  32 days, 21:01:16

Episode: 4224   score: 29.86   Avg score (100e): 29.72   actor gain: -0.32   critic loss: 0.39   steps: 4224


training loop:   8% |##                               | ETA:  32 days, 21:02:28

Episode: 4225   score: 29.86   Avg score (100e): 29.73   actor gain: -0.32   critic loss: 0.39   steps: 4225
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 21:02:15

Episode: 4226   score: 29.87   Avg score (100e): 29.73   actor gain: -0.32   critic loss: 0.39   steps: 4226
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 21:02:11

Episode: 4227   score: 29.87   Avg score (100e): 29.73   actor gain: -0.32   critic loss: 0.39   steps: 4227


training loop:   8% |##                               | ETA:  32 days, 21:01:44

Episode: 4228   score: 29.88   Avg score (100e): 29.73   actor gain: -0.32   critic loss: 0.39   steps: 4228
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 21:00:26

Episode: 4229   score: 29.88   Avg score (100e): 29.74   actor gain: -0.32   critic loss: 0.39   steps: 4229


training loop:   8% |##                               | ETA:  32 days, 20:59:28

Episode: 4230   score: 29.89   Avg score (100e): 29.74   actor gain: -0.32   critic loss: 0.39   steps: 4230


training loop:   8% |##                               | ETA:  32 days, 20:58:57

Episode: 4231   score: 29.90   Avg score (100e): 29.74   actor gain: -0.32   critic loss: 0.39   steps: 4231


training loop:   8% |##                               | ETA:  32 days, 20:58:08

Episode: 4232   score: 29.90   Avg score (100e): 29.75   actor gain: -0.32   critic loss: 0.39   steps: 4232
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:56:46

Episode: 4233   score: 29.91   Avg score (100e): 29.75   actor gain: -0.32   critic loss: 0.39   steps: 4233


training loop:   8% |##                               | ETA:  32 days, 20:55:22

Episode: 4234   score: 29.91   Avg score (100e): 29.75   actor gain: -0.32   critic loss: 0.39   steps: 4234


training loop:   8% |##                               | ETA:  32 days, 20:54:08

Episode: 4235   score: 29.91   Avg score (100e): 29.75   actor gain: -0.32   critic loss: 0.39   steps: 4235


training loop:   8% |##                               | ETA:  32 days, 20:53:15

Episode: 4236   score: 29.92   Avg score (100e): 29.76   actor gain: -0.32   critic loss: 0.39   steps: 4236


training loop:   8% |##                               | ETA:  32 days, 20:52:29

Episode: 4237   score: 29.92   Avg score (100e): 29.76   actor gain: -0.32   critic loss: 0.40   steps: 4237
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:51:00

Episode: 4238   score: 29.92   Avg score (100e): 29.76   actor gain: -0.32   critic loss: 0.40   steps: 4238


training loop:   8% |##                               | ETA:  32 days, 20:50:39

Episode: 4239   score: 29.93   Avg score (100e): 29.76   actor gain: -0.32   critic loss: 0.40   steps: 4239
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:48:44

Episode: 4240   score: 29.93   Avg score (100e): 29.77   actor gain: -0.32   critic loss: 0.40   steps: 4240
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:51:28

Episode: 4241   score: 29.93   Avg score (100e): 29.77   actor gain: -0.32   critic loss: 0.40   steps: 4241


training loop:   8% |##                               | ETA:  32 days, 20:54:18

Episode: 4242   score: 29.94   Avg score (100e): 29.77   actor gain: -0.32   critic loss: 0.40   steps: 4242


training loop:   8% |##                               | ETA:  32 days, 20:53:43

Episode: 4243   score: 29.95   Avg score (100e): 29.78   actor gain: -0.32   critic loss: 0.40   steps: 4243


training loop:   8% |##                               | ETA:  32 days, 20:55:48

Episode: 4244   score: 29.95   Avg score (100e): 29.78   actor gain: -0.32   critic loss: 0.40   steps: 4244
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:54:43

Episode: 4245   score: 29.95   Avg score (100e): 29.78   actor gain: -0.32   critic loss: 0.40   steps: 4245


training loop:   8% |##                               | ETA:  32 days, 20:56:46

Episode: 4246   score: 29.96   Avg score (100e): 29.78   actor gain: -0.32   critic loss: 0.40   steps: 4246
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:56:20

Episode: 4247   score: 29.97   Avg score (100e): 29.79   actor gain: -0.32   critic loss: 0.40   steps: 4247


training loop:   8% |##                               | ETA:  32 days, 20:55:32

Episode: 4248   score: 29.97   Avg score (100e): 29.79   actor gain: -0.32   critic loss: 0.40   steps: 4248


training loop:   8% |##                               | ETA:  32 days, 20:54:04

Episode: 4249   score: 29.97   Avg score (100e): 29.79   actor gain: -0.32   critic loss: 0.40   steps: 4249


training loop:   8% |##                               | ETA:  32 days, 20:54:02

Episode: 4250   score: 29.97   Avg score (100e): 29.80   actor gain: -0.32   critic loss: 0.40   steps: 4250


training loop:   8% |##                               | ETA:  32 days, 20:52:41

Episode: 4251   score: 29.98   Avg score (100e): 29.80   actor gain: -0.32   critic loss: 0.40   steps: 4251


training loop:   8% |##                               | ETA:  32 days, 20:52:27

Episode: 4252   score: 29.98   Avg score (100e): 29.80   actor gain: -0.32   critic loss: 0.40   steps: 4252


training loop:   8% |##                               | ETA:  32 days, 20:53:52

Episode: 4253   score: 29.98   Avg score (100e): 29.81   actor gain: -0.32   critic loss: 0.40   steps: 4253
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:57:56

Episode: 4254   score: 29.99   Avg score (100e): 29.81   actor gain: -0.32   critic loss: 0.40   steps: 4254
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:59:27

Episode: 4255   score: 29.99   Avg score (100e): 29.81   actor gain: -0.32   critic loss: 0.40   steps: 4255
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 21:01:25

Episode: 4256   score: 30.00   Avg score (100e): 29.81   actor gain: -0.32   critic loss: 0.40   steps: 4256


training loop:   8% |##                               | ETA:  32 days, 21:01:51

Episode: 4257   score: 29.99   Avg score (100e): 29.82   actor gain: -0.32   critic loss: 0.40   steps: 4257


training loop:   8% |##                               | ETA:  32 days, 21:01:29

Episode: 4258   score: 30.00   Avg score (100e): 29.82   actor gain: -0.32   critic loss: 0.40   steps: 4258
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 21:01:10

Episode: 4259   score: 30.00   Avg score (100e): 29.82   actor gain: -0.32   critic loss: 0.40   steps: 4259


training loop:   8% |##                               | ETA:  32 days, 20:59:52

Episode: 4260   score: 30.00   Avg score (100e): 29.83   actor gain: -0.32   critic loss: 0.40   steps: 4260


training loop:   8% |##                               | ETA:  32 days, 20:59:02

Episode: 4261   score: 30.01   Avg score (100e): 29.83   actor gain: -0.32   critic loss: 0.40   steps: 4261
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:58:51

Episode: 4262   score: 30.01   Avg score (100e): 29.83   actor gain: -0.32   critic loss: 0.40   steps: 4262


training loop:   8% |##                               | ETA:  32 days, 20:58:30

Episode: 4263   score: 30.02   Avg score (100e): 29.84   actor gain: -0.32   critic loss: 0.40   steps: 4263


training loop:   8% |##                               | ETA:  32 days, 20:57:21

Episode: 4264   score: 30.02   Avg score (100e): 29.84   actor gain: -0.32   critic loss: 0.40   steps: 4264
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:55:52

Episode: 4265   score: 30.03   Avg score (100e): 29.84   actor gain: -0.32   critic loss: 0.40   steps: 4265


training loop:   8% |##                               | ETA:  32 days, 20:54:42

Episode: 4266   score: 30.03   Avg score (100e): 29.85   actor gain: -0.32   critic loss: 0.40   steps: 4266
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:54:26

Episode: 4267   score: 30.04   Avg score (100e): 29.85   actor gain: -0.32   critic loss: 0.40   steps: 4267
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:53:40

Episode: 4268   score: 30.05   Avg score (100e): 29.85   actor gain: -0.32   critic loss: 0.40   steps: 4268
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:56:42

Episode: 4269   score: 30.05   Avg score (100e): 29.86   actor gain: -0.32   critic loss: 0.40   steps: 4269


training loop:   8% |##                               | ETA:  32 days, 20:57:45

Episode: 4270   score: 30.06   Avg score (100e): 29.86   actor gain: -0.32   critic loss: 0.40   steps: 4270


training loop:   8% |##                               | ETA:  32 days, 20:57:15

Episode: 4271   score: 30.06   Avg score (100e): 29.86   actor gain: -0.32   critic loss: 0.40   steps: 4271
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:56:02

Episode: 4272   score: 30.07   Avg score (100e): 29.87   actor gain: -0.33   critic loss: 0.40   steps: 4272


training loop:   8% |##                               | ETA:  32 days, 20:55:16

Episode: 4273   score: 30.08   Avg score (100e): 29.87   actor gain: -0.33   critic loss: 0.40   steps: 4273
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:54:12

Episode: 4274   score: 30.08   Avg score (100e): 29.87   actor gain: -0.33   critic loss: 0.40   steps: 4274


training loop:   8% |##                               | ETA:  32 days, 20:53:23

Episode: 4275   score: 30.09   Avg score (100e): 29.88   actor gain: -0.33   critic loss: 0.40   steps: 4275


training loop:   8% |##                               | ETA:  32 days, 20:53:18

Episode: 4276   score: 30.10   Avg score (100e): 29.88   actor gain: -0.32   critic loss: 0.40   steps: 4276


training loop:   8% |##                               | ETA:  32 days, 20:52:35

Episode: 4277   score: 30.10   Avg score (100e): 29.88   actor gain: -0.32   critic loss: 0.40   steps: 4277
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:51:40

Episode: 4278   score: 30.11   Avg score (100e): 29.89   actor gain: -0.32   critic loss: 0.40   steps: 4278


training loop:   8% |##                               | ETA:  32 days, 20:51:13

Episode: 4279   score: 30.12   Avg score (100e): 29.89   actor gain: -0.32   critic loss: 0.40   steps: 4279
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:49:26

Episode: 4280   score: 30.12   Avg score (100e): 29.90   actor gain: -0.32   critic loss: 0.40   steps: 4280


training loop:   8% |##                               | ETA:  32 days, 20:48:45

Episode: 4281   score: 30.12   Avg score (100e): 29.90   actor gain: -0.32   critic loss: 0.40   steps: 4281


training loop:   8% |##                               | ETA:  32 days, 20:47:47

Episode: 4282   score: 30.13   Avg score (100e): 29.90   actor gain: -0.32   critic loss: 0.40   steps: 4282
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:48:56

Episode: 4283   score: 30.14   Avg score (100e): 29.91   actor gain: -0.32   critic loss: 0.40   steps: 4283
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:55:42

Episode: 4284   score: 30.14   Avg score (100e): 29.91   actor gain: -0.32   critic loss: 0.40   steps: 4284


training loop:   8% |##                               | ETA:  32 days, 20:56:19

Episode: 4285   score: 30.14   Avg score (100e): 29.92   actor gain: -0.32   critic loss: 0.40   steps: 4285
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:58:51

Episode: 4286   score: 30.14   Avg score (100e): 29.92   actor gain: -0.32   critic loss: 0.40   steps: 4286


training loop:   8% |##                               | ETA:  32 days, 20:58:54

Episode: 4287   score: 30.15   Avg score (100e): 29.93   actor gain: -0.32   critic loss: 0.40   steps: 4287


training loop:   8% |##                               | ETA:  32 days, 20:59:38

Episode: 4288   score: 30.15   Avg score (100e): 29.93   actor gain: -0.32   critic loss: 0.40   steps: 4288


training loop:   8% |##                               | ETA:  32 days, 21:00:27

Episode: 4289   score: 30.15   Avg score (100e): 29.93   actor gain: -0.32   critic loss: 0.40   steps: 4289


training loop:   8% |##                               | ETA:  32 days, 21:00:15

Episode: 4290   score: 30.16   Avg score (100e): 29.94   actor gain: -0.32   critic loss: 0.40   steps: 4290


training loop:   8% |##                               | ETA:  32 days, 21:00:38

Episode: 4291   score: 30.17   Avg score (100e): 29.94   actor gain: -0.32   critic loss: 0.40   steps: 4291
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:59:55

Episode: 4292   score: 30.17   Avg score (100e): 29.95   actor gain: -0.33   critic loss: 0.40   steps: 4292


training loop:   8% |##                               | ETA:  32 days, 21:00:09

Episode: 4293   score: 30.18   Avg score (100e): 29.95   actor gain: -0.32   critic loss: 0.40   steps: 4293


training loop:   8% |##                               | ETA:  32 days, 20:59:29

Episode: 4294   score: 30.19   Avg score (100e): 29.96   actor gain: -0.32   critic loss: 0.40   steps: 4294


training loop:   8% |##                               | ETA:  32 days, 20:59:52

Episode: 4295   score: 30.19   Avg score (100e): 29.96   actor gain: -0.32   critic loss: 0.40   steps: 4295


training loop:   8% |##                               | ETA:  32 days, 20:59:14

Episode: 4296   score: 30.19   Avg score (100e): 29.96   actor gain: -0.32   critic loss: 0.40   steps: 4296


training loop:   8% |##                               | ETA:  32 days, 20:58:37

Episode: 4297   score: 30.20   Avg score (100e): 29.97   actor gain: -0.32   critic loss: 0.40   steps: 4297


training loop:   8% |##                               | ETA:  32 days, 20:58:24

Episode: 4298   score: 30.20   Avg score (100e): 29.97   actor gain: -0.32   critic loss: 0.40   steps: 4298


training loop:   8% |##                               | ETA:  32 days, 20:57:51

Episode: 4299   score: 30.21   Avg score (100e): 29.98   actor gain: -0.32   critic loss: 0.40   steps: 4299
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 20:57:55

Episode: 4300   score: 30.22   Avg score (100e): 29.98   actor gain: -0.32   critic loss: 0.40   steps: 4300
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 21:05:53

Episode: 4301   score: 30.23   Avg score (100e): 29.99   actor gain: -0.32   critic loss: 0.40   steps: 4301
np.all(done) is true! miracle!


training loop:   8% |##                               | ETA:  32 days, 21:08:55

Episode: 4302   score: 30.23   Avg score (100e): 29.99   actor gain: -0.32   critic loss: 0.40   steps: 4302


training loop:   8% |##                               | ETA:  32 days, 21:14:24

Episode: 4303   score: 30.24   Avg score (100e): 30.00   actor gain: -0.32   critic loss: 0.40   steps: 4303


training loop:   8% |##                               | ETA:  32 days, 21:15:17

Episode: 4304   score: 30.24   Avg score (100e): 30.00   actor gain: -0.32   critic loss: 0.40   steps: 4304


In [None]:
saveTrainedModel(agent, model_dir + model_name)

In [None]:
# plot the scores
import matplotlib.pyplot as plt
%matplotlib inline

fig = plt.figure()
ax = fig.add_subplot(111)
plt.plot(np.arange(len(mean_rewards)), mean_rewards)
plt.ylabel('Score')
plt.xlabel('Episode #')
plt.show()

In [None]:
scores = np.zeros(num_agents)                # initialize the score (for each agent)
for _ in range(10):
    agent.step(train_mode=False)             # lower eps and train_mode=False
    episode_reward = agent.running_rewards
    scores += episode_reward                 # update the score (for each agent)
print('Total score (averaged over agents) this episode: {}'.format(np.mean(scores)))

In [None]:
env.close()