How to use the model in inference mode #95

MarcoMeter · 2019-03-28T18:38:35Z

Hello,

how does one load the model for execution only (not training) to watch the trained behavior?
This is related to #71

mgbellemare · 2019-03-29T22:13:13Z

If you load the agent but run it in eval mode, I think that will do what you want (combined to #71).

MarcoMeter · 2019-04-01T04:40:45Z

How is the eval mode used? Setting the training steps to 0 causes a division by zero exception.

mgbellemare · 2019-04-06T23:26:26Z

That sounds like a bug -- I think probably just missing a check that if training_steps = 0, we shouldn't run the train phase. We already do this for the eval phase (when eval_steps = 0).

MarcoMeter · 2019-04-07T10:35:19Z

For the eval mode, are all checkpoints files necessary?
Like the checkpoints that store states can get really large.

mgbellemare · 2019-04-07T14:05:41Z

You should only need the last checkpoint -- the code keeps a few (3) around, in case the most recent checkpoint is corrupted.

MarcoMeter · 2019-04-08T04:21:39Z

I mean these kind of checkpoint files:

add_count_ckpt.19.gz
ckpt.19
invalid_range_ckpt.19.gz
sentinel_checkpoint_complete.19
$store$_action_ckpt.19.gz
$store$_observation_ckpt.19.gz
$store$_reward_ckpt.19.gz
sum_tree_ckpt.19.gz
tf_ckpt-19.data-00000-of-00001
tf_ckpt-19.index
tf_ckpt-19.meta

Are all relevant for the eval mode? Especially the $store$_observation_ckpt can comprise many gigabytes.

psc-g · 2019-04-08T10:24:00Z

The tf_ckpt* files are for restoring the network parameters, so you definitely want those. The other files are for restoring the replay buffer to the state it was at when the checkpoint was saved. If you're only doing inference for evaluation (i.e. not sampling batches from the replay buffer to learn from) then you don't need those files.

…

On Mon, Apr 8, 2019, 12:21 AM Marco Pleines, ***@***.***> wrote: I mean these kind of checkpoint files: add_count_ckpt.19.gz ckpt.19 invalid_range_ckpt.19.gz sentinel_checkpoint_complete.19 $store$_action_ckpt.19.gz $store$_observation_ckpt.19.gz $store$_reward_ckpt.19.gz sum_tree_ckpt.19.gz tf_ckpt-19.data-00000-of-00001 tf_ckpt-19.index tf_ckpt-19.meta Are all relevant for the eval mode? Especially the $store$_observation_ckpt can comprise many gigabytes. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#95 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ATYhMfe3XwytyJiin-OoW0mQbsIklc8Mks5vesPbgaJpZM4cQ3m6> .

mgbellemare closed this as completed Mar 29, 2019

callenshaw mentioned this issue May 1, 2019

What is the correct directory structure for submitting agent? Unity-Technologies/obstacle-tower-challenge#36

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use the model in inference mode #95

How to use the model in inference mode #95

MarcoMeter commented Mar 28, 2019

mgbellemare commented Mar 29, 2019

MarcoMeter commented Apr 1, 2019

mgbellemare commented Apr 6, 2019

MarcoMeter commented Apr 7, 2019

mgbellemare commented Apr 7, 2019

MarcoMeter commented Apr 8, 2019

psc-g commented Apr 8, 2019 via email

How to use the model in inference mode #95

How to use the model in inference mode #95

Comments

MarcoMeter commented Mar 28, 2019

mgbellemare commented Mar 29, 2019

MarcoMeter commented Apr 1, 2019

mgbellemare commented Apr 6, 2019

MarcoMeter commented Apr 7, 2019

mgbellemare commented Apr 7, 2019

MarcoMeter commented Apr 8, 2019

psc-g commented Apr 8, 2019 via email