Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to use the model in inference mode #95

Closed
MarcoMeter opened this issue Mar 28, 2019 · 7 comments
Closed

How to use the model in inference mode #95

MarcoMeter opened this issue Mar 28, 2019 · 7 comments

Comments

@MarcoMeter
Copy link

Hello,

how does one load the model for execution only (not training) to watch the trained behavior?
This is related to #71

@mgbellemare
Copy link
Collaborator

If you load the agent but run it in eval mode, I think that will do what you want (combined to #71).

@MarcoMeter
Copy link
Author

How is the eval mode used? Setting the training steps to 0 causes a division by zero exception.

@mgbellemare
Copy link
Collaborator

That sounds like a bug -- I think probably just missing a check that if training_steps = 0, we shouldn't run the train phase. We already do this for the eval phase (when eval_steps = 0).

@MarcoMeter
Copy link
Author

For the eval mode, are all checkpoints files necessary?
Like the checkpoints that store states can get really large.

@mgbellemare
Copy link
Collaborator

You should only need the last checkpoint -- the code keeps a few (3) around, in case the most recent checkpoint is corrupted.

@MarcoMeter
Copy link
Author

I mean these kind of checkpoint files:

add_count_ckpt.19.gz
ckpt.19
invalid_range_ckpt.19.gz
sentinel_checkpoint_complete.19
$store$_action_ckpt.19.gz
$store$_observation_ckpt.19.gz
$store$_reward_ckpt.19.gz
sum_tree_ckpt.19.gz
tf_ckpt-19.data-00000-of-00001
tf_ckpt-19.index
tf_ckpt-19.meta

Are all relevant for the eval mode? Especially the $store$_observation_ckpt can comprise many gigabytes.

@psc-g
Copy link
Collaborator

psc-g commented Apr 8, 2019 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants