loading from checkpoint #51

AdrianLsk · 2017-03-16T22:35:12Z

Hi, can you please clarify how to use the saved model files if I want to load the model from a checkpoint?

Here is my code:

# mock input
mock_input = np.ones(input_shape)

# build model
accuracy, cost, inference_input, label_tensor, inferences, train_op = \
    build_network(input_shape, learning_rate, specs, pt.Phase.test)

# set gpu and config options
gpu_options = \
    tf.GPUOptions(allow_growth=True, per_process_gpu_memory_fraction=.9)
config = \
    tf.ConfigProto(allow_soft_placement=True, gpu_options=gpu_options,
                   log_device_placement=True)

# load from checkpoint
model_ckpt = './models/first/-3036.data-00000-of-00001'
runner = pt.train.Runner(initial_checkpoint=model_ckpt)

with tf.Session(config=config), tf.device('/gpu:2'):
        predictions = runner.run_model(
            op_list=[inferences], num_steps=1,
            feed_vars=(inference_input,), print_every=0,
            feed_data=[(mock_input,)])

The inspection of the three saved files:

the checkpoint is neither in file.data-00000-of-00001 nor in file.meta since both yield DataLoss error
so file.index must be the correct one but it gives NotFoundError: Tensor name "conv3d_1/bias/Adam_1" not found in checkpoint files ...

I thought that all variables are saved during training. what am I doing wrong?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

loading from checkpoint #51

loading from checkpoint #51

AdrianLsk commented Mar 16, 2017

loading from checkpoint #51

loading from checkpoint #51

Comments

AdrianLsk commented Mar 16, 2017