Save weights at any time between integral epochs #8385

Yochengliu · 2017-10-22T12:42:59Z

In caffe, one can save the model weights at any time, for example, in iterations of 32768. Just like pressing 'Ctrl + C', then the training process stops and the model weights are saved automatically.

But in mxnet, I can't find a way like in caffe to save weights at any time, it seems that mxnet can only save weights in integral epochs, such as epoch 1, 2, 3 ...

I feel sorry, because for large-scale datasets, it takes several days for training for one epoch. So, if something wrong, then a lot of time is wasted.

Any solutions for this ? Please in detail.
Thanks.

solin319 · 2017-10-23T01:12:16Z

Try to add follow code after batch_end_callback in 'base_module.py'.

arg_params, aux_params = self.get_params()
if epoch_end_callback is not None:
                for callback in _as_list(epoch_end_callback):
                    callback(nbatch, self.symbol, arg_params, aux_params)

When define 'mx.callback.do_checkpoint', you can pass 'period' to control number of iterations to save checkpoints.

Yochengliu · 2017-10-23T01:21:48Z

@solin319 Thank you, I will have a try, and do you know some specific saving ways that just press 'Ctrl + C', then the training process stops and the model weights are saved automatically ?

szha · 2018-01-23T00:26:29Z

@apache/mxnet-committers: This issue has been inactive for the past 90 days. It has no label and needs triage.

For general "how-to" questions, our user forum (and Chinese version) is a good place to get help.

lanking520 · 2018-06-11T00:32:35Z

Hi @Yochengliu , I would like to follow up this topic and find someone who can possibly help you. Have you found any ways to meet your needs? @nswamy can you label this as 'feature request'?

szha added the needs triage label Jan 23, 2018

indhub added Feature request and removed needs triage labels Jun 11, 2018

Yochengliu closed this as completed Mar 28, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Save weights at any time between integral epochs #8385

Save weights at any time between integral epochs #8385

Yochengliu commented Oct 22, 2017 •

edited

solin319 commented Oct 23, 2017

Yochengliu commented Oct 23, 2017 •

edited

szha commented Jan 23, 2018

lanking520 commented Jun 11, 2018

Save weights at any time between integral epochs #8385

Save weights at any time between integral epochs #8385

Comments

Yochengliu commented Oct 22, 2017 • edited

solin319 commented Oct 23, 2017

Yochengliu commented Oct 23, 2017 • edited

szha commented Jan 23, 2018

lanking520 commented Jun 11, 2018

Yochengliu commented Oct 22, 2017 •

edited

Yochengliu commented Oct 23, 2017 •

edited