How to fine tune the existing weights on new data ? #50

asiddhant · 2018-06-14T04:39:46Z

I converted the hdf5 file back as a ckpt file (using the custom_getter method in bilm/model.py) and tried to use it with architecture in bilm/training.py but the loaded weights give very bad perplexity on heldout data when I do run_test.py. Are the architectures in bilm/model.py and bilm/training.py compatible. If you feel I m doing something wrong, is it possible for you to share the ckpt file of the given hdf5 file.

Thanks

matt-peters · 2018-06-14T17:22:01Z

The architectures are the same. However, the hdf5 file doesn't contain the softmax weights as they aren't needed to compute the ELMo representations, and they significantly increase the size of the file. I'll make the checkpoint file for the pre-trained model with the softmax weights available shortly.

asiddhant · 2018-06-14T22:15:38Z

Thanks a lot. I missed that part. Yeah it would be great to have ckpt file as well but since vocab will anyways be new, pretrained softmax would not be required for fine tuning as well. So I am closing this issue.

asiddhant · 2018-07-18T23:52:26Z

Hi would it be possible for you to share the checkpoint file as well?

matt-peters · 2018-07-30T22:42:21Z

The checkpoint file is available: see https://github.com/allenai/bilm-tf/blob/master/README.md#can-you-provide-the-tensorflow-checkpoint-from-training

asiddhant closed this as completed Jun 14, 2018

asiddhant reopened this Jul 18, 2018

matt-peters closed this as completed Jul 30, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to fine tune the existing weights on new data ? #50

How to fine tune the existing weights on new data ? #50

asiddhant commented Jun 14, 2018

matt-peters commented Jun 14, 2018 •

edited

Loading

asiddhant commented Jun 14, 2018

asiddhant commented Jul 18, 2018

matt-peters commented Jul 30, 2018

How to fine tune the existing weights on new data ? #50

How to fine tune the existing weights on new data ? #50

Comments

asiddhant commented Jun 14, 2018

matt-peters commented Jun 14, 2018 • edited Loading

asiddhant commented Jun 14, 2018

asiddhant commented Jul 18, 2018

matt-peters commented Jul 30, 2018

matt-peters commented Jun 14, 2018 •

edited

Loading