[FIX] ADR model size #14

ruohoruotsi · 2019-04-27T21:47:10Z

The ADR model is too big.

200MB is what the training script emits, but normal people cannot be downloading 200MB haba!!
What is comprising the large size?? My suspicion is that Pytorch is also saving other data along w/ the weights/biases. Investigate and optimize the size of the model so that we can store it either locally (within github's limits) or at least make it an easier download.

Ìrànlọ́wọ́:

https://github.com/OpenNMT/OpenNMT-py/blob/master/tools/release_model.py

ruohoruotsi · 2019-04-28T00:38:47Z

Learning more about the Pytorch model structure, we see that the optimizer’s state_dict is also saved as it contains state and parameters that are updated as the model trains. So this is important for checkpoint models, to resume training, but it is NOT important for the final model, so we can delete this part of the model.

If you trained your model using Adam, you need to save the optimizer state 
dict as well and reload that. Also, if you used any learning rate decay, you need 
to reload the state of the scheduler because it gets reset if you don’t, and you 
may end up with a higher learning rate that will make the solution state oscillate. 
Finally, if you have any dropout or batch norm in your model architecture, and 
you saved your model after a test loop (in which case model.eval() was called), 
make sure to call model.train() before the training loop.

ruohoruotsi · 2019-04-28T17:28:00Z

Fixed in 5781370, new model is 67MB which is still bigger than the recommended 50MB, but at least it still pushes. I should keep track of the history and other sources of bloat in this repo.

ruohoruotsi · 2019-05-16T18:44:55Z

An alternative if you want to use big files on the web, but host elsewhere than somewhere easily curl-able or within the github repo itself. Appreciate the fiddly-ness:

https://medium.freecodecamp.org/how-to-transfer-large-files-to-google-colab-and-remote-jupyter-notebooks-26ca252892fa

ruohoruotsi added bug Something isn't working enhancement New feature or request labels Apr 27, 2019

ruohoruotsi self-assigned this Apr 27, 2019

ruohoruotsi closed this as completed Apr 28, 2019

ruohoruotsi mentioned this issue Jul 20, 2019

Improve BIG file dependencies Niger-Volta-LTI/iranlowo#11

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIX] ADR model size #14

[FIX] ADR model size #14

ruohoruotsi commented Apr 27, 2019

ruohoruotsi commented Apr 28, 2019

ruohoruotsi commented Apr 28, 2019

ruohoruotsi commented May 16, 2019

[FIX] ADR model size #14

[FIX] ADR model size #14

Comments

ruohoruotsi commented Apr 27, 2019

Ìrànlọ́wọ́:

ruohoruotsi commented Apr 28, 2019

ruohoruotsi commented Apr 28, 2019

ruohoruotsi commented May 16, 2019