Model size reduction is sometimes very necessary for faster loading of model and faster predictions.
https://medium.com/google-cloud/optimizing-tensorflow-models-for-serving-959080e9ddbf
https://medium.com/tensorflow/tensorflow-model-optimization-toolkit-pruning-api-42cac9157a6a
Piyush Pathak