Investigate setting embedding weights not seen in training to zero to reduce saved model size #13

KieranLitschel · 2021-01-17T12:13:41Z

In #10 we observed that it looks like a lot of the weights in the embeddings are never seen during training so maintain their initialized values. From what we can tell this seems to be happening for most weights initialized with a negative value. If we randomly initialize our embedding layer, weights that have never been seen during training have little to contribute to prediction at test time as their value is random. We may be able to use this to make our saved models smaller.

After training, we could check which weights have values that have not changed from their initialized value and set them to zero. Then when saving the matrix of embedding weights we just need to save the non-zero values and where they occur in the matrix.

KieranLitschel self-assigned this Jan 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate setting embedding weights not seen in training to zero to reduce saved model size #13

Investigate setting embedding weights not seen in training to zero to reduce saved model size #13

KieranLitschel commented Jan 17, 2021

Investigate setting embedding weights not seen in training to zero to reduce saved model size #13

Investigate setting embedding weights not seen in training to zero to reduce saved model size #13

Comments

KieranLitschel commented Jan 17, 2021