Add dense feature normalization to Char-LSTM TorchScript model. #986

snisarg · 2019-09-17T15:04:59Z

Summary:
In D16357113 we added dense feature tokenisation to the DocModel architecture. As the ByteTokensDocumentModel extends the DocModel class and uses the FloatListTensorizer class, feature normalisation was already supported at training time after this diff was landed, however it was not supported at production time because the normalisation was not performed during the torchscriptify forward function. This presents a problem because the model can be trained on normalized data, but then won't be able to normalize fresh data at inference time, producing unusual results.

This diff adds normalisation in the torchscriptify forward function so that it can be used at inference time on Char-LSTM models.

Just like with the DocModel, if normalize is set to false in the config, then the normalization method just becomes the identity function, this is useful because we don't need to add any extra control flow directly in the forward function.

Reviewed By: snisarg

Differential Revision: D17358479

Summary: In D16357113 we added dense feature tokenisation to the `DocModel` architecture. As the `ByteTokensDocumentModel` extends the `DocModel` class and uses the `FloatListTensorizer` class, feature normalisation was already supported at training time after this diff was landed, however it was not supported at production time because the normalisation was not performed during the `torchscriptify` forward function. This presents a problem because the model can be trained on normalized data, but then won't be able to normalize fresh data at inference time, producing unusual results. This diff adds normalisation in the `torchscriptify` forward function so that it can be used at inference time on Char-LSTM models. Just like with the `DocModel`, if `normalize` is set to `false` in the config, then the normalization method just becomes the identity function, this is useful because we don't need to add any extra control flow directly in the forward function. Reviewed By: snisarg Differential Revision: D17358479 fbshipit-source-id: 4a92444963f52290d538377070bf872c9b42aba9

facebook-github-bot · 2019-09-18T02:14:26Z

This pull request has been merged in d2f3d21.

facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Sep 17, 2019

facebook-github-bot closed this in d2f3d21 Sep 18, 2019

facebook-github-bot added the Merged label Sep 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add dense feature normalization to Char-LSTM TorchScript model. #986

Add dense feature normalization to Char-LSTM TorchScript model. #986

snisarg commented Sep 17, 2019

facebook-github-bot commented Sep 18, 2019

Add dense feature normalization to Char-LSTM TorchScript model. #986

Add dense feature normalization to Char-LSTM TorchScript model. #986

Conversation

snisarg commented Sep 17, 2019

facebook-github-bot commented Sep 18, 2019