GH-387: modify FlairEmbeddings to handle large texts #444

alanakbik · 2019-02-01T18:44:17Z

closes #387

This PR adds a fix for retrieving embeddings from a character LM for long sequences. The idea is to chop long sequences into chunks and push each chunk through the LM, while always remembering the last hidden state as new initial hidden state. This lowers memory requirements (shorter sequences at once) but increases runtime (more calls to RNN).

In detail:

LanugageModel.get_representation() and FlairEmbeddings now have the chars_per_chunk parameter that defaults to 512. Lowering this parameter reduces memory but increases runtime.
LanguageModelTrainer can now shuffle sentences in each split
Deprecated DocumentMeanEmbeddings removed, as well as most mentions of deprecated CharLMEmbeddings
Removed slow unit tests

aakbik added 3 commits February 1, 2019 19:43

GH-387: modify FlairEmbeddings to handle large texts

7f46b2c

GH-387: comment out slow tests

c0f12bc

GH-387: rename parameter for consistency

1b735f3

alanakbik merged commit 1876f1f into master Feb 2, 2019

alanakbik deleted the GH-387-long-flair-embeddings branch February 6, 2019 16:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH-387: modify FlairEmbeddings to handle large texts #444

GH-387: modify FlairEmbeddings to handle large texts #444

alanakbik commented Feb 1, 2019 •

edited

Loading

GH-387: modify FlairEmbeddings to handle large texts #444

GH-387: modify FlairEmbeddings to handle large texts #444

Conversation

alanakbik commented Feb 1, 2019 • edited Loading

alanakbik commented Feb 1, 2019 •

edited

Loading