Does the WordSeq extractor support ngrams? #4

zachmayer · 2018-02-02T19:29:02Z

E.g. if I want sequences of integers, with ngrams appended to the end?

anttttti · 2018-02-03T06:35:00Z

Not currently. This can be added as a feature easily.
You can add something like this as the last line in your text normalization function :
text+= " "+bigrams(text)
This will do almost the same, but since its applied as text normalization some things won't be available, such as spelling correction and pruning tokens by frequency. You can get the exact behavior by a more complicated setup, but this should be added as a feature to make it easy to use.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does the WordSeq extractor support ngrams? #4

Does the WordSeq extractor support ngrams? #4

zachmayer commented Feb 2, 2018

anttttti commented Feb 3, 2018

Does the WordSeq extractor support ngrams? #4

Does the WordSeq extractor support ngrams? #4

Comments

zachmayer commented Feb 2, 2018

anttttti commented Feb 3, 2018