Stop using cached components in the _analyze API #19827

nik9000 · 2016-08-05T14:11:44Z

We'd like to simplify AnalysisService and remove most of its members (#19814), maybe reducing it to just a Map<String, Analyzer>. To do that, we should remove calls to tokenizer, charFilter, and tokenFilter from the _analyze API, instead rebuilding these analyzers on the fly. This will make some calls to the _analyze API slower but it'll reduce the per index heap overhead.

The text was updated successfully, but these errors were encountered:

Stop calling tokenizer/tokenFilters/chaFilter method of IndexService Add some getAnalysisProvider methods Change SynonymTokenFilterFactory constructor Closes elastic#19827

Add javadoc some methods Closes elastic#19827

…ping Today we hold on to all possible tokenizers, tokenfilters etc. when we create an index service on a node. This was mainly done to allow the `_analyze` API to directly access all these primitve. We fixed this in elastic#19827 and can now get rid of the AnalysisService entirely and replace it with a simple map like class. This ensures we don't create a gazillion long living objects that are entirely useless since they are never used in most of the indices. Also those objects might consume a considerable amount of memory since they might load stopwords or synonyms etc. Closes elastic#19828

…ping (#20627) Today we hold on to all possible tokenizers, tokenfilters etc. when we create an index service on a node. This was mainly done to allow the `_analyze` API to directly access all these primitive. We fixed this in #19827 and can now get rid of the AnalysisService entirely and replace it with a simple map like class. This ensures we don't create a gazillion long living objects that are entirely useless since they are never used in most of the indices. Also those objects might consume a considerable amount of memory since they might load stopwords or synonyms etc. Closes #19828

nik9000 added blocker :Search/Analysis How text is split into tokens v5.0.0-beta1 labels Aug 5, 2016

This was referenced Aug 5, 2016

Simplify AnalysisService #19828

Closed

Do we want to let plugins make pre-built analysis components in 5.0? #19814

Closed

johtani mentioned this issue Aug 10, 2016

Stop using cached component in _analyze API #19929

Merged

johtani added a commit to johtani/elasticsearch that referenced this issue Aug 12, 2016

Stop using cached component in _analyze API

2cde3b0

Add javadoc some methods Closes elastic#19827

johtani closed this as completed in #19929 Aug 12, 2016

clintongormley added the v5.0.0-beta1 label Sep 14, 2016

s1monw mentioned this issue Sep 22, 2016

Remove AnalysisService and reduce it to a simple name to analyzer mapping #20627

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stop using cached components in the _analyze API #19827

Stop using cached components in the _analyze API #19827

nik9000 commented Aug 5, 2016

Stop using cached components in the _analyze API #19827

Stop using cached components in the _analyze API #19827

Comments

nik9000 commented Aug 5, 2016