Make field data changes immediately taken into account and add the ability to disallow field data loading. #4432

jpountz · 2013-12-12T14:41:12Z

This commit changes field data configuration updates so that they are
immediately taken into account for loading new segments. The way it works
is that field data configuration is now cached separately from the field
data cache, meaning that it is now possible to clear the field data
configuration from IndexFieldDataService while the cache will stay around. On
the next time that Elasticsearch will reload field data configuration, it will
check if there is already a cache entry, and reuse it if it exists.

To disable field data loading, all that is required is to change the field
data format to "none" (supported by all field data types) using the update
mapping API. Elasticsearch will then refuse to load field data on any new
segment, but field data which has been loaded on the previous segments will
remain available. So you need to clear the field data cache in order to
reclaim memory (otherwise memory will be reclaimed slower, as segments get
merged).

Close #4430
Close #4431

…ility to disallow field data loading. This commit changes field data configuration updates so that they are immediately taken into account for loading new segments. The way it works is that field data configuration is now cached separately from the field data cache, meaning that it is now possible to clear the field data configuration from IndexFieldDataService while the cache will stay around. On the next time that Elasticsearch will reload field data configuration, it will check if there is already a cache entry, and reuse it if it exists. To disable field data loading, all that is required is to change the field data format to "none" (supported by all field data types) using the update mapping API. Elasticsearch will then refuse to load field data on any new segment, but field data which has been loaded on the previous segments will remain available. So you need to clear the field data cache in order to reclaim memory (otherwise memory will be reclaimed slower, as segments get merged). Close elastic#4430 Close elastic#4431

s1monw · 2013-12-12T17:06:42Z

src/main/java/org/elasticsearch/index/fielddata/IndexFieldDataService.java

-                .put(Tuple.tuple("string", "doc_values"), new DocValuesIndexFieldData.Builder())
+                .put(Tuple.tuple("string", DOC_VALUES_FORMAT), new DocValuesIndexFieldData.Builder())
+                .put(Tuple.tuple("string", NONE_FORMAT), new NoneIndexFieldData.Builder())
+                .put(Tuple.tuple("string", "no"), new DocValuesIndexFieldData.Builder())


what is no?

an typo :) will fix

s1monw · 2013-12-12T17:09:50Z

I left some small comments, other than that LGMT - I am tempted to propose to default tokenized fields to this!

kimchy · 2013-12-12T21:51:26Z

I am not sure I like calling it none or no, you explained it with the world disable, so maybe just call it disable(d)?

jpountz · 2013-12-12T22:46:04Z

I am tempted to propose to default tokenized fields to this!

I like the idea given that problems typically arise when loading by mistake field data on fields that are used for full-text search. However, I think it may be a problem for the out-of-the-box experience given that fields are tokenized by default? It's not the first time that I'm a bit torn between usability and production use when deciding on default values. Maybe we could have hints from the user on what he's doing with Elasticsearch in order to pick default values accordingly (just thinking out loud)...

I am not sure I like calling it none or no, you explained it with the world disable, so maybe just call it disable(d)?

To say the truth, I didn't like it either, I was just out of inspiration. disabled sounds much better!

jpountz · 2013-12-13T09:04:50Z

@s1monw @kimchy I just pushed a new commit based on your comments.

kimchy · 2013-12-13T09:07:03Z

docs/reference/index-modules/fielddata.asciidoc

+field data type that prevents field data from being loaded into memory and
+will cause all requests that would need field data for this field to return
+an error.
+


since circuit breaker is going to happen, I think that this doc can be misleading, since the circuit breaker will kick in and not allow for the field to be loaded (and hopefully, in the future, blacklist a field using this feature, ha!, how things connect!). It is obviously still very much good to explicitly disable certain fields from loading to field data

I just pushed a new commit to reword this part of the documentation. Does it look better?

s1monw · 2013-12-13T11:11:37Z

LGTM

kimchy · 2013-12-16T13:21:28Z

LGTM

jpountz added 2 commits December 12, 2013 15:36

Documentation

99746ed

s1monw reviewed Dec 12, 2013
View reviewed changes

Review round 1.

53a13cd

kimchy reviewed Dec 13, 2013
View reviewed changes

Reword the explanation for the disabled format.

994712b

jpountz closed this Dec 16, 2013

jpountz deleted the feature/fielddata_live_update branch December 16, 2013 14:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make field data changes immediately taken into account and add the ability to disallow field data loading. #4432

Make field data changes immediately taken into account and add the ability to disallow field data loading. #4432

jpountz commented Dec 12, 2013

s1monw Dec 12, 2013

jpountz Dec 12, 2013

s1monw commented Dec 12, 2013

kimchy commented Dec 12, 2013

jpountz commented Dec 12, 2013

jpountz commented Dec 13, 2013

kimchy Dec 13, 2013

jpountz Dec 15, 2013

s1monw commented Dec 13, 2013

kimchy commented Dec 16, 2013

Make field data changes immediately taken into account and add the ability to disallow field data loading. #4432

Make field data changes immediately taken into account and add the ability to disallow field data loading. #4432

Conversation

jpountz commented Dec 12, 2013

s1monw Dec 12, 2013

Choose a reason for hiding this comment

jpountz Dec 12, 2013

Choose a reason for hiding this comment

s1monw commented Dec 12, 2013

kimchy commented Dec 12, 2013

jpountz commented Dec 12, 2013

jpountz commented Dec 13, 2013

kimchy Dec 13, 2013

Choose a reason for hiding this comment

jpountz Dec 15, 2013

Choose a reason for hiding this comment

s1monw commented Dec 13, 2013

kimchy commented Dec 16, 2013