Add ignore_above option for ICUCollationKeywordFieldMapper #40413

clement-tourriere · 2019-03-25T14:17:54Z

ICUCollationKeywordFieldMapper is not derived from KeywordFieldMapper, but from FieldMapper, so it doesn't have the ignore_above configuration.

The use case is to be able to add a sort field for specific language, ignoring long values.
For the moment the only solution is to perform this check in the client and separate the sort field from the keyword field.

I can provide a PR for this feature.

The text was updated successfully, but these errors were encountered:

elasticmachine · 2019-03-25T14:33:20Z

Pinging @elastic/es-search

jtibshirani · 2019-04-01T21:09:49Z

@clement-tourriere to check my understanding of your use case: would you expect that ignore_above be applied to the input value, before collation is run?

clement-tourriere · 2019-04-02T16:37:10Z

In fact my use case is to be able to configure a max length on text field indexed with a sort "sub field" https://www.elastic.co/guide/en/elasticsearch/reference/current/multi-fields.html. The use case is to ignore long field (considering that they are not sortable) and in the worst case to protect against lucene byte-ref limit https://www.elastic.co/guide/en/elasticsearch/reference/current/ignore-above.html.

It might be more logical to apply the ignore_above check after collation instead of the input value.

jtibshirani · 2019-04-08T21:07:45Z

Thanks for the additional details. I could also see an argument for applying ignore_above before collation -- this matches the behavior for keyword fields, where we check ignore_above against the input before any normalizer is applied. I've asked other team members for their thoughts on this issue as well.

jpountz · 2019-04-16T13:26:02Z

+1 to support ignore_above like keyword and make it behave similarly, ie. check the length of the original string rather than the collation. @clement-tourriere you suggested the opposite but I suspect you would be fine with this behavior, wouldn't you?

clement-tourriere · 2019-04-16T16:08:28Z

Yes, it would be absolutely fine for our use case to check the length before collation ;)

jtibshirani · 2019-04-16T16:51:57Z

Great, now that we are clear on that I will take a look at your PR.

Add the possibility to use ignore_above parameter in ICUCollationKeywordFieldMapper. Close #40413

Add the possibility to use ignore_above parameter in ICUCollationKeywordFieldMapper. Close elastic#40413

cbuescher added >enhancement :Search/Analysis How text is split into tokens labels Mar 25, 2019

clement-tourriere mentioned this issue Mar 25, 2019

Add ignore_above in ICUCollationKeywordFieldMapper #40414

Merged

jtibshirani added the team-discuss label Apr 8, 2019

jtibshirani removed the team-discuss label Apr 17, 2019

jtibshirani closed this as completed in #40414 Apr 19, 2019

jtibshirani pushed a commit that referenced this issue Apr 19, 2019

Add ignore_above in ICUCollationKeywordFieldMapper (#40414)

8fe7568

Add the possibility to use ignore_above parameter in ICUCollationKeywordFieldMapper. Close #40413

jtibshirani pushed a commit that referenced this issue Apr 19, 2019

Add ignore_above in ICUCollationKeywordFieldMapper (#40414)

c80f86e

Add the possibility to use ignore_above parameter in ICUCollationKeywordFieldMapper. Close #40413

clement-tourriere pushed a commit to opendatasoft/elasticsearch that referenced this issue May 10, 2019

Add ignore_above in ICUCollationKeywordFieldMapper. Close elastic#40413

4868f4d

clement-tourriere pushed a commit to opendatasoft/elasticsearch that referenced this issue May 21, 2019

Add ignore_above in ICUCollationKeywordFieldMapper. Close elastic#40413

0690016

clement-tourriere pushed a commit to opendatasoft/elasticsearch that referenced this issue May 21, 2019

Add ignore_above in ICUCollationKeywordFieldMapper. Close elastic#40413

a0849b6

gurkankaymak pushed a commit to gurkankaymak/elasticsearch that referenced this issue May 27, 2019

Add ignore_above in ICUCollationKeywordFieldMapper (elastic#40414)

d1f5eb5

Add the possibility to use ignore_above parameter in ICUCollationKeywordFieldMapper. Close elastic#40413

clement-tourriere pushed a commit to opendatasoft/elasticsearch that referenced this issue Jul 19, 2019

Add ignore_above in ICUCollationKeywordFieldMapper. Close elastic#40413

93ed5c5

clement-tourriere pushed a commit to opendatasoft/elasticsearch that referenced this issue Aug 5, 2019

Add ignore_above in ICUCollationKeywordFieldMapper. Close elastic#40413

fd8de43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ignore_above option for ICUCollationKeywordFieldMapper #40413

Add ignore_above option for ICUCollationKeywordFieldMapper #40413

clement-tourriere commented Mar 25, 2019

elasticmachine commented Mar 25, 2019

jtibshirani commented Apr 1, 2019

clement-tourriere commented Apr 2, 2019

jtibshirani commented Apr 8, 2019 •

edited

jpountz commented Apr 16, 2019

clement-tourriere commented Apr 16, 2019

jtibshirani commented Apr 16, 2019

Add ignore_above option for ICUCollationKeywordFieldMapper #40413

Add ignore_above option for ICUCollationKeywordFieldMapper #40413

Comments

clement-tourriere commented Mar 25, 2019

elasticmachine commented Mar 25, 2019

jtibshirani commented Apr 1, 2019

clement-tourriere commented Apr 2, 2019

jtibshirani commented Apr 8, 2019 • edited

jpountz commented Apr 16, 2019

clement-tourriere commented Apr 16, 2019

jtibshirani commented Apr 16, 2019

jtibshirani commented Apr 8, 2019 •

edited