Skip to content

Conversation

romseygeek
Copy link
Contributor

This commit adds an adjust_offsets parameter to the word_delimiter_graph token filter, defaulting
to true. Most of the time you'd want sub-tokens emitted by this filter to have offsets that are
adjusted to their real position in the token stream; however, some token filters can change the length
or starting position of a token (eg trim) without changing their offset attributes, and this can lead to
word_delimiter_graph emitting illegal offsets. Setting adjust_offsets to false in these cases will
allow indexing again.

Fixes #34741, #33710

@romseygeek romseygeek self-assigned this Dec 17, 2018
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-search

@romseygeek romseygeek requested a review from jpountz December 17, 2018 13:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants