New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
5.0.2 plugin analysis-stempel Array index out of range: -2 #21911
Comments
It would be really helpful if you could try and identify exactly which of your documents is causing this issue. |
I get this error for random documents, each time is a different content :/ sample: [2016-12-01T14:04:23,782][DEBUG][o.e.a.b.TransportShardBulkAction] [es5] [666.pb1][0] failed to execute bulk item (index) index {[666.pb1][document][224], source[{"author":{"id":580,"name":"Waldemar G"},"indexes":[{"id":19,"name":"K |
@mikemccand does this look like a bug in Lucene? |
This problem occurs only with BULK operations. Single update, index requests work fine. Another exception is StringIndexOutOfBoundsException:
|
The problem still occurs on: Elasticsearch version: 5.1.2 |
I'll try to get to the bottom of this. Can you share your full analysis chain? From the stack traces I see at least shingle and synonym filter after the stempel stemmer, but what tokenizers/filters come before it? |
OK, nevermind on the full analysis chain @Mod3st, I see the issue... There is a thread safety bug in the TokenFilterFactory: https://github.com/elastic/elasticsearch/blob/master/plugins/analysis-stempel/src/main/java/org/elasticsearch/index/analysis/pl/PolishStemTokenFilterFactory.java#L48 That |
I understand that in the previous version of ES every thread was creating a new instance of PolishStemTokenFilterFactory and now each thread uses the same object? (the problem doesn't occur on ES 2) |
Hmm, @Mod3st I still see the bug in ES 2.4.x's latest sources (the same |
I understand, but I don't get any exceptions in ES 2.4.3 and I wonder why. |
Elasticsearch version: 5.0.2
Plugins installed: analysis-stempel 5.0,2
JVM version: 1.8.0_111
OS version: Ubuntu 14.04.1 LTS
I have problem with indexing data when analysis-stempel is set as fields analyser.
While indexing documents at random receives an error
Array index out of range: -1
Array index out of range: -2
Trace from log:
The text was updated successfully, but these errors were encountered: