You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
since Elasticsearch 6.0.0, startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards.
That means startOffsets should be always greater than before. Although Chinese word token can be divided into servial token due to ambiguity like "中国人" can be tokenized "中国"+"人" & "中国人"
Origin Error:
{
"error": {
"root_cause": [
{
"type": "remote_transport_exception",
"reason": "[4YKGDw9][192.168.0.21:9305][indices:data/write/update[s]]"
}
],
"type": "illegal_argument_exception",
"reason": "startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards startOffset=0,endOffset=3,lastStartOffset=1 for field 'description'"
},
"status": 400
}
since Elasticsearch 6.0.0, startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards.
That means startOffsets should be always greater than before. Although Chinese word token can be divided into servial token due to ambiguity like "中国人" can be tokenized "中国"+"人" & "中国人"
Origin Error:
One of solution metioned below:
hankcs/hanlp-lucene-plugin#27 (comment)
hankcs/hanlp-lucene-plugin@eebea90
And origin issue reference:
sing1ee#9
The text was updated successfully, but these errors were encountered: