Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SOLR indexer: use field type text for content field #545

Closed
jnioche opened this Issue Mar 13, 2018 · 0 comments

Comments

Projects
None yet
1 participant
@jnioche
Copy link
Member

jnioche commented Mar 13, 2018

Otherwise you end up with no tokenisation and messages like

org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error from server at http://localhost:8983/solr/docs: Exception writing document id xxxxx to the index; possible analysis error: Document contains at least one immense term in field="content" (whose UTF8 encoding is longer than the max length 32766), all of which were skipped. Please correct the analyzer to not produce such terms. The prefix of the first immense term is: '[67, 108, 105, 99, 107, 32, 72, 69, 82, 69, 32, 116, 111, 32, 99, 104, 101, 99, 107, 32, 111, 117, 116, 32, 116, 104, 101, 32, 66, 69]...', original message: bytes can be at most 32766 in length; got 34731. Perhaps the document has an indexed string field (solr.StrField) which is too large

@jnioche jnioche added bug SOLR labels Mar 13, 2018

@jnioche jnioche added this to the 1.8 milestone Mar 13, 2018

@jnioche jnioche closed this in f6b3b7b Mar 13, 2018

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
You can’t perform that action at this time.