Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tika server returns status: 500 #11

Closed
Quyrean opened this issue May 22, 2018 · 2 comments
Closed

Tika server returns status: 500 #11

Quyrean opened this issue May 22, 2018 · 2 comments

Comments

@Quyrean
Copy link

Quyrean commented May 22, 2018

I built the appliance from the latest sources. I exported it to a virtual machine. When I run etl-file on the server I get an error

etl-file -v -f /media/sf_searchme/test.txt

2018-05-22 19:29:34,321 [MainThread  ] [WARNI]  Tika server returned status: 500
Exception while data enrichment of file:///searchme/test.txt with plugin enhance_extract_text_tika_server: 'content'
Exception while data enrichment of file:///searchme/test.txt with plugin enhance_pdf_ocr: 'content_type_ss'
Exception while data enrichment of file:///searchme/test.txt with plugin enhance_contenttype_group: 'content_type_ss'
Exception while data enrichment of file:///searchme/test.txt with plugin enhance_rdf_annotations_by_http_request: HTTP Error 500: Internal Server Error
Exception while data enrichment of file:///searchme/test.txt with plugin enhance_rdf: 'content_type_ss'

I know the tika server is running because I get a response when sending a request directly.

curl -T /media/sf_searchme/test.txt http://localhost:9998/tika

I changed the permission on the tika jar (not the link) to 777 and restarted the service.

I also tried running curl as tika user

connect to tika as tika user

here is the full log file

etl-file.log

Any ideas what is wrong or how to debug? thanks.

@Mandalka
Copy link
Collaborator

If you build even the open semantic search and so the tika package from scratch:

Bug in Tika 1.18: opensemanticsearch/tika-server.deb#7

Until then use Tika 1.17 too or go to config ui and set only one OCR dictionary.

Please reopen if not the solution.

@Quyrean
Copy link
Author

Quyrean commented May 23, 2018

Yes, that helped get me past that error, thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants