Skip to content

TIKA-2931#281

Merged
tballison merged 5 commits intoapache:masterfrom
epugh:TIKA-2931
Sep 4, 2019
Merged

TIKA-2931#281
tballison merged 5 commits intoapache:masterfrom
epugh:TIKA-2931

Conversation

@epugh
Copy link
Contributor

@epugh epugh commented Aug 29, 2019

Here is a PR that fixes sending information information to System.err, so you can return clean JSON when using tika-app on the commandline.

There are some extract commits that are from merging my tika project, and not sure how to remov them. Maybe just a cherrypick?

@tballison tballison changed the title Tika 2931 TIKA-2931 Aug 29, 2019
@tballison
Copy link
Contributor

I'm good with this (once squashed, which I can do at merge time).

I suspect this code was added before we had logging. I wonder if we should use LOG.info() instead, and change the unit test to check for the existence/size of the output file.

@tballison tballison merged commit 056304f into apache:master Sep 4, 2019
tballison pushed a commit that referenced this pull request Sep 4, 2019
* need to lowercase the output file to match the format passed to tesseract cmd line

* need to lowercase the output file to match the format passed to tesseract cmd line

* pull original code from apache/tika

* Send informational output to System.err, not System.out so commandline json response is clean
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants