Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Tika OpenNLP Language Detection support to Tika Text Detection component. #1592

Open
hhuangMITRE opened this issue Sep 12, 2022 · 0 comments
Assignees
Labels
enhancement feature java Pull requests that update Java code
Milestone

Comments

@hhuangMITRE
Copy link
Contributor

hhuangMITRE commented Sep 12, 2022

Apache Tika supports newer language detection options including OpenNLP:

Overall OpenNLP appears to be an improvement over Tika's original (depreciated) LanguageDetector and OptimaizeLangDetector. As a result we should add Tika's OpenNLPDetector as a supported option moving forward for the Tika Text Detection component.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement feature java Pull requests that update Java code
Projects
Status: In Progress
OpenMPF: Development
  
In Progress
Development

No branches or pull requests

2 participants