Skip to content

mimaraslan/ocr-apache-tika-demo-project

Repository files navigation

Apache Tika OCR Demo Java Project

Apache Tika OCR Demo Java Project

Apache Tika OCR Demo Java Project

Plus aspects:

  • Open source.
  • Text, PDF, JPEG, JPG, Html, Xml, Excel documents are doing text parse.
  • Text, PDF, JPEG, JPG, Html, Xml, Excel, Mp3, Odp, Mp4, JAR etc. showing metadata information of files.

Negative aspects:

  • There are problems in Turkish characters.
  • There are problems in Russian (Cyrillic alphabet) characters.

About

OCR Apache Tika Demo Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages