Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
-
Updated
Apr 14, 2024 - Python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
🚴♂️⛷Data Lake, Performance tuning for text extraction from a huge amount of files.
Extracting information from PDF files.
Add a description, image, and links to the tika-python topic page so that developers can more easily learn about it.
To associate your repository with the tika-python topic, visit your repo's landing page and select "manage topics."