forked from xrowgmbh/eztika
A wrapper script for the standalone Tika toolkit that allows conversion to plain text and indexing of a large variety of binary file types like MsWord, MsOffice, PDF, Excel, ODF, .... Copy from http://svn.projects.ez.no/eztika
License
ezpublishlegacy/eztika
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
License for all but the tika.jar file: GNU GPL 2.0 tika.jar is licensed with the ASF License (Apache) Installation: See, INSTALL.txt Description: eZ Tika is an extension that enables a handler for converting multiple binary file formats to plain text as used by the search engine (if you enabled those attributes as searcheable) Currently, most common office formats are enabled (see also binaryfile.ini.append.php): [application/pdf] [application/msword] [application/vnd.ms-excel] [application/vnd.ms-powerpoint] [application/vnd.visio] [application/vnd.ms-outlook] [application/xml] [application/rtf] [application/vnd.oasis.opendocument.text] [application/vnd.oasis.opendocument.presentation] [application/vnd.oasis.opendocument.spreadsheet] [application/vnd.oasis.opendocument.formula] [application/zip] [application/vnd.openxmlformats-officedocument.wordprocessingml.document] [application/vnd.openxmlformats-officedocument.spreadsheetml.sheet] [application/vnd.openxmlformats-officedocument.presentationml.presentation] [application/octet-stream]
About
A wrapper script for the standalone Tika toolkit that allows conversion to plain text and indexing of a large variety of binary file types like MsWord, MsOffice, PDF, Excel, ODF, .... Copy from http://svn.projects.ez.no/eztika
Resources
License
Stars
Watchers
Forks
Packages 0
No packages published
Languages
- PHP 96.1%
- Shell 3.9%