Text extraction
Clojure
Switch branches/tags
Nothing to show
Latest commit 0cf4ee1 Jan 15, 2014 @marcoy Upgrade dep
Permalink
Failed to load latest commit information.
src/ebtoolkits
test/ebtoolkits/test
.gitignore Implement Content protocol for various types Jan 10, 2014
README.md Add mime-type Jan 11, 2014
project.clj Upgrade dep Jan 15, 2014

README.md

Ebtoolkits

This project extends String, File, URL, URI, and InputStream with a few functions to facilitate extraction of textual contents using Apache Tika.

Usage

(require 'ebtoolkits.core)

; Get the content as a string
(content (File. "filename"))

; Get the content as an InputStream
(content-stream (File. "filename"))

; Get the content as an Observable from RxJava
(content-observable (File. "filename"))

; MIME type from a java.io.File
(mime-type (File. "filename"))

; MIME type from a String
(mime-type "path-to-file")

License

Copyright © 2014 Marco Yuen <marcoy@gmail.com>

Distributed under the Eclipse Public License, the same as Clojure.