URL normalizer, article content extraction, summary generator (SOMEDAY)
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
doc
src/hearst
test/hearst
.gitignore
LICENSE
README.md
project.clj

README.md

hearst

URL normalizer, article content extraction, summary generator. Because the world needed another one.

The Most Recent Release

hearst has not been fully released to Clojars yet because it is not ready for prime time.

With Leiningen, add it to the dependencies in project.clj:

[hearst "0.1.1-SNAPSHOT"]

Usage

URL normalization

user=> (use 'hearst.url-cleanup)
user=> (normalize-url "http://example.com/%7Ejane?q=Search&ugly=%c2%b1&utm_source=example.com&utm_medium=whoknows")
"http://example.com/~jane?q=Search&ugly=%C2%B1"

License

Copyright © 2014 Matt Gauger

Distributed under the Eclipse Public License either version 1.0 or (at your option) any later version.