Skip to content

kchodorow/blook

Repository files navigation

Blook

Blook is a tool for turning blogs into ebooks. GIve blook a blog's URL and it'll create a .epub from the content.

Install

Clone this repo and install the prerequisites:

$ git clone https://github.com/kchodorow/blook.git
$ pip install beautifulsoup4 ebooklib

Usage

To use, specify the 'main' url of the blog, e.g.:

$ python blook.py http://avc.com/

If you only want the latest N entries (for example, you're downloading a very large blog and you don't actually want all of the entries) you can give a limit:

$ python blook.py --limit 73 avc.com

This would download the last 73 entries.

What if a blog isn't parsed correctly?

If you find a blog that this does not parse correctly, please file an issue.

Alternatively, please feel free to add a new blog format:

  • Add a file to filters/.
  • Extend base.BaseEntry and base.BaseListing with the appropriate code (look at siat.py for examples).
  • Add your filter to the filter list.
  • Add some tests to filters/your_filter_test.py following the filters/nhl_test.py model.
  • Make sure your test passes by running python -m filters.your_filter_test.
  • Submit a pull request!

About

Turn a blog into an ebook

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published