Get texts from Project Gutenberg, extract and format.
Download the gutenberg.py script (or clone the entire repo).
Use the script as follows:
The cleaned version of the text will then be printed to standard out.
Running the tests:
Note that we have test data in
Copyright and License
Copyright 2005-2012 Open Knowledge Foundation. All material licensed under the MIT license: