An exercise in NLP with NLTK based on George Orwell's *Politics and the English Language*
Switch branches/tags
Nothing to show
Clone or download
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.gitignore Add a test for foreign words. Mar 14, 2015
README.md Initial commit Feb 17, 2014
orwell.py Add a test for foreign words. Mar 14, 2015
requirements.txt Add unit tests and requirements. Mar 14, 2015
test.py Add unit tests and requirements. Mar 14, 2015

README.md

In Politics and the English Language George Orwell sets out a few simple rules for writing clear, conscise english. Orwell himself doesn't always follow his rules, but they're helpful to keep in mind. This is a tool designed to alert an author to violations of these rules so the author can choose whether or not to change the text in question.

  1. Never use a metaphor, simile, or other figure of speech which you are used to seeing in print.
  2. Never use a long word where a short one will do.
  3. If it is possible to cut a word out, always cut it out.
  4. Never use the passive where you can use the active.
  5. Never use a foreign phrase, a scientific word, or a jargon word if you can think of an everyday English equivalent.
  6. Break any of these rules sooner than say anything outright barbarous.

I've wanted to play with NLP and Python's NLTK for quite a while now, and this seemed like a reasonable exercise in the relatively easy and the complex (I'm not sure how number 1 will work, for example). That's what this amounts to: an exercise for me to learn NLP and NLTK. Use at your own risk.