whamnospellgood

Python script that totals up all the spelling mistakes in the top stories on 13wham.com ...

Connects to http://m.13wham.com/display/1438 (at the time of thie script being authored this was the "top stories" page) and pulls down the links to the top stories.

It then pulls the story text out of the p tags within the div tag with class="StoryBlock".

It uses nltk to then tokenize everything. Removes all tokens that are 1, 2, or 3 'chars' in length. Spell checks words that only have letters in them using enchant.

The reason for writing this is because the wife and I always noticed the high number of spelling errors within the stories ... I wanted a hard number.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
teststories.py		teststories.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

whamnospellgood

About

Releases

Packages

Contributors 2

Languages

thequbit/whamnospellgood

Folders and files

Latest commit

History

Repository files navigation

whamnospellgood

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages