Find file History
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
..
Failed to load latest commit information.
LICENSE
README.md
homophone_dictionary.py
namelist
requirements.txt
wordlist

README.md

Homophones with wildly different spellings

 
I've been working on search implementation stuff lately, and we needed wordlists for testing that had words which are similarly phonetically encoded but spelled very differently.
 
Here are a couple of such wordlists that I generated, and the Python script I wrote to generate them, in case you ever need such a thing.
 
Each line in these wordlists has a set of words that are identically phonetically encoded (comparing only the first Double Metaphone encoding), but are spelled very differently (meaning, which share no more than a quarter of their trigrams).  
 
##Requirements