Soundex Phonetic Code Algorithm Demo for Indian Languages. Supports all indian languages and English. Provides intra-indic string comparison
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
libindic/soundex
.gitignore
.testr.conf
.travis.yml
LICENSE
Makefile
README.md
circle.yml
requirements.txt
setup.cfg
setup.py
test-requirements.txt
tox.ini

README.md

LibIndic Soundex

Build Status Coverage Status

Soundex is phonetic algorithm for indexing names by sound as pronounced in English. LibIndic's soundex module implements Soundex algorithm for Engish as well as a modified version of soundex algorithm for Indian languages.

Details on how Soundex is implemented can be found at Santhosh's blog

Installation

  1. Clone the repository git clone https://github.com/libindic/soundex.git
  2. Change to the cloned directory cd soundex
  3. Run setup.py to create installable source python setup.py sdist
  4. Install using pip pip install dist/libindic-soundex*.tar.gz

Usage

>>> from libindic.soundex import Soundex
>>> instance = Soundex()
>>> instance.soundex(u"കൃത്രിമം")
u'\u0d15PKPBN00'
>>> instance.compare(u"വിദ്യാർഥി", u"വിദ്യാർദി")
1
>>> instance.compare(u"മോര്", u"മുതിര")
2