Locality-sensitive hashing algorithm for text similarity comparisons
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.
nilsimsa fixing faulty implementation of lazily evaluated .digest attribute; Jan 31, 2016
.gitignore Fix installation on Python 3 Jun 11, 2015
LICENSE.txt Deprecated old version of py-nilsimsa. Released new version under the… Jan 23, 2014
README.md removing wiki branch and putting ProjectHome.md into main README.md Mar 27, 2015


This is a implementation of the nilsimsa algorithm, see http://en.wikipedia.org/wiki/Nilsimsa_Hash

An earlier version of this library was a port to Python of nilsimsa.pl (by way of a ruby port), which was GPLed. The reimplementation has an explanation of how these hashes work, and is MIT/X11 licensed.

"A nilsimsa code is something like a hash, but unlike hashes, a small change in the message results in a small change in the nilsimsa code. Such a function is called a locality-sensitive hash." Quoted from: http://ixazon.dynip.com/~cmeclax/nilsimsa.html