Minhash LSH in Golang
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
cmd/minhash-lsh-all-pair
.gitignore
.travis.yml
LICENSE
README.md
lsh.go
lsh_benchmark_test.go update signature interface Dec 24, 2017
lsh_test.go
minhash.go
minhash_test.go

README.md

Minhash LSH in Golang

Build Status GoDoc

Documentation

Install: go get github.com/ekzhu/minhash-lsh

Run Benchmark

Set file format

  1. One set per line

  2. Each set, all items are separated by whitespaces

  3. If the parameter firstItemIsID is set to true, the first itme is the unique ID of the set.

  4. The rest of the items with the following format: <value>____<frequency>

    • value is an unique element of the set
    • frequency is an integer count of the occurance of value
    • ____ (4 underscores) is the separator

All Pair Benchmark

minhash-lsh-all-pair -input <set file name>