Add approximate LSH approach for MinHash

Currently, our MinHashing scheme falls back to a LSH scheme for approximate MinHashing. This provides a reduction in data replication from _n_ to _b_ (where _n_ is the number of elements and _b_ is the number of buckets). However, more efficient approximate LSH schemes can achieve a further reduction. We should add a method like multiprobing:

```
Lv, Qin, et al. "Multi-probe LSH: efficient indexing for high-dimensional similarity search." Proceedings of the 33rd international conference on Very large data bases. VLDB Endowment, 2007.
```


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add approximate LSH approach for MinHash #23

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add approximate LSH approach for MinHash #23

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions