Skip to content

Latest commit

 

History

History
10 lines (7 loc) · 305 Bytes

README.md

File metadata and controls

10 lines (7 loc) · 305 Bytes

DocumentID

DocumentID is a ASP.NET Framework Web API for indentifying similar documents by using MinHashing to fingerprint documents.

TODO List

  • Tweak universal hashing function for more reliable resutls
  • Implement Locality Sensitive Hashing
  • Add more unit tests
  • Dockerise it