sourmash core functionality implemented in Rust
sourmash is a command-line tool and Python library for computing MinHash sketches from DNA sequences, comparing them to each other, and plotting the results. This allows you to estimate sequence similarity between even very large data sets quickly and accurately. The core data structure is implemented in C++.
There is a PR in sourmash to replace the C++ core with this implementation (tests passing, yay!).
This project is licensed under a BSD 3-Clause License.