Skip to content

rjzak/bwmd

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Test Lint Crates.io Version

Burrows Wheeler Markov Distance (BWMD)

This crate provides a simple Rust implementation of the Burrows-Wheeler Markov Distance (BWMD) algorithm by Edward Raff.

Please see the readme in the original code for more details. As for this crate, it has a few functions:

  • vectorize(): creates a vector from binary data
  • distance(): calculates the distance between arrays of binary data.

A sparse vector implementation is also provided since the output vector is large (65,536). The sparse vector can convert to/from a dense vector, encode/decode from base64, and with the serde feature is able to be directly serialized and deserialized. For a very large file, the sparse vector may end up being larger than the original. Measure and test with your data.

About

Rust version of the Burrows Wheeler Markov Distance https://github.com/EdwardRaff/pyBWMD

Topics

Resources

License

Stars

Watchers

Forks

Contributors

Languages