Here provides benchmark tools to compare the performance of data structures for string matching.
Directory rust
provides benchmarks for the following libraries:
crawdad::Trie
crawdad::MpTrie
yada::DoubleArray
fst::Map
daachorse::DoubleArrayAhoCorasick
daachorse::CharwiseDoubleArrayAhoCorasick
std::collections::BTreeMap
std::collections::HashMap
hashbrown::HashMap
After moving your current directory to rust
,
you can measure time performance and memory usage with your datasets in the following command.
$ cargo run --release --bin measure -- -k ../data/unidic/unidic -t ../data/wagahaiwa_nekodearu.txt
Or, you can measure search time more accurately with criterion.rs
in the following command.
$ cargo bench
Directory python
provides benchmarks for the following libraries:
After moving your current directory to python
,
you can measure search time with pytest
in the following command.
$ nox
This software is developed by LegalForce, Inc., but not an officially supported LegalForce product.
Licensed under either of
- Apache License, Version 2.0 (LICENSE-APACHE or http://www.apache.org/licenses/LICENSE-2.0)
- MIT license (LICENSE-MIT or http://opensource.org/licenses/MIT)
at your option.
The datasets contained here are copied from third party repositories. Follow the license terms of each software