Skip to content

Latest commit

History

History
51 lines (40 loc) 路 2.27 KB

CHANGELOG.md

File metadata and controls

51 lines (40 loc) 路 2.27 KB

0.6.0 (2020-11-11)

Highlights

  • Full feature parity with Java version
  • ~15% analysis speed improvement over 0.6.0-rc1

Rust

  • Added dictionary build functionality
  • Added an option to perform analysis without sentence splitting
    • Use it with --split-sentences=no

Python

  • Added bindings for dictionary build (undocumented and not supported as API).
  • sudachipy build and sudachipy ubuild should work once more
    • Report on build times and dictionary part sizes can differ from the original SudachiPy

0.6.0-rc1 (2021-10-26)

Highlights

  • First release of Sudachi.rs
  • SudachiPy compatible Python bindings
  • ~30x speed improvement over original SudachiPy
  • Dictionary build mode will be done before 0.6.0 final (See #13)

Rust

  • Analysis: feature parity with Python and Java version
  • Dictionary build is not supported in rc1
  • ~2x faster than Java version (with sentence splitting)
  • No public API at the moment (contact us if you want to use Rust version directly, internals will significantly change and names are not finalized)

Python

Known Issues

  • List of deprecated SudachiPy API:
    • MorphemeList.empty(dict: Dictionary)
      • This also needs a dictionary as an argument.
    • Morpheme.split(mode: SplitMode)
    • Morpheme.get_word_info()
    • Most of instance attributes are not exported: e.g. Dictionary.grammar, Dictionary.lexicon.
  • Dictionary Build is not supported: sudachipy build and sudachipy ubuild will not work, please use 0.5.3 in another virtual environment for the time being until the feature is implemented: #13