Skip to content

Release v5.0.0

Compare
Choose a tag to compare
@hiroshi-matsuda-rit hiroshi-matsuda-rit released this 25 Aug 03:24
· 199 commits to master since this release
79f27f8

ginza-5.0.0

  • 2021-08-26, Demantoid
  • Important changes
    • Upgrade spaCy to v3
      • Release transformer-based ja-ginza-electra model
      • Improve UPOS accuracy of the standard ja-ginza model by adding morphologizer to the tail of spaCy pipleline
    • Need to insrtall analysis model along with ginza package
      • High accuracy model (>=16GB memory needed)
        • pip install -U ginza ja-ginza-electra
      • Speed oriented model
        • pip install -U ginza ja-ginza
    • Change component names of CompoundSplitter and BunsetuRecognizer to compound_splitter and bunsetu_recognizer respectively
    • Also see spaCy v3 Backwards Incompatibilities
  • Improvements
    • Add command line options
      • -n
        • Force using SudachiPy's normalized_form as Token.lemma_
      • -m (ja_ginza|ja_ginza_electra)
        • Select model package
    • Revise ENE category name
      • Degital_Game to Digital_Game