Skip to content

Latest commit

 

History

History
33 lines (24 loc) · 1.99 KB

CHANGELOG.md

File metadata and controls

33 lines (24 loc) · 1.99 KB

Changelog

Version 1.3.0 - New feature - 2023-04

  • ✨ Added Python 3.8 and 3.9 support

Version 1.2.2 - Bugfix release - 2022-07

  • Fix sudachipy version not being compatible with Python 3.6 anymore

Version 1.2.1 - Bugfix release - 2021-06

  • ✨ Improved Japanese stopwords
  • 🐛 Add explicit UI for languages which do not support lemmatization

Version 1.2.0 - New feature and bugfix release - 2021-04

  • ✨ Added Python 3.7 and Japanese support
  • 🐛 Fixed silent failure when tokenizing long text (> 1 million characters)
  • 💄 Improved recipe interface loading time, enhanced logging and column descriptions

Version 1.1.1 - Bugfix release - 2020-12

  • 🐛 Fixed macedonian support
  • 💚 Added integration tests

Version 1.1.0 - New feature release - 2020-12

  • ✨ Text cleaning recipe to tokenize, filter and lemmatize text data in 58 languages
  • ✅ Stopwords peer-reviewed by native speakers
  • 💄 Enhancements to the UX, the logging and the tokenization library

Version 1.0.1 - Bugfix release - 2020-09

Version 1.0.0 - Initial release - 2020-09

  • 🌎 🌍 🌏  Recipe to detect dominant languages among 114 languages
  • 🧐 Recipe to identify and correct misspellings in 36 languages