Version 1.3.0 - New feature - 2023-04
- ✨ Added Python 3.8 and 3.9 support
Version 1.2.2 - Bugfix release - 2022-07
- Fix sudachipy version not being compatible with Python 3.6 anymore
Version 1.2.1 - Bugfix release - 2021-06
- ✨ Improved Japanese stopwords
- 🐛 Add explicit UI for languages which do not support lemmatization
Version 1.2.0 - New feature and bugfix release - 2021-04
- ✨ Added Python 3.7 and Japanese support
- 🐛 Fixed silent failure when tokenizing long text (> 1 million characters)
- 💄 Improved recipe interface loading time, enhanced logging and column descriptions
Version 1.1.1 - Bugfix release - 2020-12
- 🐛 Fixed macedonian support
- 💚 Added integration tests
Version 1.1.0 - New feature release - 2020-12
- ✨ Text cleaning recipe to tokenize, filter and lemmatize text data in 58 languages
- ✅ Stopwords peer-reviewed by native speakers
- 💄 Enhancements to the UX, the logging and the tokenization library
Version 1.0.1 - Bugfix release - 2020-09
- 👹 Removed Japanese support because of installation issues with the User Isolation Framework
Version 1.0.0 - Initial release - 2020-09
- 🌎 🌍 🌏 Recipe to detect dominant languages among 114 languages
- 🧐 Recipe to identify and correct misspellings in 36 languages