Skip to content

Unitxt 1.9.0

Compare
Choose a tag to compare
@elronbandel elronbandel released this 20 May 12:20
· 84 commits to main since this release

What's Changed

The most important things are:

  • Addition of LLM as a Judge Metrics and Tasks for both evaluating LLMs as judge and using them for evaluation of other tasks. Read more in the LLM as a Judge Tutorial
  • Addition of RAG response generation tasks and datasets as part of an effort to add comprhensive RAG evaluation to unitxt.
  • Renaming FormTask to Task for simplicity
  • Major improvments to documentation and tutorials

Breaking Changes 🚨

  • Ensure consistent evaluation of CI across implementations [Might change previous results] by @dafnapension in #844
  • Fix default format so it will be the same as formats.empty in catalog. Impacts runs that did not specify a format by @yoavkatz in #848
  • LoadJson operator moved from unit.processors to unitxt.struct_data_operators
  • Fixed YesNoTemplate and Diverse LabelSampler, to support binary task typing. YesNoTemplate now expect class field to contain a string and not a list of of strings with one elements by @yoavkatz in #836

Bug Fixes

New Features

New Assets

Documentation

New Contributors

Full Changelog: 1.8.1...1.9.0