Skip to content

Release/2.2.0#1029

Merged
ArshaanNazir merged 86 commits intomainfrom
release/2.2.0
May 15, 2024
Merged

Release/2.2.0#1029
ArshaanNazir merged 86 commits intomainfrom
release/2.2.0

Conversation

@chakravarthik27
Copy link
Collaborator

📢 Highlights

John Snow Labs is excited to announce the release of LangTest 2.2.0! This update introduces powerful new features and enhancements to elevate your language model testing experience and deliver even greater insights.

  • 🏆 Model Ranking & Leaderboard: LangTest introduces a comprehensive model ranking system. Use harness.get_leaderboard() to rank models based on various test metrics and retain previous rankings for historical comparison.

  • 🔍 Few-Shot Model Evaluation: Optimize and evaluate your models using few-shot prompt techniques. This feature enables you to assess model performance with minimal data, providing valuable insights into model capabilities with limited examples.

  • 📊 Evaluating NER in LLMs: This release extends support for Named Entity Recognition (NER) tasks specifically for Large Language Models (LLMs). Evaluate and benchmark LLMs on their NER performance with ease.

  • 🚀 Enhanced Data Augmentation: The new DataAugmenter module allows for streamlined and harness-free data augmentation, making it simpler to enhance your datasets and improve model robustness.

  • 🎯 Multi-Dataset Prompts: LangTest now offers optimized prompt handling for multiple datasets, allowing users to add custom prompts for each dataset, enabling seamless integration and efficient testing.

…rompt-handling-for-different-datasets

User prompt handling for multi-dataset testing
…-augmentation-allow-access-without-harness-testing
chakravarthik27 and others added 26 commits May 11, 2024 13:13
…-importing-of-edited-testcases-into-harness

Refactor: Improved the `import_edited_testcases()` functionality in Harness.
…pt-techniques

Implementation of prompt techniques
…-benchmark-report

Fix: Summary class to update summary dataframe and handle file path
…allow-access-without-harness-testing

Refactor: Improve Code Organization and Readability
…-benchmark-report

Improved: `rank_by` argument add to `harness.get_leaderboard()`
@chakravarthik27 chakravarthik27 self-assigned this May 15, 2024
@ArshaanNazir ArshaanNazir merged commit f875632 into main May 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants