Skip to content

John Snow Labs NLP Test 1.4.0: Enhancing Support for Toxicity test and new QA benchmark datasets (NarrativeQA, TruthfulQA, QuAC, HellaSwag, MMLU and OpenbookQA)

Choose a tag to compare

@ArshaanNazir ArshaanNazir released this 06 Jun 14:17
· 3860 commits to main since this release
8cc420c

John Snow Labs NLP Test 1.4.0: Enhancing Support for Toxicity test and new QA benchmark datasets (NarrativeQA, TruthfulQA, QuAC, HellaSwag, MMLU and OpenbookQA)


πŸ“’ Overview

NLP Test 1.4.0 πŸš€ comes with brand new features, including: new capabilities for testing Large Language Models for toxicity and support for new QA benchmark datasets (NarrativeQA, TruthfulQA, QuAC, HellaSwag, MMLU and OpenbookQA) for robustness, representation, fairness and accuracy tests. It also includes addition of some new robustness tests and many other enhancements and bug fixes!

A big thank you to our early-stage community for their contributions, feedback, questions, and feature requests πŸŽ‰

Make sure to give the project a star right here ⭐


πŸ”₯ New Features & Enhancements

  • Adding support for NarrativeQA dataset #487
  • Adding support for toxicity task #488
  • Adding support for TruthfulQA dataset #477
  • Adding support for new dyslexia swap test for robustness testing #474
  • Adding support for new slangificator test for robustness testing #463
  • Adding support for new abbreviation test for robustness testing #471
  • Adding support for OpenBookQA dataset #479
  • Adding support for MMLU dataset #481
  • Adding support for hellaswag dataset #486
  • Adding new tutorial notebooks #497

❓ How to Use

Get started now! πŸ‘‡

pip install nlptest

Create your test harness in 3 lines of code πŸ§ͺ

# Set OpenAI API keys
os.environ['OPENAI_API_KEY'] = ''

# Import and create a Harness object
from nlptest import Harness
h = Harness(task='toxicity', model='text-davinci-002', hub='openai', data='toxicity-test-tiny')

# Generate test cases, run them and view a report
h.generate().run().report()

πŸ“– Documentation


❀️ Community support

  • Slack For live discussion with the NLP Test community, join the #nlptest channel
  • GitHub For bug reports, feature requests, and contributions
  • Discussions To engage with other community members, share ideas, and show off how you use NLP Test!

We would love to have you join the mission πŸ‘‰ open an issue, a PR, or give us some feedback on features you'd like to see! πŸ™Œ


♻️ Changelog

What's Changed

New Contributors

Full Changelog: v1.3.0...v1.4.0