Skip to content

John Snow Labs NLP Test 1.1.0: Announcing Support for Testing LLMs

Choose a tag to compare

@luca-martial luca-martial released this 02 May 15:09
· 4451 commits to main since this release
932d8e7

πŸ“’ Overview

NLP Test 1.1.0 πŸš€ comes with brand new features, including: new capabilities for testing Large Language Models on Question Answering tasks, with support for testing OpenAI-based LLMs and support for robustness tests on the BoolQ and Natural Questions datasets!

A big thank you to our early-stage community for their contributions, feedback, questions, and feature requests πŸŽ‰

Make sure to give the project a star right here ⭐


πŸ”₯ New Features & Enhancements

  • Support for testing OpenAI LLMs on Question Answering #361
  • Support for BoolQ and Natural Questions datasets #361
  • Improved layout for configuring tests #361
  • Improved warning and error messaging #361

πŸ› Bug Fixes

  • Fixed overlapping and mis-formatted country names in dictionaries #347

❓ How to Use

Get started now! πŸ‘‡

pip install nlptest

Create your test harness in 3 lines of code πŸ§ͺ

# Set OpenAI API keys
os.environ['OPENAI_API_KEY'] = ''

# Import and create a Harness object
from nlptest import Harness
h = Harness(task='question-answering', model='gpt-3.5-turbo', hub='openai', data='BoolQ-test', config='config.yml')

# Generate test cases, run them and view a report
h.generate().run().report()

πŸ“– Documentation


❀️ Community support

  • Slack For live discussion with the NLP Test community, join the #nlptest channel
  • GitHub For bug reports, feature requests, and contributions
  • Discussions To engage with other community members, share ideas, and show off how you use NLP Test!

We would love to have you join the mission πŸ‘‰ open an issue, a PR, or give us some feedback on features you'd like to see! πŸ™Œ


♻️ Changelog

What's Changed

Full Changelog: v1.0.2...v1.1.0