Skip to content

John Snow Labs NLP Test 1.2.0: Announcing Support for Cohere, AI21, Azure OpenAI and Hugging Face Inference API

Compare
Choose a tag to compare
@ArshaanNazir ArshaanNazir released this 13 May 10:59
· 3575 commits to main since this release
0698507

πŸ“’ Overview

NLP Test 1.2.0 πŸš€ comes with brand new features, including: support for testing Cohere, AI21, Hugging Face Inference API and Azure-OpenAI LLMs for robustness, bias, accuracy and representation tests on the BoolQ and Natural Questions datasets, and many other enhancements and bug fixes!

A big thank you to our early-stage community for their contributions, feedback, questions, and feature requests πŸŽ‰

Make sure to give the project a star right here ⭐


πŸ”₯ New Features & Enhancements

  • Adding support for 4 new LLM APIs for Question Answering task #388
  • Adding support for bias tests for testing LLMs on Question Answering #404
  • Adding support for representation tests for testing LLMs on Question Answering #405
  • Adding support for accuracy tests for testing LLMs on Question Answering #394
  • Adding new robustness test called number_to_word #377

πŸ› Bug Fixes

  • Fixed bias tests to enable multi-token name replacements #400
  • Fixed issue in ethnicity/religion-names #393
  • Fixed issue in default HF text classification model #402

❓ How to Use

Get started now! πŸ‘‡

pip install nlptest

Create your test harness in 3 lines of code πŸ§ͺ

# Set OpenAI API keys
os.environ['OPENAI_API_KEY'] = ''

# Import and create a Harness object
from nlptest import Harness
h = Harness(task='question-answering', model='gpt-3.5-turbo', hub='openai', data='BoolQ-test', config='config.yml')

# Generate test cases, run them and view a report
h.generate().run().report()

πŸ“– Documentation


❀️ Community support

  • Slack For live discussion with the NLP Test community, join the #nlptest channel
  • GitHub For bug reports, feature requests, and contributions
  • Discussions To engage with other community members, share ideas, and show off how you use NLP Test!

We would love to have you join the mission πŸ‘‰ open an issue, a PR, or give us some feedback on features you'd like to see! πŸ™Œ


♻️ Changelog

What's Changed

New Contributors

Full Changelog: v1.1.0...v1.2.0