Skip to content

John Snow Labs - NLP Test 1.0.0: An open-source library for delivering safe & effective models into production!

Compare
Choose a tag to compare
@luca-martial luca-martial released this 03 Apr 20:22
· 4207 commits to main since this release
0abe698

πŸ“’ Overview

We are very excited to release John Snow Labs' latest library: NLP Test! πŸš€ This is our first major step towards building responsible AI.

NLP Test is an open-source library for testing NLP models and datasets from all major NLP libraries in a few lines of code. πŸ§ͺ The library has 1 goal: delivering safe & effective models into production. 🎯

Make sure to give the project a star right here ⭐


πŸ”₯ Features

  • Generate & run over 50 test types in a few lines of code πŸ’»
  • Test all aspects of model quality: robustness, bias, representation, fairness and accuracy
  • Automatically augment training data based on test results πŸ’ͺ
  • Support for popular NLP libraries: Spark NLP, Hugging Face Transformers & spaCy
  • Support for popular NLP tasks: Named Entity Recognition and Text Classification πŸŽ‰

❓ How to Use

Get started now! πŸ‘‡

pip install nlptest

Create your test harness in 3 lines of code πŸ§ͺ

# Import and create a Harness object
from nlptest import Harness
h = Harness(task='ner', model='dslim/bert-base-NER', hub='transformers')

# Generate test cases, run them and view a report
h.generate().run().report()

πŸ“– Documentation


❀️ Community support

  • Slack For live discussion with the NLP Test community, join the #nlptest channel
  • GitHub For bug reports, feature requests, and contributions
  • Discussions To engage with other community members, share ideas, and show off how you use NLP Test!

We would love to have you join the mission πŸ‘‰ open an issue, a PR, or give us some feedback on features you'd like to see! πŸ™Œ


πŸš€ Mission

While there is a lot of talk about the need to train AI models that are safe, robust, and fair - few tools have been made available to data scientists to meet these goals. As a result, the front line of NLP models in production systems reflects a sorry state of affairs.

We propose here an early stage open-source community project that aims to fill this gap, and would love for you to join us on this mission. We aim to build on the foundation laid by previous research such as Ribeiro et al. (2020), Song et al. (2020), Parrish et al. (2021), van Aken et al. (2021) and many others.

John Snow Labs has a full development team allocated to the project and is committed to improving the library for years, as we do with other open-source libraries. Expect frequent releases with new test types, tasks, languages, and platforms to be added regularly. We look forward to working together to make safe, reliable, and responsible NLP an everyday reality.