John Snow Labs LangTest 1.3.0: Clinical Test Support, Model & Dataset Uploading to HF, Prompt-Injection Exploration, Codebase Refinement, Enhanced Unit Test Coverage and Support for custom column names for CSV loading.
π’ Overview
LangTest 1.3.0 π by John Snow Labs is here with an array of advancements: We've amped up our support for Clinical-Tests, made it simpler to upload models and augmented datasets to HF, and ventured into the domain of Prompt-Injection tests. Streamlined codebase, bolstered unit test coverage, added support for custom column names in harness for CSVs and polished contribution protocols with bug fixes!
A big thank you to our early-stage community for their contributions, feedback, questions, and feature requests π
Make sure to give the project a star right here β
π₯ New Features & Enhancements
- Adding support for clinical-tests #707
- Adding support for prompt-injection test #708
- Updated Harness format #706
- Adding support for model/dataset upload to HF #713
- Adding contribution guidelines #701
- Improving Unittest coverage #700
- Adding support for custom column names in harness for csv #650
π Bug Fixes
- Fix fairness scores #709
β How to Use
Get started now! π
pip install "langtest[langchain,openai,transformers]"
import os
os.environ["OPENAI_API_KEY"] = <ADD OPEN-AI-KEY>
Create your test harness in 3 lines of code π§ͺ
# Import and create a Harness object
from langtest import Harness
harness = Harness(task="clinical-tests",model={"model": "text-davinci-003", "hub": "openai"},data = {"data_source": "Gastroenterology-files"})
# Generate test cases, run them and view a report
h.generate().run().report()
π Documentation
β€οΈ Community support
- Slack For live discussion with the LangTest community, join the
#langtestchannel - GitHub For bug reports, feature requests, and contributions
- Discussions To engage with other community members, share ideas, and show off how you use LangTest!
We would love to have you join the mission π open an issue, a PR, or give us some feedback on features you'd like to see! π
β»οΈ Changelog
What's Changed
- Improve unit test coverage by @Prikshit7766 in #700
- Docs/Added Contribution Guidelines by @RakshitKhajuria in #701
- Feature/clinical tests by @ArshaanNazir in #707
- fix fairness scores by @alytarik in #709
- pytest/Representation Classes by @RakshitKhajuria in #705
- Feature/explore prompt injection tests by @chakravarthik27 in #708
- Refacto/Updated format of Harness by @Prikshit7766 in #706
- Fix/support more ner hf formats by @alytarik in #712
- Chore/clinical tests nb-website updates by @ArshaanNazir in #714
- Upload model/dataset to hf by @RakshitKhajuria in #713
- Support for custom column names in harness for csv by @Prikshit7766 in #650
- Feature/llm unit tests by @ArshaanNazir in #716
- Update Website/Nbs by @ArshaanNazir in #715
- Release/1.3.0 by @ArshaanNazir in #717
Full Changelog: 1.2.0...1.3.0