v5.0.0 — Modular Runtime, HITL Approval & Evaluation Scoring

Latest

aiqualitylab released this 03 May 11:05

· 109 commits to main since this release

v5.0.0

38922ba

Splits the runtime into focused modules (qa_config, qa_runtime, qa_workflow), adds a --approve flag for human-in-the-loop test review before save, and introduces HTML replay utilities for failure investigation. Evaluation now includes ROUGE/similarity scoring in the NLP baseline and an overall quality score in Ragas.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v5.0.0 — Modular Runtime, HITL Approval & Evaluation Scoring

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!