fix: test_hallucination_detection missing qualitative marker and tight tolerance

## Problem

`test_hallucination_detection` in `test/stdlib/components/intrinsic/test_rag.py:152` has two issues:

1. **Missing `@pytest.mark.qualitative`** — every other LLM-output-quality test in the file is marked qualitative, but this one isn't. It runs in fast test loops where it shouldn't.

2. **Tolerance too tight** — asserts `pytest.approx(r, abs=3e-2)` on a generative model score. Observed drift of 0.036 causes spurious failures (reported in #804).

## Fix

- Add `@pytest.mark.qualitative` decorator
- Widen tolerance from `abs=3e-2` to `abs=5e-2`

## Related

- #384 — same class of issue (qualitative assertion too strict)
- #735 / #692 — semantic assertion infrastructure
- Reported in #804 (comment by @planetf1)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: test_hallucination_detection missing qualitative marker and tight tolerance #809

Problem

Fix

Related

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

fix: test_hallucination_detection missing qualitative marker and tight tolerance #809

Description

Problem

Fix

Related

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions