Add requires_review flag to detect incomplete LLM extraction by Arijit429 · Pull Request #349 · fireform-core/FireForm

Arijit429 · 2026-03-25T19:00:25Z

Closes #450
Closes #40

Overview

This PR enhances the reliability and safety of the FireForm extraction pipeline by introducing a validation layer that detects incomplete or low-confidence LLM extraction results.

In the current workflow, form submissions proceed even when the language model fails to extract critical fields (e.g., returning placeholder values like "-1" or missing entries). This change introduces a structured mechanism to flag such cases for human review.

Key Changes

Extraction Validation Utility

Added a reusable validation helper (src/utils/validation.py) to evaluate extracted JSON data against required template fields.
The validator checks for:
- Missing keys
- Empty string values
- Placeholder extraction outputs such as "-1"

requires_review Flag Integration

Extended the extraction pipeline to compute a requires_review boolean after LLM-based extraction.
Updated FileManipulator.fill_form() to return both:
- Generated PDF path
- Review requirement status

Controller and API Flow Updates

Propagated the validation flag through the Controller layer.
Updated /forms/fill route to persist the review state alongside form submission data.

Database Model and API Contract Extension

Added requires_review column to the FormSubmission model.
Extended FormFillResponse schema to expose review status to API consumers.

Motivation

LLM-driven extraction is inherently probabilistic and may produce incomplete outputs in real-world emergency scenarios. By surfacing a review indicator, this change supports safer operational workflows and enables future human-in-the-loop validation interfaces.

Impact

Non-breaking enhancement to submission pipeline.
Improves transparency and reliability of automated form generation.
Establishes groundwork for advanced validation, confidence scoring, and UI review flows.

Future Work

Field-level confidence metrics
Structured extraction schema enforcement
Frontend review dashboards for flagged submissions
Retry or fallback extraction strategies

Arijit429 added 5 commits March 18, 2026 18:27

Fixing error handling message when PDF generation fails

a27df64

Added 30 seconds of timeout handling time for Api request on Ollamas

c96ab2a

Replace print statements with logging for better observability

a027545

Improve README with clearer local setup steps

dd6ac2d

Add requires_review flag for incomplete LLM extraction validation

aa98b33

This was referenced Apr 17, 2026

[FEATURE] Introduce Schema Validation and Confidence Scoring Layer for LLM Extraction Reliability #450

Closed

[UPDATE] Post-proposal contribution summary — Arijit Deb #456

Open

Merge branch 'main' into add-requires-review-flag

5da4437

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add requires_review flag to detect incomplete LLM extraction#349

Add requires_review flag to detect incomplete LLM extraction#349
Arijit429 wants to merge 6 commits intofireform-core:mainfrom
Arijit429:add-requires-review-flag

Arijit429 commented Mar 25, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Arijit429 commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Key Changes

Extraction Validation Utility

requires_review Flag Integration

Controller and API Flow Updates

Database Model and API Contract Extension

Motivation

Impact

Future Work

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Arijit429 commented Mar 25, 2026 •

edited

Loading