feat(audit): --quality and --ocr-text vision-based checks by gfargo · Pull Request #82 · gfargo/localpress

gfargo · 2026-05-11T20:54:09Z

Closes #81. See issue. (--inconsistent deferred — embedding-based, separate design.)

Two new audit checks powered by Ollama vision: - --quality flags blurry / low-contrast / poorly-composed images - --ocr-text flags images that visually contain a supplied text string Both run per-item Ollama calls (~10s each), so neither is included in the default "run all checks" behavior — must be opted into. If Ollama isn't reachable, the check is skipped with a warning rather than failing the whole audit. --inconsistent (cross-library style outliers) is deferred; it requires embedding-based clustering and is a separate design conversation. - New AuditFinding types: 'quality' and 'ocr-match' - New helpers detectQualityIssues / detectOcrMatches using generateCaption with strict YES/NO prompts - audit MCP tool gains `quality` (bool) and `ocrText` (string) fields - JSON output `summary` extended with `quality` + `ocrMatch` counts - Text output grouping extended with new sections Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

gfargo merged commit 2eed1ea into main May 11, 2026
4 checks passed

gfargo deleted the feat/vision-audit branch May 11, 2026 20:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(audit): --quality and --ocr-text vision-based checks#82

feat(audit): --quality and --ocr-text vision-based checks#82
gfargo merged 1 commit into
mainfrom
feat/vision-audit

gfargo commented May 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

gfargo commented May 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant