Skip to content

docs: PDF content quality pipeline — architecture + diagrams#239

Merged
mrviduus merged 1 commit into
mainfrom
docs/pdf-quality-architecture
May 23, 2026
Merged

docs: PDF content quality pipeline — architecture + diagrams#239
mrviduus merged 1 commit into
mainfrom
docs/pdf-quality-architecture

Conversation

@mrviduus
Copy link
Copy Markdown
Owner

Summary

Documents the feat-0007 pipeline in docs/01-architecture/README.md with
two ASCII diagrams (matching the existing TTS section style):

  • Synchronous ingest — PdfPig → processor chain → analyzer scoring →
    BookQualityJob queued.
  • Asynchronous cleanup — systemd poller → Phase 1 validate → Phase 2
    structure fixes → Phase 3 content cleanup (gated by score, verified by
    preservation gate) → pair log → ratchet feedback into the processor chain.

Plus a components table, configuration knobs (CONTENT_CLEANUP_ENABLED,
CONTENT_QUALITY_THRESHOLD, CLEANUP_TIMEOUT, the admin auto-queue setting),
and a brief mention in the root README Features section linking back to it.

Changes

  • docs/01-architecture/README.md — new "PDF Content Quality Pipeline"
    section between TTS and See Also.
  • README.md — bullet under Library linking to the architecture section.

Docs-only.

🤖 Generated with Claude Code

Adds a section to docs/01-architecture covering the feat-0007 pipeline:
synchronous ingest path (PdfPig + processor chain + analyzer scoring) and
asynchronous cleanup path (poller → Phase 1-2-3 → preservation gate → pair
log). Components table, configuration knobs, links back to the feature doc.

Brief mention in the root README Features section linking to the new
architecture section.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@mrviduus mrviduus merged commit 1536230 into main May 23, 2026
5 checks passed
@mrviduus mrviduus deleted the docs/pdf-quality-architecture branch May 23, 2026 00:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant