-
Notifications
You must be signed in to change notification settings - Fork 1
Home
swayerloren edited this page Apr 13, 2026
·
2 revisions
PDF TO ESX AGENT is a Windows-focused desktop app and Python pipeline that converts insurance estimate PDFs into structured ESX/XML export artifacts.
It is built for contractors, public adjusters, estimators, restoration and roofing teams, and developers who need a local, inspectable path from messy claim documents to structured data.
- accepts one or more insurance estimate PDFs
- detects text-based pages versus scan-heavy pages
- uses local OCR fallback when native PDF text is weak
- extracts estimate metadata, totals, and line items
- normalizes the result into a canonical estimate model
- writes
*.esx,*.esx.xml, and*.canonical.json
- real and runnable today
- local-first, with no cloud parsing dependency
- Windows-focused
- parser coverage is still heuristic and layout-sensitive
- the
.esxoutput is a deterministic ESX-style package, not a proprietary nativeXACTDOC.ZIPXMLwriter
| If you want to... | Read this |
|---|---|
| get the app running quickly | Getting Started |
| understand the pipeline in plain language | How PDF to ESX Works |
| see a simple user flow | Example Workflow |
| troubleshoot common problems | Common Issues |
| contribute code or docs | Contributing |
| get short answers to common questions | FAQ |
- Repository README
- Main CONTRIBUTING guide
- Deep docs index in
docs_repo - Architecture overview
- Debugging guide
- Issues
- Releases
This wiki is the fast front door:
- quick overview for first-time visitors
- practical guidance for non-developer users
- short contributor orientation
- links into the deeper documentation in the main repository
The full technical source of truth stays in docs_repo/.
Main repository | Deep docs in docs_repo | Issues
docs_repo/ in the main repository is the authoritative deep technical source of truth.