Skip to content
swayerloren edited this page Apr 13, 2026 · 2 revisions

PDF TO ESX AGENT

PDF TO ESX AGENT is a Windows-focused desktop app and Python pipeline that converts insurance estimate PDFs into structured ESX/XML export artifacts.

It is built for contractors, public adjusters, estimators, restoration and roofing teams, and developers who need a local, inspectable path from messy claim documents to structured data.

What It Does

  • accepts one or more insurance estimate PDFs
  • detects text-based pages versus scan-heavy pages
  • uses local OCR fallback when native PDF text is weak
  • extracts estimate metadata, totals, and line items
  • normalizes the result into a canonical estimate model
  • writes *.esx, *.esx.xml, and *.canonical.json

Project Reality

  • real and runnable today
  • local-first, with no cloud parsing dependency
  • Windows-focused
  • parser coverage is still heuristic and layout-sensitive
  • the .esx output is a deterministic ESX-style package, not a proprietary native XACTDOC.ZIPXML writer

Start Here

If you want to... Read this
get the app running quickly Getting Started
understand the pipeline in plain language How PDF to ESX Works
see a simple user flow Example Workflow
troubleshoot common problems Common Issues
contribute code or docs Contributing
get short answers to common questions FAQ

Main Repo Links

Why This Wiki Exists

This wiki is the fast front door:

  • quick overview for first-time visitors
  • practical guidance for non-developer users
  • short contributor orientation
  • links into the deeper documentation in the main repository

The full technical source of truth stays in docs_repo/.

Clone this wiki locally