Skip to content

v1.2.2

Choose a tag to compare

@ayoub-ibm ayoub-ibm released this 26 Jan 03:19
· 90 commits to main since this release

Bug Fixes

Graph Converters
  • Preserve component (non-entity) data during graph pruning to avoid dropping nested structures (addresses, totals, etc.) (8552ea5)
  • Tighten error logging and improve node-id collision detection for more reliable graph builds (bafab02)
  • Auto-clean empty output directories on pipeline failure when dump_to_disk is enabled (preserves partial results) (9e4c031)
  • Add a User-Agent header for URL downloads (HEAD + GET) to avoid HTTP 403 responses; add regression tests (77dbd02)
Extractors & Visualization
  • Make Pydantic schemas validators lenient (coerce instead of reject) and log coercions for data-quality tracking (fb1bb37)
  • Render nested node/edge details as formatted JSON (fixes "[object Object]" in the interactive viewer) (014778a)

Refactoring

BillingDocument Template
  • Add a comprehensive billing/invoice Pydantic extraction template (21a2200)
  • Simplify BillingDocument schema + prompts to improve extraction consistency and reduce unnecessary nesting (81fdbc9)

Documentation

  • Update the problem statement to reflect recent docling-graph improvements and capabilities (872a3a3)
  • Refine wording and improve styling (e7b7f0a)
  • Update examples and navigation to align with BillingDocument schema references (c254825)

Chores

Dependencies
  • Update aiofiles requirement (afc6d52)
Continuous Integration
  • Restore default semantic-release templates and regenerate changelog (59b2f43)
  • Treat refactor: commits as patch bumps in semantic-release config (f3debd6)