Release v0.3.0 · populationgenomics/groundmark

Breaking changes

Replace full-document alignment with quote-only resolution.

New API

DocumentIndex(pdf_bytes) — extract per-character bounding boxes from a PDF (one-time cost)
doc.resolve(quotes) — resolve verbatim quotes to bounding boxes via Smith-Waterman alignment
from groundmark.convert import convert, Config — PDF-to-Markdown conversion via LLM (requires optional [bedrock]/[anthropic]/etc. extra)

Removed

process() function and ProcessResult — replaced by convert() and DocumentIndex.resolve()
strip(), annotate(), resolve() re-exports from anchorite
PdfplumberAnchorProvider — pdfplumber dependency removed
Visualize module
anchorite dependency — alignment now uses seq-smith directly

Dependencies restructured

Core dependencies (seq-smith, pypdfium2) are always installed. LLM providers are optional extras:

uv add groundmark                          # resolve only
uv add groundmark --extra bedrock          # + Bedrock conversion
uv add groundmark --extra anthropic,bedrock # multiple providers

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.3.0

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Breaking changes

New API

Removed

Dependencies restructured

Uh oh!