Call for parser submissions: DocFailBench v0.1 Combined Public RC #2
Travor278
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
DocFailBench v0.1 Combined Public RC is open for parser submissions.
DocFailBench is a failure-oriented benchmark for PDF-to-Markdown, OCR, and VLM document parsers. It does not only ask whether a page looks broadly similar; it checks small executable facts: table cells, formulas, reading order, captions, page furniture, and optional bbox grounding.
Release links:
Frozen target:
DocFailBench-v0.1-combined-public-rcdata/releases/docfailbench_v0_1_combined_public_rc_cases.jsonCurrent baseline snapshot:
What we are looking for:
A useful parser submission should include parser name/version, exact run command, prediction JSON, evaluation JSON, hardware/runtime metadata, and model/API run date for hosted parsers. The full guide is here:
https://github.com/Travor278/DocFailBench/blob/main/docs/submitting-parser-results.md
Please post general feedback in this discussion. For concrete parser results, use the submission issue or open a PR so the result can be reviewed and added to the leaderboard.
Beta Was this translation helpful? Give feedback.
All reactions