Skip to content

Conversation

@nikos-livathinos
Copy link
Contributor

  • Use the parquet dataset to read DoclingDocuments and generate a dataset in COCO format.
  • Either the ground truth or the predicted document can be used.

…thod that exports DoclingDocument

column from parquet to a COCO dataset. It can operate either on the true_doc or on pred_doc.
Extend the main CLI to support multiple operations.

Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
@nikos-livathinos nikos-livathinos self-assigned this Aug 15, 2025
@github-actions
Copy link
Contributor

DCO Check Passed

Thanks @nikos-livathinos, all your commits are properly signed off. 🎉

@mergify
Copy link

mergify bot commented Aug 15, 2025

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

  • title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

@nikos-livathinos nikos-livathinos changed the title feat: Extend the DoclingEvalCOCOExporter to generate a COCO dataset a parquet dataset feat: Extend the DoclingEvalCOCOExporter to export a parquet dataset in COCO format Aug 15, 2025
@nikos-livathinos nikos-livathinos merged commit a6811c4 into main Aug 20, 2025
10 checks passed
@nikos-livathinos nikos-livathinos deleted the nli/tellus_layout_evaluation branch August 20, 2025 08:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants