Skip to content

v1.0.0

Choose a tag to compare

@docling-ops docling-ops released this 11 Mar 15:42
· 21 commits to main since this release

Feature

  • Parallelize the evaluation of tables and cache the loading of external predictions (#190) (9d04a56)
  • Regression tests for CVAT to Docling conversion (#193) (8a10188)
  • CVAT box rotation support, structural cleanup (#191) (db068e9)
  • Improvements in user experience: Performance, error handling, logging (#189) (a850784)
  • Visualizer tool and command for datasets (#186) (373f959)
  • Extend the evaluators to support external predictions stored in files (#185) (53dbd95)
  • Convert Docling JSON inputs to image streams in FileDatasetBuilder (#184) (15888fd)
  • Allow subset to split routing in CVAT to HF exporter (#182) (ebb8800)
  • Ingest CVAT assets and filter submissions (#180) (b55b2ea)
  • Runtime optimizations for MultiLabelConfusionMatrix (#175) (5084a4d)
  • Add more fine-grained control in the DoclingEvalCOCOExporter (#149) (8f33420)
  • Remove legacy CvatDatasetBuilder code, use modernized code (#174) (693c224)
  • Introduce the PixelLayoutEvaluator to produce confusion matrices for the multi-label layout analysis (#173) (a79bac5)
  • Review-bundle builder, fixes for GraphCell with merged elements and more (#172) (21341ce)

Fix

  • Correct import path for TableStructureModel (#199) (a7e74a3)
  • Fix the reporting of doc_id, true_md, pred_md in markdown_text_evaluator.py (#196) (3ce7591)
  • PixelLayoutEvaluator: Set all-pixels background in case of a missing prediction and evaluate (#183) (4314091)
  • Fix empty prediction handling in markdown evaluator (#177) (9b6df83)
  • Consistenty and perf improvements (#171) (8fb3a16)

Breaking

  • CvatDatasetBuilder now requires modern CVAT folder structure and uses convert_cvat_folder_to_docling() internally. (693c224)