-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Pull requests: docling-project/docling
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: handles line breaks in pptx table cells to markdown
#1600
opened May 16, 2025 by
georgehgfonseca
Loading…
feat: add fallback_lang support in TesseractOcrCliModel
ocr
#1593
opened May 15, 2025 by
IoannisMaras
Loading…
3 tasks
feat: Picture description using context with surrounding text
#1587
opened May 13, 2025 by
rafaeltuelho
•
Draft
1 of 3 tasks
feat: adding new vlm-models support
#1570
opened May 11, 2025 by
PeterStaar-IBM
Loading…
11 of 12 tasks
Change path to input data in the docs/examples (as tests is ../.. relative to it
#1561
opened May 9, 2025 by
ue71603
Loading…
fix(pypdfium backend): resolve overlapping text when merging bounding boxes
#1549
opened May 8, 2025 by
pedrolourencoribeiro
Loading…
3 tasks
feat: add textbox content extraction in msword_backend
#1538
opened May 7, 2025 by
AndrewTsai0406
Loading…
fix: Fix issue with detecting docx files, and files with upper case extentions
#1528
opened May 6, 2025 by
MoheyEl-DinBadr
Loading…
docs(enrichment): add enrichments for tables and figures
#1525
opened May 6, 2025 by
Nikhil200030
Loading…
fix: find paragraphs in elements with images in docx
#1486
opened Apr 28, 2025 by
Manuel030
Loading…
3 tasks
feat: new HTML backend that handles styled html as well as images
enhancement
New feature or request
html
issue related to html backend
#1411
opened Apr 17, 2025 by
vaaale
Loading…
3 tasks done
feat(html): add anchor tag support in HTML conversion
#1402
opened Apr 15, 2025 by
ka-weihe
Loading…
3 tasks done
fix: incorrect force_backend_text behaviour for VLM DocTag pipelines
bug
Something isn't working
#1371
opened Apr 13, 2025 by
krishkoushik
Loading…
3 tasks
feat: add a new PictureDescription Model to support llama-stack API
#1350
opened Apr 9, 2025 by
rafaeltuelho
•
Draft
1 task
fix: Capturing of pptx images following the docx backend
#1328
opened Apr 8, 2025 by
benichou
Loading…
feat: Establish confidence estimation for document and pages
enhancement
New feature or request
#1313
opened Apr 7, 2025 by
cau-git
Loading…
1 of 6 tasks
fix: Fixing Handling of Pictures PowerPoint Backend
#1263
opened Mar 30, 2025 by
benichou
Loading…
3 tasks done
docs: replace 'poetry shell' with 'poetry env activate' for poetry>=2.0.0
#1247
opened Mar 26, 2025 by
mkrssg
Loading…
3 tasks done
fix: Unset DPv1 backend on tests (use DPv4 default), re-generate test output [Only for reference, DO NOT MERGE]
pdf parsing
PDF issue related to docling-parse
feat(ocr): tesseract support mis-oriented documents
#1167
opened Mar 14, 2025 by
ClemDoum
Loading…
3 tasks done
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-04-18.