Releases: docling-project/docling
Releases Β· docling-project/docling
v2.37.0
Feature
- Make Page.parsed_page the only source of truth for text cells, add OCR cells to it (#1745) (
7d3302c
) - Support xlsm files (#1520) (
df14022
)
Fix
- Pptx line break and space handling (#1664) (
f28d23c
) - asciidoc: Set default size when missing in image directive (#1769) (
b886e4d
) - Handle NoneType error in MsPowerpointDocumentBackend (#1747) (
7a275c7
) - Prov for merged-elems (#1728) (
6613b9e
) - tesseract: Initialize df_osd to avoid uninitialized variable error (#1718) (
e979750
) - Allow custom torch_dtype in vlm models (#1735) (
f7f3113
) - Improve extraction from textboxes in Word docs (#1701) (
9dbcb3d
) - Add WEBP to the list of image file extensions (#1711) (
a2b83fe
)
Documentation
v2.36.1
v2.36.0
v2.35.0
v2.34.0
v2.33.0
Feature
Fix
- Fix issue with detecting docx files, and files with upper case extensions (#1609) (
f4d9d41
) - Load_from_doctags static usage (#1617) (
0e00a26
) - Incorrect force_backend_text behaviour for VLM DocTag pipelines (#1371) (
f2e9c07
) - pypdfium: Resolve overlapping text when merging bounding boxes (#1549) (
98b5eeb
)
v2.32.0
v2.31.2
v2.31.1
Fix
- Add smoldocling in download utils (#1577) (
127e386
) - HTML: Handle row spans in header rows (#1536) (
776e7ec
) - Mime error in document streams (#1523) (
f1658ed
) - Usage of hashlib for FIPS (#1512) (
7c70573
) - Guard against attribute errors in TesseractOcrModel del (#1494) (
4ab7e9d
) - Enable cuda_use_flash_attention2 for PictureDescriptionVlmModel (#1496) (
cc45396
) - Updated the time-recorder label for reading order (#1490) (
976e92e
) - Incorrect scaling of TableModel bboxes when do_cell_matching is False (#1459) (
94d66a0
)
Documentation
v2.31.0
Feature
Fix
- html: Handle address, details, and summary tags (#1436) (
ed20124
) - Treat overflowing -v flags as DEBUG (#1419) (
8012a3e
) - codecov: Fix codecov argument and yaml file (#1399) (
fa7fc9e
)
Documentation
- Fix wrong output format in example code (#1427) (
c2470ed
) - Add OpenSSF Best Practices badge (#1430) (
64918a8
) - Typo fixes in docling_document.md (#1400) (
995b3b0
) - Updated the [Usage] link in architecture.md (#1416) (
88948b0
) - ocr: Add docs entry for OnnxTR OCR plugin (#1382) (
a7dd59c
) - security: More statements about secure development (#1381) (
293c28c
) - Add testing in the docs (#1379) (
01fbfd5
) - Add Notes for Installing in Intel macOS (#1377) (
a026b4e
)