Releases: docling-project/docling
Releases · docling-project/docling
v2.56.0
Feature
- AutoOCR model selecting the best OCR model available and deprecating the usage of EasyOCR (#2391) (
f7244a4
) - Add Tesseract PSM options support (#2411) (
f11f8c0
)
Fix
- asr: Implement robust status check in AsrPipeline (#2442) (
db985bb
) - Deal with chartsheets in workbooks (#2433) (
cce18b2
) - Skip temporary docx files (#2413) (
ee55013
) - AsrPipeline to handle absolute paths and BytesIO streams correctly (#2407) (
b5f7fef
) - Enrichment of documents without pages metadata (pptx and xlsx) (#2401) (
0610d01
) - Proper heading support in rich tables for HTML backend (#2394) (
9705f40
)
Documentation
v2.55.1
v2.55.0
Feature
- Repetition-based StoppingCriteria for GraniteDocling (#2323) (
1e9dc43
) - Rich tables support for HTML backend (#2324) (
c803abe
)
Fix
- Pin wider range of typer (#2309) (
68ae7cc
) - Update Transformers & VLLM inference code, CLI and VLM specs (#2322) (
654c70f
) - Support escaped characters in markdown backend (#2304) (
9d67bb9
)
Documentation
v2.54.0
v2.53.0
v2.52.0
v2.51.0
v2.50.0
v2.49.0
Feature
- [Beta] Extraction with schema (#2138) (
9f4bc5b
) - msexcel: Set ContentLayer.INVISIBLE for invisible sheet (#1876) (
a283ccf
)
Fix
- pypdfium2: Fix OCR bounding box misalignment caused by mismatched rotation metadata (#2039) (
4d94e38
) - Translation example (#2166) (
9f0286b
) - Extend offline mode for rapidocr fonts (#2155) (
9904d14
)