Skip to content

Releases: docling-project/docling

v2.56.0

13 Oct 09:19
Compare
Choose a tag to compare

Feature

  • AutoOCR model selecting the best OCR model available and deprecating the usage of EasyOCR (#2391) (f7244a4)
  • Add Tesseract PSM options support (#2411) (f11f8c0)

Fix

  • asr: Implement robust status check in AsrPipeline (#2442) (db985bb)
  • Deal with chartsheets in workbooks (#2433) (cce18b2)
  • Skip temporary docx files (#2413) (ee55013)
  • AsrPipeline to handle absolute paths and BytesIO streams correctly (#2407) (b5f7fef)
  • Enrichment of documents without pages metadata (pptx and xlsx) (#2401) (0610d01)
  • Proper heading support in rich tables for HTML backend (#2394) (9705f40)

Documentation

v2.55.1

03 Oct 10:26
Compare
Choose a tag to compare

Fix

Documentation

v2.55.0

30 Sep 14:50
Compare
Choose a tag to compare

Feature

  • Repetition-based StoppingCriteria for GraniteDocling (#2323) (1e9dc43)
  • Rich tables support for HTML backend (#2324) (c803abe)

Fix

  • Pin wider range of typer (#2309) (68ae7cc)
  • Update Transformers & VLLM inference code, CLI and VLM specs (#2322) (654c70f)
  • Support escaped characters in markdown backend (#2304) (9d67bb9)

Documentation

v2.54.0

22 Sep 15:28
Compare
Choose a tag to compare

Feature

Fix

  • Correct y-axis scaling in draw_table_cells (#2287) (b5628f1)

Documentation

v2.53.0

17 Sep 13:59
Compare
Choose a tag to compare

Feature

Fix

  • Handle empty result from RapidOCR to avoid crash (#2264) (609d902)

Documentation

v2.52.0

11 Sep 16:11
Compare
Choose a tag to compare

Feature

  • Enrichment steps on all convert pipelines (incl docx, html, etc) (#2251) (2c91234)

Fix

  • Add missing features in ThreadedStandardPdfPipeline (#2252) (0700af2)
  • Address deprecation warnings of dependencies (#2237) (c696549)

Documentation

  • Add an example of RAG with OpenSearch (#2238) (f8cc545)
  • Add instructions for using Docling with MCP to README (#2219) (e5cd702)
  • Document VLM support requirement in extraction example (#2231) (55f5f37)

v2.51.0

05 Sep 13:01
Compare
Choose a tag to compare

Feature

  • Updating default parameters to get better performance with docling-parse (#2208) (b49d1ad)
  • Updated the backend for new docling-parse (#2187) (b3d7542)

Documentation

v2.50.0

03 Sep 11:39
Compare
Choose a tag to compare

Feature

Fix

  • html: Access to variable not yet declared (#2171) (293e81b)

v2.49.0

01 Sep 16:39
Compare
Choose a tag to compare

Feature

  • [Beta] Extraction with schema (#2138) (9f4bc5b)
  • msexcel: Set ContentLayer.INVISIBLE for invisible sheet (#1876) (a283ccf)

Fix

  • pypdfium2: Fix OCR bounding box misalignment caused by mismatched rotation metadata (#2039) (4d94e38)
  • Translation example (#2166) (9f0286b)
  • Extend offline mode for rapidocr fonts (#2155) (9904d14)

Documentation

v2.48.0

26 Aug 05:29
Compare
Choose a tag to compare

Feature

Fix

  • html: Preserve code blocks in list items (#2131) (fa3327e)