Skip to content

Releases: docling-project/docling

v2.48.0

26 Aug 05:29
Compare
Choose a tag to compare

Feature

Fix

  • html: Preserve code blocks in list items (#2131) (fa3327e)

v2.47.1

23 Aug 14:11
Compare
Choose a tag to compare

Fix

v2.47.0

22 Aug 14:15
Compare
Choose a tag to compare

Feature

  • CLI: Option to download arbitrary HuggingFace model (#2123) (cdf079d)
  • Batching support for VLMs in transformers backend, add initial VLLM backend (#2094) (3c660c0)
  • html: Support formatting tags in HTML texts (#2111) (94fcc46)

Fix

  • Improve numbered list detection for msword docs (#2100) (3f03709)

Documentation

v2.46.0

20 Aug 15:25
Compare
Choose a tag to compare

Feature

Fix

  • HTML: Parse footer tag as a group in furniture content layer (#2106) (c5f2e2f)

Performance

  • Clean up resources with docling-parse v4, no parsed_page output by default (#2105) (5f57ff2)
  • Speed up function _parse_orientation (#1934) (8820b55)

v2.45.0

18 Aug 10:25
Compare
Choose a tag to compare

Feature

  • Add backend for METS with Google Books profile (#1989) (31087f3)
  • html: Support in-line anchor tags in HTML texts (#1659) (9687297)
  • vlm: Ability to preprocess VLM response (#1907) (5f050f9)

Documentation

v2.44.0

12 Aug 09:51
Compare
Choose a tag to compare

Feature

  • Add convert_string to document-converter (#2069) (b09033c)

Fix

  • html: Parse rawspan and colspan when they include non numerical values (#2048) (ed56f2d)
  • Support new mlx-vlm module (#2001) (0130e3a)
  • Extend error reporting when verbose logging is enabled (#2017) (2eb760d)
  • HTML: Replace non-standard Unicode characters (#2006) (86f7012)

Documentation

v2.43.0

28 Jul 09:45
Compare
Choose a tag to compare

Feature

Fix

  • markdown: Ensure correct parsing of nested lists (#1995) (aec29a7)
  • HTML: Remove an unnecessary print command (#1988) (945721a)

v2.42.2

24 Jul 10:21
Compare
Choose a tag to compare

Fix

  • HTML: Concatenation of child strings in table cells and list items (#1981) (5132f06)
  • docx: Adding plain latex equations to table cells (#1986) (0b83609)
  • Preserve PARTIAL_SUCCESS status when document timeout hits (#1975) (98e2fcf)
  • Multi-page image support (tiff) (#1928) (8d50a59)

Documentation

v2.42.1

22 Jul 16:45
Compare
Choose a tag to compare

Fix

Documentation

v2.42.0

18 Jul 15:35
Compare
Choose a tag to compare

Feature

  • Add option to control empty clusters in layout postprocessing (#1940) (a436be7)

Fix

  • Safe pipeline init, use device_map in transformers models (#1917) (cca05c4)
  • Fix HTML table parser and JATS backend bugs (#1948) (e1e3053)
  • KeyError: 'fPr' when processing latex fractions in DOCX files (#1926) (95e7096)
  • Change granite vision model URL from preview to stable version (#1925) (c5fb353)

Documentation