docling releases - releaseport

No immediate action

v2.115.0 Mixed 3d

service-client + docx + xlsx

Open

Review required

v2.114.0 New feature 6d

Dependencies

Legacy office formats + video pipeline

Open

No immediate action

v2.113.0 Bug fix 12d

DOCX formatting preservation

Open

No immediate action

v2.112.0 New feature 16d

Excel chart rendering + DCLX backend

Open

Review required

v2.111.0 New feature 18d

Dependencies RCE / SSRF

BoxNote + Excel charts + JSONL chunks

Open

No immediate action

v2.110.0 New feature 22d

DCLX export + faster OCR

Open

No immediate action

v2.109.0 New feature 23d

Font override, OCR defaults, Whisper opts, code detect, PDF headings

Open

No immediate action

v2.108.0 New feature 25d

Fast ASR + xlsx EMF/WMF + inline formula

Open

No immediate action

v2.107.0 New feature 1mo

Typed backend load errors

Open

No immediate action

v2.106.0 New feature 1mo

Infer PDF headings

Open

Upgrade now

v2.105.0 Mixed 1mo

Dependencies

CVE fix + APIs + OCR

Open

No immediate action

v2.104.0 New feature 1mo

ConfidenceScores

Open

No immediate action

v2.103.0 Breaking risk 1mo

DoclingDocument removal

Open

No immediate action

v2.102.2 Bug fix 1mo

Presigned URL fix + JATS abstracts

Open

No immediate action

v2.102.1 Maintenance 1mo

Routine maintenance and dependency updates.

Open

No immediate action

v2.102.0 New feature 1mo

Presigned artifact retrieval

Open

Review required

v2.101.0 Bug fix 1mo

Large HTTP payload fix

Open

No immediate action

v2.100.0 New feature 1mo

DocLang backend + EPUB support

Open

No immediate action

v2.99.0 New feature 1mo

S3 max elements + error handling

Open

No immediate action

v2.98.0 Bug fix 1mo

Chart scaling + docx fixes

Open

No immediate action

v2.97.0 Breaking risk 1mo

Parameter rename

Open

No immediate action

v2.96.1 Bug fix 1mo

FFmpeg error + DrawingML text

Open

No immediate action

v2.96.0 Mixed 1mo

PDF backend + JSON fix + docs update

Open

No immediate action

v2.95.0 Bug fix 2mo

Preserve DOCX text on DrawingML images

Open

No immediate action

v2.94.0 New feature 2mo

TikZ rendering + new options + Vision 4.1 + HF

Open

v2.93.0 New feature 2mo

Notable features

Upgraded Granite Vision model to version 4.1 for enhanced table and chart extraction

Full changelog

Feature

vlm: Upgrade Granite Vision model to 4.1 for table + chart extraction (#3382) (24f2d14)

Fix

docx: Fix OMML equation handling and improve type safety (#3381) (e00735d)

View release on GitHub

v2.92.0 New feature 2mo

Notable features

Multi-lingual support for kserve-triton OCR
Checkbox parsing support for DOCX
Modular docling-slim package

Full changelog

Feature

Extend the kserve-triton OCR model to have multi-lingual support (#3368) (8b67fae)
docx: Add checkbox parsing support (#3349) (c455a65)
Introduce modular docling-slim package (#3285) (ed32c5e)
Add ResponseFormat.DOCLANG and parsing branch in VLM pipeline (#3350) (0f6f8d0)

Fix

pptx: Skip malformed picture shapes instead of aborting conversion (#3372) (7294248)
docx: OMML conversion failures for unsupported limit functions (#3359) (3df80e7)
Make VLLM model_impl configurable (#3358) (a6a37ca)

View release on GitHub

v2.91.0 Bug fix 3mo

Security fixes

Path traversal prevention in LaTeX macro handlers

Notable features

VML image extraction with v:imagedata elements

Full changelog

Feature

docx: Extract VML images with v:imagedata elements (#3343) (2ddaa3b)

Fix

Strengthen input validation for METS‑GBS processing (#3336) (c1dbac2)
EasyOCR model downloading (#3339) (5e161ac)
vlm: Remove bogus preamble from VLM chat template (#3351) (c190ba2)
html: Refine image URL and size handling (#3348) (cd0cb69)
Fixes to html_backend (#3342) (9813190)
pptx: Assign pptx notes to ContentLayer.NOTES (#3341) (3a3c8f6)
Prevent path traversal in LaTeX macro handlers (#3330) (65ef180)
service: Add explicit usage exceeded exception handling (#3325) (075fa69)

Documentation

uspto: Improve documentation of USPTO XML parser security config (#3338) (09de7f9)

View release on GitHub

v2.90.0 New feature 3mo

Notable features

GraniteVisionTableStructureModel for VLM-based table extraction

Full changelog

Feature

Implement GraniteVisionTableStructureModel for VLM-based table extraction (#3323) (1569e42)

Fix

latex: Fully unwrap deeply nested formatting macros (#3249) (101233e)
docx: Handle inline formulas in list items (#3304) (c761512)
format: Add MD fallback for .txt files in _guess_from_content (#3311) (3bab6b4)
Strip soft hyphen when joining merged text elements (#3232) (8274892)
pptx: Handle NotImplementedError from shape.shape_type (#3309) (043ed2d)

Documentation

Fix nanonets_ocr2 runtime support matrix (#3317) (8ec14f2)

View release on GitHub

v2.89.0 New feature 3mo

Notable features

Explicit TikZ environment handling in LaTeX backend
Aligned RapidOCR english assets with 3.8 mobile models
Fixed list state isolation in table cells for DOCX documents

View release on GitHub

v2.88.0 New feature 3mo

Notable features

Client SDK for docling serve
Support for rapidocr 3.8 mobile model naming

View release on GitHub

v2.87.0 Mixed 3mo

Notable features

Nanonets OCR2 onboarding
Transformers v5 compatibility for AUTOMODEL_CAUSALLM VLMs
VLM tool-calling API responses support

View release on GitHub

v2.86.0 New feature 3mo

Notable features

Support for GraniteVision v4
Add signature/stamp html block to DC document
Add PARTIAL_SUCCESS status for VLM pipeline pages

View release on GitHub

v2.85.0 New feature 3mo

Notable features

Falcon-OCR support
LightOnOCR-2-1B support

Full changelog

Feature

Add support for Falcon-OCR (#3237) (d0e19be)
Add support for LightOnOCR-2-1B (#3213) (f2affd7)

Fix

latex: Expand custom macro parameters (#3223) (77a2505)

View release on GitHub

v2.84.0 New feature 3mo

Notable features

GLM OCR support
DocumentFigureClassifier v2.5

Full changelog

Feature

Glm ocr (#3146) (a9265d8)
Switch to the latest version of DocumentFigureClassifier model v2.5 (#3171) (d046390)
Remove the deprecation of extraction (#3220) (e9a39e8)

View release on GitHub

v2.83.0 New feature 3mo

Notable features

Upgrade to transformers v5
OCR model for remote KServe v2 API

View release on GitHub

v2.82.0 New feature 4mo

Notable features

Implementation of HTML backend with headless browser

View release on GitHub

v2.81.0 New feature 4mo

Notable features

Route plain-text and Quarto/R Markdown files to the Markdown backend

View release on GitHub

v2.80.0 New feature 4mo

Notable features

VllmCudaGraphMode

Full changelog

Feature

Add the VllmCudaGraphMode (#3125) (f950679)

View release on GitHub

v2.79.0 New feature 4mo

Notable features

Add fact metadata and linkbase relationships for XBRL

View release on GitHub

v2.78.0 New feature 4mo

Notable features

TableFormer v2 support
gRPC transport for KServe v2 API

View release on GitHub

v2.77.0 New feature 4mo

Notable features

VLM inference time tracking for mlx_model
Configurable ONNX Runtime graph optimization

View release on GitHub

v2.76.0 New feature 4mo

Notable features

WebVTT export

View release on GitHub

v2.75.0 New feature 5mo

Notable features

XBRL instance report backend parser
KServe v2 API support

Full changelog

Feature

Create a backend parser for XBRL instance reports (#3017) (334ba6e)
Unified model-family inference engines (including image-classification) and KServe v2 API support (#2979) (0353293)

Fix

Skip ASR segments when length is zero (#2998) (6b824f8)
docx: Guard against None hyperlink address in _get_paragraph_elements (#2367) (#3022) (236216e)

View release on GitHub

v2.74.0 Security relevant 5mo

Security fixes

XML External Entity and related attack vulnerabilities

Notable features

docling-parse v5 released

View release on GitHub

v2.73.1 Bug fix 5mo

Minor fixes and improvements.

View release on GitHub

v2.73.0 New feature 5mo

Notable features

LaTeX document parsing
Inference engines abstraction for object detection
Pluggable VLM runtime with preset configuration

View release on GitHub

v2.72.0 New feature 5mo

Notable features

Chart extraction models

View release on GitHub

v2.71.0 New feature 5mo

Notable features

Word document comments extraction
WebVTT and source tracker

View release on GitHub

v2.70.0 Breaking risk 6mo

Breaking changes

Python 3.9 support removed

View release on GitHub

v2.69.1 Bug fix 6mo

Fixes off-by-one error in page indexing within vlm_pipeline.

View release on GitHub

v2.69.0 New feature 6mo

Notable features

Picture classifier v2.0
Classification filters for picture description

View release on GitHub

v2.68.0 New feature 6mo

Notable features

DeepSeek-OCR support in VLM pipeline

View release on GitHub

v2.67.0 New feature 6mo

Notable features

XPU device support for Intel GPUs
Enrichment annotations in meta format

Full changelog

Feature

Enrichment annotations in the new meta format (#2859) (aab3ff5)
Add XPU device support for Intel GPUs (#2809) (2b83fdd)
Add option to report timings details (#2772) (cbc6537)

Fix

Lock new deps and update python 3.14 warnings (#2844) (d9295df)
Correct type hint for table_structure_options usage (#2823) (a0530a2)
Transformers models lazy-loaded (#2826) (3ef4525)
Font download by passing font_path to RapidOcr (#2822) (ffafe58)
cli: Add Layout and Table models to --show-external-plugins (#2832) (ed57089)

View release on GitHub

v2.66.0 New feature 7mo

Notable features

Add preset for using granite-docling via vllm and other apis

View release on GitHub

v2.65.0 New feature 7mo

Notable features

Add YAML output format to CLI

View release on GitHub

v2.64.1 Bug fix 7mo

Minor fixes and improvements.

View release on GitHub

v2.64.0 New feature 7mo

Notable features

Add experimental TableCropsLayoutModel
Factory and plugin-capability for Layout and Table models

View release on GitHub

v2.63.0 New feature 8mo

Notable features

Add save and load for conversion result
Enable GPU for RapidOCR when available

View release on GitHub

v2.62.0 New feature 8mo

Notable features

Add the Image backend
Layout + VLM model with layout prompt (experimental)

View release on GitHub

v2.61.2 Bug fix 8mo

Defaults to EasyOCR for Python 3.14 compatibility.

View release on GitHub

v2.61.1 Bug fix 8mo

Fixes slow table parsing performance in DOCX and HTML formats.

View release on GitHub

v2.61.0 Bug fix 8mo

Minor fixes and improvements.

Full changelog

Feature

vlm: Track generated tokens and stop reasons for VLM models (#2543) (6a04e27)

Fix

Temporarily pin NuExtract to working revision (#2588) (fa92574)
ocr: Use PSM integer values directly instead of constructor (#2578) (1a5146a)

View release on GitHub

v2.60.1 Bug fix 8mo

Minor fixes and improvements.

View release on GitHub

v2.60.0 New feature 8mo

Notable features

Threading in standard pipeline

Full changelog

Feature

Use threading in the standard pipeline and move old behavior to legacy (#2452) (268d027)

Fix

pdf: Threadsafe for pypdfium2 backend (#2527) (a51275d)

Documentation

Update link to Open WebUI docs (#2549) (01577e9)
Update installation options with extras and review FAQ (#2548) (cb10043)
Fix typos (#2546) (741c44f)

View release on GitHub

v2.59.0 New feature 8mo

Notable features

Python 3.14 support
Added num_tokens attribute for VlmPrediction

View release on GitHub

v2.58.0 New feature 9mo

Notable features

Password-protected PDF document support
MLX Whisper support for Apple Silicon ASR
Generic options support and HTML image handling modes

View release on GitHub

v2.57.0 New feature 9mo

Notable features

Process DrawingML objects in DOCX

Full changelog

Feature

docx: Process drawingml objects in docx (#2453) (1682993)

Fix

Use proper page concatentation in VLM pipeline MD/HTML conversion (#2458) (cd7f7ba)

Documentation

Example on PII obfuscation (#2459) (3e6da2c)

View release on GitHub

v2.56.1 Bug fix 9mo

EasyOCR models no longer download by default.

View release on GitHub

v2.56.0 Breaking risk 9mo

Notable features

AutoOCR model selecting best available OCR model, deprecating EasyOCR
Tesseract PSM options support

View release on GitHub

v2.55.1 Bug fix 9mo

Minor fixes and improvements.

View release on GitHub

v2.55.0 New feature 9mo

Notable features

Rich tables support for HTML backend
Repetition-based StoppingCriteria for GraniteDocling

View release on GitHub

v2.54.0 New feature 10mo

Notable features

Rich tables support for MSWord backend
New WebVTT file backend parser

View release on GitHub

v2.53.0 New feature 10mo

Notable features

Granite-docling model for document understanding
Generic extra arguments support for RapidOCR

View release on GitHub

v2.52.0 New feature 10mo

Notable features

Enrichment steps on all convert pipelines (incl docx, html, etc.)

View release on GitHub

v2.51.0 New feature 10mo

Improved performance with updated docling-parse backend and optimized default parameters.

View release on GitHub

v2.50.0 New feature 10mo

Heron layout model is now the default.

View release on GitHub

v2.49.0 New feature 10mo

Introduces beta schema extraction.

View release on GitHub

v2.48.0 New feature 11mo

Upgrades RapidOCR to version 3.x.

View release on GitHub

v2.47.1 Bug fix 11mo

Fixed vllm extra to be available only on Linux x86_64 platforms.

View release on GitHub

v2.47.0 New feature 11mo

Notable features

VLM batching in transformers backend
VLLM backend
HTML formatting tags support

View release on GitHub

v2.46.0 New feature 11mo

Notable features

Code formula model

View release on GitHub

v2.45.0 New feature 11mo

Adds METS backend with Google Books profile support.

View release on GitHub

v2.44.0 New feature 11mo

Notable features

convert_string method

View release on GitHub

v2.43.0 Mixed 0y

Introduces threaded PDF pipeline.

View release on GitHub

v2.42.2 Bug fix 1y

Minor fixes and improvements.

View release on GitHub

v2.42.1 Bug fix 1y

Minor fixes and improvements.

View release on GitHub

v2.42.0 Bug fix 1y

Notable features

Option to control empty clusters in layout postprocessing

View release on GitHub

v2.41.0 New feature 1y

Adds image-text-to-text models and enables layout model configuration.

View release on GitHub

v2.40.0 New feature 1y

Notable features

Introduce LayoutOptions to control layout postprocessing behaviour
Integrate ListItemMarkerProcessor into document assembly

View release on GitHub

v2.39.0 Mixed 1y

Adds list modeling with default marker capture.

View release on GitHub

v2.38.1 Bug fix 1y

Minor fixes and improvements.

View release on GitHub

v2.38.0 Breaking risk 1y

Notable features

Support audio input
Add formatting & improve inline support
Maximum image size for Vlm models

View release on GitHub

v2.37.0 New feature 1y

Notable features

Support xlsm files
Make Page.parsed_page the only source of truth for text cells

View release on GitHub

v2.36.1 Bug fix 1y

Minor fixes and improvements.

View release on GitHub

v2.36.0 New feature 1y

Adds support for new vision language models.

View release on GitHub

v2.35.0 New feature 1y

Notable features

Add visualization of bbox on page with html export

View release on GitHub

All releases

Feature

Fix

Feature

Fix

Feature

Fix

Documentation

Feature

Fix

Documentation

Feature

Fix

Feature

Feature

Feature

Fix

Feature

Fix

Feature

Fix

Feature

Fix

Documentation

Feature

Fix

Documentation