PaddleOCR

Developer Productivity

Python Latest v3.7.0 · 1mo ago Security brief →

Features

Intelligent Document Parsing: Convert PDFs and images to LLM‑ready Markdown/JSON using SOTA PaddleOCR-VL-1.6 (0.9B) with 96.3% accuracy on OmniDocBench v1.6.
Structure‑Aware Conversion: PP-StructureV3 provides fine‑grained coordinate data for tables, cells, and text in Markdown/JSON outputs.
Universal Text Recognition: PP-OCRv6 supports 100+ languages (including Chinese, English, Japanese) with +4.6% detection and +5.1% recognition gains over the previous version.

View all 6 releases →

No immediate action

v3.7.0 New feature 1mo

PP-OCRv6 accuracy + speed

Open

No immediate action

v3.6.0 New feature 1mo

PaddleOCR-VL-1.6 release + SDKs

Open

v3.5.0 New feature 3mo

Notable features

Deep integration with Hugging Face ecosystem supporting 20 major models via Transformers as inference backend
Flexible switching among PaddlePaddle static graph, dynamic graph, or Transformers inference engines
Conversion of Word, Excel, PowerPoint documents to Markdown

Full changelog

Deeply integrated with the Hugging Face ecosystem, with 20 major models supporting Transformers as the inference backend. Supports flexible switching of inference engines, including PaddlePaddle static graph, PaddlePaddle dynamic graph, or Transformers.
Supports conversion of common document formats (Word, Excel, Powerpoint) to Markdown.
The PaddleOCR-VL series, PP-StructureV3, and PP-DocTranslation support exporting parsing results to DOCX format, making it convenient to view and edit in Word.
Official browser inference SDK PaddleOCR.js is released, supporting running PP-OCRv5 in the browser.

深度适配 Hugging Face 生态，20 个主要模型支持以 Transformers 作为推理后端。支持灵活切换推理引擎，可选飞桨静态图、飞桨动态图或 Transformers。
支持常见文档格式（Word、Excel、Powerpoint）转 Markdown。
PaddleOCR-VL 系列、PP-StructureV3、PP-DocTranslation 支持将解析结果导出为 DOCX 格式，便于在 Word 中查看和编辑。
发布官方浏览器推理 SDK PaddleOCR.js，支持在浏览器中运行 PP-OCRv5。

Full Changelog: https://github.com/PaddlePaddle/PaddleOCR/compare/v3.4.1...v3.5.0

v3.4.1 New feature 3mo

Notable features

Full changelog

PaddleOCR-VL adds llama-cpp-server backend support.
PaddleOCR-VL adds AMD GPU and Intel Arc GPU hardware support.
Fixed dependency issues in the PaddleOCR-VL images for Huawei NPU and KunlunXin XPU*
For the PaddleOCR-VL Docker Compose service, the default configuration no longer limits the maximum number of pages per request.

Full Changelog: https://github.com/PaddlePaddle/PaddleOCR/compare/v3.4.0...v3.4.1

v3.4.0 New feature 5mo

⚠ Upgrade required

Fixed error when accessing the `/docs` endpoint in the official PaddleOCR‑VL image

Notable features

PaddleOCR-VL-1.5 supports irregular‑shaped bounding box localization and achieves 94.5% accuracy on OmniDocBench v1.5
Adds seal recognition and integrates spotting tasks into PaddleOCR-VL-1.5
PP-StructureV3 gains `format_block_content` and `markdown_ignore_labels` parameters

Full changelog

Release the PaddleOCR-VL-1.5 complex document parsing solution.

PaddleOCR-VL-1.5 is a new iterative version of the PaddleOCR-VL series. Based on comprehensive optimization of the core capabilities of version 1.0, the model achieves 94.5% accuracy on the authoritative document parsing benchmark OmniDocBench v1.5, surpassing top global general-purpose large models and document parsing–specific models.

PaddleOCR-VL-1.5 innovatively supports irregular-shaped bounding box localization of document elements, enabling excellent performance in real-world application scenarios such as scanning, skew, warping, screen-photography, and complex illumination, achieving comprehensive SOTA performance. In addition, the model further integrates seal recognition and spotting tasks, with key metrics continuing to lead mainstream models.

You can use it online on the PaddleOCR official website or call the model API.
Add support for calling MLX-VLM inference services.
PaddleOCR-VL now supports cross-page table merging and multi-level heading reconstruction.
PP-StructureV3 adds support for the format_block_content and markdown_ignore_labels parameters.
Fixed an issue where accessing the /docs endpoint in the official PaddleOCR-VL image would result in an error.

@AmirHosseinOmidi0 made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/16659
@ZhangX-21 made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/16745
@AdlerFleurant made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/16756
@tianyuzhou668 made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/16518
@shiyuasuka made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/17041
@1250890838 made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/16996
@Ihebdhouibi made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/16994
@Ghazi-raad made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/17201
@orbisai0security made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/17289
@danghoangnhan made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/17019
@Luxorion-12 made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/17158