This release adds 2 notable features for engineering teams evaluating rollout.
Published 3mo
Developer Productivity
✓ No known CVEs patched
✓ No known CVEs patched in this version
Topics
ai-agent
docling
document-parsing
llm
mcp
ocr
+8 more
opendataloader
pdf
pdf-extraction
pdf-to-json
pdf-to-markdown
python
self-healing
structured-extraction
Summary
AI summaryAdded pdfmux doctor and bench commands
Full changelog
Added
pdfmux doctor— check installed extractors, versions, and API keyspdfmux bench— benchmark all available extractors on a PDF side by side
Fixed
- Suppressed upstream pymupdf4llm "Consider using pymupdf_layout" noise
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About NameetP/pdfmux
PDF extraction router with built-in MCP server. Classifies each page (digital, scanned, tables) and routes to the best backend (PyMuPDF, Docling, OCR, or optional LLM fallback)
Related context
Related tools
Beta — feedback welcome: [email protected]