NameetP/pdfmux

Developer Productivity

Self‑healing PDF extractor that audits each page, re‑extracts failures and supports OCR, table, LLM fallback backends for clean RAG input

Track releases GitHub Website

Python Latest v1.8.5 · 1h ago Security brief →

Features

Per‑page confidence scoring and automatic self‑audit of extraction quality
Routed extraction through multiple rule‑based backends (PyMuPDF, OpenDataLoader, RapidOCR, Docling, Surya, Marker) plus BYOK LLM fallback
Zero‑config CLI and Python API with chunking, schema‑guided structured output, cost estimation and CI‑friendly strict mode

Recent releases

View all 19 releases →

No immediate action

v1.8.1 New feature 1h

Verification audit tool

Open

No immediate action

v1.8.4 Bug fix 1h

Version string fix

Open

No immediate action

v1.8.3 Bug fix 1h

Dead link fix + timeout termination

Open

Config change

v1.7.0 Breaking risk 1mo

Breaking upgrade

--strict on by default

Open

No immediate action

v1.6.4 Breaking risk 2mo

audit command + upsell line

Open

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Releases

View all →

Releases per month

M

A

M

J

J

Cadence 0.9 / wk

Last release 0d

Tracked 19

Security

Full profile →

Security score 6.5/10

OpenSSF —

Open CVEs 0

SECURITY.md Active maintainer

Community

GitHub stars 75

Forks 12

Contributors 90d 2

Open issues 6

Open PRs 2

Stars/wk velocity 0.0

About

Stars

75

Forks

12

Languages

Python JavaScript Shell

Downloads/week

7 ↑550%

NPM Maintainers

1 Single npm maintainer

Contributors

3

View on GitHub View on npm Homepage

Install & Platforms

Install via

pip

Alternative to

LlamaParse

Similar tools

PSPDFKit/nutrient-dws-mcp-server

olgasafonova/mediawiki-mcp-server

ndjordjevic/pinrag

About

Stars

75

Forks

12

Languages

Python JavaScript Shell

Downloads/week

7 ↑550%

NPM Maintainers

1 Single npm maintainer

Contributors

3

View on GitHub View on npm Homepage

Install & Platforms

Install via

pip

Similar tools

PSPDFKit/nutrient-dws-mcp-server

olgasafonova/mediawiki-mcp-server

ndjordjevic/pinrag

Alternative to

LlamaParse

Beta — feedback welcome: [email protected]