This release includes 2 breaking changes for platform teams planning a safe upgrade.
Published 1mo
RAG & Retrieval
✓ No known CVEs patched
✓ No known CVEs patched in this version
Topics
multi-modal-rag
retrieval-augmented-generation
Summary
AI summaryDoclingParser now uses the Python API, requiring pip install docling, ignoring env={...}, and changing artifact generation.
Full changelog
What's Changed
⚠️ Behavior changes worth noting
DoclingParsernow uses the Docling Python API instead of shelling out to thedoclingCLI. This means:- You now need
pip install doclingto use it (thedoclingexecutable on PATH alone is no longer sufficient). - The
env={...}kwarg onDoclingParserparse methods is still accepted for compatibility but is now ignored — set the relevant environment variables in the parent process or pass_get_converterkwargs (artifacts_path,table_mode, …). <file_stem>.jsonand<file_stem>.mdartifacts written under<output_dir>/<file_stem>/docling/are still produced, but viaexport_to_dict()/export_to_markdown()rather than the CLI serializer — the logical content is the same but the files are not byte-identical.check_installation()now tests Python importability rather than probing the CLI on PATH.
- You now need
MineruParsersubprocess calls now run with a default timeout (configurable) and raiseTimeoutErrorinstead of hanging indefinitely.
✨ New features
- feat(parser): add remote URL support for DoclingParser by @bueno12223 in https://github.com/HKUDS/RAG-Anything/pull/195
- feat(omml): add OMML equation extraction utility for DOCX documents (closes #259) by @Abdeltoto in https://github.com/HKUDS/RAG-Anything/pull/262
- feat: add MiniMax provider support by @octo-patch in https://github.com/HKUDS/RAG-Anything/pull/264
- feat(examples): make LLM and vision model names configurable via env vars by @zhangzhenfei in https://github.com/HKUDS/RAG-Anything/pull/231
- feat: add Ollama integration example (closes #118) by @jwchmodx in https://github.com/HKUDS/RAG-Anything/pull/238
🛠 Refactor / performance
- refactor(parser): replace Docling CLI subprocess with Python API (closes #222) by @Abdeltoto in https://github.com/HKUDS/RAG-Anything/pull/261
🐛 Bug fixes
- fix: create doc_status even when LightRAG lacks multimodal insert args (closes #244) by @DeepaliPaspule in https://github.com/HKUDS/RAG-Anything/pull/255
- fix: prevent crashes from uninitialized LightRAG, env-var stripping, and parser cleanup by @jwchmodx in https://github.com/HKUDS/RAG-Anything/pull/240
- fix: add timeout parameter to MinerU subprocess to prevent indefinite hang (#172) by @peterCheng123321 in https://github.com/HKUDS/RAG-Anything/pull/254
- fix: pass entity_chunks_storage and relation_chunks_storage to all merge_nodes_and_edges calls (#241) by @peterCheng123321 in https://github.com/HKUDS/RAG-Anything/pull/250
- fix: handle messages= kwarg in vision_model_func (insert_content_list_example) (#28) by @peterCheng123321 in https://github.com/HKUDS/RAG-Anything/pull/252
- fix: forward system_prompt parameter in aquery_with_multimodal (#257) by @kuishou68 in https://github.com/HKUDS/RAG-Anything/pull/258
- fix(examples): preserve embedding kwargs with partial by @txhno in https://github.com/HKUDS/RAG-Anything/pull/263
- fix: demote misleading LibreOffice 'not found' warning to debug (closes #230) by @jwchmodx in https://github.com/HKUDS/RAG-Anything/pull/237
- fix: strip
<think>tags from modal processor fallback responses (closes #159) by @jwchmodx in https://github.com/HKUDS/RAG-Anything/pull/236 - fix: create example log directory correctly by @haosenwang1018 in https://github.com/HKUDS/RAG-Anything/pull/242
- fix(init): remove duplicate
__all__assignment (#267) by @kuishou68 in https://github.com/HKUDS/RAG-Anything/pull/268 - fix: improve PDF parser handling by @davidangularme in https://github.com/HKUDS/RAG-Anything/pull/243
New Contributors
- @jwchmodx made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/237
- @davidangularme made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/243
- @zhangzhenfei made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/231
- @bueno12223 made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/195
- @kuishou68 made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/268
- @peterCheng123321 made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/250
- @txhno made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/263
- @octo-patch made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/264
- @Abdeltoto made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/261
- @DeepaliPaspule made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/255
Full Changelog: https://github.com/HKUDS/RAG-Anything/compare/v1.2.10...v1.3.0
Breaking Changes
- `DoclingParser` no longer shells out to the `docling` CLI; requires `pip install docling` and ignores the `env={...}` kwarg.
- `MineruParser` subprocess calls now have a default timeout and raise `TimeoutError` instead of hanging indefinitely.
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
Related context
Related tools
Beta — feedback welcome: [email protected]