Skip to content

RAG-Anything

v1.3.0 Breaking

This release includes 2 breaking changes for platform teams planning a safe upgrade.

Published 1mo RAG & Retrieval
✓ No known CVEs patched
Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Topics

multi-modal-rag retrieval-augmented-generation

Summary

AI summary

DoclingParser now uses the Python API, requiring pip install docling, ignoring env={...}, and changing artifact generation.

Full changelog

What's Changed

⚠️ Behavior changes worth noting

  • DoclingParser now uses the Docling Python API instead of shelling out to the docling CLI. This means:
    • You now need pip install docling to use it (the docling executable on PATH alone is no longer sufficient).
    • The env={...} kwarg on DoclingParser parse methods is still accepted for compatibility but is now ignored — set the relevant environment variables in the parent process or pass _get_converter kwargs (artifacts_path, table_mode, …).
    • <file_stem>.json and <file_stem>.md artifacts written under <output_dir>/<file_stem>/docling/ are still produced, but via export_to_dict() / export_to_markdown() rather than the CLI serializer — the logical content is the same but the files are not byte-identical.
    • check_installation() now tests Python importability rather than probing the CLI on PATH.
  • MineruParser subprocess calls now run with a default timeout (configurable) and raise TimeoutError instead of hanging indefinitely.

✨ New features

  • feat(parser): add remote URL support for DoclingParser by @bueno12223 in https://github.com/HKUDS/RAG-Anything/pull/195
  • feat(omml): add OMML equation extraction utility for DOCX documents (closes #259) by @Abdeltoto in https://github.com/HKUDS/RAG-Anything/pull/262
  • feat: add MiniMax provider support by @octo-patch in https://github.com/HKUDS/RAG-Anything/pull/264
  • feat(examples): make LLM and vision model names configurable via env vars by @zhangzhenfei in https://github.com/HKUDS/RAG-Anything/pull/231
  • feat: add Ollama integration example (closes #118) by @jwchmodx in https://github.com/HKUDS/RAG-Anything/pull/238

🛠 Refactor / performance

  • refactor(parser): replace Docling CLI subprocess with Python API (closes #222) by @Abdeltoto in https://github.com/HKUDS/RAG-Anything/pull/261

🐛 Bug fixes

  • fix: create doc_status even when LightRAG lacks multimodal insert args (closes #244) by @DeepaliPaspule in https://github.com/HKUDS/RAG-Anything/pull/255
  • fix: prevent crashes from uninitialized LightRAG, env-var stripping, and parser cleanup by @jwchmodx in https://github.com/HKUDS/RAG-Anything/pull/240
  • fix: add timeout parameter to MinerU subprocess to prevent indefinite hang (#172) by @peterCheng123321 in https://github.com/HKUDS/RAG-Anything/pull/254
  • fix: pass entity_chunks_storage and relation_chunks_storage to all merge_nodes_and_edges calls (#241) by @peterCheng123321 in https://github.com/HKUDS/RAG-Anything/pull/250
  • fix: handle messages= kwarg in vision_model_func (insert_content_list_example) (#28) by @peterCheng123321 in https://github.com/HKUDS/RAG-Anything/pull/252
  • fix: forward system_prompt parameter in aquery_with_multimodal (#257) by @kuishou68 in https://github.com/HKUDS/RAG-Anything/pull/258
  • fix(examples): preserve embedding kwargs with partial by @txhno in https://github.com/HKUDS/RAG-Anything/pull/263
  • fix: demote misleading LibreOffice 'not found' warning to debug (closes #230) by @jwchmodx in https://github.com/HKUDS/RAG-Anything/pull/237
  • fix: strip <think> tags from modal processor fallback responses (closes #159) by @jwchmodx in https://github.com/HKUDS/RAG-Anything/pull/236
  • fix: create example log directory correctly by @haosenwang1018 in https://github.com/HKUDS/RAG-Anything/pull/242
  • fix(init): remove duplicate __all__ assignment (#267) by @kuishou68 in https://github.com/HKUDS/RAG-Anything/pull/268
  • fix: improve PDF parser handling by @davidangularme in https://github.com/HKUDS/RAG-Anything/pull/243

New Contributors

  • @jwchmodx made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/237
  • @davidangularme made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/243
  • @zhangzhenfei made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/231
  • @bueno12223 made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/195
  • @kuishou68 made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/268
  • @peterCheng123321 made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/250
  • @txhno made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/263
  • @octo-patch made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/264
  • @Abdeltoto made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/261
  • @DeepaliPaspule made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/255

Full Changelog: https://github.com/HKUDS/RAG-Anything/compare/v1.2.10...v1.3.0

Breaking Changes

  • `DoclingParser` no longer shells out to the `docling` CLI; requires `pip install docling` and ignores the `env={...}` kwarg.
  • `MineruParser` subprocess calls now have a default timeout and raise `TimeoutError` instead of hanging indefinitely.

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Track RAG-Anything

Get notified when new releases ship.

Sign up free

About RAG-Anything

"RAG-Anything: All-in-One RAG Framework"

All releases →

Related context

Beta — feedback welcome: [email protected]