Skip to content

RAG-Anything

RAG & Retrieval

"RAG-Anything: All-in-One RAG Framework"

Python Latest v1.3.1 · 13d ago Security brief →

Features

  • Comprehensive all‑in‑one multimodal RAG framework
  • Processes text, images, tables, equations and charts in a single system
  • Eliminates need for multiple specialized tools

Recent releases

View all 4 releases →
No immediate action
v1.3.1 Bug fix

Duplicate detection fix

v1.3.0 Breaking risk
⚠ Upgrade required
  • Install the `docling` Python package (`pip install docling`) before using `DoclingParser`.
  • `env={...}` kwarg on `DoclingParser` parse methods is ignored; set environment variables in the parent process or use `_get_converter` kwargs such as `artifacts_path` and `table_mode`.
Breaking changes
  • `DoclingParser` no longer shells out to the `docling` CLI; requires `pip install docling` and ignores the `env={...}` kwarg.
  • `MineruParser` subprocess calls now have a default timeout and raise `TimeoutError` instead of hanging indefinitely.
Notable features
  • Add remote URL support for DoclingParser
  • OMML equation extraction utility for DOCX documents
  • MiniMax provider support
Full changelog

What's Changed

⚠️ Behavior changes worth noting

  • DoclingParser now uses the Docling Python API instead of shelling out to the docling CLI. This means:
    • You now need pip install docling to use it (the docling executable on PATH alone is no longer sufficient).
    • The env={...} kwarg on DoclingParser parse methods is still accepted for compatibility but is now ignored — set the relevant environment variables in the parent process or pass _get_converter kwargs (artifacts_path, table_mode, …).
    • <file_stem>.json and <file_stem>.md artifacts written under <output_dir>/<file_stem>/docling/ are still produced, but via export_to_dict() / export_to_markdown() rather than the CLI serializer — the logical content is the same but the files are not byte-identical.
    • check_installation() now tests Python importability rather than probing the CLI on PATH.
  • MineruParser subprocess calls now run with a default timeout (configurable) and raise TimeoutError instead of hanging indefinitely.

✨ New features

  • feat(parser): add remote URL support for DoclingParser by @bueno12223 in https://github.com/HKUDS/RAG-Anything/pull/195
  • feat(omml): add OMML equation extraction utility for DOCX documents (closes #259) by @Abdeltoto in https://github.com/HKUDS/RAG-Anything/pull/262
  • feat: add MiniMax provider support by @octo-patch in https://github.com/HKUDS/RAG-Anything/pull/264
  • feat(examples): make LLM and vision model names configurable via env vars by @zhangzhenfei in https://github.com/HKUDS/RAG-Anything/pull/231
  • feat: add Ollama integration example (closes #118) by @jwchmodx in https://github.com/HKUDS/RAG-Anything/pull/238

🛠 Refactor / performance

  • refactor(parser): replace Docling CLI subprocess with Python API (closes #222) by @Abdeltoto in https://github.com/HKUDS/RAG-Anything/pull/261

🐛 Bug fixes

  • fix: create doc_status even when LightRAG lacks multimodal insert args (closes #244) by @DeepaliPaspule in https://github.com/HKUDS/RAG-Anything/pull/255
  • fix: prevent crashes from uninitialized LightRAG, env-var stripping, and parser cleanup by @jwchmodx in https://github.com/HKUDS/RAG-Anything/pull/240
  • fix: add timeout parameter to MinerU subprocess to prevent indefinite hang (#172) by @peterCheng123321 in https://github.com/HKUDS/RAG-Anything/pull/254
  • fix: pass entity_chunks_storage and relation_chunks_storage to all merge_nodes_and_edges calls (#241) by @peterCheng123321 in https://github.com/HKUDS/RAG-Anything/pull/250
  • fix: handle messages= kwarg in vision_model_func (insert_content_list_example) (#28) by @peterCheng123321 in https://github.com/HKUDS/RAG-Anything/pull/252
  • fix: forward system_prompt parameter in aquery_with_multimodal (#257) by @kuishou68 in https://github.com/HKUDS/RAG-Anything/pull/258
  • fix(examples): preserve embedding kwargs with partial by @txhno in https://github.com/HKUDS/RAG-Anything/pull/263
  • fix: demote misleading LibreOffice 'not found' warning to debug (closes #230) by @jwchmodx in https://github.com/HKUDS/RAG-Anything/pull/237
  • fix: strip <think> tags from modal processor fallback responses (closes #159) by @jwchmodx in https://github.com/HKUDS/RAG-Anything/pull/236
  • fix: create example log directory correctly by @haosenwang1018 in https://github.com/HKUDS/RAG-Anything/pull/242
  • fix(init): remove duplicate __all__ assignment (#267) by @kuishou68 in https://github.com/HKUDS/RAG-Anything/pull/268
  • fix: improve PDF parser handling by @davidangularme in https://github.com/HKUDS/RAG-Anything/pull/243

New Contributors

  • @jwchmodx made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/237
  • @davidangularme made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/243
  • @zhangzhenfei made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/231
  • @bueno12223 made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/195
  • @kuishou68 made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/268
  • @peterCheng123321 made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/250
  • @txhno made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/263
  • @octo-patch made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/264
  • @Abdeltoto made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/261
  • @DeepaliPaspule made their first contribution in https://github.com/HKUDS/RAG-Anything/pull/255

Full Changelog: https://github.com/HKUDS/RAG-Anything/compare/v1.2.10...v1.3.0

v1.2.10 Security relevant
Security fixes
  • Path traversal vulnerability
  • Local file read vulnerability
Notable features
  • vLLM backend integration
  • Custom parser plugin system
  • Processing events and callbacks

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

About

Stars
20,860
Forks
2,427
Language
Python

Community & Support

Beta — feedback welcome: [email protected]