This release adds 2 notable features for engineering teams evaluating rollout.
✓ No known CVEs patched in this version
Topics
+14 more
Summary
AI summaryParallel BM25 + Semantic Search runs simultaneously, reducing latency ~50%, and adjacent chunk retrieval expands matched results with surrounding context.
Full changelog
New Features
Parallel BM25 + Semantic Search
Both search engines now run simultaneously in threads. ~50% latency reduction in hybrid mode.
Adjacent Chunk Retrieval
Matched chunks are automatically expanded with surrounding context. When a chunk matches your query, the system fetches the chunks immediately before and after it (from the same document) and merges them into a single expanded result.
- Results include
context_expanded: truewhen adjacent chunks were merged - Content grows from ~650 chars to ~1500 chars per result (more context for the LLM)
- Zero impact on retrieval precision — the matching still happens on the original chunk
Inspired by PrivateGPT's SentenceWindow pattern and Kotaemon's parallel retrieval.
Upgrade
pip install --upgrade knowledge-rag
Full Changelog
https://github.com/lyonzin/knowledge-rag/compare/v3.1.1...v3.2.0
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About lyonzin/knowledge-rag
Local RAG system for Claude Code with hybrid search (BM25 + semantic), cross-encoder reranking, markdown-aware chunking, query expansion, and 12 MCP tools. Runs entirely offline with zero external servers.
Related context
Beta — feedback welcome: [email protected]