This release adds 3 notable features for engineering teams evaluating rollout.
✓ No known CVEs patched in this version
Topics
+7 more
Summary
AI summaryContinuation prompts now inherit the prior turn's classification for faster follow‑up handling.
Full changelog
What's new
- Continuation prompt state inheritance — short follow-up prompts (
yes,ok,go ahead,do it,sounds good) instantly reuse the prior turn's classification instead of re-running the full Ollama/API classifier (~1–3s saved per continuation) - Negative downgrades —
no,stop,skip,cancelroute toquery/simple → llm_query - Session-scoped state — persisted at
~/.llm-router/last_route_{session_id}.jsonwith 30-min TTL; parallel sessions never interfere [via context-inherit]method tag visible in routing directive- Hook bumped to v16
All changes since v2.2.0
| Version | Headline |
|---------|----------|
| v2.3.0 | Zero-Friction Activation — shadow/suggest/enforce modes, yearly projection, weekly digest |
| v2.4.0 | Repo-Aware YAML Config — .llm-router.yml, block_providers, model pins, config CLI |
| v2.5.0 | Context-Aware Routing — continuation prompts inherit prior route |
Upgrade
```bash
pip install --upgrade claude-code-llm-router
llm-router install
If using CC plugin:
claude plugin uninstall llm-router && claude plugin install llm-router
```
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About ypollak2/llm-router
Subscription-aware LLM router for Claude Code. Routes tasks to 20+ providers (OpenAI, Gemini, Groq, Ollama, Codex) based on complexity classification, Claude subscription pressure, and cost. Free tasks stay on Claude subscription; expensive tasks fall back to the cheapest capable model. Includes 30 MCP tools, 6 auto-routing hooks, semantic dedup cache, prompt caching, daily spend cap, and a live web dashboard.
Related context
Related tools
Beta — feedback welcome: [email protected]