This release adds 2 notable features for engineering teams evaluating rollout.
✓ No known CVEs patched in this version
Topics
+7 more
Summary
AI summarySession-end summary shows free‑model savings and the saved‑cost tip now includes both paid and free‑model contributions.
Full changelog
What's new
Added
- Session-end summary shows free-model savings — Ollama and Codex calls are now separated from paid external calls in the stop hook. A new "Free models" section shows per-provider call counts, token volumes, and savings vs Sonnet baseline.
- The combined
💡 Saved ~$X.XXtip now includes both paid routing savings and free-model savings.
Upgrade
pip install --upgrade claude-code-llm-router
llm-router update
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About ypollak2/llm-router
Subscription-aware LLM router for Claude Code. Routes tasks to 20+ providers (OpenAI, Gemini, Groq, Ollama, Codex) based on complexity classification, Claude subscription pressure, and cost. Free tasks stay on Claude subscription; expensive tasks fall back to the cheapest capable model. Includes 30 MCP tools, 6 auto-routing hooks, semantic dedup cache, prompt caching, daily spend cap, and a live web dashboard.
Related context
Related tools
Beta — feedback welcome: [email protected]