This release adds 3 notable features for engineering teams evaluating rollout.
✓ No known CVEs patched in this version
Topics
+7 more
Summary
AI summaryPolicy engine adds org/user/repo precedence, glob-based allow/block lists, per‑task cost caps and audit trail.
Full changelog
What's new
v3.2 — Policy Engine
- Org/user/repo precedence hierarchy for model-level routing rules
- Glob-based
allow_models/block_modelsin~/.llm-router/org-policy.yamland.llm-router.yaml - Per-task cost caps in org policy
- Policy audit trail in
routing_decisions.policy_applied - New
llm_policytool: shows active policy + last 10 audit entries
v3.3 — Savings Digest
- Period summaries (day/week/month) with spend spike detection (2× 7-day avg alert)
- "What if router was off?" cost simulation
- Slack / Discord / generic webhook auto-detection from URL
LLM_ROUTER_WEBHOOK_URLfor a dedicated digest channel- New
llm_digesttool withsend=Truefor webhook push
v3.4 — Community Benchmarks
- Per-task-type routing accuracy from
llm_rate👍/👎 feedback - Confidence tiers:
★★★ High/★★☆ Medium/★☆☆ Low LLM_ROUTER_COMMUNITY=trueenables local anonymous export- New
llm_benchmarktool
41 total MCP tools (+3 from v3.1.0)
Upgrade
pip install --upgrade claude-code-llm-router && llm-router install
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About ypollak2/llm-router
Subscription-aware LLM router for Claude Code. Routes tasks to 20+ providers (OpenAI, Gemini, Groq, Ollama, Codex) based on complexity classification, Claude subscription pressure, and cost. Free tasks stay on Claude subscription; expensive tasks fall back to the cheapest capable model. Includes 30 MCP tools, 6 auto-routing hooks, semantic dedup cache, prompt caching, daily spend cap, and a live web dashboard.
Related context
Related tools
Beta — feedback welcome: [email protected]