This release adds 3 notable features for engineering teams evaluating rollout.
✓ No known CVEs patched in this version
Topics
+7 more
Summary
AI summaryAgent-context aware routing reorders model chain based on active agent (Claude Code or Codex).
Full changelog
What's new
-
Agent-context aware routing: when
llm_select_agentpicks Claude Code or Codex as the active agent, all subsequent routing calls reorder the model chain to put that subscription's models first — maximising already-paid capacity before paid-per-call APIs. -
Priority matrix:
| Session | Complexity | Chain order |
|---------|-----------|-------------|
| Codex | simple / moderate | Ollama → Codex → rest → Claude |
| Codex | complex | Codex → Claude → rest → Ollama |
| Claude Code | simple / moderate | Ollama → Claude → rest → Codex |
| Claude Code | complex | Claude → rest → Codex → Ollama | -
New state helpers:
get_active_agent()/set_active_agent()instate.py -
34 new tests covering all combinations and edge cases
Upgrade
```bash
pip install --upgrade claude-code-llm-router && llm-router install
```
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About ypollak2/llm-router
Subscription-aware LLM router for Claude Code. Routes tasks to 20+ providers (OpenAI, Gemini, Groq, Ollama, Codex) based on complexity classification, Claude subscription pressure, and cost. Free tasks stay on Claude subscription; expensive tasks fall back to the cheapest capable model. Includes 30 MCP tools, 6 auto-routing hooks, semantic dedup cache, prompt caching, daily spend cap, and a live web dashboard.
Related context
Related tools
Beta — feedback welcome: [email protected]