Skip to content

ypollak2/llm-router

v2.5.0 Feature

This release adds 3 notable features for engineering teams evaluating rollout.

Published 1mo LLM Frameworks
✓ No known CVEs patched
Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Topics

ai-routing anthropic claude claude-code cost-optimization gemini
+7 more
litellm llm llm-router mcp-server model-router ollama openai

Summary

AI summary

Continuation prompts now inherit the prior turn's classification for faster follow‑up handling.

Full changelog

What's new

  • Continuation prompt state inheritance — short follow-up prompts (yes, ok, go ahead, do it, sounds good) instantly reuse the prior turn's classification instead of re-running the full Ollama/API classifier (~1–3s saved per continuation)
  • Negative downgradesno, stop, skip, cancel route to query/simple → llm_query
  • Session-scoped state — persisted at ~/.llm-router/last_route_{session_id}.json with 30-min TTL; parallel sessions never interfere
  • [via context-inherit] method tag visible in routing directive
  • Hook bumped to v16

All changes since v2.2.0

| Version | Headline |
|---------|----------|
| v2.3.0 | Zero-Friction Activation — shadow/suggest/enforce modes, yearly projection, weekly digest |
| v2.4.0 | Repo-Aware YAML Config — .llm-router.yml, block_providers, model pins, config CLI |
| v2.5.0 | Context-Aware Routing — continuation prompts inherit prior route |

Upgrade

```bash
pip install --upgrade claude-code-llm-router
llm-router install

If using CC plugin:

claude plugin uninstall llm-router && claude plugin install llm-router
```

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Track ypollak2/llm-router

Get notified when new releases ship.

Sign up free

About ypollak2/llm-router

Subscription-aware LLM router for Claude Code. Routes tasks to 20+ providers (OpenAI, Gemini, Groq, Ollama, Codex) based on complexity classification, Claude subscription pressure, and cost. Free tasks stay on Claude subscription; expensive tasks fall back to the cheapest capable model. Includes 30 MCP tools, 6 auto-routing hooks, semantic dedup cache, prompt caching, daily spend cap, and a live web dashboard.

All releases →

Related context

Earlier breaking changes

  • v9.2.0 Changes auto‑route directive from advisory "DO NOT SKIP" to hard constraint with explicit blocked tools list.
  • v9.2.0 Breaks permanent downgrade of enforcement after first Edit/Write; v13 now requires per‑turn routing.

Beta — feedback welcome: [email protected]