Skip to content

ypollak2/llm-router

v3.2.1 Bugfix

This release fixes issues for SREs watching stability and regressions.

Published 1mo LLM Frameworks
✓ No known CVEs patched
Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Topics

ai-routing anthropic claude claude-code cost-optimization gemini
+7 more
litellm llm llm-router mcp-server model-router ollama openai

Summary

AI summary

Fixed session-end hook to include JSONL records in cumulative savings totals immediately.

Full changelog

What's fixed

Session-end hook: cumulative savings now include JSONL records

Codex and Ollama calls that bypass the MCP server (written to `savings_log.jsonl` by the PostToolUse hook) were missing from the cumulative totals in the session summary due to a one-session lag.

  • `_sync_import_savings_log()` now runs before `_query_cumulative_savings()` at session end
  • `savings_stats` is now unioned into every period total alongside `usage`
  • The "today / this week / this month / all time" rows now reflect the full picture immediately

Also in this release

  • ROADMAP updated: v2.6, v3.2, v3.3, v3.4 all marked ✅ Complete

Upgrade

```bash
pip install --upgrade claude-code-llm-router && llm-router install
```

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Track ypollak2/llm-router

Get notified when new releases ship.

Sign up free

About ypollak2/llm-router

Subscription-aware LLM router for Claude Code. Routes tasks to 20+ providers (OpenAI, Gemini, Groq, Ollama, Codex) based on complexity classification, Claude subscription pressure, and cost. Free tasks stay on Claude subscription; expensive tasks fall back to the cheapest capable model. Includes 30 MCP tools, 6 auto-routing hooks, semantic dedup cache, prompt caching, daily spend cap, and a live web dashboard.

All releases →

Related context

Earlier breaking changes

  • v9.2.0 Changes auto‑route directive from advisory "DO NOT SKIP" to hard constraint with explicit blocked tools list.
  • v9.2.0 Breaks permanent downgrade of enforcement after first Edit/Write; v13 now requires per‑turn routing.

Beta — feedback welcome: [email protected]