ypollak2/llm-router

v5.1.0 Feature

This release adds 5 notable features for engineering teams evaluating rollout.

Published 3mo LLM Frameworks

View tool

✓ No known CVEs patched

Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Topics

ai-routing anthropic claude claude-code cost-optimization gemini

+7 more

litellm llm llm-router mcp-server model-router ollama openai

Summary

AI summary

Added llm-router budget CLI, Dashboard Budget tab, Helicone and LiteLLM spend integrations, and multi-source aggregation.

Full changelog

What's new

llm-router budget CLI — list, set <provider> <amount>, remove <provider> with persistent storage in ~/.llm-router/budgets.json
Dashboard Budget tab — editable per-provider cap inputs, live Save, Prometheus /metrics endpoint
Helicone integration — routing property headers on every call + optional spend pull (LLM_ROUTER_HELICONE_PULL=true)
LiteLLM BudgetManager integration — read per-provider monthly spend from LiteLLM Proxy SQLite DB
Multi-source spend aggregation — local SQLite + Helicone + LiteLLM spend used concurrently; max ensures pressure is never under-reported
Onboarding nudges — setup wizard and llm_budget tool prompt uncapped providers
35 new tests (828 total)

Upgrade

pip install --upgrade claude-code-llm-router && llm-router install

Quick start with budget caps

llm-router budget set openai 20
llm-router budget set gemini 5
llm-router budget list

View diff on GitHub

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Share on X Share on Bluesky

Track ypollak2/llm-router

Get notified when new releases ship.

About ypollak2/llm-router

Subscription-aware LLM router for Claude Code. Routes tasks to 20+ providers (OpenAI, Gemini, Groq, Ollama, Codex) based on complexity classification, Claude subscription pressure, and cost. Free tasks stay on Claude subscription; expensive tasks fall back to the cheapest capable model. Includes 30 MCP tools, 6 auto-routing hooks, semantic dedup cache, prompt caching, daily spend cap, and a live web dashboard.

All releases →

Related context

Related tools

Earlier breaking changes

v9.2.0 Changes auto‑route directive from advisory "DO NOT SKIP" to hard constraint with explicit blocked tools list.
v9.2.0 Breaks permanent downgrade of enforcement after first Edit/Write; v13 now requires per‑turn routing.