This release adds 3 notable features for engineering teams evaluating rollout.
✓ No known CVEs patched in this version
Topics
+7 more
Summary
AI summaryTeam Dashboard adds cost‑savings reporting and multi‑channel push notifications.
Full changelog
Team Dashboard + Multi-Channel Push
New Features
Team Dashboard — See your team's LLM cost savings across projects with multi-channel push notifications:
llm-router team report # Print savings dashboard
llm-router team push # Push to Slack/Discord/Telegram
llm-router team setup # Interactive webhook wizard
New MCP Tools
llm_team_report— ASCII savings table with call counts, free-tier %, top modelsllm_team_push— Push to configured webhook (37 total tools)
Multi-Channel Push (auto-detected from URL)
- 🟣 Slack: Block Kit with header, fields, model list
- 🟦 Discord: Rich embed with color and fields
- 📱 Telegram: MarkdownV2 with proper escaping
- 🔗 Generic: Raw JSON POST for custom webhooks
Ollama Always Injected
Ollama is now always prepended to routing chains when OLLAMA_BASE_URL is set — it's free, so there's no reason to ever skip it.
Smart Enforcement Mode (default for new users)
- Q&A tasks (
query,research,generate,analyze) → hard block untilllm_*called - Code tasks → soft (file tools are allowed for actual editing)
llm-router set-enforce hard # Block all tools until routing used
llm-router set-enforce smart # Default: hard for Q&A, soft for code
llm-router set-enforce soft # Suggest routing but never block
llm-router set-enforce off # Disable enforcement entirely
Upgrade
pip install --upgrade claude-code-llm-router
llm-router install
Configuration
# Set team webhook (Slack, Discord, Telegram, or any HTTP endpoint)
export LLM_ROUTER_TEAM_ENDPOINT="https://hooks.slack.com/..."
export LLM_ROUTER_TEAM_CHAT_ID="..." # Telegram only
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About ypollak2/llm-router
Subscription-aware LLM router for Claude Code. Routes tasks to 20+ providers (OpenAI, Gemini, Groq, Ollama, Codex) based on complexity classification, Claude subscription pressure, and cost. Free tasks stay on Claude subscription; expensive tasks fall back to the cheapest capable model. Includes 30 MCP tools, 6 auto-routing hooks, semantic dedup cache, prompt caching, daily spend cap, and a live web dashboard.
Related context
Related tools
Beta — feedback welcome: [email protected]