This release includes breaking changes for platform teams planning a safe upgrade.
✓ No known CVEs patched in this version
Topics
+7 more
Summary
AI summaryAdded system context for Ollama, Gemini, and OpenAI direct calls and simplified Direct Execution output format.
Full changelog
v9.0.7 — Polished Direct Execution & System Context (2026-05-24)
Added
- System Context for Direct Exec — added a specialized system prompt for Ollama, Gemini, and OpenAI direct calls so they know they are providing responses for a Claude Code user within
llm-router. - Cleaner Response Format — simplified the Direct Execution output in the terminal. Removed verbose headers and separators in favor of a clean message with a minimal, dimmed metadata footer.
Fixed
- Response Rendering — improved ANSI styling for direct responses to feel more integrated into the conversation.
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About ypollak2/llm-router
Subscription-aware LLM router for Claude Code. Routes tasks to 20+ providers (OpenAI, Gemini, Groq, Ollama, Codex) based on complexity classification, Claude subscription pressure, and cost. Free tasks stay on Claude subscription; expensive tasks fall back to the cheapest capable model. Includes 30 MCP tools, 6 auto-routing hooks, semantic dedup cache, prompt caching, daily spend cap, and a live web dashboard.
Related context
Related tools
Beta — feedback welcome: [email protected]