This release includes breaking changes for platform teams planning a safe upgrade.
✓ No known CVEs patched in this version
Topics
+7 more
Summary
AI summaryAutomatic quality scoring added for routed responses with a per-model threshold.
Full changelog
Major release: Automatic quality scoring, developer-first README rewrite, documentation consistency pass, stale asset cleanup.
Added
- Quality Feedback Loop (Sprint 4) —
src/llm_router/quality_feedback.py- Auto-scores every routed response using content heuristics (code blocks, structure, refusals, citations)
- Per-model quality tracking with minimum-calls threshold (3) before trusting signal
should_skip_model()— routing engine skips models with avg quality < 0.4 for specific task patterns- Integrated into
router.pydispatch loop and alltext.pytools (query, research, generate, analyze, code) - 23 new tests in
tests/test_quality_feedback.py
Changed
- Complete README rewrite — developer-first, text-based, high-trust landing page
- No images/SVGs — shields.io badges only
- Honest "Use this if / Don't use this if" section
- Accurate tool count (60 MCP tools), package names, provider list
- ASCII architecture diagram, markdown tables throughout
- Documentation consistency pass — corrected "48 tools" → "60 tools" across 10+ docs
- Package name corrections —
pip install llm-routingconsistently referenced - Tool count standardized — 60 MCP tools (56 llm_* + 4 agoragentic_*) across all docs
Removed
- 18 orphaned SVG assets from
docs/readme/(stale claims, zero references)
Fixed
SECURITY.mdreferenced wrong package name (claude-code-llm-router→llm-routing)HOST_SUPPORT_MATRIX.mdreferenced wrong install commandserver.pyanddocs/TOOLS.mdhad outdated tool counts
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About ypollak2/llm-router
Subscription-aware LLM router for Claude Code. Routes tasks to 20+ providers (OpenAI, Gemini, Groq, Ollama, Codex) based on complexity classification, Claude subscription pressure, and cost. Free tasks stay on Claude subscription; expensive tasks fall back to the cheapest capable model. Includes 30 MCP tools, 6 auto-routing hooks, semantic dedup cache, prompt caching, daily spend cap, and a live web dashboard.
Related context
Related tools
Beta — feedback welcome: [email protected]