This release adds 5 notable features for engineering teams evaluating rollout.
✓ No known CVEs patched in this version
Topics
+7 more
Summary
AI summaryComplete agent resource budgeting system with session allocation, tracking, reconciliation and cost estimation.
Full changelog
Agent Resource Budgeting
Complete agent resource budgeting system with provisional tracking and reconciliation.
Key Features
- Session Budget Allocation — Smart budget carving based on quota pressure (30% allocation)
- Provisional Spend Tracking — Real-time budget management preventing budget over-commitment
- Budget Reconciliation — 50% refunds on failure, final cost on success
- Cost Estimation — Accurate estimates by complexity (simple $0.15, moderate $1.00, complex $4.00)
- Hard Limits — Per-agent max $5.00, session max $50.00
What's Changed
src/llm_router/hooks/agent-route.py: Session budget initialization + provisional trackingsrc/llm_router/hooks/agent-error.py: Budget reconciliation with smart refundstests/test_agent_resource_budgeting.py: 12 comprehensive tests (all passing)
Fixed
- Test helper bug where budget file was reinitializing on every call instead of persisting
See CHANGELOG.md for full details.
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About ypollak2/llm-router
Subscription-aware LLM router for Claude Code. Routes tasks to 20+ providers (OpenAI, Gemini, Groq, Ollama, Codex) based on complexity classification, Claude subscription pressure, and cost. Free tasks stay on Claude subscription; expensive tasks fall back to the cheapest capable model. Includes 30 MCP tools, 6 auto-routing hooks, semantic dedup cache, prompt caching, daily spend cap, and a live web dashboard.
Related context
Related tools
Beta — feedback welcome: [email protected]