This release adds 5 notable features for engineering teams evaluating rollout.
✓ No known CVEs patched in this version
Topics
+12 more
Summary
AI summaryNew commands and native adapters add live regression watching, badge generation, dashboard monitoring, Pydantic AI and CrewAI integration.
Full changelog
What's New
New Commands
evalview watch— re-run regression checks on every file save with live scorecard ($0 in quick mode)evalview badge— shields.io status badge, auto-updates on every checkevalview monitor --dashboard— live terminal dashboard with per-test history dots
Native Adapters
- Pydantic AI (
pydantic-ai) — callsagent.run()in-process, extracts tool calls from typed messages - CrewAI (
crewai-native) — callscrew.kickoff()in-process, captures tools via event bus
Smart DX
- Assertion wizard — capture real traffic, get pre-configured assertions automatically
- Auto-variant discovery —
--statistical N --auto-variantfinds and saves non-deterministic paths - Budget circuit breaker —
--budget 0.50enforces spend limits mid-execution - Eval profiles —
initauto-detects agent type and configures evaluators
Python API
gate(),gate_async(),gate_or_revert()— programmatic regression checks- OpenClaw integration with
check_and_decide()for autonomous loops
GitHub Action
- Auto PR comments, artifact uploads, version pinning — all in one step
Documentation
- CrewAI, Pydantic AI, and OpenClaw integration guides
- README rewritten for conversion
- 26 community issues for contributors
Full changelog: https://github.com/hidai25/eval-view/blob/main/CHANGELOG.md
Install: pip install evalview==0.5.5 or curl -fsSL https://raw.githubusercontent.com/hidai25/eval-view/main/install.sh | bash
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About hidai25/eval-view
Regression testing framework for AI agents. Save golden baselines, detect behavioral drift, and block regressions in CI. Works with LangGraph, CrewAI, OpenAI, Claude, and any HTTP API.
Related context
Related tools
Beta — feedback welcome: [email protected]