hidai25/eval-view
Developer ProductivityRegression testing framework for AI agents. Save golden baselines, detect behavioral drift, and block regressions in CI. Works with LangGraph, CrewAI, OpenAI, Claude, and any HTTP API.
This package is deprecated
This package has moved to evalview. Install with: npm install evalview
Features
- Detect silent behavior regressions in AI agents beyond simple pass/fail tests
- Classify drift as provider/model change vs. system regression with graded confidence
- Auto‑heal flaky failures and provide deterministic replay via captured tool calls
Recent releases
View all 32 releases →Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
About
Stars
112
Forks
20
Languages
Python
Makefile
TypeScript
Downloads/week
17
↑5%
NPM Maintainers
1
Contributors
15
TypeScript
Types included ✓
Install & Platforms
Install via
pip