Skip to content

Release history

hidai25/eval-view releases

Regression testing framework for AI agents. Save golden baselines, detect behavioral drift, and block regressions in CI. Works with LangGraph, CrewAI, OpenAI, Claude, and any HTTP API.

All releases

32 shown

No immediate action
v0.8.0 New feature

Cassettes + schedule cron

No immediate action
v0.7.1 Breaking risk

TOML test cases + CSV log import

No immediate action
v0.7.0 New feature

Aider CLI adapter

No immediate action
v0.6.2 New feature

Closed-model drift detection

No immediate action
v0.6.1 New feature

MCP feature parity + new tools

No immediate action
v0.6.0 New feature

Auto-heal engine

No immediate action
v0.5.5 New feature

Commands + Native adapters

No immediate action
v0.5.4 New feature

LLM model update + OpenClaw

No immediate action
v0.5.3 Breaking risk

HTML report redesign

No immediate action
v0.5.2 New feature

Cold‑start test generation + GPT‑5 support

No immediate action
v0.5.1 New feature

evalview generate + approvals

No immediate action
v0.5.0 New feature

Regression monitoring + Slack alerts

Review required
v0.4.1 New feature
RCE / SSRF Auth

PII evaluation

No immediate action
v0.4.0 New feature

Multi-turn conversation testing

Review required
v0.3.2 Bug fix
Auth

Auth fix + timeout increase

No immediate action
v0.3.0 Breaking risk

Claude Code MCP integration

No immediate action
v0.2.9 Bug fix

Strip ANSI from MCP output

Upgrade now
v0.2.8 Bug fix
Breaking upgrade

Bug fixes enable full workflow

No immediate action
v0.2.7 Bug fix

Adapter method fix + CLI version

No immediate action
v0.2.6 New feature

Claude Code integration

No immediate action
v0.2.5 New feature

AGENT HEALTHY/REGRESSION DETECTED

Config change
v0.2.4 New feature
Auth

/skill command + --dangerously-skip-permissions

No immediate action
v0.2.3 New feature

Partial credit for sequence evaluation

No immediate action
v0.2.1 New feature

/run, /test, /adapters, /compare

No immediate action
v0.2.0 New feature

Subsequence matching + reliability metrics

No immediate action
v0.1.9 Feature

Interactive chat + Ollama

No immediate action
v0.1.8 Bug fix

Division fix + list shadowing

No immediate action
v0.1.7 Bug fix

Goose fix + Skill Doctor example

No immediate action
v0.1.6 New feature

Claude Code & OpenAI Codex testing

No immediate action
v0.1.5 New feature

Statistical Pass/Fail System

No immediate action
v0.1.4 New feature

Ollama support

No immediate action
v0.1.3 New feature

EvalView GitHub Action

Beta — feedback welcome: [email protected]