Skip to content

hidai25/eval-view

v0.5.5 Feature

This release adds 5 notable features for engineering teams evaluating rollout.

✓ No known CVEs patched
Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Topics

agent-benchmark agent-evaluation agentic-ai ai-agents anthropic autogen
+12 more
cli crewai evaluation langchain-agent langgraph llm mcp openai-assistants pytest python regression-testing testing

Summary

AI summary

New commands and native adapters add live regression watching, badge generation, dashboard monitoring, Pydantic AI and CrewAI integration.

Full changelog

What's New

New Commands

  • evalview watch — re-run regression checks on every file save with live scorecard ($0 in quick mode)
  • evalview badge — shields.io status badge, auto-updates on every check
  • evalview monitor --dashboard — live terminal dashboard with per-test history dots

Native Adapters

  • Pydantic AI (pydantic-ai) — calls agent.run() in-process, extracts tool calls from typed messages
  • CrewAI (crewai-native) — calls crew.kickoff() in-process, captures tools via event bus

Smart DX

  • Assertion wizard — capture real traffic, get pre-configured assertions automatically
  • Auto-variant discovery--statistical N --auto-variant finds and saves non-deterministic paths
  • Budget circuit breaker--budget 0.50 enforces spend limits mid-execution
  • Eval profilesinit auto-detects agent type and configures evaluators

Python API

  • gate(), gate_async(), gate_or_revert() — programmatic regression checks
  • OpenClaw integration with check_and_decide() for autonomous loops

GitHub Action

  • Auto PR comments, artifact uploads, version pinning — all in one step

Documentation

  • CrewAI, Pydantic AI, and OpenClaw integration guides
  • README rewritten for conversion
  • 26 community issues for contributors

Full changelog: https://github.com/hidai25/eval-view/blob/main/CHANGELOG.md

Install: pip install evalview==0.5.5 or curl -fsSL https://raw.githubusercontent.com/hidai25/eval-view/main/install.sh | bash

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Track hidai25/eval-view

Get notified when new releases ship.

Sign up free

About hidai25/eval-view

Regression testing framework for AI agents. Save golden baselines, detect behavioral drift, and block regressions in CI. Works with LangGraph, CrewAI, OpenAI, Claude, and any HTTP API.

All releases →

Related context

Beta — feedback welcome: [email protected]