Skip to content

hidai25/eval-view

v0.1.8 Feature

This release adds 3 notable features for engineering teams evaluating rollout.

✓ No known CVEs patched
Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Topics

agent-benchmark agent-evaluation agentic-ai ai-agents anthropic autogen
+12 more
cli crewai evaluation langchain-agent langgraph llm mcp openai-assistants pytest python regression-testing testing

Summary

AI summary

Fixed the demo agent's division operation and unshadowed the list builtin.

Full changelog

What's New

uv Support for Faster Development

  • Added uv as an optional package manager for 2x faster installs
  • Both pip and uv are fully supported - use whichever you prefer
  • New Makefile targets: make install (uv) and make pip-install (pip)

CI Hardening

  • Config file support for ci.fail_on and ci.warn_on
  • Exit code 2 for execution errors (network, timeout) vs exit code 1 for test failures
  • Model/provider validation to catch mismatches early (e.g., llama3.2 + openai)

LangGraph Example

  • Added complete LangGraph example agent at examples/langgraph/agent/
  • Includes ReAct agent with search and calculator tools

UX Improvements

  • Diverse quickstart test cases (division, weather, multi-tool, multiplication)
  • Demo agent now supports division operations
  • GitHub star CTA always shows after quickstart
  • evalview connect shows actual paths instead of placeholders

Bug Fixes

  • Fixed list builtin being shadowed by CLI command
  • Fixed demo agent division support

Installation

pip install evalview==0.1.8

Full Changelog

https://github.com/hidai25/eval-view/compare/v0.1.7...v0.1.8

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Track hidai25/eval-view

Get notified when new releases ship.

Sign up free

About hidai25/eval-view

Regression testing framework for AI agents. Save golden baselines, detect behavioral drift, and block regressions in CI. Works with LangGraph, CrewAI, OpenAI, Claude, and any HTTP API.

All releases →

Related context

Beta — feedback welcome: [email protected]