Skip to content

hidai25/eval-view

v0.4.1 Security

This release includes 1 security fix for security teams reviewing exposed deployments.

✓ No known CVEs patched
Read the diff → Tool health → What is this tool? →
This release patches 1 known CVE

Topics

agent-benchmark agent-evaluation agentic-ai ai-agents anthropic autogen
+12 more
cli crewai evaluation langchain-agent langgraph llm mcp openai-assistants pytest python regression-testing testing

Affected surfaces

rce_ssrf auth

Summary

AI summary

PII evaluator adds opt-in detection for emails, phones, SSNs, credit cards and Luhn validation.

Full changelog

What's New

Mistral Adapter

  • Direct Mistral API support via pip install evalview[mistral]
  • Lazy import — no dependency unless you use it

PII Evaluator

  • Opt-in detection for emails, phones, SSNs, credit cards, addresses
  • Luhn algorithm validation for credit cards to reduce false positives
  • Enable with checks: { pii: true } in test YAML

Multi-Turn HTML Reports

  • Mermaid sequence diagrams showing conversation turns with tool calls
  • Per-turn query and tool breakdown in the Execution Trace tab

Security

  • GitHub Action: replaced eval $CMD with bash arrays, moved inputs to env vars
  • Mermaid diagrams: fixed autoescape breaking arrows, sanitized user content

README

  • New hero section with logo, sequence diagram screenshot, data flow diagram
  • "Your data stays local" privacy explanation
  • Updated model version examples to Claude 4.5/4.6

Full Changelog: https://github.com/hidai25/eval-view/compare/v0.4.0...v0.4.1

Security Fixes

  • Mermaid diagram autoescape issue fixed; user content now sanitized to prevent injection attacks

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Track hidai25/eval-view

Get notified when new releases ship.

Sign up free

About hidai25/eval-view

Regression testing framework for AI agents. Save golden baselines, detect behavioral drift, and block regressions in CI. Works with LangGraph, CrewAI, OpenAI, Claude, and any HTTP API.

All releases →

Related context

Beta — feedback welcome: [email protected]