hidai25/eval-view

v0.4.1 Security

This release includes 1 security fix for security teams reviewing exposed deployments.

Published 4mo Developer Productivity

View tool

✓ No known CVEs patched

Read the diff → Tool health → What is this tool? →

This release patches 1 known CVE

Topics

agent-benchmark agent-evaluation agentic-ai ai-agents anthropic autogen

+12 more

cli crewai evaluation langchain-agent langgraph llm mcp openai-assistants pytest python regression-testing testing

Affected surfaces

rce_ssrf auth

Summary

AI summary

PII evaluator adds opt-in detection for emails, phones, SSNs, credit cards and Luhn validation.

Full changelog

What's New

Mistral Adapter

Direct Mistral API support via pip install evalview[mistral]
Lazy import — no dependency unless you use it

PII Evaluator

Opt-in detection for emails, phones, SSNs, credit cards, addresses
Luhn algorithm validation for credit cards to reduce false positives
Enable with checks: { pii: true } in test YAML

Multi-Turn HTML Reports

Mermaid sequence diagrams showing conversation turns with tool calls
Per-turn query and tool breakdown in the Execution Trace tab

Security

GitHub Action: replaced eval $CMD with bash arrays, moved inputs to env vars
Mermaid diagrams: fixed autoescape breaking arrows, sanitized user content

README

New hero section with logo, sequence diagram screenshot, data flow diagram
"Your data stays local" privacy explanation
Updated model version examples to Claude 4.5/4.6

Full Changelog: https://github.com/hidai25/eval-view/compare/v0.4.0...v0.4.1

Security Fixes

Mermaid diagram autoescape issue fixed; user content now sanitized to prevent injection attacks

View diff on GitHub

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Share on X Share on Bluesky

Track hidai25/eval-view

Get notified when new releases ship.

About hidai25/eval-view

Regression testing framework for AI agents. Save golden baselines, detect behavioral drift, and block regressions in CI. Works with LangGraph, CrewAI, OpenAI, Claude, and any HTTP API.

All releases →