Skip to content

AI Security

LLM security: prompt injection, jailbreak detection, guardrails, and adversarial evaluation.

Subscribe
← Releases
No immediate action
beelzebub v3.8.0 New feature

Preserve TCP raw bytes

Review required
msaad00/agent-bom v0.88.5 New feature
Auth RBAC Breaking upgrade

UI, onboarding, gateway, findings, observability, graph, hardening, deps, runtime

No immediate action
HookGuard v0.1.0 New feature

HG-001‑004 detectors + scanner + CLI

v0.85.0 (1mo) Inter-agent firewall foundation
beelzebub v3.7.1 New feature
Notable features
  • Add realClientAddr configuration option
  • Improve CLI functionality and increase code coverage
v3.7.0 (1mo) Plugin system
beelzebub v3.6.10 New feature
Notable features
  • Interactive command matching for TCP
  • LLM integration for responses
v3.6.9 (1mo) Maze plugin improvements
AI-Infra-Guard by Tencent Zhuque Lab v4.1.4 New feature
Notable features
  • MCP Scan: multi‑turn red team attack module with TAP and Crescendo strategies
  • System API: data auto‑sync endpoints (`POST /api/v1/system/update-data`, `GET /api/v1/system/update-status`)
  • Agent Scan API: inline `agent_config` support, optional verify flag
Verdict v0.2.0 New feature
Notable features
  • CSV support via Dataset.from_csv() with default column names `input` and `ideal` and overrides `input_field`/`output_field`
  • Arbitrary JSONL field mapping through CLI flags `--input-field` / `--output-field` and Python API
  • Label‑free evaluation allowing datasets without reference answers; reference‑based metrics emit a clear upfront error
v0.9.3 (1mo) pytest plugin, pre-commit hook, ASGI proxy, ActionClaim
Armorer v0.0.1-urlcheck-20260404113020 New feature
Notable features
  • Persisted alert and runtime monitoring added
  • GitHub issue workflow planning pipeline introduced
  • Automated GitHub issue execution with Claude Code
v0.6.1 (2mo) Regex combination + LRU cache
v0.4.1 (2mo) Async guardrails + performance

Beta — feedback welcome: [email protected]