chernistry/bernstein

v1.8.4 Feature

This release adds 3 notable features for engineering teams evaluating rollout.

Published 3mo AI Agents & Assistants

View tool

✓ No known CVEs patched

Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Topics

agent-orchestrator agentic-ai ai-agents aider air-gap audit-trail

+14 more

claude-code cli-tool codex-cli coding-agent deterministic-replay deterministic-scheduler hmac-audit mcp-server model-context-protocol multi-agent parallel-worktrees provenance python reproducibility

Affected surfaces

auth rbac

Summary

AI summary

Plan-and-Execute architecture formalized, agent identity cards with capability enforcement, and built-in eval framework introduced.

Full changelog

v1.8.4

Planning, identity, and evaluation.

Features

Plan-and-Execute architecture formalized. Planning and execution are now explicit phases with typed interfaces, so you can swap planners without touching executors.
Agent identity cards with capability enforcement. Every spawn carries a signed identity card; the orchestrator refuses tool calls outside the card's declared capabilities.
Built-in eval framework with per-model accuracy reporting — useful for A/B-ing planners, routers, or adapter configs.
Canary deployments for prompt/model versions. Route a fraction of traffic to a new prompt or model; promote automatically on metric parity.
Opus alias upgraded to Claude Opus 4.7 — default "opus" now resolves to the 4.7 snapshot.

Docs

Added growth metrics breakdown.
Small README polish.

CI

npm publish: NPM_TOKEN is exported so .npmrc interpolation works under GitHub Actions.

Full changelog: https://github.com/chernistry/bernstein/compare/v1.8.3...v1.8.4

View diff on GitHub

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Share on X Share on Bluesky

Track chernistry/bernstein

Get notified when new releases ship.

About chernistry/bernstein

Deterministic multi-agent orchestrator for 18 CLI coding agents (Claude Code, Codex, Cursor, Aider, Gemini CLI, OpenAI Agents SDK, and more). MCP server mode (stdio + HTTP/SSE) exposes the orchestrator to any MCP client. Git worktree isolation per agent, HMAC-chained audit trail, cost-aware model routing via contextual bandit. ~11K monthly PyPI downloads, Apache 2.0.

All releases →

Related context

Related tools

Earlier breaking changes

v3.7.1 `bernstein approve` and `bernstein reject` now enforce identifier regex `[A-Za-z0-9._-]{1,64}`.
v3.7.1 Tampered mission ledger reports as unverified rather than not-found.
v3.7.1 `mission define` now refuses phases without gate tasks.
v3.5.0 MCP client, transport, and gateway become stateless; calls carry content‑derived trace IDs in _meta.