This release includes 2 breaking changes for platform teams planning a safe upgrade.
✓ No known CVEs patched in this version
Topics
+13 more
Affected surfaces
ReleasePort's take
Light signalThe v0.1.8 release adds a Red‑Blue Testing adversarial probe suite to the core and removes several ancillary artifacts (blog exports, npm auth steps) from the public package and CI pipelines.
Why it matters: Test the new Red-Blue Testing suite in dev environments before production use; no immediate action required for removed artifacts.
Summary
AI summaryUpdates core, ci, and T02 across a mixed release.
Changes in this release
| Type | Severity | Summary | CVE |
|---|---|---|---|
| Feature | Medium |
Adds Red-Blue Testing adversarial probe suite to core by GobiShanthan. Adds Red-Blue Testing adversarial probe suite to core by GobiShanthan. Source: llm_adapter@2026-05-22 Confidence: high |
— |
| Feature | Medium |
Removes blog export artifacts from public OSS package by Keesan12. Removes blog export artifacts from public OSS package by Keesan12. Source: llm_adapter@2026-05-22 Confidence: low |
— |
| Refactor | Medium |
Clarifies README positioning for agent proof workflows by Keesan12. Clarifies README positioning for agent proof workflows by Keesan12. Source: llm_adapter@2026-05-22 Confidence: low |
— |
| Refactor | Medium |
Removes npm auth setup from publish workflows by Keesan12. Removes npm auth setup from publish workflows by Keesan12. Source: llm_adapter@2026-05-22 Confidence: low |
— |
Full changelog
What's Changed
- feat(core): add Red-Blue Testing adversarial probe suite by @GobiShanthan
- ci: remove npm auth setup from publish workflows by @Keesan12 in https://github.com/Keesan12/martin-loop/pull/57
- Clarify README positioning for agent proof workflows by @Keesan12 in https://github.com/Keesan12/martin-loop/pull/59
- Remove blog export artifacts from the public OSS package by @Keesan12 in https://github.com/Keesan12/martin-loop/pull/58
🔴🔵 Red-Blue Testing
Adversarial probe suite that runs before a patch is accepted. Six deterministic probes detect patch-level cheating: assertion deletion (T01), silent export reverts (T02), manifest scope creep (T03), context directory poisoning (T07), budget self-reporting (T10), and grounding evasion pragmas (T11).
Three risk tiers:
baseline— 6-probe sweep, no model callhigh_risk— paranoid 12-probe scan, no model callrelease_critical— paranoid scan + optional Haiku model call for deeper inspection
A single block-severity finding rejects the patch. warn findings are recorded but do not block.
Exported from martin-loop SDK: runRedPhase, shouldAcceptPatch, buildRedFindings, resolveRedBudgetPolicy
Full Changelog: https://github.com/Keesan12/martin-loop/compare/v0.1.7...v0.1.8
Breaking Changes
- Removed npm authentication setup from publish workflows (CI)
- Removed blog export artifacts from the public OSS package
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About Keesan12/Martin-Loop
All releases →Related context
Related tools
Earlier breaking changes
- vmcp-v0.1.3 martin_status uses oneOf for selector exclusivity, latest as const.
- vmcp-v0.1.3 maxIterations and maxTokens modeled as integers in tool schemas.
- vmcp-v0.1.3 Tool schemas enforce additionalProperties: false on public contracts.
- vmcp-v0.1.3 Packaged artifacts now require and ship server.json alongside package.json.
Beta — feedback welcome: [email protected]