This release adds 2 notable features for engineering teams evaluating rollout.
✓ No known CVEs patched in this version
Topics
+4 more
Summary
AI summaryAdded an evaluation framework for testing LLM tool selection accuracy in ProductPlan MCP.
Full changelog
What's New
MCP Evals Framework for Tool Selection Testing
Added a comprehensive evaluation framework to validate LLM tool selection accuracy with ProductPlan MCP tools.
Test Suites
| Suite | Tests | Description |
|-------|-------|-------------|
| Tool Selection | 69 | Tests correct tool selection for natural language prompts |
| Confusion Pairs | 40 | Tests disambiguation between commonly confused tools |
| Argument Correctness | 25 | Tests argument extraction from prompts |
| Total | 134 | |
Test Coverage by Category
| Category | Tests |
|----------|-------|
| Roadmaps | 8 |
| Bars | 15 |
| Lanes | 3 |
| Milestones | 3 |
| Objectives | 9 |
| Ideas | 10 |
| Opportunities | 4 |
| Launches | 3 |
| Status | 2 |
| Create | 14 |
| Update | 8 |
| Delete | 5 |
Usage
import "github.com/olgasafonova/productplan-mcp-server/evals"
// Load all test suites
toolSel, confPairs, args, _ := evals.LoadAllEvals("evals/")
// Run with your LLM selector
metrics, _ := evals.EvaluateToolSelection(toolSel, yourSelector)
fmt.Println(evals.FormatMetrics(metrics, "Tool Selection"))
Files Added
evals/runner.go- Evaluation frameworkevals/runner_test.go- Unit testsevals/tool_selection.json- 69 tool selection testsevals/confusion_pairs.json- 10 pairs with 40 disambiguation testsevals/argument_correctness.json- 25 argument extraction testsevals/README.md- Documentation
Full Changelog: https://github.com/olgasafonova/productplan-mcp-server/compare/v4.2.0...v4.3.0
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About olgasafonova/productplan-mcp-server
Query ProductPlan roadmaps. Access OKRs, ideas, launches, and timeline data.
Related context
Beta — feedback welcome: [email protected]