webpeel/webpeel

v0.12.0 Feature

This release adds 3 notable features for engineering teams evaluating rollout.

Published 5mo MCP Developer Tools

View tool

✓ No known CVEs patched

Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Topics

ai-agent ai-agents browser-automation claude-code codex content-extraction

+14 more

cursor firecrawl-alternative mcp mcp-server model-context-protocol llm screenshot structured-data token-efficiency typescript web-crawler web-data web-scraping web-search

Summary

AI summary

Deep Research Agent adds an autonomous 5‑phase pipeline for CLI and Claude Desktop.

Full changelog

What's New in v0.12.0

🔬 Deep Research Agent

webpeel research "<query>" — autonomous 5-phase pipeline: Search → Fetch → BM25 Extract → Follow Links → Synthesize
CLI + MCP tool (webpeel_research) for Claude Desktop / Cursor
85% average relevance, 5/5 source success rate, ~5s per query
Graceful degradation: no LLM key = ranked sources with BM25-filtered findings

🧹 Content Pruner v2 (Complete Rewrite)

Two-pass architecture: semantic removal + density scoring
Real-world token savings: Wikipedia 22%, BBC 21%, MDN 20%, StackOverflow 33%
Enabled by default — opt out with --full-content

📊 BM25 Relevance Scoring

computeRelevanceScore() for document-level relevance (0-1)
Calibrated: relevant 70-94%, irrelevant 0%

🐍 Python SDK

pip install webpeel — full client: scrape, search, batch, crawl, map, extract, screenshot, research
Sync + async support, type hints, httpx-based

📜 Smart Infinite Scroll

--scroll-extract now auto-detects content loading (height stabilization)
Stops after stable height, max scrolls, or timeout

🧠 Token Efficiency Pipeline

BM25 Query Filtering (--focus "query") — 54% savings on targeted queries
Smart Chunking (--chunk N) — semantic/fixed/paragraph strategies
Combined pipeline: Prune → Focus → Budget = up to 77% savings

📈 Stats

654 Node.js + 39 Python = 693 tests, 0 failures
13 MCP tools, 7 CSS schemas
Zero TypeScript errors

View diff on GitHub

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Share on X Share on Bluesky

Track webpeel/webpeel

Get notified when new releases ship.

About webpeel/webpeel

Smart web fetcher for AI agents with auto-escalation from HTTP to headless browser to stealth mode. Includes 9 MCP tools: fetch, search, crawl, map, extract, batch, screenshot, jobs, and agent. Achieved 100% success rate on a 30-URL benchmark.

All releases →