Skip to content

webpeel/webpeel

v0.12.0 Feature

This release adds 3 notable features for engineering teams evaluating rollout.

Published 3mo MCP Developer Tools
✓ No known CVEs patched
Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Topics

ai-agent ai-agents browser-automation claude-code codex content-extraction
+14 more
cursor firecrawl-alternative mcp mcp-server model-context-protocol llm screenshot structured-data token-efficiency typescript web-crawler web-data web-scraping web-search

Summary

AI summary

Deep Research Agent adds an autonomous 5‑phase pipeline for CLI and Claude Desktop.

Full changelog

What's New in v0.12.0

🔬 Deep Research Agent

  • webpeel research "<query>" — autonomous 5-phase pipeline: Search → Fetch → BM25 Extract → Follow Links → Synthesize
  • CLI + MCP tool (webpeel_research) for Claude Desktop / Cursor
  • 85% average relevance, 5/5 source success rate, ~5s per query
  • Graceful degradation: no LLM key = ranked sources with BM25-filtered findings

🧹 Content Pruner v2 (Complete Rewrite)

  • Two-pass architecture: semantic removal + density scoring
  • Real-world token savings: Wikipedia 22%, BBC 21%, MDN 20%, StackOverflow 33%
  • Enabled by default — opt out with --full-content

📊 BM25 Relevance Scoring

  • computeRelevanceScore() for document-level relevance (0-1)
  • Calibrated: relevant 70-94%, irrelevant 0%

🐍 Python SDK

  • pip install webpeel — full client: scrape, search, batch, crawl, map, extract, screenshot, research
  • Sync + async support, type hints, httpx-based

📜 Smart Infinite Scroll

  • --scroll-extract now auto-detects content loading (height stabilization)
  • Stops after stable height, max scrolls, or timeout

🧠 Token Efficiency Pipeline

  • BM25 Query Filtering (--focus "query") — 54% savings on targeted queries
  • Smart Chunking (--chunk N) — semantic/fixed/paragraph strategies
  • Combined pipeline: Prune → Focus → Budget = up to 77% savings

📈 Stats

  • 654 Node.js + 39 Python = 693 tests, 0 failures
  • 13 MCP tools, 7 CSS schemas
  • Zero TypeScript errors

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Track webpeel/webpeel

Get notified when new releases ship.

Sign up free

About webpeel/webpeel

Smart web fetcher for AI agents with auto-escalation from HTTP to headless browser to stealth mode. Includes 9 MCP tools: fetch, search, crawl, map, extract, batch, screenshot, jobs, and agent. Achieved 100% success rate on a 30-URL benchmark.

All releases →

Beta — feedback welcome: [email protected]