Skip to content

0xMassi/webclaw

v0.1.5 Feature

This release adds 3 notable features for engineering teams evaluating rollout.

Published 2mo MCP Developer Tools
✓ No known CVEs patched
Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Topics

ai ai-agents ai-scraping cli crawler data-extraction
+13 more
firecrawl-alternative html-to-markdown llm markdown mcp mcp-server rust self-hosted tls-fingerprinting web-crawler web-extraction web-scraper web-scraping

Summary

AI summary

Added --output-dir to save each extracted page as a separate file with filenames derived from URL paths.

Full changelog

Added

  • --output-dir: save each extracted page to its own file instead of printing to stdout
  • Works with single URL, crawl, and batch modes
  • Filenames derived from URL path: /docs/apidocs/api.md
  • Root URLs use hostname/index.md to avoid collisions
  • Subdirectories created automatically
  • CSV input with custom filenames: url,filename format in --urls-file

Examples

# Crawl a docs site, one file per page
webclaw https://docs.example.com --crawl --depth 2 --output-dir ./docs

# Batch with custom filenames
echo "https://example.com,home.md
https://example.com/about,about.md" > urls.csv
webclaw --urls-file urls.csv --output-dir ./output

# JSON format
webclaw https://example.com --crawl -f json --output-dir ./data

Full changelog: https://github.com/0xMassi/webclaw/blob/main/CHANGELOG.md

Full Changelog: https://github.com/0xMassi/webclaw/compare/v0.1.4...v0.1.5

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Track 0xMassi/webclaw

Get notified when new releases ship.

Sign up free

About 0xMassi/webclaw

Web content extraction for AI agents. 10 tools: scrape, crawl, map, batch, extract, summarize, diff, brand, search, research. TLS fingerprinting bypasses anti-bot without a browser. 67% fewer tokens than raw HTML. `npx create-webclaw` auto-configures Claude, Cursor, Windsurf, Codex, OpenCode.

All releases →

Beta — feedback welcome: [email protected]