0xMassi/webclaw

v0.1.5 Feature

This release adds 3 notable features for engineering teams evaluating rollout.

Published 4mo MCP Developer Tools

View tool

✓ No known CVEs patched

Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Topics

ai ai-agents ai-scraping cli crawler data-extraction

+13 more

firecrawl-alternative html-to-markdown llm markdown mcp mcp-server rust self-hosted tls-fingerprinting web-crawler web-extraction web-scraper web-scraping

Summary

AI summary

Added --output-dir to save each extracted page as a separate file with filenames derived from URL paths.

Full changelog

Added

--output-dir: save each extracted page to its own file instead of printing to stdout
Works with single URL, crawl, and batch modes
Filenames derived from URL path: /docs/api → docs/api.md
Root URLs use hostname/index.md to avoid collisions
Subdirectories created automatically
CSV input with custom filenames: url,filename format in --urls-file

Examples

# Crawl a docs site, one file per page
webclaw https://docs.example.com --crawl --depth 2 --output-dir ./docs

# Batch with custom filenames
echo "https://example.com,home.md
https://example.com/about,about.md" > urls.csv
webclaw --urls-file urls.csv --output-dir ./output

# JSON format
webclaw https://example.com --crawl -f json --output-dir ./data

Full changelog: https://github.com/0xMassi/webclaw/blob/main/CHANGELOG.md

Full Changelog: https://github.com/0xMassi/webclaw/compare/v0.1.4...v0.1.5

View diff on GitHub

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Share on X Share on Bluesky

Track 0xMassi/webclaw

Get notified when new releases ship.

About 0xMassi/webclaw

Web content extraction for AI agents. 10 tools: scrape, crawl, map, batch, extract, summarize, diff, brand, search, research. TLS fingerprinting bypasses anti-bot without a browser. 67% fewer tokens than raw HTML. `npx create-webclaw` auto-configures Claude, Cursor, Windsurf, Codex, OpenCode.

All releases →

0xMassi/webclaw

Summary

Added

Examples

Related context

Related tools