This release adds 3 notable features for engineering teams evaluating rollout.
✓ No known CVEs patched in this version
Topics
+13 more
Summary
AI summaryAdded --output-dir to save each extracted page as a separate file with filenames derived from URL paths.
Full changelog
Added
--output-dir: save each extracted page to its own file instead of printing to stdout- Works with single URL, crawl, and batch modes
- Filenames derived from URL path:
/docs/api→docs/api.md - Root URLs use
hostname/index.mdto avoid collisions - Subdirectories created automatically
- CSV input with custom filenames:
url,filenameformat in--urls-file
Examples
# Crawl a docs site, one file per page
webclaw https://docs.example.com --crawl --depth 2 --output-dir ./docs
# Batch with custom filenames
echo "https://example.com,home.md
https://example.com/about,about.md" > urls.csv
webclaw --urls-file urls.csv --output-dir ./output
# JSON format
webclaw https://example.com --crawl -f json --output-dir ./data
Full changelog: https://github.com/0xMassi/webclaw/blob/main/CHANGELOG.md
Full Changelog: https://github.com/0xMassi/webclaw/compare/v0.1.4...v0.1.5
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About 0xMassi/webclaw
Web content extraction for AI agents. 10 tools: scrape, crawl, map, batch, extract, summarize, diff, brand, search, research. TLS fingerprinting bypasses anti-bot without a browser. 67% fewer tokens than raw HTML. `npx create-webclaw` auto-configures Claude, Cursor, Windsurf, Codex, OpenCode.
Related context
Beta — feedback welcome: [email protected]