Erodenn/fetch-guard

v1.2.0 Breaking

This release includes breaking changes for platform teams planning a safe upgrade.

Published 4mo MCP Developer Tools

View tool

✓ No known CVEs patched

Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Affected surfaces

rce_ssrf

Summary

AI summary

Expanded multilingual injection detection with four new scan phases and extended CSS sanitization techniques.

Full changelog

What's new

Expanded injection detection

Four new scan phases and a broader pattern library.

Multilingual patterns. The pattern library now covers 6 high-severity injection phrases in English, Spanish, French, German, Japanese, Simplified Chinese, and Portuguese — 50 compiled patterns in total, up from 14. Multilingual payloads pass through all existing scan phases (homoglyph normalization, decode-and-scan) automatically.

URL percent-encoding decode-and-scan. Detects 3+ consecutive %XX sequences, decodes them, and scans the decoded content against high-severity patterns. The 3-token threshold avoids false positives from incidental single-character encoding.

ROT13 whole-document scan. Decodes the full document with ROT13 and scans against high-severity patterns only. Medium-severity patterns are excluded to keep false positives low on legitimate rotated text.

Metadata field scanning. Metadata fields (title, description, og:title, etc.) are now scanned independently. Matches are namespaced to their source field (e.g. metadata:title:ignore_previous) and merged into the overall risk level and injection_matches output.

Extended pre-extraction sanitization

Nine additional CSS hidden-element techniques are now stripped before trafilatura sees the HTML, based on the Unit 42 2026 taxonomy:

font-size: 0
color: transparent / rgba(..., 0) / hsla(..., 0)
height: 0 + overflow: hidden
max-height: 0 + overflow: hidden
clip: rect(0, 0, 0, 0)
transform: scale(0)
Color-match hidden text: elements where inline color and background-color resolve to the same RGB value are removed. Supports cross-format comparisons (white == #ffffff == rgb(255,255,255)).
<template> tag removal: template content is never rendered by browsers and is an active injection vector.

Test coverage

358 unit tests, up from 262.

View diff on GitHub

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Share on X Share on Bluesky

Track Erodenn/fetch-guard

Get notified when new releases ship.

About Erodenn/fetch-guard

URL fetcher and HTML-to-markdown converter with three-layer prompt injection defense: pre-extraction sanitization of hidden/off-screen elements and non-printing Unicode, 15-pattern risk scanning (HIGH/MEDIUM/OK), and per-request session-salt content boundary wrapping.

All releases →