Scrapegraph-ai
AI Agents & AssistantsA Python library that builds AI‑driven web scraping pipelines with just a prompt, supporting many file formats and integrations.
Features
- Generates scraping pipelines from natural‑language prompts using LLMs
- Handles multiple input types (web pages, XML, HTML, JSON, Markdown, etc.)
- Provides Python SDK and Node.js SDK for easy integration
- Integrates with popular frameworks like Langchain, Llama Index, Crew.ai, Agno, CamelAI
Recent releases
View all 13 releases →
v2.0.0
Breaking risk
Breaking changes
- Requires scrapegraph-py v2.0.0 or later
- API surface aligned with scrapegraph-py v2
- Minimum Python version raised to 3.12
Full changelog
2.0.0 (2026-04-19)
⚠ BREAKING CHANGES
- requires scrapegraph-py v2.0.0+
Co-Authored-By: Claude Opus 4.6 (1M context) [email protected]
Features
- add scrapegraph-py PR #84 SDK compatibility (e8b2a28), closes #82
- align with scrapegraph-py v2 API surface from PR #82 (c0f5fd5)
- migrate to scrapegraph-py v2 API surface (fd23bb0), closes ScrapeGraphAI/scrapegraph-py#82
CI
- bump min Python to 3.12 and trim test suite (5fda03f)
v1.76.0
New feature
Notable features
- PlasmateLoader: lightweight scraping backend with no Chrome dependency
Full changelog
v1.73.1
Bug fix
## 1.73.1 (2026-02-16) ### Bug Fixes * handle list content in telemetry event validation
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
About
Stars
26,619
Forks
2,479
Languages
Python
Makefile
Dockerfile
Install & Platforms
Install via
pip