Skip to content

Scrapegraph-ai

AI Agents & Assistants

A Python library that builds AI‑driven web scraping pipelines with just a prompt, supporting many file formats and integrations.

Python Latest v2.1.3 · 1d ago Security brief →

Features

  • Generates scraping pipelines from natural‑language prompts using LLMs
  • Handles multiple input types (web pages, XML, HTML, JSON, Markdown, etc.)
  • Provides Python SDK and Node.js SDK for easy integration
  • Integrates with popular frameworks like Langchain, Llama Index, Crew.ai, Agno, CamelAI

Recent releases

View all 13 releases →
No immediate action
v2.1.3 Bug fix

SearchGraph restored

v2.0.0 Breaking risk
Breaking changes
  • Requires scrapegraph-py v2.0.0 or later
  • API surface aligned with scrapegraph-py v2
  • Minimum Python version raised to 3.12
Full changelog

2.0.0 (2026-04-19)

⚠ BREAKING CHANGES

  • requires scrapegraph-py v2.0.0+

Co-Authored-By: Claude Opus 4.6 (1M context) [email protected]

Features

CI

  • bump min Python to 3.12 and trim test suite (5fda03f)
v1.76.0 New feature
Notable features
  • PlasmateLoader: lightweight scraping backend with no Chrome dependency
Full changelog

1.76.0 (2026-04-09)

Features

  • add PlasmateLoader as lightweight scraping backend (no Chrome needed) (9dd1fb5), closes #1055

CI

  • reduce GitHub Actions costs by ~85% on PRs (403080a)
v1.73.1 Bug fix

## 1.73.1 (2026-02-16) ### Bug Fixes * handle list content in telemetry event validation

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

About

Stars
26,619
Forks
2,479
Languages
Python Makefile Dockerfile

Install & Platforms

Install via
pip

Beta — feedback welcome: [email protected]