This release adds 3 notable features for engineering teams evaluating rollout.
✓ No known CVEs patched in this version
Topics
+6 more
Summary
AI summaryFirst public release of mem0-mcp-selfhosted adds 11 tools, dual LLM support, knowledge graph integration and numerous bug fixes.
Full changelog
Changelog
All notable changes to mem0-mcp-selfhosted are documented here.
[0.1.0] - 2026-02-27
First public release. Self-hosted mem0 MCP server for Claude Code with 11 tools, dual LLM provider support, and knowledge graph integration.
New Features
- 11 MCP tools — 9 memory tools (
add_memory,search_memories,get_memories,get_memory,update_memory,delete_memory,delete_all_memories,list_entities,delete_entities) + 2 graph tools (search_graph,get_entity) +memory_assistantprompt - Dual LLM providers — Anthropic (Claude) and Ollama as configurable main LLM for fact extraction and memory updates. Set
MEM0_PROVIDER=ollamafor a fully local setup with no cloud dependencies - Knowledge graph — Neo4j-backed entity and relationship extraction via
enable_graphtoggle. Supports 5 graph LLM providers:anthropic,ollama,gemini,gemini_split, andanthropic_oat - Split-model graph pipeline —
gemini_splitroutes entity extraction to Gemini and contradiction detection to Claude, combining Gemini's extraction quality with Claude's reasoning - Zero-config Anthropic auth — Automatically reads Claude Code's OAT token from
~/.claude/.credentials.json. No API key needed for Claude Code users - OAT token self-refresh — Proactive pre-expiry refresh + 3-step defensive retry (piggyback on credentials file, self-refresh via OAuth, wait-and-retry). Long-running sessions survive token rotation seamlessly
MEM0_PROVIDERcascade — Single env var configures both main LLM and graph LLM providers.MEM0_OLLAMA_URLcascades to all Ollama-backed services. Per-service overrides still work- Structured outputs — Claude Opus/Sonnet/Haiku 4.x models use native JSON schema via
output_configfor reliable fact extraction - Ollama defense-in-depth — 6 layers for reliable structured output from Ollama:
/no_thinkinjection, deterministic params, think-tag stripping, JSON extraction, and retry on empty responses - Per-call graph toggle —
enable_graphparameter onadd_memoryandsearch_memorieswith thread-safe locking - Wildcard graph search — Pass
*tosearch_graphto list all entities - Qdrant Facet API —
list_entitiesuses server-side aggregation (Qdrant v1.12+) with scroll+dedupe fallback for older versions - Safe bulk delete — Never calls
memory.delete_all(). Iterates and deletes individually with explicit graph cleanup
Bug Fixes
- Fix
anthropic_oatprovider not registered inLlmFactory, preventing explicit use - Fix
is_oat_token(None)crash in proactive refresh when no Anthropic token configured - Fix
response.content[0]IndexError when Anthropic API returns empty content - Fix thread-safety race condition in
safe_bulk_deletereading mutableenable_graphstate - Fix contradiction model defaulting to Ollama model name when sent to Anthropic API
- Fix Anthropic provider not registered for
gemini_splitcontradiction LLM - Fix
MEM0_QDRANT_TIMEOUTrejected by Pydantic — use pre-configuredQdrantClientinstead - Fix Gemini
_parse_responsesignature mismatch after upstreamtoolsparameter addition - Fix Neo4j
CypherSyntaxErroron LLM-generated relationship names with hyphens or leading digits
Infrastructure
- 301 tests — Unit, contract, integration, MCP protocol, and concurrency test suites
- Centralized env helpers —
env(),opt_env(),bool_env()with consistent whitespace stripping across all modules - Telemetry suppression — mem0ai PostHog telemetry disabled before any imports
- Relationship sanitizer — Monkey-patches mem0ai's sanitizer at startup for Neo4j identifier compliance
- Gemini null content guard — Patches
GeminiLLM._parse_responseto handlecontent=Noneresponses - Transient retry — Anthropic API 500/502/503/529 errors retried with exponential backoff
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About elvismdev/mem0-mcp-selfhosted
Self-hosted mem0 MCP server for Claude Code with Qdrant vector search, Neo4j knowledge graph, and Ollama embeddings. Zero-config OAT auth, split-model graph routing, session hooks for automatic cross-session memory, and 11 tools. Supports both Anthropic and fully local Ollama setups.
Related context
Beta — feedback welcome: [email protected]