Skip to content

LocalAI

Model Serving & MLOps

Open‑source AI engine that runs any model (LLM, vision, voice, etc.) locally on CPU or GPU hardware without needing a cloud service.

Go Latest v4.3.6 · 4d ago Security brief →

Features

  • Drop‑in API compatibility with OpenAI, Anthropic, ElevenLabs APIs
  • Supports +36 backends (llama.cpp, vLLM, transformers, whisper, diffusers, MLX…)
  • Runs on any hardware – NVIDIA/AMD/Intel GPUs, Apple Silicon, Vulkan or pure CPU
  • Multi‑user ready with API‑key auth, quotas and role‑based access
  • Built‑in autonomous AI agents (RAG, tool use, MCP, skills)

Recent releases

View all 24 releases →
Review required
v4.3.6 Mixed
Auth RBAC

HTTP redirect refusal + Parakeet ASR + dependency bumps

No immediate action
v4.3.5 Mixed

Tool‑call bugs + Reasoning effort + Dependency bumps

No immediate action
v4.3.4 Bug fix

Guard turboquant grpc fields

No immediate action
v4.3.3 Bug fix

OpenResponses content fix

Review required
v4.3.2 Mixed
Dependencies

Dependencies, Middleware, Distributed, Gallery, UI, Backend

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

About

Stars
46,587
Forks
4,124
Languages
Go JavaScript Python

Install & Platforms

Install via
docker binary
Platforms
macos linux arm64

Community & Support

Beta — feedback welcome: [email protected]