LLMKube
Containers & OrchestrationA Kubernetes operator that simplifies self‑hosted LLM inference, turning deployment into a two‑line YAML problem.
Features
- Runs LLMs as native Kubernetes workloads (llama.cpp, vLLM, etc.) with automatic GPU scheduling
- Provides an OpenAI‑compatible API for seamless integration with existing SDKs and frameworks
- Supports cross‑engine routing via ModelRouter for policy‑aware fallback to external providers (Anthropic, OpenAI, LiteLLM)
- Optional metal‑agent enables Apple Silicon (Metal) GPU support alongside Linux/NVIDIA GPUs
Recent releases
View all 89 releases →
v0.8.1
Breaking risk
⚠ Upgrade required
- After upgrading to v0.8.1, re‑apply all Agent CRs so existing Agents pick up explicit values for the new requestTimeoutSeconds and requestTurnTimeoutSeconds fields.
Breaking changes
- Agent.spec.requestTimeoutSeconds now represents a loop-wide wall-clock budget (default 3600) instead of per-request HTTP timeout; the former behavior is moved to Agent.spec.requestTurnTimeoutSeconds (default 120). Re‑apply Agent CRs after upgrade.
Notable features
- **inferenceservice:** adds typed spec.ropeScaling for RoPE/YaRN context extension
Full changelog
0.8.1 (2026-06-01)
⚠ BREAKING CHANGES
- foreman: Agent.spec.requestTimeoutSeconds changes meaning from a per-request HTTP timeout to a loop-wide wall-clock budget, and its default moves from 600 to 3600. The former per-request bound is now the new Agent.spec.requestTurnTimeoutSeconds (default 120). Re-apply your Agent CRs after upgrade so existing Agents pick up explicit values.
Features
Bug Fixes
- foreman: recover orphaned phase=Running tasks on agent restart (#542) (#598) (6dd2c44)
- foreman: split per-turn timeout from loop-wide budget (#532) (#602) (41e7663)
- foreman: warm-path reviewer scheduling on macOS (#578, #579) (#597) (a94d1ef)
- metal-agent: prefer routable interface for host-IP auto-detect (#526) (#599) (c780795)
Documentation
- foreman: absolute paths in overview README cross-refs (fix llmkube-web prerender) (#596) (b5f6f94)
- foreman: move docs/foreman to docs/site/foreman + register in site nav (#594) (9fd85bb)
Miscellaneous
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Install & Platforms
Install via
brew
helm
Platforms
linux
macos
arm64