LLMKube

v0.7.12 Feature

This release adds 1 notable feature for engineering teams evaluating rollout.

Published 2mo Containers & Orchestration

View tool

✓ No known CVEs patched

Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Topics

ai apple-silicon autoscaling edge-computing gguf gpu

+12 more

self-hosted inference kubernetes llama-cpp llm local-llm metal mlx multi-gpu nvidia tgi vllm

ReleasePort's take

Light signal

editorial:auto 2mo

Release v0.7.12 adds a workload reconciler with stub planner to foreman/m6 and fixes gate job handling in foreman/m4, while improving chart wiring and documentation across foreman.

Why it matters: Affects developers using foreman/m6 for new reconciler workflows; SREs must update foreman/m4 jobs to respect payload.branch and remote URLs. Documentation updates require review before next deployment cycle.

Summary

AI summary

Updates 0.7.12, Bug Fixes, and 2026-05-24 across a mixed release.

Changes in this release

Type	Severity	Summary	CVE
Feature	Medium	Adds workload reconciler with stub planner in foreman/m6. Adds workload reconciler with stub planner in foreman/m6. Source: llm_adapter@2026-05-24 Confidence: high	—
Bugfix	Medium	Fixes gate Job to honor payload.branch and clone from --git-remote-url in foreman/m4. Fixes gate Job to honor payload.branch and clone from --git-remote-url in foreman/m4. Source: llm_adapter@2026-05-24 Confidence: high	—
Bugfix	Medium	Improves chart wiring of --workspace-dir and tightens native-mode documentation in foreman. Improves chart wiring of --workspace-dir and tightens native-mode documentation in foreman. Source: llm_adapter@2026-05-24 Confidence: low	—

Full changelog

0.7.12 (2026-05-24)

Features

foreman/m6: Workload reconciler with stub planner (explicit pipeline + issue-batch shortcut) (#533) (dbdcd46)

Bug Fixes

foreman/m4: gate Job honors payload.branch + clones from --git-remote-url (#529) (905a269)
foreman: chart wires --workspace-dir + tightens docs for native-mode required values (#534) (1c43c69)

View diff on GitHub

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Share on X Share on Bluesky

Track LLMKube

Get notified when new releases ship.

About LLMKube

Kubernetes operator for llama.cpp-native LLM inference with GPU scheduling, Apple Silicon Metal support, and OpenAI-compatible API.

All releases →

Related context

Related tools

Earlier breaking changes

v0.8.1 foreman: requestTimeoutSeconds now sets loop-wide budget, default changes from 600 to 3600.