Skip to content

LLMKube

v0.7.12 Feature

This release adds 1 notable feature for engineering teams evaluating rollout.

✓ No known CVEs patched
Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Topics

ai apple-silicon autoscaling edge-computing gguf gpu
+12 more
self-hosted inference kubernetes llama-cpp llm local-llm metal mlx multi-gpu nvidia tgi vllm

ReleasePort's take

Light signal
editorial:auto 10d

Release v0.7.12 adds a workload reconciler with stub planner to foreman/m6 and fixes gate job handling in foreman/m4, while improving chart wiring and documentation across foreman.

Why it matters: Affects developers using foreman/m6 for new reconciler workflows; SREs must update foreman/m4 jobs to respect payload.branch and remote URLs. Documentation updates require review before next deployment cycle.

Summary

AI summary

Updates 0.7.12, Bug Fixes, and 2026-05-24 across a mixed release.

Changes in this release

Feature Medium

Adds workload reconciler with stub planner in foreman/m6.

Adds workload reconciler with stub planner in foreman/m6.

Source: llm_adapter@2026-05-24

Confidence: high

Bugfix Medium

Fixes gate Job to honor payload.branch and clone from --git-remote-url in foreman/m4.

Fixes gate Job to honor payload.branch and clone from --git-remote-url in foreman/m4.

Source: llm_adapter@2026-05-24

Confidence: high

Bugfix Medium

Improves chart wiring of --workspace-dir and tightens native-mode documentation in foreman.

Improves chart wiring of --workspace-dir and tightens native-mode documentation in foreman.

Source: llm_adapter@2026-05-24

Confidence: low

Full changelog

0.7.12 (2026-05-24)

Features

  • foreman/m6: Workload reconciler with stub planner (explicit pipeline + issue-batch shortcut) (#533) (dbdcd46)

Bug Fixes

  • foreman/m4: gate Job honors payload.branch + clones from --git-remote-url (#529) (905a269)
  • foreman: chart wires --workspace-dir + tightens docs for native-mode required values (#534) (1c43c69)

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Track LLMKube

Get notified when new releases ship.

Sign up free

About LLMKube

Kubernetes operator for llama.cpp-native LLM inference with GPU scheduling, Apple Silicon Metal support, and OpenAI-compatible API.

All releases →

Related context

Earlier breaking changes

  • v0.8.1 foreman: requestTimeoutSeconds now sets loop-wide budget, default changes from 600 to 3600.

Beta — feedback welcome: [email protected]