LLMKube

vllmkube-0.8.1 scope: llmkube Maintenance

This release keeps dependencies and maintenance posture current for teams operating this tool.

Published 1mo Containers & Orchestration

✓ No known CVEs patched

✓ No known CVEs patched in this version

Topics

ai apple-silicon autoscaling edge-computing gguf gpu

+12 more

self-hosted inference kubernetes llama-cpp llm local-llm metal mlx multi-gpu nvidia tgi vllm

Summary

AI summary

Minor fixes and improvements.

Changelog

A Helm chart for LLMKube - Kubernetes operator for GPU-accelerated LLM inference

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Track LLMKube

Get notified when new releases ship.

About LLMKube

Kubernetes operator for llama.cpp-native LLM inference with GPU scheduling, Apple Silicon Metal support, and OpenAI-compatible API.

v0.8.1 foreman: requestTimeoutSeconds now sets loop-wide budget, default changes from 600 to 3600.

Beta — feedback welcome: [email protected]