This release adds 2 notable features for engineering teams evaluating rollout.
✓ No known CVEs patched in this version
Topics
+12 more
ReleasePort's take
Moderate signalVersion v4.2.4 corrects X‑Forwarded-Prefix handling so proxied requests are stripped properly.
Why it matters: If your deployment uses a reverse proxy that strips request prefixes, upgrade to v4.2.4 immediately to avoid routing errors.
Summary
AI summaryFixed HTTP X-Forwarded-Prefix handling to correctly strip proxy prefixes.
Changes in this release
| Type | Severity | Summary | CVE |
|---|---|---|---|
| Feature | Medium |
parse VRAM budget/usage from vulkaninfo parse VRAM budget/usage from vulkaninfo Source: llm_adapter@2026-05-21 Confidence: low |
— |
| Feature | Medium |
Add Liquid Audio s2s model and assistant mode on talk page Add Liquid Audio s2s model and assistant mode on talk page Source: llm_adapter@2026-05-21 Confidence: low |
— |
| Dependency | Medium |
Update ggml-org/llama.cpp to a9883db8ee021cf16783016a60996d41820b5195 Update ggml-org/llama.cpp to a9883db8ee021cf16783016a60996d41820b5195 Source: llm_adapter@2026-05-21 Confidence: low |
— |
| Dependency | Medium |
Update TheTom/llama-cpp-turboquant to 5aeb2fdbe26cd4c534c6fa15de73cb5749bd0403 Update TheTom/llama-cpp-turboquant to 5aeb2fdbe26cd4c534c6fa15de73cb5749bd0403 Source: llm_adapter@2026-05-21 Confidence: low |
— |
| Dependency | Medium |
Update antirez/ds4 to 0cba357ca1bc0e7510421cc26888e420ea942123 Update antirez/ds4 to 0cba357ca1bc0e7510421cc26888e420ea942123 Source: llm_adapter@2026-05-21 Confidence: low |
— |
| Dependency | Medium |
Update ikawrakow/ik_llama.cpp to 949bb8f1d660fc1264c137a6f3dbd619375f6134 Update ikawrakow/ik_llama.cpp to 949bb8f1d660fc1264c137a6f3dbd619375f6134 Source: llm_adapter@2026-05-21 Confidence: low |
— |
| Dependency | Medium |
Update ggml-org/whisper.cpp to 3e9b7d0fef3528ee2208da3cdb873a2c53d2ae2f Update ggml-org/whisper.cpp to 3e9b7d0fef3528ee2208da3cdb873a2c53d2ae2f Source: llm_adapter@2026-05-21 Confidence: low |
— |
| Bugfix | Medium |
cascade-clean stale node_models rows and filter routing by healthy status cascade-clean stale node_models rows and filter routing by healthy status Source: llm_adapter@2026-05-21 Confidence: high |
— |
| Bugfix | Medium |
honor X-Forwarded-Prefix when proxy strips the prefix honor X-Forwarded-Prefix when proxy strips the prefix Source: llm_adapter@2026-05-21 Confidence: high |
— |
| Bugfix | Medium |
close truncate-then-read race in agent_jobs.json persistence close truncate-then-read race in agent_jobs.json persistence Source: llm_adapter@2026-05-21 Confidence: high |
— |
| Bugfix | Medium |
parse OpenAI-spec tool_choice in /v1/chat/completions parse OpenAI-spec tool_choice in /v1/chat/completions Source: llm_adapter@2026-05-21 Confidence: high |
— |
| Other | Medium |
update docs version mudler/LocalAI update docs version mudler/LocalAI Source: llm_adapter@2026-05-21 Confidence: low |
— |
| Other | Medium |
publish missing :latest-* and :v<X>-* singleton image tags publish missing :latest-* and :v<X>-* singleton image tags Source: llm_adapter@2026-05-21 Confidence: low |
— |
Full changelog
What's Changed
Bug fixes :bug:
- fix(distributed): cascade-clean stale node_models rows + filter routing by healthy status by @localai-bot in https://github.com/mudler/LocalAI/pull/9754
- fix(http): honor X-Forwarded-Prefix when proxy strips the prefix by @Dennisadira in https://github.com/mudler/LocalAI/pull/9614
- fix(agentpool): close truncate-then-read race in agent_jobs.json persistence by @localai-bot in https://github.com/mudler/LocalAI/pull/9811
- fix(middleware): parse OpenAI-spec tool_choice in /v1/chat/completions by @Anai-Guo in https://github.com/mudler/LocalAI/pull/9559
Exciting New Features 🎉
- feat: also parse VRAM budget/usage from vulkaninfo by @eglia in https://github.com/mudler/LocalAI/pull/9800
- feat(realtime): Add Liquid Audio s2s model and assistant mode on talk page by @richiejp in https://github.com/mudler/LocalAI/pull/9801
Other Changes
- chore: :arrow_up: Update ggml-org/llama.cpp to
a9883db8ee021cf16783016a60996d41820b5195by @localai-bot in https://github.com/mudler/LocalAI/pull/9796 - chore: :arrow_up: Update TheTom/llama-cpp-turboquant to
5aeb2fdbe26cd4c534c6fa15de73cb5749bd0403by @localai-bot in https://github.com/mudler/LocalAI/pull/9740 - docs: :arrow_up: update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/9805
- chore: :arrow_up: Update antirez/ds4 to
0cba357ca1bc0e7510421cc26888e420ea942123by @localai-bot in https://github.com/mudler/LocalAI/pull/9806 - chore: :arrow_up: Update ikawrakow/ik_llama.cpp to
949bb8f1d660fc1264c137a6f3dbd619375f6134by @localai-bot in https://github.com/mudler/LocalAI/pull/9807 - chore: :arrow_up: Update ggml-org/whisper.cpp to
3e9b7d0fef3528ee2208da3cdb873a2c53d2ae2fby @localai-bot in https://github.com/mudler/LocalAI/pull/9808 - ci(image): publish missing :latest-* and :v-* singleton image tags by @localai-bot in https://github.com/mudler/LocalAI/pull/9812
Full Changelog: https://github.com/mudler/LocalAI/compare/v4.2.3...v4.2.4
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About LocalAI
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
Related context
Related tools
Beta — feedback welcome: [email protected]