This release adds 5 notable features for engineering teams evaluating rollout.
✓ No known CVEs patched in this version
Topics
+12 more
Affected surfaces
Summary
AI summaryBroad release touches Other Changes, chore, π Dependencies, and deps.
Changes in this release
| Type | Severity | Summary | CVE |
|---|---|---|---|
| Feature | Medium |
Add Model routing, PII filtering, and Cloud model proxies middleware. Add Model routing, PII filtering, and Cloud model proxies middleware. Source: llm_adapter@2026-05-27 Confidence: high |
β |
| Feature | Medium |
Introduce gated XβLocalAIβNode response header (middleware + wrapper). Introduce gated XβLocalAIβNode response header (middleware + wrapper). Source: llm_adapter@2026-05-27 Confidence: high |
β |
| Feature | Medium |
Add LTXβ2 support and LTXβ2.3 GGUF gallery entries for stablediffusionβggml. Add LTXβ2 support and LTXβ2.3 GGUF gallery entries for stablediffusionβggml. Source: llm_adapter@2026-05-27 Confidence: high |
β |
| Feature | Low |
Mux LTXβ2 audio into the output MP4 for stablediffusionβggml. Mux LTXβ2 audio into the output MP4 for stablediffusionβggml. Source: granite4.1:30b@2026-05-27-audit Confidence: low |
β |
| Feature | Low |
Update Swagger documentation. Update Swagger documentation. Source: granite4.1:30b@2026-05-27-audit Confidence: low |
β |
| Feature | Low |
Add native object detection and segmentation backend (rfdetrβcpp). Add native object detection and segmentation backend (rfdetrβcpp). Source: granite4.1:30b@2026-05-27-audit Confidence: low |
β |
| Dependency | Low |
Bump github.com/nats-io/nats.go from 1.50.0 to 1.52.0. Bump github.com/nats-io/nats.go from 1.50.0 to 1.52.0. Source: llm_adapter@2026-05-27 Confidence: high |
β |
| Dependency | Low |
Bump github.com/aws/aws-sdk-go-v2/credentials from 1.19.15 to 1.19.17. Bump github.com/aws/aws-sdk-go-v2/credentials from 1.19.15 to 1.19.17. Source: llm_adapter@2026-05-27 Confidence: high |
β |
| Dependency | Low |
Bump actions/stale from 10.2.0 to 10.3.0. Bump actions/stale from 10.2.0 to 10.3.0. Source: llm_adapter@2026-05-27 Confidence: high |
β |
| Dependency | Low |
Bump sentence-transformers from 5.5.0 to 5.5.1 in /backend/python/transformers. Bump sentence-transformers from 5.5.0 to 5.5.1 in /backend/python/transformers. Source: llm_adapter@2026-05-27 Confidence: high |
β |
| Dependency | Low |
Update transformers requirement from >=5.8.1 to >=5.9.0 in /backend/python/transformers. Update transformers requirement from >=5.8.1 to >=5.9.0 in /backend/python/transformers. Source: llm_adapter@2026-05-27 Confidence: high |
β |
| Dependency | Low |
Bump protobuf from 6.33.5 to 7.35.0 in /backend/python/transformers. Bump protobuf from 6.33.5 to 7.35.0 in /backend/python/transformers. Source: llm_adapter@2026-05-27 Confidence: high |
β |
| Bugfix | Medium |
Fix VRAM detection in intel component. Fix VRAM detection in intel component. Source: llm_adapter@2026-05-27 Confidence: high |
β |
| Bugfix | Low |
Persist per-model load info so distributed reconciler survives frontend restart. Persist per-model load info so distributed reconciler survives frontend restart. Source: granite4.1:30b@2026-05-27-audit Confidence: low |
β |
| Bugfix | Low |
Sync gallery OpCache and caches across frontend replicas in distributed mode. Sync gallery OpCache and caches across frontend replicas in distributed mode. Source: granite4.1:30b@2026-05-27-audit Confidence: low |
β |
| Bugfix | Low |
Add diffusion_model flag to all LTXβ2.3 variants in the model gallery. Add diffusion_model flag to all LTXβ2.3 variants in the model gallery. Source: granite4.1:30b@2026-05-27-audit Confidence: low |
β |
| Bugfix | Low |
Set vae_decode_only:false for i2v and flf2v variants of LTXβ2.3. Set vae_decode_only:false for i2v and flf2v variants of LTXβ2.3. Source: granite4.1:30b@2026-05-27-audit Confidence: low |
β |
| Bugfix | Low |
Prevent thinkstep leakage into content when autoparser is in pure-content mode (reasoning). Prevent thinkstep leakage into content when autoparser is in pure-content mode (reasoning). Source: granite4.1:30b@2026-05-27-audit Confidence: low |
β |
| Bugfix | Low |
Exclude localβonly artifacts from Docker build context via .dockerignore. Exclude localβonly artifacts from Docker build context via .dockerignore. Source: granite4.1:30b@2026-05-27-audit Confidence: low |
β |
| Bugfix | Low |
Share a single /api/operations poller across ReactβUI consumers to reduce load. Share a single /api/operations poller across ReactβUI consumers to reduce load. Source: granite4.1:30b@2026-05-27-audit Confidence: low |
β |
| Bugfix | Low |
Polish 'Fits in my GPU' filter to use designβsystem Toggle component. Polish 'Fits in my GPU' filter to use designβsystem Toggle component. Source: granite4.1:30b@2026-05-27-audit Confidence: low |
β |
| Bugfix | Low |
Force .check() on hidden Toggle input for e2e testing of the fitsβfilter feature. Force .check() on hidden Toggle input for e2e testing of the fitsβfilter feature. Source: granite4.1:30b@2026-05-27-audit Confidence: low |
β |
| Other | Low |
Add 'Fits in my GPU' filter on Install Models UI page. Add 'Fits in my GPU' filter on Install Models UI page. Source: granite4.1:30b@2026-05-27-audit Confidence: low |
β |
Full changelog
What's Changed
π Dependencies
- chore(deps): bump github.com/nats-io/nats.go from 1.50.0 to 1.52.0 by @dependabot[bot] in https://github.com/mudler/LocalAI/pull/10003
- chore(deps): bump github.com/aws/aws-sdk-go-v2/credentials from 1.19.15 to 1.19.17 by @dependabot[bot] in https://github.com/mudler/LocalAI/pull/10008
- chore(deps): bump actions/stale from 10.2.0 to 10.3.0 by @dependabot[bot] in https://github.com/mudler/LocalAI/pull/10002
- chore(deps): bump sentence-transformers from 5.5.0 to 5.5.1 in /backend/python/transformers by @dependabot[bot] in https://github.com/mudler/LocalAI/pull/10007
- chore(deps): update transformers requirement from >=5.8.1 to >=5.9.0 in /backend/python/transformers by @dependabot[bot] in https://github.com/mudler/LocalAI/pull/10005
- chore(deps): bump protobuf from 6.33.5 to 7.35.0 in /backend/python/transformers by @dependabot[bot] in https://github.com/mudler/LocalAI/pull/10004
Other Changes
- feat(middleware): Model routing, PII filtering, Cloud model proxies by @richiejp in https://github.com/mudler/LocalAI/pull/9802
- fix(intel): VRAM detection by @richiejp in https://github.com/mudler/LocalAI/pull/9944
- feat(distributed): gated X-LocalAI-Node response header (middleware + wrapper) by @localai-bot in https://github.com/mudler/LocalAI/pull/9976
- fix(distributed): persist per-model load info so reconciler survives frontend restart by @localai-bot in https://github.com/mudler/LocalAI/pull/9981
- feat(stablediffusion-ggml): LTX-2 support + LTX-2.3 GGUF gallery entries by @localai-bot in https://github.com/mudler/LocalAI/pull/9980
- fix(distributed): sync gallery OpCache + caches across frontend replicas by @localai-bot in https://github.com/mudler/LocalAI/pull/9983
- fix(gallery/ltx-2.3): add diffusion_model flag to all variants by @mudler in https://github.com/mudler/LocalAI/pull/9986
- fix(gallery/ltx-2.3): add vae_decode_only:false for i2v / flf2v by @mudler in https://github.com/mudler/LocalAI/pull/9987
- fix(reasoning): stop leaking into content when autoparser is in pure-content mode by @localai-bot in https://github.com/mudler/LocalAI/pull/9991
- fix(stablediffusion-ggml): mux LTX-2 audio into output MP4 by @localai-bot in https://github.com/mudler/LocalAI/pull/9990
- feat(swagger): update swagger by @localai-bot in https://github.com/mudler/LocalAI/pull/9992
- docs: :arrow_up: update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/9993
- fix(streaming/tools): stop healing-marker stubs from gating off content by @localai-bot in https://github.com/mudler/LocalAI/pull/9999
- chore: :arrow_up: Update antirez/ds4 to
ad0209f6a4b067574d2b4afe896c08c177156b31by @localai-bot in https://github.com/mudler/LocalAI/pull/9996 - chore: :arrow_up: Update ikawrakow/ik_llama.cpp to
b4e1d916c5ec7e75ea3c124dd090425a99fc613fby @localai-bot in https://github.com/mudler/LocalAI/pull/9995 - chore: :arrow_up: Update leejet/stable-diffusion.cpp to
1ceb5bd9df7784bcdf67dd9ed8bf0198b542ebc9by @localai-bot in https://github.com/mudler/LocalAI/pull/9994 - chore: :arrow_up: Update ggml-org/whisper.cpp to
e0fd1f6787a5bd4a4957dd97c5b64df882ee7b0cby @localai-bot in https://github.com/mudler/LocalAI/pull/9997 - fix(streaming/tools): don't leak prefill-misclassified content as trailing reasoning chunk by @localai-bot in https://github.com/mudler/LocalAI/pull/10000
- chore: :arrow_up: Update ggml-org/llama.cpp to
35c9b1f39ebe5a7bb83986d64415a079218be78dby @localai-bot in https://github.com/mudler/LocalAI/pull/9998 - chore(model gallery): :robot: add 1 new models via gallery agent by @localai-bot in https://github.com/mudler/LocalAI/pull/10011
- fix(dockerignore): exclude local-only artifacts from build context by @richiejp in https://github.com/mudler/LocalAI/pull/10015
- chore(model gallery): :robot: add 1 new models via gallery agent by @localai-bot in https://github.com/mudler/LocalAI/pull/10016
- test: add Go + React UI coverage gates and fill test gaps by @richiejp in https://github.com/mudler/LocalAI/pull/9989
- fix(qwen-asr): enable timestamp output when forced_aligner is configured by @fqscfqj in https://github.com/mudler/LocalAI/pull/10013
- fix(nemo): extract Hypothesis.text for TDT/RNNT ASR models by @fqscfqj in https://github.com/mudler/LocalAI/pull/10012
- chore: :arrow_up: Update ikawrakow/ik_llama.cpp to
d2da6da05c73aeb658a3d1751f386c24e6963856by @localai-bot in https://github.com/mudler/LocalAI/pull/10020 - chore: :arrow_up: Update ggml-org/whisper.cpp to
27101c01dcac1676e2b6422256233cd0f1f9ae28by @localai-bot in https://github.com/mudler/LocalAI/pull/10021 - chore: :arrow_up: Update ggml-org/llama.cpp to
0d18aaa9d1a8af3df9abccd828e22eeaac7f840bby @localai-bot in https://github.com/mudler/LocalAI/pull/10022 - chore: :arrow_up: Update leejet/stable-diffusion.cpp to
92dc7268fc4ffb0c0cc0bd52dfcefea91326e797by @localai-bot in https://github.com/mudler/LocalAI/pull/10023 - chore: :arrow_up: Update antirez/ds4 to
e8e8779b261c10f36ad6270ba732c8f0be5b62e3by @localai-bot in https://github.com/mudler/LocalAI/pull/10024 - UI: add 'Fits in my GPU' filter on Install Models by @siddimore in https://github.com/mudler/LocalAI/pull/10017
- fix(react-ui): share single /api/operations poller across consumers by @localai-bot in https://github.com/mudler/LocalAI/pull/10029
- feat(backend): rfdetr-cpp native object detection + segmentation backend by @localai-bot in https://github.com/mudler/LocalAI/pull/10028
- fix(react-ui): polish 'Fits in my GPU' filter to use design-system Toggle by @localai-bot in https://github.com/mudler/LocalAI/pull/10030
- fix(react-ui): force .check() on hidden Toggle input in fits-filter e2e by @localai-bot in https://github.com/mudler/LocalAI/pull/10031
New Contributors
- @fqscfqj made their first contribution in https://github.com/mudler/LocalAI/pull/10013
Full Changelog: https://github.com/mudler/LocalAI/compare/v4.3.1...v4.3.2
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About LocalAI
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
Related context
Related tools
Beta — feedback welcome: [email protected]