Skip to content

noonghunna/club-3090

v0.8.1 Bugfix

This release fixes issues for SREs watching stability and regressions.

Published 17d Model Serving & MLOps
โœ“ No known CVEs patched
Read the diff โ†’ Tool health โ†’ What is this tool? โ†’

✓ No known CVEs patched in this version

Summary

AI summary

Updates ๐Ÿ“ Documentation, ๐Ÿ› Bug fixes, and v0.8.1 โ€” 2026-05-17 across a mixed release.

Full changelog

v0.8.1 โ€” fix / docs-fidelity release. A focused patch stack shipped deliberately before any v0.8.x feature work: pull argparse usage errors now exit 64 not 2 (#370), the missing Gemma-4-31b vllm-pr41800 overlay vendored + registered (#153 / #155), the newer-driver 3090 MAX_MODEL_LEN=105000 HARDWARE anchor (#149), the EXAMPLES.md thinking-default correction (#372), plus the v0.8.0 docs-fidelity cleanup. No model-behavior change; all gated by the full pre-tag suite at the tag commit.

Post-tag follow-ups โ€” NOT in the v0.8.1 tree (transparency note).
These landed on master after this tag and ship in the next release; they are listed here only so a v0.8.1-pinned reader isn't surprised. They are not part of the v0.8.1 release tree:

  • #156 โ€” report.sh lspci PCIe/P2P diagnostics (LnkSta / ACS / topology) for #137/#351 (closes #148)
  • #157 โ€” froggeric chat-template v19 adopted (closes #150): +10pp hermesagent-20 (50โ†’60%), 7 packs flat, streaming/โ€‹soak clean, TPS-neutral

If you track master you already have both. If you pin the v0.8.1 tag or GHCR image, you get them at the next release. The CHANGELOG/Bug-fix list below is auto-generated and correctly reflects only the v0.8.1 tree.


v0.8.1 โ€” 2026-05-17

๐Ÿ› Bug fixes

  • fix(patch-attribution): register vendored gemma-4-31b pr41800 overlay (follow-up to #153/#154) (b4b20ff)
  • fix(gemma-4-31b): vendor missing vllm-pr41800 overlay into the model tree (closes #153) (9c79192)
  • fix(pull): argparse usage errors exit 64, not 2 โ€” distinguishable from honest hard-stop (#370) (820eb38)

๐Ÿ“ Documentation

  • docs(examples): correct "thinking on by default" โ€” shipped composes set enable_thinking=false (#372) (46bb271)
  • docs(hardware): newer-driver 3090 caps long-text.yml at MAX_MODEL_LEN=105000 (#149) (b0774f9)
  • docs: fix v0.8.0 docs-fidelity gaps (trc-ack first-run heads-up, exit-code honesty, GGUF message claim) (78a7dee)
  • docs: cross-link the v0.8.0 universal pull flow from the existing user guides (afe56f7)

[Pin: git checkout v0.8.1] ยท Full diff

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Track noonghunna/club-3090

Get notified when new releases ship.

Sign up free

About noonghunna/club-3090

All releases โ†’

Related context

Earlier breaking changes

  • v0.8.7 Genesis vLLM composes deprecated; default to `vllm/minimal`.
  • v0.8.6 Compose paths moved to `models/<model>/<engine>/compose/<topology>/<quant>/<serving>.yml`.

Beta — feedback welcome: [email protected]