This release adds 3 notable features for engineering teams evaluating rollout.
Published 22d
Model Serving & MLOps
โ No known CVEs patched
✓ No known CVEs patched in this version
Summary
AI summaryUpdates โจ Features, ๐ Documentation, and qwen-tq3 across a mixed release.
Full changelog
v0.5.0 โ 2026-05-12
โจ Features
- feat(qwen): ship froggeric chat-template fixes as default-on (84498d4)
- feat(vllm): add PR #35936 required-tool fallback overlay (28b16b5)
- feat(qwen-tq3): add CLUB3090_TQ_K1_SKIP_MTP layer-filter for PR #40914 K+1 dispatch (6b2a7d5)
๐ฏ New models + serving paths
- compose(tq3-mtp-genesis): pin to Genesis v7.72.2 known-good vLLM nightly (570fa71)
๐ Benchmarks + cross-rig data
- bench(matrix): @ygafarov first heterogeneous Ampere + Blackwell eGPU dual (1770931)
๐ Documentation
- docs(dtype-matrix): more polish โ RDNA naming, FP8 maturity caveats, AMD detection (62b3b45)
- docs(dtype-matrix): polish nuances + add Intel and AMD vendor sections (3d4548c)
- docs(dtype-matrix): per-arch hardware accelerator matrix for compose optimization (9c6d3cf)
- docs(faq): add 'INT8 PTH doesn't scale at concurrency โ is that a bug?' (df53287)
- docs(tq3-mtp): writeup + charts for the Genesis-backed TQ3+MTP path (c2b1c93)
- docs(qwen-tq3): close round-4 โ #40914 not shippable, route to nomtp + Genesis (9fba037)
- docs(qwen-tq3): re-tombstone tq3-mtp.yml after round-3 MTP-skip validation (063d3e9)
๐งน Maintenance
- refactor(qwen): rename int8-tq3 โ tq3-* family + add no-MTP + Genesis variants (6182922)
[Pin: git checkout v0.5.0] ยท Full diff
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About noonghunna/club-3090
All releases โBeta — feedback welcome: [email protected]