Text Generation Web UI releases

Config change

v4.9 Mixed 2mo

Auth RCE / SSRF

MTP, Web search snippets, Electron UI

Open

v4.8 Security relevant patches CVE-2023-4863 2mo

⚠ Upgrade required

Use cuda13.1 build if `nvidia-smi` reports CUDA Version >= 13.1; otherwise use cuda12.4
ik_llama.cpp offers new quant types – choose based on preference
Portable builds now support Windows, Linux, macOS with specific GPU/ROCm/CPU variants

Notable features

Redesigned chat composer: taller input area with paperclip and action buttons pinned to bottom (Gemini/DeepSeek style)
Smooth scroll animation when sending a new message
Electron improvements – persist window bounds, add --no-electron flag, disable spellcheck in chat input

Full changelog

Changes

Redesigned chat composer: Taller input area with the paperclip and message-action buttons pinned to the bottom, similar to Gemini and DeepSeek.
Smooth scroll animation when sending a new message: Inspired by Gemini's chat UI.
Electron improvements:
- Persist window bounds and maximize state across launches.
- Add a --no-electron flag to skip the desktop window and use the web UI in the browser instead.
- Disable spellcheck in the chat input.
API: Add support for list-format content in tool and assistant messages.
Add more space below the last chat/chat-instruct message so its action buttons have breathing room.

Bug fixes

Fix speculative decoding broken by upstream llama.cpp arg renames (#7541).
Fix truncation length reverting after model load on UI reload (#7540).
Don't clear the chat input when sending a message with no model loaded (#7542).
Electron:
- Fix big character picture failing to load (#7540).
- Fix --listen mode in the launcher.
- Fix missing log colors on Windows.

Dependency updates

Update llama.cpp to https://github.com/ggml-org/llama.cpp/commit/68380ae11b564af67196afc70f10c99dbb532fa9
Update ik_llama.cpp to https://github.com/ikawrakow/ik_llama.cpp/commit/9a26522af234f8db079ae3735f35ab6c20fe2c66

Portable builds

TextGen is now a desktop app for local LLMs. Download, unzip, double-click.

[!NOTE]
NVIDIA GPU: If nvidia-smi reports CUDA Version >= 13.1, use the cuda13.1 build. Otherwise, use cuda12.4.

ik_llama.cpp is a llama.cpp fork with new quant types. If unsure, use the llama.cpp column.

Windows

| GPU/Platform | llama.cpp | ik_llama.cpp |
|---|---|---|
| NVIDIA (CUDA 12.4) | Download (891 MB) | Download (1.23 GB) |
| NVIDIA (CUDA 13.1) | Download (817 MB) | Download (1.33 GB) |
| AMD/Intel (Vulkan) | Download (336 MB) | — |
| AMD (ROCm 7.2) | Download (604 MB) | — |
| CPU only | Download (319 MB) | Download (334 MB) |

Linux

| GPU/Platform | llama.cpp | ik_llama.cpp |
|---|---|---|
| NVIDIA (CUDA 12.4) | Download (848 MB) | Download (1.20 GB) |
| NVIDIA (CUDA 13.1) | Download (803 MB) | Download (1.33 GB) |
| AMD/Intel (Vulkan) | Download (324 MB) | — |
| AMD (ROCm 7.2) | Download (396 MB) | — |
| CPU only | Download (307 MB) | Download (334 MB) |

macOS

| Architecture | llama.cpp |
|---|---|
| Apple Silicon (arm64) | Download (271 MB) |
| Intel (x86_64) | Download (283 MB) |

Updating a portable install:

Download and extract the latest version.
Replace the user_data folder with the one in your existing install. All your settings and models will be moved.

Starting with 4.0, you can also move user_data one folder up, next to the install folder. It will be detected automatically, making updates easier:

textgen-4.6/
textgen-4.7/
user_data/    <-- shared by both installs

All releases

Changes

Bug fixes

Dependency updates

Portable builds

Windows

Linux

macOS

Updating a portable install:

Changes

Bug fixes

Dependency updates

Portable builds

Windows

Linux

macOS

Updating a portable install:

Changes

Bug fixes

Dependency updates

Portable builds

Windows

Linux

macOS

Updating a portable install:

Changes

Bug fixes

Dependency updates

Portable builds

Windows

Linux

macOS

Updating a portable install:

Changes

Security

Bug fixes

Dependency updates

Portable builds

Windows

Linux

macOS

Updating a portable install:

Changes

Security

Bug fixes

Dependency updates

Portable builds

Windows

Linux

macOS

Updating a portable install:

Changes

Security

Bug fixes

Dependency updates

Portable builds

Windows

Linux

macOS

Updating a portable install: