Skip to content

LLM Harbor

v0.4.19 Feature

This release adds 3 notable features for engineering teams evaluating rollout.

Published 8d LLM Frameworks
✓ No known CVEs patched
Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Topics

ai automation shell cli container docker
+10 more
self-hosted llm local mcp npm package pypi safetensors server tools

Summary

AI summary

New harbor launch command adds configurable local agentic coding tools with backend selection.

Changes in this release

Feature Medium

Adds `harbor launch` command to spawn pre-configured local agentic coding tools.

Adds `harbor launch` command to spawn pre-configured local agentic coding tools.

Source: llm_adapter@2026-05-26

Confidence: high

Feature Low

Adds Anthropic-compatible Messages API and OpenAI Responses API compatibility layers for Boost.

Adds Anthropic-compatible Messages API and OpenAI Responses API compatibility layers for Boost.

Source: llm_adapter@2026-05-26

Confidence: high

Feature Low

Adds ik_llama.cpp service integration.

Adds ik_llama.cpp service integration.

Source: llm_adapter@2026-05-26

Confidence: high

Feature Low

Adds new guides and optimized documentation screenshots.

Adds new guides and optimized documentation screenshots.

Source: llm_adapter@2026-05-26

Confidence: high

Refactor Low

Extensive Boost hardening: streaming fixes, SDK compatibility audit, request validation improvements.

Extensive Boost hardening: streaming fixes, SDK compatibility audit, request validation improvements.

Source: llm_adapter@2026-05-26

Confidence: high

Full changelog

Changes

https://github.com/user-attachments/assets/e4897391-c5a8-4391-93c3-9f8b76155f11

A new launch command to spawn pre-configured local agentic coding tools on host pointing to inference backends in Harbor.

# Uses the first reachable running backend and discovered model
harbor launch codex

# Explicit backend and model selection; launch options go before the tool name
harbor launch --backend ollama --model qwen3.5:4b codex
harbor launch --backend ollama --model qwen3.5:4b mi
harbor launch --backend llamacpp --model Qwen3.5-4B opencode
harbor launch --backend ollama --model qwen3.5:4b copilot -p "explain this repo"

# Route through Harbor Boost with web search and URL-reading tools
harbor launch --web --backend ollama --model qwen3.5:4b codex

# Claude Code uses Harbor's Ollama backend through Ollama's Anthropic-compatible API
harbor launch --backend ollama --model qwen3.5:4b claude -p "explain this repo"

# Print the computed launch config without starting the tool
harbor launch --config opencode
harbor launch --config openclaw
  • Anthropic-compatible Messages API and OpenAI Responses API compat layers for Boost
  • harbor launch command for starting services with host tool adapters
  • ik_llama.cpp service
  • Boost workflows and tools module
  • Adaptive thinking, annotations, rate limit forwarding, SSE keep-alive in compat layers
  • Extensive Boost hardening: streaming fixes, SDK compat audit, request validation
  • new guides, docs screenshots optimised

Full Changelog: https://github.com/av/harbor/compare/v0.4.18...v0.4.19

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Track LLM Harbor

Get notified when new releases ship.

Sign up free

About LLM Harbor

Containerized LLM toolkit. Run LLM backends, APIs, frontends, and additional services via a concise CLI. `Apache-2.0` `Docker/Shell`

All releases →

Beta — feedback welcome: [email protected]