LiteRT-LM

LLM Frameworks

A production‑ready orchestration layer from Google for running large language models (LLMs) with high performance and cross‑platform support.

Track releases GitHub

C++ Latest v0.14.0 · 24d ago Security brief →

Features

Cross‑platform deployment on Android, iOS, Web, Desktop and IoT devices
Hardware acceleration via GPU and NPU for peak inference speed
Multi‑modality support (vision and audio inputs)
Tool use / function calling for agentic workflows
Broad model compatibility (Gemma, Llama, Phi‑4, Qwen, etc.)

Recent releases

View all 8 releases →

No immediate action

v0.13.0 New feature 1mo

Agent skill + CLI + macOS Swift

Open

No immediate action

v0.12.0 New feature 2mo

Swift, Web JS, CLI NPU, Flutter

Open

v0.11.0 New feature 2mo

Notable features

Windows Native Support: LiteRT-LM CLI runs natively on Windows with CPU and GPU backends

Full changelog

🔥 What's New: `v0.11.0`

Gemma 4 Multi-token Prediction (MTP) Support: Supercharge Gemma 4 on-device inference with Single Position Multi Token Prediction (MTP), delivering >2x faster decode speeds on mobile GPUs with zero quality degradation (blog, documentation).
Windows Native Support: The LiteRT-LM CLI now runs natively on Windows with both CPU and GPU backend support.

View release on GitHub

v0.10.2 Feature 3mo

Notable features

Improved UI smoothness

Changelog

Various Bug fixes
Improve the UI smoothness

View release on GitHub

v0.10.1 New feature 3mo

Notable features

Support for deploying and running Gemma 4 across Linux, macOS, Windows (WSL) and Raspberry Pi
Migrated CLI from `fire` to `click`, adding `--verbose`, `--version`, improved help formatting and styled terminal output
Added direct Hugging Face model import with auto‑conversion for missing models during `run`

Full changelog

🔥 Gemma 4 support

Deploy Gemma 4 across a broad range of hardware with stellar performance (blog).

👉 Try on Linux, macOS, Windows (WSL) or Raspberry Pi with the
LiteRT-LM CLI:

litert-lm run  \
   --from-huggingface-repo=litert-community/gemma-4-E2B-it-litert-lm \
   gemma-4-E2B-it.litertlm \
   --prompt="What is the capital of France?"

Release Notes

CLI Enhancements & Migration: Migrated the CLI from fire to click, adding features like --verbose, --version, improved help formatting, and enhanced terminal output styling (#1784, #1733, #1791, #1792).
Hugging Face Integration: Added support for importing models directly from Hugging Face and implemented auto-conversion for missing models during "run" commands (#1797, #1735).
Core Performance & Features: Introduced a LiteRT-based KV cache implementation, speculative decoding support, and improved context merging for conversation history (#1601, #1793, #1742).
Platform & Build Improvements: Refactored CMake for better Android/cross-compilation support, updated the Windows build with a CPU sampler workaround, and transitioned nightly releases to Ubuntu-22.04 (#1741, #1734, #1772).
API & Documentation: Expanded the Kotlin API for response channel configuration and launched new Python API resources, including a "Getting Started" guide and a Colab notebook (#1724, #1737, #1757).

View release on GitHub

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Releases

View all →

Releases per month

Cadence 2.0 / wk

Last release 4d

Tracked 8

Security

Full profile →

Security score 6.5/10

OpenSSF —

Open CVEs 0

Active maintainer

Community

GitHub stars 5,950

Forks 640

Contributors 90d 20

Open issues 457

Open PRs 300

Stars/wk velocity 0.0

HN peak 5

About

Stars

5,950

Forks

640

Languages

C++ Python TypeScript

Downloads/week

345 ↑674%

NPM Maintainers

Contributors

TypeScript

Types included ✓

View on GitHub View on npm Live demo Documentation

Install & Platforms

Platforms

linux macos windows arm64

Mobile

Android IOS

Similar tools

Luml

Langflow

Lich

Corterm

About

Stars

5,950

Forks

640

Languages

C++ Python TypeScript

Downloads/week

345 ↑674%

NPM Maintainers

Contributors

TypeScript

Types included ✓

View on GitHub View on npm Live demo Documentation

Install & Platforms

Platforms

linux macos windows arm64

Mobile

Android IOS

Similar tools

Luml

Langflow

Lich

Corterm

LiteRT-LM

Features

Recent releases

🔥 What's New: v0.11.0

🔥 Gemma 4 support

Release Notes

About

Install & Platforms

Similar tools

🔥 What's New: `v0.11.0`