Skip to content

Find the best local LLM for your hardware, ranked by benchmarks

LLM Frameworks

A command‑line tool that auto‑detects your hardware and ranks the best local LLM from HuggingFace based on real benchmark scores.

Python Latest v0.5.7 · 15d ago Security brief →

Features

  • Auto‑detects GPU/CPU/RAM and matches suitable models
  • Ranks models using live benchmark data with confidence weighting
  • One‑command chat (`whichllm run`) for instant inference
  • Generates ready‑to‑use Python snippets (`whichllm snippet`)
  • Simulates different GPUs before purchase (`--gpu "RTX 4090"`)
  • Provides JSON output for script integration

Recent releases

View all 8 releases →
No immediate action
v0.5.7 Bug fix

Offload folder fixes crashes

No immediate action
v0.5.6 New feature

Speed metadata + GPU detection

No immediate action
v0.5.5 Bugfix

GGUF resolution fix

No immediate action
v0.5.4 Bug fix

APU handling fix

No immediate action
v0.5.3 Breaking risk

Tokenizer mapping fix

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

About

Stars
2,456
Forks
126
Language
Python

Install & Platforms

Install via
pip brew

Beta — feedback welcome: [email protected]