This release includes breaking changes for platform teams planning a safe upgrade.
✓ No known CVEs patched in this version
ReleasePort's take
Light signalTinySearch v0.1.1 moves ONNX and tokenizer blobs from repository to first-start download. Embedding runtime adds token_type_ids support for graph operations.
Why it matters: Reduces repository footprint for faster clones and checkouts. First deployment may experience download latency; test in dev. Routine maintenance; no breaking changes or migration deadline.
Summary
AI summaryUpdates Highlights, Notes for upgraders, and TinySearch v0.1.1 across a mixed release.
Changes in this release
| Type | Severity | Summary | CVE |
|---|---|---|---|
| Feature | Medium |
ONNX bundle downloads on first start when embedding_backend is default. ONNX bundle downloads on first start when embedding_backend is default. Source: llm_adapter@2026-05-21 Confidence: high |
— |
| Feature | Medium |
Embedding runtime supports graphs expecting token_type_ids (zeros when required). Embedding runtime supports graphs expecting token_type_ids (zeros when required). Source: llm_adapter@2026-05-21 Confidence: high |
— |
| Feature | Medium |
Export script shares HF model ID via services/onnx_bundle_constants.py. Export script shares HF model ID via services/onnx_bundle_constants.py. Source: llm_adapter@2026-05-21 Confidence: low |
— |
| Performance | Medium |
Benchmark prefetches ONNX bundle using ensure logic when required. Benchmark prefetches ONNX bundle using ensure logic when required. Source: llm_adapter@2026-05-21 Confidence: low |
— |
| Deprecation | Medium |
ONNX and tokenizer blobs removed from git repository. ONNX and tokenizer blobs removed from git repository. Source: llm_adapter@2026-05-21 Confidence: low |
— |
Full changelog
TinySearch v0.1.1
Highlights
- ONNX bundle on first start: When
embedding_backendisdefault, MCP and FastAPI callensure_onnx_bundle_sync()so the MiniLM ONNX bundle is downloaded once intomodels/all-minilm-l6-v2-onnx/(orTINYSEARCH_ONNX_MODEL_DIR) instead of shipping large weights in the repo. - Smaller repo / clearer licensing: ONNX and tokenizer blobs are removed from git (
.gitkeepkeeps the folder). README adds a License section: MIT for project code; downloaded / optional ONNX weights remain Apache-2.0 (Hugging Face model cards apply). - Embedding runtime: ONNX inference supports graphs that expect
token_type_ids(zeros when required), matching the prebuilt bundle fromonnx-models/all-MiniLM-L6-v2-onnx. - Export script: Shares HF model id via
services/onnx_bundle_constants.pywith the export path. - Benchmark:
scripts/benchmark_mcp_research.pyprefetches the bundle with the same ensure logic when_REQUIRE_ONNX_BUNDLEis true and the resolved backend isdefault; errors if ONNX is required but the config uses a non-default embedding backend.
Notes for upgraders
- After pull, run the app or benchmark once (with network) so the bundle downloads, or run
scripts/export_embedding_onnx.pyto generate files locally. .gitignorenow ignores the bundle files undermodels/all-minilm-l6-v2-onnx/; do not expectmodel.onnxin the clone anymore.
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About TinySearch
All releases →Related context
Beta — feedback welcome: [email protected]