model2vec
Model Serving & MLOpsFeatures
- Converts any Sentence Transformer into a small static embedding model
- Reduces model size by up to 50× and speeds inference up to 500× with minimal performance loss
- Provides flagship pre‑trained models (e.g., potion-base-32M) for immediate use
Recent releases
View all 5 releases →
v0.8.1
Bug fix
Fixed POSIX path handling for Windows.
Full changelog
What's Changed
- fix: Use posix path for Windows compatibility by @stephantul in https://github.com/MinishLab/model2vec/pull/314
- increase version to 0.8.1 by @stephantul in https://github.com/MinishLab/model2vec/pull/315
Full Changelog: https://github.com/MinishLab/model2vec/compare/v0.8.0...v0.8.1
v0.8.0
Breaking risk
Breaking changes
- Python 3.9 support deprecated
Notable features
- Replace tokenizer with skeletoken
Full changelog
What's Changed
- docs: Added CITATION.cff, updated citation info by @Pringled in https://github.com/MinishLab/model2vec/pull/293
- docs: Added modelcard logo by @Pringled in https://github.com/MinishLab/model2vec/pull/294
- docs: Update default model for example by @Pringled in https://github.com/MinishLab/model2vec/pull/299
- chore: Deprecated Python 3.9 support by @Pringled in https://github.com/MinishLab/model2vec/pull/301
- chore: Update CI to run on external PRs, fix test import by @Pringled in https://github.com/MinishLab/model2vec/pull/303
- feat: replace tokenizer code with skeletoken by @stephantul in https://github.com/MinishLab/model2vec/pull/302
- feat: refactor persistence, remove from_sentence_transformers by @stephantul in https://github.com/MinishLab/model2vec/pull/305
- fix: refix tokenizer with added token shenanigans by @stephantul in https://github.com/MinishLab/model2vec/pull/304
- fix: if layers == 0, layers were not initialized by @stephantul in https://github.com/MinishLab/model2vec/pull/306
- feat: expose random seed, small typing fix by @stephantul in https://github.com/MinishLab/model2vec/pull/308
- feat: add util for tokenizer pad id by @stephantul in https://github.com/MinishLab/model2vec/pull/310
- docs: Update results to MTEB V2 by @Pringled in https://github.com/MinishLab/model2vec/pull/311
- chore: Bump version by @Pringled in https://github.com/MinishLab/model2vec/pull/312
Full Changelog: https://github.com/MinishLab/model2vec/compare/v0.7.0...v0.8.0
v0.7.0
Breaking risk
Breaking changes
- Rename PoolingType to PoolingMode
- Deprecate apply_zipf and use_subword parameters
Notable features
- Add vocabulary quantization
Full changelog
What's Changed
- add support for passing weight to the loss functions by @volker48 in https://github.com/MinishLab/model2vec/pull/260
- fix: padding token not recognized, update transformers by @stephantul in https://github.com/MinishLab/model2vec/pull/265
- Fix tag train documentation by @Lhemamou in https://github.com/MinishLab/model2vec/pull/269
- chore: Added python 3.13 to pyproject and CI by @Pringled in https://github.com/MinishLab/model2vec/pull/270
- feat: add classifier freezing by @stephantul in https://github.com/MinishLab/model2vec/pull/274
- fix: remove windows tests by @stephantul in https://github.com/MinishLab/model2vec/pull/277
- feat: add configurable pad token by @stephantul in https://github.com/MinishLab/model2vec/pull/276
- feat: faster loading if model already cached by @stephantul in https://github.com/MinishLab/model2vec/pull/278
- feat: add vocabulary quantization by @stephantul in https://github.com/MinishLab/model2vec/pull/271
- fix: load faster, make quantization better by @stephantul in https://github.com/MinishLab/model2vec/pull/279
- fix: F rule, A rule, update ruff by @stephantul in https://github.com/MinishLab/model2vec/pull/281
- feat: Added embedding_dtype and vocabulary_quantization to config by @Pringled in https://github.com/MinishLab/model2vec/pull/280
- fix: Disable MPS for Torch versions >=2.8.0 by @Pringled in https://github.com/MinishLab/model2vec/pull/287
- feat: Add configurable pooling for distillation by @Pringled in https://github.com/MinishLab/model2vec/pull/288
- chore: Deprecate apply_zipf and use_subword parameters by @Pringled in https://github.com/MinishLab/model2vec/pull/289
- chore: Rename PoolingType to PoolingMode by @Pringled in https://github.com/MinishLab/model2vec/pull/290
- docs: Update main docs by @Pringled in https://github.com/MinishLab/model2vec/pull/291
- chore: Bump version by @Pringled in https://github.com/MinishLab/model2vec/pull/292
New Contributors
- @volker48 made their first contribution in https://github.com/MinishLab/model2vec/pull/260
- @Lhemamou made their first contribution in https://github.com/MinishLab/model2vec/pull/269
Deprecation warnings ⚠️
apply_zipfanduse_subwordare now officially deprecated from distill
Full Changelog: https://github.com/MinishLab/model2vec/compare/v0.6.0...v0.7.0
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
About
Stars
2,108
Forks
121
Languages
Python
Jupyter Notebook
Makefile
Install & Platforms
Install via
pip