Release history
model2vec releases
All releases
5 shown
v0.8.1
Bug fix
Fixed POSIX path handling for Windows.
Full changelog
What's Changed
- fix: Use posix path for Windows compatibility by @stephantul in https://github.com/MinishLab/model2vec/pull/314
- increase version to 0.8.1 by @stephantul in https://github.com/MinishLab/model2vec/pull/315
Full Changelog: https://github.com/MinishLab/model2vec/compare/v0.8.0...v0.8.1
v0.8.0
Breaking risk
Breaking changes
- Python 3.9 support deprecated
Notable features
- Replace tokenizer with skeletoken
Full changelog
What's Changed
- docs: Added CITATION.cff, updated citation info by @Pringled in https://github.com/MinishLab/model2vec/pull/293
- docs: Added modelcard logo by @Pringled in https://github.com/MinishLab/model2vec/pull/294
- docs: Update default model for example by @Pringled in https://github.com/MinishLab/model2vec/pull/299
- chore: Deprecated Python 3.9 support by @Pringled in https://github.com/MinishLab/model2vec/pull/301
- chore: Update CI to run on external PRs, fix test import by @Pringled in https://github.com/MinishLab/model2vec/pull/303
- feat: replace tokenizer code with skeletoken by @stephantul in https://github.com/MinishLab/model2vec/pull/302
- feat: refactor persistence, remove from_sentence_transformers by @stephantul in https://github.com/MinishLab/model2vec/pull/305
- fix: refix tokenizer with added token shenanigans by @stephantul in https://github.com/MinishLab/model2vec/pull/304
- fix: if layers == 0, layers were not initialized by @stephantul in https://github.com/MinishLab/model2vec/pull/306
- feat: expose random seed, small typing fix by @stephantul in https://github.com/MinishLab/model2vec/pull/308
- feat: add util for tokenizer pad id by @stephantul in https://github.com/MinishLab/model2vec/pull/310
- docs: Update results to MTEB V2 by @Pringled in https://github.com/MinishLab/model2vec/pull/311
- chore: Bump version by @Pringled in https://github.com/MinishLab/model2vec/pull/312
Full Changelog: https://github.com/MinishLab/model2vec/compare/v0.7.0...v0.8.0
v0.7.0
Breaking risk
Breaking changes
- Rename PoolingType to PoolingMode
- Deprecate apply_zipf and use_subword parameters
Notable features
- Add vocabulary quantization
Full changelog
What's Changed
- add support for passing weight to the loss functions by @volker48 in https://github.com/MinishLab/model2vec/pull/260
- fix: padding token not recognized, update transformers by @stephantul in https://github.com/MinishLab/model2vec/pull/265
- Fix tag train documentation by @Lhemamou in https://github.com/MinishLab/model2vec/pull/269
- chore: Added python 3.13 to pyproject and CI by @Pringled in https://github.com/MinishLab/model2vec/pull/270
- feat: add classifier freezing by @stephantul in https://github.com/MinishLab/model2vec/pull/274
- fix: remove windows tests by @stephantul in https://github.com/MinishLab/model2vec/pull/277
- feat: add configurable pad token by @stephantul in https://github.com/MinishLab/model2vec/pull/276
- feat: faster loading if model already cached by @stephantul in https://github.com/MinishLab/model2vec/pull/278
- feat: add vocabulary quantization by @stephantul in https://github.com/MinishLab/model2vec/pull/271
- fix: load faster, make quantization better by @stephantul in https://github.com/MinishLab/model2vec/pull/279
- fix: F rule, A rule, update ruff by @stephantul in https://github.com/MinishLab/model2vec/pull/281
- feat: Added embedding_dtype and vocabulary_quantization to config by @Pringled in https://github.com/MinishLab/model2vec/pull/280
- fix: Disable MPS for Torch versions >=2.8.0 by @Pringled in https://github.com/MinishLab/model2vec/pull/287
- feat: Add configurable pooling for distillation by @Pringled in https://github.com/MinishLab/model2vec/pull/288
- chore: Deprecate apply_zipf and use_subword parameters by @Pringled in https://github.com/MinishLab/model2vec/pull/289
- chore: Rename PoolingType to PoolingMode by @Pringled in https://github.com/MinishLab/model2vec/pull/290
- docs: Update main docs by @Pringled in https://github.com/MinishLab/model2vec/pull/291
- chore: Bump version by @Pringled in https://github.com/MinishLab/model2vec/pull/292
New Contributors
- @volker48 made their first contribution in https://github.com/MinishLab/model2vec/pull/260
- @Lhemamou made their first contribution in https://github.com/MinishLab/model2vec/pull/269
Deprecation warnings ⚠️
apply_zipfanduse_subwordare now officially deprecated from distill
Full Changelog: https://github.com/MinishLab/model2vec/compare/v0.6.0...v0.7.0
v0.6.0
New feature
Notable features
- Smaller tokenizers
- Multilingual results support
Full changelog
What's Changed
- docs: update chonkie link on tutorial readme by @iaurg in https://github.com/MinishLab/model2vec/pull/235
- Fix dates in README.md by @Pringled in https://github.com/MinishLab/model2vec/pull/238
- fix: add default arg for push_to_hub by @stephantul in https://github.com/MinishLab/model2vec/pull/240
- fix: remove direct dependency on specific hf utils by @stephantul in https://github.com/MinishLab/model2vec/pull/244
- feat: smaller tokenizers by @stephantul in https://github.com/MinishLab/model2vec/pull/243
- feat: update lock by @stephantul in https://github.com/MinishLab/model2vec/pull/246
- feat: allow passing validation set explicitly by @JarbasAl in https://github.com/MinishLab/model2vec/pull/245
- docs: Added multilingual results by @Pringled in https://github.com/MinishLab/model2vec/pull/247
- fix: distillation for models without card by @JarbasAl in https://github.com/MinishLab/model2vec/pull/248
- feat: add supertokenizers by @stephantul in https://github.com/MinishLab/model2vec/pull/236
- clean-up print statement by @stephantul in https://github.com/MinishLab/model2vec/pull/249
- fix: small typing issue by @stephantul in https://github.com/MinishLab/model2vec/pull/250
- docs: Added new logo by @Pringled in https://github.com/MinishLab/model2vec/pull/252
- fix: missing unk, fix bug by @stephantul in https://github.com/MinishLab/model2vec/pull/251
- bump version by @stephantul in https://github.com/MinishLab/model2vec/pull/258
- feat: make normalization dependent on spacing by @stephantul in https://github.com/MinishLab/model2vec/pull/259
New Contributors
- @iaurg made their first contribution in https://github.com/MinishLab/model2vec/pull/235
- @JarbasAl made their first contribution in https://github.com/MinishLab/model2vec/pull/245
Full Changelog: https://github.com/MinishLab/model2vec/compare/v0.5.0...v0.6.0