Skip to content

Release history

model2vec releases

All releases

5 shown

No immediate action
v0.8.2 Mixed

Training refactor + new features

v0.8.1 Bug fix

Fixed POSIX path handling for Windows.

Full changelog

What's Changed

  • fix: Use posix path for Windows compatibility by @stephantul in https://github.com/MinishLab/model2vec/pull/314
  • increase version to 0.8.1 by @stephantul in https://github.com/MinishLab/model2vec/pull/315

Full Changelog: https://github.com/MinishLab/model2vec/compare/v0.8.0...v0.8.1

v0.8.0 Breaking risk
Breaking changes
  • Python 3.9 support deprecated
Notable features
  • Replace tokenizer with skeletoken
Full changelog

What's Changed

  • docs: Added CITATION.cff, updated citation info by @Pringled in https://github.com/MinishLab/model2vec/pull/293
  • docs: Added modelcard logo by @Pringled in https://github.com/MinishLab/model2vec/pull/294
  • docs: Update default model for example by @Pringled in https://github.com/MinishLab/model2vec/pull/299
  • chore: Deprecated Python 3.9 support by @Pringled in https://github.com/MinishLab/model2vec/pull/301
  • chore: Update CI to run on external PRs, fix test import by @Pringled in https://github.com/MinishLab/model2vec/pull/303
  • feat: replace tokenizer code with skeletoken by @stephantul in https://github.com/MinishLab/model2vec/pull/302
  • feat: refactor persistence, remove from_sentence_transformers by @stephantul in https://github.com/MinishLab/model2vec/pull/305
  • fix: refix tokenizer with added token shenanigans by @stephantul in https://github.com/MinishLab/model2vec/pull/304
  • fix: if layers == 0, layers were not initialized by @stephantul in https://github.com/MinishLab/model2vec/pull/306
  • feat: expose random seed, small typing fix by @stephantul in https://github.com/MinishLab/model2vec/pull/308
  • feat: add util for tokenizer pad id by @stephantul in https://github.com/MinishLab/model2vec/pull/310
  • docs: Update results to MTEB V2 by @Pringled in https://github.com/MinishLab/model2vec/pull/311
  • chore: Bump version by @Pringled in https://github.com/MinishLab/model2vec/pull/312

Full Changelog: https://github.com/MinishLab/model2vec/compare/v0.7.0...v0.8.0

v0.7.0 Breaking risk
Breaking changes
  • Rename PoolingType to PoolingMode
  • Deprecate apply_zipf and use_subword parameters
Notable features
  • Add vocabulary quantization
Full changelog

What's Changed

  • add support for passing weight to the loss functions by @volker48 in https://github.com/MinishLab/model2vec/pull/260
  • fix: padding token not recognized, update transformers by @stephantul in https://github.com/MinishLab/model2vec/pull/265
  • Fix tag train documentation by @Lhemamou in https://github.com/MinishLab/model2vec/pull/269
  • chore: Added python 3.13 to pyproject and CI by @Pringled in https://github.com/MinishLab/model2vec/pull/270
  • feat: add classifier freezing by @stephantul in https://github.com/MinishLab/model2vec/pull/274
  • fix: remove windows tests by @stephantul in https://github.com/MinishLab/model2vec/pull/277
  • feat: add configurable pad token by @stephantul in https://github.com/MinishLab/model2vec/pull/276
  • feat: faster loading if model already cached by @stephantul in https://github.com/MinishLab/model2vec/pull/278
  • feat: add vocabulary quantization by @stephantul in https://github.com/MinishLab/model2vec/pull/271
  • fix: load faster, make quantization better by @stephantul in https://github.com/MinishLab/model2vec/pull/279
  • fix: F rule, A rule, update ruff by @stephantul in https://github.com/MinishLab/model2vec/pull/281
  • feat: Added embedding_dtype and vocabulary_quantization to config by @Pringled in https://github.com/MinishLab/model2vec/pull/280
  • fix: Disable MPS for Torch versions >=2.8.0 by @Pringled in https://github.com/MinishLab/model2vec/pull/287
  • feat: Add configurable pooling for distillation by @Pringled in https://github.com/MinishLab/model2vec/pull/288
  • chore: Deprecate apply_zipf and use_subword parameters by @Pringled in https://github.com/MinishLab/model2vec/pull/289
  • chore: Rename PoolingType to PoolingMode by @Pringled in https://github.com/MinishLab/model2vec/pull/290
  • docs: Update main docs by @Pringled in https://github.com/MinishLab/model2vec/pull/291
  • chore: Bump version by @Pringled in https://github.com/MinishLab/model2vec/pull/292

New Contributors

  • @volker48 made their first contribution in https://github.com/MinishLab/model2vec/pull/260
  • @Lhemamou made their first contribution in https://github.com/MinishLab/model2vec/pull/269

Deprecation warnings ⚠️

  • apply_zipf and use_subword are now officially deprecated from distill

Full Changelog: https://github.com/MinishLab/model2vec/compare/v0.6.0...v0.7.0

v0.6.0 New feature
Notable features
  • Smaller tokenizers
  • Multilingual results support
Full changelog

What's Changed

  • docs: update chonkie link on tutorial readme by @iaurg in https://github.com/MinishLab/model2vec/pull/235
  • Fix dates in README.md by @Pringled in https://github.com/MinishLab/model2vec/pull/238
  • fix: add default arg for push_to_hub by @stephantul in https://github.com/MinishLab/model2vec/pull/240
  • fix: remove direct dependency on specific hf utils by @stephantul in https://github.com/MinishLab/model2vec/pull/244
  • feat: smaller tokenizers by @stephantul in https://github.com/MinishLab/model2vec/pull/243
  • feat: update lock by @stephantul in https://github.com/MinishLab/model2vec/pull/246
  • feat: allow passing validation set explicitly by @JarbasAl in https://github.com/MinishLab/model2vec/pull/245
  • docs: Added multilingual results by @Pringled in https://github.com/MinishLab/model2vec/pull/247
  • fix: distillation for models without card by @JarbasAl in https://github.com/MinishLab/model2vec/pull/248
  • feat: add supertokenizers by @stephantul in https://github.com/MinishLab/model2vec/pull/236
  • clean-up print statement by @stephantul in https://github.com/MinishLab/model2vec/pull/249
  • fix: small typing issue by @stephantul in https://github.com/MinishLab/model2vec/pull/250
  • docs: Added new logo by @Pringled in https://github.com/MinishLab/model2vec/pull/252
  • fix: missing unk, fix bug by @stephantul in https://github.com/MinishLab/model2vec/pull/251
  • bump version by @stephantul in https://github.com/MinishLab/model2vec/pull/258
  • feat: make normalization dependent on spacing by @stephantul in https://github.com/MinishLab/model2vec/pull/259

New Contributors

  • @iaurg made their first contribution in https://github.com/MinishLab/model2vec/pull/235
  • @JarbasAl made their first contribution in https://github.com/MinishLab/model2vec/pull/245

Full Changelog: https://github.com/MinishLab/model2vec/compare/v0.5.0...v0.6.0

Beta — feedback welcome: [email protected]