2.26.0 Breaking risk 3mo

Notable features

Custom score rerankers in scoreModifiers via BM25 or vector closeness
lexicalOperand parameter for controlling lexical operators (or, and, weakAnd)
CONTAINS filter for token-level matching on lexical fields

View release on GitHub

2.25.3 Bug fix 4mo

Minor fixes and improvements.

View release on GitHub

2.25.2 Bug fix 4mo

Adds configurable connection recycling to mitigate connection imbalance in long-running deployments.

View release on GitHub

2.25.1 New feature 4mo

Notable features

center parameter for reproducible recency scores
applyToSubqueries to control recency in hybrid search

Full changelog

2.25.1 New Features

Add center and applyToSubqueries parameters to recency scoring (https://github.com/marqo-ai/marqo/pull/1376)
- center — A fixed Unix epoch timestamp (seconds) to use as the reference point instead of now(), enabling reproducible recency scores across queries
- applyToSubqueries — Control which hybrid subqueries ("tensor", "lexical", or both) receive recency boosting in RRF hybrid search

View release on GitHub

2.25.0 New feature 4mo

Notable features

Triton-based inference orchestrator
Model Management Container with lifecycle management
Centralized model registry in marqo-common

Full changelog

2.25.0 Changes

Triton-based Inference Architecture

Marqo's inference layer has been restructured from a monolithic design into three dedicated components:

Inference Orchestrator — A FastAPI service that coordinates inference requests. (https://github.com/marqo-ai/marqo/pull/1315)
Model Management Container — A FastAPI service for managing ML model lifecycles with Triton Inference Server, including model loading/unloading, health checks, and environment variable consistency. (https://github.com/marqo-ai/marqo/pull/1322)
Marqo API adaptations — The core Marqo API has been updated to work with the new Triton-backed components. (https://github.com/marqo-ai/marqo/pull/1317)

This architecture enables independent scaling and deployment of inference, model management, and search API layers.

Other Changes

Centralize model registry into a shared components/common package (marqo-common). (https://github.com/marqo-ai/marqo/pull/1356)
Fix model download auth handling to support public S3 buckets. (https://github.com/marqo-ai/marqo/pull/1352)

View release on GitHub

2.24.15 Bug fix 4mo

Minor fixes and improvements.

Full changelog

2.24.15

Bug Fixes and Minor Changes

Back-port update_index_settings API to 2.24 release branch to support modifying modelProperties of an existing index (https://github.com/marqo-ai/marqo/pull/1369)

View release on GitHub

2.24.14 Bug fix 4mo

Minor fixes and improvements.

View release on GitHub

2.24.13 Bug fix 5mo

Minor fixes and improvements.

Full changelog

2.24.13

Bug Fixes and Minor Changes

Use orjson in get_document(s) endpoints (https://github.com/marqo-ai/marqo/pull/1349)
Support picking the representative document within each collapsed group based on a numeric field sorting result (https://github.com/marqo-ai/marqo/pull/1350)

View release on GitHub

2.24.12 Bug fix 6mo

Added support for dual-stack endpoints in S3 model downloads.

View release on GitHub

2.24.11 New feature 6mo

Notable features