This release adds 3 notable features for engineering teams evaluating rollout.
✓ No known CVEs patched in this version
Topics
+7 more
Summary
AI summaryUpdates Enhancements, New Features, and Bug Fixes across a mixed release.
Changes in this release
| Type | Severity | Summary | CVE |
|---|---|---|---|
| Feature | Low |
Adds document versioning with diff-style changelog preview. Adds document versioning with diff-style changelog preview. Source: llm_adapter@2026-06-03 Confidence: high |
— |
| Feature | Low |
Reintroduces priority rules to pin matching URLs at top of search results. Reintroduces priority rules to pin matching URLs at top of search results. Source: llm_adapter@2026-06-03 Confidence: high |
— |
| Feature | Low |
Enables local PDF indexing with full‑text extraction via new browser extension endpoint. Enables local PDF indexing with full‑text extraction via new browser extension endpoint. Source: llm_adapter@2026-06-03 Confidence: high |
— |
| Feature | Low |
Introduces faceted filtering for fields like language, domain, and date ranges. Introduces faceted filtering for fields like language, domain, and date ranges. Source: llm_adapter@2026-06-03 Confidence: high |
— |
| Feature | Low |
Detects embedded videos (YouTube, Vimeo) and renders them in preview panels. Detects embedded videos (YouTube, Vimeo) and renders them in preview panels. Source: llm_adapter@2026-06-03 Confidence: high |
— |
| Feature | Low |
Adds Notion page extractor for clean article content extraction from public URLs. Adds Notion page extractor for clean article content extraction from public URLs. Source: llm_adapter@2026-06-03 Confidence: high |
— |
| Feature | Low |
Introduces config option to disable full HTML storage and preview generation. Introduces config option to disable full HTML storage and preview generation. Source: llm_adapter@2026-06-03 Confidence: high |
— |
| Feature | Low |
Allows per‑directory user‑specific indexing via `user` field in config. Allows per‑directory user‑specific indexing via `user` field in config. Source: llm_adapter@2026-06-03 Confidence: high |
— |
| Feature | Low |
Enables inline editing of document labels from search result cards. Enables inline editing of document labels from search result cards. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Feature | Low |
Adds MCP endpoint exposing document previews to LLM agents and MCP tools. Adds MCP endpoint exposing document previews to LLM agents and MCP tools. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Feature | Low |
Supports importing browser history from Ladybird in `import-browser` command. Supports importing browser history from Ladybird in `import-browser` command. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Feature | Low |
Automatically detects browser database file path for supported browsers in `import-browser`. Automatically detects browser database file path for supported browsers in `import-browser`. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Feature | Low |
Adds copy‑to‑clipboard button next to each result URL. Adds copy‑to‑clipboard button next to each result URL. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Feature | Low |
Allows switching between available extractors when viewing a document preview without re‑indexing. Allows switching between available extractors when viewing a document preview without re‑indexing. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Feature | Low |
Makes rules table columns sortable, filterable, and adds regexp validation before saving. Makes rules table columns sortable, filterable, and adds regexp validation before saving. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Feature | Low |
Adds `--delay`, `--timeout`, and `--user-agent` flags directly to `hister index` CLI. Adds `--delay`, `--timeout`, and `--user-agent` flags directly to `hister index` CLI. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Feature | Low |
Makes HTTP client timeout during indexing configurable. Makes HTTP client timeout during indexing configurable. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Feature | Low |
Standardizes config file lookup order across all platforms. Standardizes config file lookup order across all platforms. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Feature | Low |
Keeps selected entry visible by autoscrolling in the history view. Keeps selected entry visible by autoscrolling in the history view. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Feature | Low |
Displays running Hister version on admin profile page. Displays running Hister version on admin profile page. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Feature | Low |
Accepts common short aliases for log level values. Accepts common short aliases for log level values. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Feature | Low |
Shows rotating set of usage tips on front page when navigating back from search results. Shows rotating set of usage tips on front page when navigating back from search results. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Feature | Low |
Makes popup header a link to the configured Hister server. Makes popup header a link to the configured Hister server. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Performance | Medium |
Stores HTML content and favicons gzip‑compressed on disk to reduce index size and memory usage. Stores HTML content and favicons gzip‑compressed on disk to reduce index size and memory usage. Source: llm_adapter@2026-06-03 Confidence: high |
— |
| Performance | Low |
Inserts discovered URLs in batches during recursive crawl for better throughput. Inserts discovered URLs in batches during recursive crawl for better throughput. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Performance | Low |
Iterates each language sub-index independently to prevent paging gaps in multi-language setups. Iterates each language sub-index independently to prevent paging gaps in multi-language setups. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Bugfix | Medium |
Fixes Docker `BASE_URL` environment variable override by config file. Fixes Docker `BASE_URL` environment variable override by config file. Source: llm_adapter@2026-06-03 Confidence: high |
— |
| Bugfix | Medium |
Ensures search results contain all queried terms rather than any of them. Ensures search results contain all queried terms rather than any of them. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Bugfix | Low |
Automatically removes deleted locally‑watched files from the index and disk storage. Automatically removes deleted locally‑watched files from the index and disk storage. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
| Bugfix | Low |
Improves yt-dlp error messages and debug logging for configuration problems. Improves yt-dlp error messages and debug logging for configuration problems. Source: granite4.1:30b@2026-06-03-audit Confidence: high |
— |
Full changelog
New Features
Document Versioning
A new versioning rule type instructs Hister to track changes to a document each
time it is re-indexed. A diff-style changelog appears inside the preview panel so
you can see what changed between versions, and the preview endpoint returns the
full version history. Combine with a priority or skip rule to version only the
pages that matter to you.
Priority Rules
Priority rules have been reintroduced (closes #222). Documents whose URLs match a
priority rule are pinned to the top of search results regardless of their relevance
score. Rules can be created, edited, and sorted directly in the web UI.
PDF Indexing
Local PDF files can now be indexed with full text extraction. The browser extension
gains a new endpoint for sending PDFs directly to the server (closes #55). Indexed
PDFs are stored as a distinct document type and can be filtered with
metadata.type:pdf in search queries.
Faceted Filtering
Search results can now be filtered through facets that group documents by common
fields such as language, and domain. Facet counts update live as you refine your
query. Date-range filters have been moved into the same filters dropdown for a
unified experience, and a "load more" control expands facets with many values.
Embedded Video Extractor
A new extractor detects embedded videos (YouTube, Vimeo, and similar platforms)
on indexed pages and stores the embedding metadata as a dedicated document type.
Embedded videos are rendered directly inside the result preview panel, so you can
watch them without leaving Hister (closes #446).
Notion Extractor
A dedicated extractor for Notion pages extracts article content cleanly from
public Notion URLs, removing navigation chrome and other non-content elements.
Compressed HTML and Favicon Storage
HTML content and favicons are now stored gzip-compressed in separate files on
disk rather than inline inside the Bleve index. This significantly reduces index
size on disk and lowers memory pressure during search (closes #384).
Disable HTML Storage
A new config option lets you turn off full HTML storage and preview generation
entirely. Disabling storage trades preview functionality for a smaller footprint,
useful for bulk or headless indexing workflows (closes #440).
User-Specific Directory Indexing
The indexer.directories config now accepts a user field per directory entry.
Files under that path are indexed only for the specified user, making it easy to
share a single Hister instance while keeping personal file indexes private.
Label Editing from Results
Document labels can be edited inline directly from search result cards without
navigating away. The browser extension can also apply a one-off label to a
document at index time (closes #407). Updating a document no longer clears
previously assigned labels.
File Deletion Tracking
When a locally-watched file is deleted from the filesystem, Hister now
automatically removes it from the index (closes #230). Batch and single-document
deletions also clean up the associated HTML and favicon files from disk.
MCP Document Preview Endpoint
A new MCP endpoint exposes document previews to LLM agents and MCP-compatible
tools, complementing the existing MCP search endpoint.
Browser Import: Ladybird Support
The import-browser command now supports importing history from the
Ladybird browser.
Browser Import: Auto-Detect Database
The import-browser command now automatically detects the browser database file
path, so passing an explicit path is no longer required for supported browsers.
Enhancements
- Copy URL button: a copy-to-clipboard icon appears next to each result URL
- Preview extractor selector: switch between available extractors when viewing
a document preview without re-indexing - Rules table: columns are sortable; values are filterable; filter toggle
buttons are visually distinct from column headers; regexp validation runs before
saving a rule - CLI index flags:
--delay,--timeout, and--user-agentare now
available directly onhister indexwithout needing a config file - Configurable client timeout: the HTTP client timeout used during indexing
is configurable (fixes #429) - Standardized config file lookup: config file discovery follows a consistent
search order across all platforms (closes #424) - History autoscroll: the history view scrolls to keep the selected entry
visible (#427) - Admin profile version: the admin profile page now shows the running Hister
version (closes #409) - Log level aliases: common short aliases are accepted for log level values
(fixes #411) - Random tips: a rotating set of usage tips is shown on the front page when
navigating back from search results - yt-dlp diagnostics: clearer error messages and debug logging help diagnose
yt-dlp configuration problems - Batch crawl insertion: URLs discovered during a recursive crawl are
inserted in batches for better throughput - Extension popup header: the popup header is now a link to the configured
Hister server - Per-sub-index paging: each language sub-index is iterated independently to
prevent paging gaps in multi-language setups - Improved embedding handling: fallow and embedding request processing
is more robust
Bug Fixes
- Docker
BASE_URLenvironment variable can now be correctly overridden by a
config file (closes #442) - Search results now contain all queried terms rather than any of them
- HTML is no longer re-written to the indexer during a reindex run
- yt-dlp subtitle download no longer skipped when the sub language differs from
the original language (#429) - History page stops loading more entries when the last page has been reached
- "Show all" in history now scrolls back to the top
- Rule type filter no longer incorrectly excludes rules
- Debug-level init messages are correctly suppressed at higher log levels
- Result paging uses a valid sort key, fixing out-of-order pages
- Times are displayed in the browser's configured timezone
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
Related context
Beta — feedback welcome: [email protected]