UQLM

v0.2.1 Feature

This release adds 1 notable feature for engineering teams evaluating rollout.

Published 0y AI Agents & Assistants

✓ No known CVEs patched

✓ No known CVEs patched in this version

Topics

ai-evaluation ai-safety confidence-estimation confidence-score hallucination hallucination-detection

+8 more

hallucination-evaluation hallucination-mitigation llm llm-evaluation llm-hallucination llm-safety uncertainty-estimation uncertainty-quantification

Summary

AI summary

Fixes missing max_calls_per_min propagation to UQEnsemble judges.

Full changelog

If exception is raised during generation (e.g. RateLimitError), the progress bar is stopped to avoid LiveError upon retry.
Fix BERTScore printed text
Fix Ensemble diagram for dark mode
Fixes missing max_calls_per_min being passed to LLMPanel constructor inside of UQEnsemble. After this fix, max_calls_per_min will be applied to ensemble judges as well.
Add flaky retry logic using @pytest.mark.flaky(retries=3) to tests that fail due to network issues related to HuggingFace.
Fix handling of missing logprobs with multiple responses in UQEnsemble

Patch release: v0.2.1 by @dylanbouchard in https://github.com/cvs-health/uqlm/pull/129

Full Changelog: https://github.com/cvs-health/uqlm/compare/v0.2.0...v0.2.1

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Track UQLM

Get notified when new releases ship.

About UQLM