This release adds 1 notable feature for engineering teams evaluating rollout.
Published 10mo
AI Agents & Assistants
✓ No known CVEs patched
✓ No known CVEs patched in this version
Topics
ai-evaluation
ai-safety
confidence-estimation
confidence-score
hallucination
hallucination-detection
+8 more
hallucination-evaluation
hallucination-mitigation
llm
llm-evaluation
llm-hallucination
llm-safety
uncertainty-estimation
uncertainty-quantification
Summary
AI summaryFixes missing max_calls_per_min propagation to UQEnsemble judges.
Full changelog
Highlights
- If exception is raised during generation (e.g.
RateLimitError), the progress bar is stopped to avoidLiveErrorupon retry. - Fix BERTScore printed text
- Fix Ensemble diagram for dark mode
- Fixes missing
max_calls_per_minbeing passed toLLMPanelconstructor inside ofUQEnsemble. After this fix,max_calls_per_minwill be applied to ensemble judges as well. - Add flaky retry logic using
@pytest.mark.flaky(retries=3)to tests that fail due to network issues related to HuggingFace. - Fix handling of missing
logprobswith multiple responses inUQEnsemble
What's Changed
- Patch release: v0.2.1 by @dylanbouchard in https://github.com/cvs-health/uqlm/pull/129
Full Changelog: https://github.com/cvs-health/uqlm/compare/v0.2.0...v0.2.1
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About UQLM
All releases →Related context
Related tools
Beta — feedback welcome: [email protected]