This release adds 2 notable features for engineering teams evaluating rollout.
Published 11mo
AI Agents & Assistants
✓ No known CVEs patched
✓ No known CVEs patched in this version
Topics
ai-evaluation
ai-safety
confidence-estimation
confidence-score
hallucination
hallucination-detection
+8 more
hallucination-evaluation
hallucination-mitigation
llm
llm-evaluation
llm-hallucination
llm-safety
uncertainty-estimation
uncertainty-quantification
Summary
AI summaryFixed floating point precision bug causing ensemble scores > 1, added brier_score and average_precision to Tuner/UQEnsemble.
Full changelog
Highlights
- Fixes bug related to floating point precision causing ensemble score greater than 1 (1.00000002). This was throwing an error when certain tuner metrics were being computed. Patched with
np.clip. - Allow use of
brier_scoreandaverage_precisionwithTunerandUQEnsemble
What's Changed
- v0.1.6 updates by @dylanbouchard in https://github.com/cvs-health/uqlm/pull/68
- New metrics by @dylanbouchard in https://github.com/cvs-health/uqlm/pull/69
- Patch/v0.1.7 by @dylanbouchard in https://github.com/cvs-health/uqlm/pull/70
Full Changelog: https://github.com/cvs-health/uqlm/compare/v0.1.6...v0.1.7
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About UQLM
All releases →Related context
Related tools
Beta — feedback welcome: [email protected]