UQLM

v0.1.7 Feature

This release adds 2 notable features for engineering teams evaluating rollout.

Published 1y AI Agents & Assistants

✓ No known CVEs patched

✓ No known CVEs patched in this version

Topics

ai-evaluation ai-safety confidence-estimation confidence-score hallucination hallucination-detection

+8 more

hallucination-evaluation hallucination-mitigation llm llm-evaluation llm-hallucination llm-safety uncertainty-estimation uncertainty-quantification

Summary

AI summary

Fixed floating point precision bug causing ensemble scores > 1, added brier_score and average_precision to Tuner/UQEnsemble.

Full changelog

Fixes bug related to floating point precision causing ensemble score greater than 1 (1.00000002). This was throwing an error when certain tuner metrics were being computed. Patched with np.clip.
Allow use of brier_score and average_precision with Tuner and UQEnsemble

Full Changelog: https://github.com/cvs-health/uqlm/compare/v0.1.6...v0.1.7

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Track UQLM

Get notified when new releases ship.

About UQLM