Skip to content

UQLM

v0.5.2 Feature

This release adds 2 notable features for engineering teams evaluating rollout.

✓ No known CVEs patched
Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Topics

ai-evaluation ai-safety confidence-estimation confidence-score hallucination hallucination-detection
+8 more
hallucination-evaluation hallucination-mitigation llm llm-evaluation llm-hallucination llm-safety uncertainty-estimation uncertainty-quantification

Summary

AI summary

Add LLM-based entailment classifier for long‑text scoring.

Full changelog

Highlights

  • Create uqlm.nli.EntailmentClassifier class for LLM-based entailment classification. This is well-suited for long-text scoring when responses exceed the length that can be handled by the Hugging Face NLI model
  • Update LongTextGraph, LongTexUQ, UnitResponseScorer, GraphScorer and associated notebooks to allow for LLM-based entailment classification.
  • Update unit tests
  • Misc. docs site cleanup

What's Changed

  • Add LLM-based entailment classification + Docs cleanup by @dylanbouchard in https://github.com/cvs-health/uqlm/pull/320
  • Patch release: v0.5.2 by @dylanbouchard in https://github.com/cvs-health/uqlm/pull/321

Full Changelog: https://github.com/cvs-health/uqlm/compare/v0.5.1...v0.5.2

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Track UQLM

Get notified when new releases ship.

Sign up free

Related context

Beta — feedback welcome: [email protected]