This release adds 2 notable features for engineering teams evaluating rollout.
Published 4mo
AI Agents & Assistants
✓ No known CVEs patched
✓ No known CVEs patched in this version
Topics
ai-evaluation
ai-safety
confidence-estimation
confidence-score
hallucination
hallucination-detection
+8 more
hallucination-evaluation
hallucination-mitigation
llm
llm-evaluation
llm-hallucination
llm-safety
uncertainty-estimation
uncertainty-quantification
Summary
AI summaryAdd LLM-based entailment classifier for long‑text scoring.
Full changelog
Highlights
- Create
uqlm.nli.EntailmentClassifierclass for LLM-based entailment classification. This is well-suited for long-text scoring when responses exceed the length that can be handled by the Hugging Face NLI model - Update
LongTextGraph,LongTexUQ,UnitResponseScorer,GraphScorerand associated notebooks to allow for LLM-based entailment classification. - Update unit tests
- Misc. docs site cleanup
What's Changed
- Add LLM-based entailment classification + Docs cleanup by @dylanbouchard in https://github.com/cvs-health/uqlm/pull/320
- Patch release:
v0.5.2by @dylanbouchard in https://github.com/cvs-health/uqlm/pull/321
Full Changelog: https://github.com/cvs-health/uqlm/compare/v0.5.1...v0.5.2
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About UQLM
All releases →Related context
Related tools
Beta — feedback welcome: [email protected]