This release includes 1 breaking change for platform teams planning a safe upgrade.
Published 10mo
AI Agents & Assistants
✓ No known CVEs patched
✓ No known CVEs patched in this version
Topics
ai-evaluation
ai-safety
confidence-estimation
confidence-score
hallucination
hallucination-detection
+8 more
hallucination-evaluation
hallucination-mitigation
llm
llm-evaluation
llm-hallucination
llm-safety
uncertainty-estimation
uncertainty-quantification
Summary
AI summaryBERTScorer replaces the slow bert_score.score function, adding a configurable response format option and removing inconsistent color formatting.
Full changelog
Highlights
- Replaces use of
bert_score.scorewithbert_score.BERTScorer.scorefor a ~43x speedup. While the former (old approach) re-checks and re-assignstorch.devicewith each use ofscore, the latter (updated approach) assignstorch.deviceonly once during instantiation. - Creates the option for users to specify whether they want only postprocessed responses, only raw responses, or both versions when they specify a postprocessor. This applies to
BlackBoxUQ,UQEnsemble, andSemanticEntropy. To do so, users can respectively specify 'postprocessed', 'raw', or 'all' in the 'return_responses' argument in the constructor of these classes. By default, 'all' is specified. [black]is removed where specified inrichprint statements to avoid inconsistent colors in progress bars.
What's Changed
- v0.2.2 updates by @dylanbouchard in https://github.com/cvs-health/uqlm/pull/140
- use bert_score class rather than function for 43x speedup by @dylanbouchard in https://github.com/cvs-health/uqlm/pull/141
- Enable different handling of raw vs postprocessed responses by @dylanbouchard in https://github.com/cvs-health/uqlm/pull/143
- Patch release:
v0.2.3by @dylanbouchard in https://github.com/cvs-health/uqlm/pull/144
Full Changelog: https://github.com/cvs-health/uqlm/compare/v0.2.2...v0.2.3
Breaking Changes
- Removal of `[black]` formatting token in `rich` print statements
Weekly OSS security release digest.
The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.
No spam, unsubscribe anytime.
Share this release
About UQLM
All releases →Related context
Related tools
Beta — feedback welcome: [email protected]