v0.2.0
New feature
Notable features
- CSV support via Dataset.from_csv() with default column names `input` and `ideal` and overrides `input_field`/`output_field`
- Arbitrary JSONL field mapping through CLI flags `--input-field` / `--output-field` and Python API
- Label‑free evaluation allowing datasets without reference answers; reference‑based metrics emit a clear upfront error
Full changelog
What's new in 0.2.0
Dataset
- CSV support via
Dataset.from_csv()— default column namesinputandideal, withinput_field/output_fieldoverrides for custom schemas - Arbitrary JSONL field mapping via
--input-field/--output-fieldCLI flags and Python API - Label-free evaluation — datasets without reference answers work end-to-end; reference-based metrics raise a clear error upfront
Metrics
- Multi-dimensional LLM-as-judge via the
dimensionsparameter — score multiple criteria (e.g. fluency, accuracy, safety) in a single judge call