Skip to content

feat(accuracy): HellaSwag DeepEval-backed benchmark + ExactMatch grader (AIP-877)#923

Merged
debermudez merged 10 commits into
mainfrom
dbermudez/aip-877-implement-hellaswag-benchmark-loader
May 22, 2026
Merged

feat(accuracy): HellaSwag DeepEval-backed benchmark + ExactMatch grader (AIP-877)#923
debermudez merged 10 commits into
mainfrom
dbermudez/aip-877-implement-hellaswag-benchmark-loader

Commits

Commits on May 22, 2026