feat(accuracy): HellaSwag DeepEval-backed benchmark + ExactMatch grader (AIP-877)#923
Merged
debermudez merged 10 commits intoMay 22, 2026
Merged
Commits
Commits on May 22, 2026
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- authored