Skip to content

feat(accuracy): BigBench-Hard DeepEval-backed benchmark (AIP-878)#924

Merged
ajcasagrande merged 10 commits into
mainfrom
dbermudez/aip-878-implement-bigbench-benchmark-loader
May 23, 2026
Merged

feat(accuracy): BigBench-Hard DeepEval-backed benchmark (AIP-878)#924
ajcasagrande merged 10 commits into
mainfrom
dbermudez/aip-878-implement-bigbench-benchmark-loader

Commits

Commits on May 22, 2026