Skip to content

fix(benchmarks): correct evaluator bugs, add agent guidelines, establish honest baseline#117

Merged
jack-arturo merged 6 commits into
mainfrom
docs/benchmark-agent-guidelines
Mar 10, 2026
Merged

fix(benchmarks): correct evaluator bugs, add agent guidelines, establish honest baseline#117
jack-arturo merged 6 commits into
mainfrom
docs/benchmark-agent-guidelines

Commits

Commits on Mar 10, 2026