Commit 3ff80dc
GAIA: use evaluation_utils.py get_default_on_result_writer instead of _default_on_result_writer (#111)
* GAIA: use evaluation_utils.py get_default_on_result_writer instead of _default_on_result_writer
- Replace local _default_on_result_writer function with shared get_default_on_result_writer from evaluation_utils.py
- Remove unused fcntl import from GAIA run_infer.py
- Add comprehensive tests to verify the integration works correctly
- Ensures consistency across all benchmarks in using shared evaluation utilities
Fixes #110
Co-authored-by: openhands <[email protected]>
* Delete tests/test_gaia_evaluation_utils.py
---------
Co-authored-by: openhands <[email protected]>1 parent 293778a commit 3ff80dc
1 file changed
+5
-13
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
2 | 1 | | |
3 | 2 | | |
4 | 3 | | |
| |||
17 | 16 | | |
18 | 17 | | |
19 | 18 | | |
20 | | - | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
21 | 23 | | |
22 | 24 | | |
23 | 25 | | |
| |||
435 | 437 | | |
436 | 438 | | |
437 | 439 | | |
438 | | - | |
439 | | - | |
440 | | - | |
441 | | - | |
442 | | - | |
443 | | - | |
444 | | - | |
445 | | - | |
446 | | - | |
447 | | - | |
448 | 440 | | |
449 | | - | |
| 441 | + | |
450 | 442 | | |
451 | 443 | | |
452 | 444 | | |
| |||
0 commit comments