tests(skill-evals): add test suite for runner.py by justinmclean · Pull Request #234 · apache/airflow-steward

justinmclean · 2026-05-20T03:29:42Z

tests(skill-evals): add test suite for runner.py

runner.py in tools/skill-evals previously had zero test coverage. This PR adds 37 tests covering all public functions, and makes a small refactor to main() to make it cleanly testable.

Changes

tests/test_runner.py — new test suite covering:

build_corpus_text / build_roster_text — empty and multi-item cases
find_repo_root — child dir, root itself, missing .git
extract_skill_section — heading levels, code fences, missing heading
load_step_config — step-config.json path, system-prompt.md fallback, output-spec appending, custom user-prompt-template, missing both
load_case — optional corpus/roster, required report/expected
find_cases — single case, fixtures dir, recursive skill dir, nested-fixtures deduplication
main CLI — no cases → exit 1, --quiet flag, prompt printing, caching, bad template raises

runner.py — main() now accepts an optional argv parameter (consistent with sandbox-lint and skill-validator) so tests can invoke it directly without patching sys.argv. CLI output capture in tests uses pytest's capsys fixture rather than monkeypatching sys.stdout.

37 tests covering all public functions in runner.py, which previously had zero test coverage: - build_corpus_text / build_roster_text (empty and multi-item cases) - find_repo_root (child dir, root itself, missing .git) - extract_skill_section (heading levels, code fences, missing heading) - load_step_config (step-config.json path, system-prompt.md fallback, output-spec appending, custom user-prompt-template, missing both) - load_case (optional corpus/roster, required report/expected) - find_cases (single case, fixtures dir, recursive skill dir, nested-fixtures deduplication) - main CLI (no cases → exit 1, --quiet flag, prompt printing, caching, bad template raises) https://claude.ai/code/session_01WEx58ofmTyCe2YhCppV3qN

Give main() an optional argv parameter (consistent with sandbox-lint and skill-validator) so tests can call it directly without patching sys.argv. Switch CLI test output capture to pytest's capsys fixture, removing the monkeypatch-on-sys.stdout approach entirely. https://claude.ai/code/session_01WEx58ofmTyCe2YhCppV3qN

https://claude.ai/code/session_01WEx58ofmTyCe2YhCppV3qN

claude added 3 commits May 19, 2026 08:58

revert(skill-evals): remove uv.lock changes

7df4b98

https://claude.ai/code/session_01WEx58ofmTyCe2YhCppV3qN

justinmclean self-assigned this May 20, 2026

potiuk approved these changes May 20, 2026

View reviewed changes

potiuk merged commit c713e0c into apache:main May 20, 2026
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tests(skill-evals): add test suite for runner.py#234

tests(skill-evals): add test suite for runner.py#234
potiuk merged 3 commits into
apache:mainfrom
justinmclean:analyze-test-coverage

justinmclean commented May 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

justinmclean commented May 20, 2026

tests(skill-evals): add test suite for runner.py

Changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants