Skip to content

[P1.5-S5-ARI-36] Test harness design + implementation #70

Description

@bjridicodes

Ticket: P1.5-S5-05

Type: Feature | Est: TBD — scope in dedicated session

Goal: Automated harness that runs all 30 incidents in both invocation modes (tool and API), collects per-incident results, and scores AC-01 through AC-06 against ground_truth.json.

Scope: This ticket is a placeholder. Do not start implementation without a dedicated scoping session that produces an implementation spec in .docs/.

Acceptance criteria:

  • Dedicated scoping session held before S5 execution starts
  • Implementation spec written to .docs/test_harness_spec.md
  • Harness implementation complete and tested on ≥ 3 incidents before running full 30

Metadata

Metadata

Assignees

No one assigned

    Labels

    P1.5-S5S5: Testing Prep + Round 2 ExecutionfeatureNew capability or user-facing functionalityphase-1.5All Phase 1.5 issues

    Type

    No type

    Fields

    No fields configured for issues without a type.

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions