Summary
Decompose the monolithic single-Opus-session auditor into a multi-stage pipeline with model-appropriate task assignment.
See docs/architecture.md for the full design document.
Stages
- Ingestion + Parsing (Haiku) — Parse references into structured JSON
- Procedural Verification (Sonnet + web search) — DOI resolution, PubMed queries, retraction checks
- Forensic Interpretation (Opus) — Heuristic application, risk classification, adversarial reasoning
- Report Generation (Haiku) — HTML report assembly
Key Design Decisions Needed
Implementation Phases
- Define and test JSON schemas for inter-stage data
- Implement Stage 1 (Haiku parsing) as standalone prompt
- Implement Stage 2 (Sonnet verification) as standalone prompt
- Implement Stage 3 (Opus forensics) consuming Stage 1+2 output
- Implement Stage 4 (Haiku report generation) consuming all prior stages
- Build orchestration script to chain stages
- Benchmark cost-per-run vs. monolithic v3
Acceptance Criteria
Dependencies
- v3 prompt stable (baseline for comparison)
- Test sets committed for regression testing
Summary
Decompose the monolithic single-Opus-session auditor into a multi-stage pipeline with model-appropriate task assignment.
See
docs/architecture.mdfor the full design document.Stages
Key Design Decisions Needed
Implementation Phases
Acceptance Criteria
Dependencies