Skip to content

v0.9.0 Harness/Profile cutline: model posture before automatic harness creation #2728

@Hmbown

Description

@Hmbown

Problem

The Agentic Harness Creator is important, but it is not the first harness-profile PR. CodeWhale needs explicit model posture and profile resolution before any agent tries to evolve profiles automatically.

MVP order

  1. HarnessPosture enum/policy surface — v0.9.0 HarnessPosture: model-specific context and subagent policy #2693
  2. HarnessProfile schema/registry — v0.9.0 HarnessProfile schema: provider/model/posture-aware profile registry #2696
  3. HarnessProfile resolver — v0.9.0 HarnessProfile resolver: apply prompt, tool, context, and subagent policy #2697
  4. Seed profiles — v0.9.0 Seed HarnessProfiles: PrefixCached DeepSeek/MiMo and LeanRoot generic #2701
    • PrefixCached DeepSeek/MiMo
    • LeanRoot generic/local/open-weight
    • explicit Arcee handling so Trinity Thinking does not get lost in DeepSeek/MiMo assumptions
  5. Repo overlay with .codewhale/constitution.jsonv0.9.0 Repo Harness Overlay: compose .codewhale/constitution.json with model HarnessProfiles #2702
  6. UX/status display — v0.9.0 HarnessProfile UX: show provider, model, profile, and repo law clearly #2703
  7. Evidence store — v0.9.0 Harness Evidence Store: profile-tagged traces for harness evolution #2698
  8. Candidate manifest — v0.9.0 HarnessCandidate manifest: falsifiable profile changes from trace evidence #2699
  9. PromotionGate — v0.9.0 HarnessProfile PromotionGate: replay, eval, and rollback for profile updates #2700
  10. Harness Creator workflow — v0.9.0 Harness Creator workflow: inspect, propose, evaluate, promote, rollback #2704
  11. Full Agentic Harness Creator epic — v0.9.0 Agentic Harness Creator: evolve per-model CodeWhale harnesses from trace evidence #2695

Acceptance criteria

  • The first implementation PR stops after schema/resolver/seed-profile/display unless tests justify more.
  • Provider route, endpoint, model id, HarnessProfile, and repo constitution are separately visible.
  • DeepSeek V4 and Xiaomi MiMo v2.5 can use cache-heavy posture; generic/HF/local models can use lean/subagent-oriented posture.
  • Arcee is explicitly covered in resolver tests and smoke docs.
  • No profile is silently promoted or mutated.
  • Docs cite AHE/GEPA as inspiration only when discussing future automatic evolution.

Non-goals

  • Do not implement autonomous harness mutation before evidence, manifests, replay, and promotion gates exist.
  • Do not change provider auth or tool allowlists as part of the first profile schema PR.

Metadata

Metadata

Assignees

No one assigned

    Labels

    cache-maximalismDeepSeek V4 cache-maximal context and agent architecturecontextContext management / contextdocumentationImprovements or additions to documentationv0.9.0Targeting v0.9.0

    Projects

    Status
    Backlog

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions