chore(ai): explicit model policy — router/chat/program/summarize + preview-vs-stable

## Context

\`convex/ai/providers.ts:23-93\` ships one generic primary/fallback pair per provider. Two problems:

1. **No tier-by-task policy**. Current: everything hits Sonnet 4.6 / Gemini 3 Flash / GPT-5.4 primary. World-class split: cheap for routing/classification/summarization, mid for chat, premium for planning/programming.

2. **Preview models in production**. \`primaryModel: "gemini-3-flash-preview"\` — we ship a preview model as the default to our Gemini BYOK majority. One silent API change from Google breaks the whole product.

## Files

- \`convex/ai/providers.ts:23-93\` — current config
- \`convex/ai/coach.ts:192-249\` — \`buildCoachAgents\` / \`buildCoachAgentsForProvider\` consumers
- New (probably): \`docs/ai/model-policy.md\` or an ADR

## Acceptance

- [ ] Write a short policy doc: which model runs which task tier across all 4 providers
- [ ] Tiers: \`router\` (cheapest — Haiku 4.5 / Gemini Flash-lite / GPT-5.4-nano / OpenRouter auto), \`chat\` (mid — Sonnet 4.6 / Gemini Pro / GPT-5.4-mini), \`programming\` (premium — Opus 4.6 / Gemini Pro + high effort / GPT-5.4), \`summarize\` (router tier)
- [ ] Preview-vs-stable rule: \`*-preview\` models MUST NOT be the default primary. Can only be opt-in via user setting or feature flag
- [ ] Move \`gemini-3-flash-preview\` off the default path; replace with stable \`gemini-3-flash\` (or latest stable)
- [ ] Wire \`prepareStep\` to swap models by tier (pairs with #205, #210)
- [ ] Document per-tier cost in the policy doc for future reference

## References

- OpenAI [Compare Models](https://developers.openai.com/api/docs/models/compare)
- Google [Gemini Models](https://ai.google.dev/gemini-api/docs/models)
- Anthropic [pricing tiers](https://platform.claude.com/docs/en/about-claude/pricing)
- [LLM routing: 85% cost reduction case study](https://www.burnwise.io/blog/llm-model-routing-guide)

Related: #190 (intent routing is one consumer of this policy), #205 (effort per step), #210 (prepareStep tool subset)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(ai): explicit model policy — router/chat/program/summarize + preview-vs-stable #216

Context

Files

Acceptance

References

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

chore(ai): explicit model policy — router/chat/program/summarize + preview-vs-stable #216

Description

Context

Files

Acceptance

References

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions