docs(red-team): follow-up fixes to Quick Start (#325)

Aryansharma28 · claude · web-flow · commit f11d7f6a364f · 2026-04-13T16:14:58.000+02:00
- Step 2: point users to app.langwatch.ai Settings → API Keys so
  the LANGWATCH_API_KEY placeholder isn't a dead end for new users.
- Step 4: replace hardcoded pytest/npm commands with "run your
  tests or ask your coding assistant to run and monitor it".
  The hardcoded paths assumed a directory structure the MCP
  may not produce, and npm test/pytest defaults don't give
  red team runs enough timeout. Also surface cost/time awareness
  (50 turns × LLM calls on two models = real tokens).
- Step 5: drop "(optional)" — viewing runs in LangWatch is the
  natural next step, not an aside.

Co-authored-by: Claude Opus 4.6 (1M context) &lt;noreply@anthropic.com&gt;
diff --git a/docs/docs/pages/advanced/red-teaming/quick-start.mdx b/docs/docs/pages/advanced/red-teaming/quick-start.mdx
@@ -47,7 +47,7 @@ Works with Claude Code, Cursor, Claude Desktop, Codex, and any MCP-compatible cl
 }
 ```
 
-Restart the client. Full reference: [LangWatch MCP Server](https://langwatch.ai/docs/integration/mcp).
+Grab your API key from [app.langwatch.ai](https://app.langwatch.ai) → **Settings → API Keys**, then restart your client. Full reference: [LangWatch MCP Server](https://langwatch.ai/docs/integration/mcp).
 
 ## 3. Ask your assistant to generate the test
 
@@ -150,21 +150,11 @@ describe("Agent security", () => {
 
 ## 4. Run it
 
-:::code-group
-
-```bash [python]
-pytest tests/red_team/ -v
-```
-
-```bash [typescript]
-npm test -- tests/red-team
-```
-
-:::
+Run your usual test command, or just ask your coding assistant to run it and monitor it for you. Red team runs are long — 50 turns can take several minutes and will consume real LLM tokens on both the attacker and target models — so make sure your runner's per-test timeout is generous.
 
 Each turn prints the attacker's message, your agent's response, and a per-turn score. A failing test includes the full transcript and the judge's reasoning — you see exactly which turn broke the agent and how.
 
-## 5. View the run in LangWatch (optional)
+## 5. View the run in LangWatch
 
 If you've instrumented your agent with LangWatch, every red team run appears in the Simulations dashboard: full attack transcripts, per-turn scores, and side-by-side comparison across runs to track whether a prompt change made your agent more or less resilient.