feat: v3.1 — Context Window Guardian (smart auto-save at 50%/75%)

DEX · claude · DEX · commit 32b9f0b16d20 · 2026-03-25T14:26:54.000+08:00
New continuous monitoring behavior:
- Track context window usage via observable signals (turns, tool
  calls, compaction markers, re-reads)
- 50% threshold: lightweight checkpoint save (skip maintenance)
- 75% threshold: full session-wrap + suggest new session
- 80%+ emergency: minimize responses, prioritize saving
- Session behavior adapts: free exploration → targeted reads →
  minimal responses as context grows
- Platform-specific compaction detection (Claude [compacted],
  Cursor truncation, etc.)

Co-Authored-By: Claude Opus 4.6 (1M context) &lt;noreply@anthropic.com&gt;
diff --git a/README.md b/README.md
@@ -1,6 +1,6 @@
-# session-wrap v3.0
+# session-wrap v3.1
 
-**通用 AI Agent 記憶持久化** — 對話結束自動保存上下文，下次無縫接續。支援 19+ AI 編碼助手與 AI 平台。
+**通用 AI Agent 記憶持久化** — 對話結束自動保存上下文，下次無縫接續。v3.1 新增 Context Window 智慧管理。支援 19+ AI 平台。
 
 Universal memory persistence for every AI agent. Say "wrap up" — context survives to the next session.
 
@@ -171,7 +171,49 @@ npx session-wrap-skill uninstall
 
 ---
 
-## v3.0 新功能：記憶維護引擎
+## v3.1 新功能：Context Window 智慧管理
+
+不用等到「收工」才存。AI 會**持續監控**上下文使用量，自動在關鍵節點保存記憶。
+
+### 三段式閾值機制
+
+```
+上下文使用量
+─────────────────────────────────────────────────
+ 0%                    50%              75%    100%
+ │     🟢 正常工作      │  🟠 檢查點存檔  │ 🔴 完整存檔 │
+ │                     │  (輕量快速)     │ (含維護壓縮) │
+ │                     │                │ → 建議開新對話│
+```
+
+| 閾值 | 行為 |
+|------|------|
+| **~50%** | 🟠 **自動檢查點** — 快速保存當前進度（跳過維護），然後繼續工作 |
+| **~75%** | 🔴 **完整存檔 + 提醒** — 執行全套 session-wrap，建議開新對話 |
+| **> 80%** | 🔴 **緊急模式** — 最小化回應，優先保存所有未存的上下文 |
+
+### 怎麼判斷上下文用了多少？
+
+AI 沒有精確 token 計數器，但可以從訊號估算：
+
+| 訊號 | 說明 |
+|------|------|
+| 對話輪次 | 30+ 輪通常已過半 |
+| 工具呼叫數 | 60+ 次通常已過半 |
+| 系統壓縮訊息 | 看到 `[compacted]` = 已經被壓縮，立刻存檔 |
+| 重複讀檔 | 重新讀之前讀過的檔案 = 早期 context 被裁剪 |
+
+### 各階段行為調整
+
+| 階段 | 行為 |
+|------|------|
+| **早期 (< 30%)** | 自由探索、完整讀檔、詳細解釋 |
+| **中期 (30-60%)** | 精確定位讀取、合併工具呼叫、精簡回覆 |
+| **後期 (60%+)** | 只讀必要內容、引用記憶不重讀、主動存檔 |
+
+---
+
+## v3.0 功能：記憶維護引擎
 
 ### 自動壓縮
 
@@ -204,15 +246,22 @@ npx session-wrap-skill uninstall
 ## 運作原理
 
 ```
-觸發「收工」
+                    持續監控 context window
+                    ┌──────────────────┐
+                    │ < 50%  → 正常    │
+                    │ ~ 50%  → 檢查點  │  ← v3.1 新增
+                    │ ~ 75%  → 完整存檔│
+                    └──────────────────┘
+
+觸發「收工」或達到閾值
     │
     ▼
 ┌───────────────────────────────────┐
 │  1. 偵測平台（19+ agent）          │
 │  2. 決定記憶路徑                   │
 │  3. 掃描 git + 現有記憶            │
 │  4. 分析本次 session 關鍵資訊      │
-│  5. 記憶維護（壓縮/過期/去重）     │  ← v3 新增
+│  5. 記憶維護（壓縮/過期/去重）     │  ← v3.0
 │  6. 寫入/更新記憶檔案              │
 │  7. 更新 MEMORY.md 索引           │
 │  8. Bootstrap 注入（如需要）       │
@@ -319,7 +368,8 @@ auto memory 被動。session-wrap 主動掃描 + 結構化寫入 + 維護壓縮
 
 | 版本 | 日期 | 變更 |
 |------|------|------|
-| **v3.0** | 2026-03-25 | 記憶維護引擎（壓縮/過期/衝突解決/去重）、一鍵安裝 CLI、install.sh、19+ agent、韓語觸發 |
+| **v3.1** | 2026-03-25 | Context Window 智慧管理（50% 檢查點、75% 完整存檔、行為分階段調整） |
+| v3.0 | 2026-03-25 | 記憶維護引擎（壓縮/過期/衝突解決/去重）、一鍵安裝 CLI、install.sh、19+ agent、韓語觸發 |
 | v2.1.1 | 2026-03-25 | 新增 OpenClaw 支援 |
 | v2.1 | 2026-03-25 | 14+ agent、Bootstrap 機制、中文文件 |
 | v2.0 | 2026-03-25 | 通用化（Claude Code + Cursor + Windsurf） |
diff --git a/SKILL.md b/SKILL.md
@@ -1,6 +1,6 @@
 ---
 name: session-wrap
-version: 3.0.0
+version: 3.1.0
 description: |
   Universal session wrap-up: automatically persist project context to memory files for seamless resume.
   Works with ANY AI agent — Claude Code, Codex, Gemini CLI, Cursor, Windsurf, Cline, Roo Code,
@@ -38,6 +38,100 @@ Also trigger proactively when: major milestone reached, user signals finality, o
 
 ---
 
+## Context Window Guardian
+
+**This is a continuous monitoring behavior, not a one-time step.** Throughout the entire session, actively track context window usage and take action at defined thresholds.
+
+### How to Estimate Context Usage
+
+You don't have a precise token counter, but you can estimate based on observable signals:
+
+| Signal | Estimation method |
+|--------|------------------|
+| **Conversation turns** | Count user messages + your responses. Average turn ≈ 500-2000 tokens. |
+| **Tool call accumulation** | Each file read ≈ 1000-5000 tokens. Each large grep/glob result ≈ 500-3000 tokens. |
+| **Code blocks generated** | Each code block ≈ 200-1000 tokens. |
+| **Session duration** | Long sessions (30+ turns, 60+ tool calls) are likely past 50%. |
+| **System compaction hints** | If you see `[compacted]` markers or system messages about compression, you're near the limit. |
+| **Repeated context** | If you're re-reading files you already read earlier, context may have been pruned — you're deep. |
+
+**Rough formula**: `estimated_usage = (user_turns × 800) + (tool_results × 1500) + (your_responses × 600)`
+
+For reference, common context windows:
+- Claude Code (Opus): ~200K tokens (1M with extended)
+- Cursor: ~128K tokens
+- Codex: ~128K tokens
+- Gemini CLI: ~1M tokens
+- Most others: ~64K-200K tokens
+
+### Threshold Actions
+
+| Context % | Action | What to do |
+|-----------|--------|------------|
+| **< 40%** | 🟢 Normal | Continue working. No action needed. |
+| **40-50%** | 🟡 Awareness | Start mentally noting what's important this session. No action yet. |
+| **~50%** | 🟠 **Checkpoint Save** | **Proactively execute a lightweight memory save.** Don't wait for user to say "收工". Announce: "上下文已過半，先存個記憶檢查點。" Then run Steps 1-8 (lightweight mode — skip maintenance, just save current state). |
+| **50-70%** | 🟠 Active management | After checkpoint, continue working but be concise. Avoid re-reading files already in context. Prefer targeted reads over full-file reads. |
+| **~75%** | 🔴 **Full Save + Alert** | Execute full session-wrap (including maintenance). Alert user: "上下文快滿了，建議開新對話繼續。記憶已保存。" |
+| **> 80%** | 🔴 Critical | If still in session, every response should be minimal. Prioritize saving any unsaved work to memory before context is auto-compacted. |
+
+### Checkpoint Save (Lightweight Mode)
+
+When triggered at ~50%, do a fast save without full maintenance:
+
+1. **Skip** memory compression, deduplication, staleness check (Step 5)
+2. **Do** gather state (Step 2) → analyze (Step 3) → write (Step 4) → update index (Step 6)
+3. **Mark** the checkpoint in memory with timestamp: `> Checkpoint: 2026-03-25 14:30 (mid-session save)`
+4. **Continue working** — this is NOT a session end, just a safety save
+
+### Full Save at ~75%
+
+When triggered at ~75%, do the complete session-wrap:
+
+1. Execute **all steps** (1-8) including maintenance
+2. Summarize what's left to do (if any incomplete tasks)
+3. Suggest user start a new session: "記憶已完整保存，建議開新對話繼續未完成的工作。"
+4. If user continues, keep responses minimal and focused
+
+### Context-Aware Behavior Throughout Session
+
+Beyond threshold saves, adopt these habits as context grows:
+
+**Early session (< 30%)**:
+- Read files freely, explore broadly
+- Detailed explanations are fine
+
+**Mid session (30-60%)**:
+- Prefer targeted reads (specific line ranges) over full files
+- Consolidate tool calls — do multiple checks in one call
+- Keep responses concise
+
+**Late session (60%+)**:
+- Only read what's absolutely necessary
+- Reference previous reads by memory, don't re-read
+- Short, action-oriented responses
+- Proactively save any important findings to memory
+
+### Platform-Specific Context Signals
+
+| Platform | How to detect you're running out |
+|----------|----------------------------------|
+| **Claude Code** | System sends `<compacted>` message with conversation summary. If you see this, you've already been compressed — save immediately. |
+| **Cursor** | Chat may slow down or truncate earlier messages. No explicit signal. |
+| **Codex** | May start dropping early tool results from context. |
+| **Gemini CLI** | Has 1M context — less urgent but still track for very long sessions. |
+| **Cline/Roo** | VS Code may show token count in status bar. |
+
+### What to Save at Each Threshold
+
+| Threshold | Save scope |
+|-----------|-----------|
+| **50% checkpoint** | Current task progress, key decisions made, files modified, any gotchas discovered so far |
+| **75% full save** | Everything from checkpoint + lessons learned + what's left to do + handoff notes for next session |
+| **80%+ emergency** | Bare minimum: what was the goal, what's done, what's not done, any critical context that would be lost |
+
+---
+
 ## Step 1: Detect Platform & Memory Path
 
 ### Platform Detection Table
@@ -268,3 +362,6 @@ Report with summary table:
 | Memory file > 50 lines | Compress: summarize, keep last 3 history entries |
 | Expired memory | Mark [STALE] in index, don't auto-delete |
 | Duplicate memories | Merge into more specific file, delete generic one |
+| Context auto-compacted | System compressed history — immediately save all important context to memory before more is lost |
+| User ignores 75% alert | Continue working but save on every significant milestone. Don't nag repeatedly. |
+| Very long session (100+ turns) | Likely past 50% on most platforms. Trigger checkpoint if not already done. |
diff --git a/package.json b/package.json
@@ -1,6 +1,6 @@
 {
   "name": "session-wrap-skill",
-  "version": "3.0.0",
+  "version": "3.1.0",
   "description": "Universal AI agent memory persistence — auto-save project context at session end. Works with Claude Code, Codex, Gemini CLI, Cursor, Windsurf, Cline, Roo Code, Copilot, Amp, OpenClaw, OpenHands, Amazon Q, and more.",
   "keywords": [
     "ai-agent",

Original file line number	Diff line number	Diff line change
`@@ -1,6 +1,6 @@`
`1`	`1`	`{`
`2`	`2`	`"name": "session-wrap-skill",`
`3`		`- "version": "3.0.0",`
	`3`	`+ "version": "3.1.0",`
`4`	`4`	`"description": "Universal AI agent memory persistence — auto-save project context at session end. Works with Claude Code, Codex, Gemini CLI, Cursor, Windsurf, Cline, Roo Code, Copilot, Amp, OpenClaw, OpenHands, Amazon Q, and more.",`
`5`	`5`	`"keywords": [`
`6`	`6`	`"ai-agent",`