中文 | English
Robust LLM-Wiki is a document-centered control-plane for long-term Wiki maintenance with AI.
This repository does not publish a private wiki. It publishes the framework for keeping one stable over time: how to preserve Wiki form, keep wikilinks healthy, control agent runtime drift, and make maintenance auditable as the corpus grows.
The project was extracted from two long-running real LLM-Wikis. The goal is not one-shot generation. The goal is repeatable maintenance.
Important
Read This First
If you want the best entry into the schema docs, start with:
schema/README.md
It gives you the reading order first, then points you to:
SPEC.mdfor architecture and boundariesrobust-llm-wiki-CLAUDE.mdfor operator policy and agent-team rulesdetails/for focused playbooks and rule pages
Once an LLM-Wiki becomes large enough, the main problem is no longer "can the model write a page." The problem becomes:
- can the wiki stay navigable,
- can new sources be compiled without polluting old pages,
- can query results be trusted enough to feed later maintenance,
- can the agent runtime stop drifting as tool chains get longer.
This repository exists to answer those questions with reusable rules rather than one-off prompts.
| Item | Value |
|---|---|
| Snapshot date | 2026-04-17 |
| Practice samples | Two anonymized production Wikis (A/B) |
| Markdown files | 3,093 |
| Markdown lines | 779,376 |
Wikilinks ([[...]]) |
17,780 |
At this scale, the core challenge is sustaining maintenance quality over time, not just finishing one-time page creation.
The motivation is straightforward: long-running operation of two real LLM-Wikis exposed the same classes of failure repeatedly, so the repair patterns were turned into reusable guidance. This repository publishes methods, rules, and validation structure, not the private contents of those Wikis:
- Human-read and AI-read goals become mixed, so pages are either hard to read or hard to process.
- Overuse of rigid outlines drifts away from the Wiki form.
- Sources are appended at the tail while body-level wikilinks stay weak, creating isolated pages.
- Rules and checks are not layered, so maintenance cost rises quickly with volume.
The design direction is document-centered rather than tool-centered.
In practical terms, that means:
- pages are the long-lived unit, not chat transcripts,
- addresses, links, and logs matter as much as generated prose,
- the system should improve its maintenance process, not only add more content.
This is also the historical direction that makes the project legible: closer to Engelbart's document-centered OHS/CODIAK line than to a prompt-only workflow. The durable unit is the addressable page and its links, not the transient tool session.
That is why this repository behaves more like a control-plane than a content repository.
This repository intentionally publishes the control-plane:
research/for evidence, scale, and recurring failure patterns,schema/for architectural boundaries and operating rules,- runbooks for how agents should ingest, query, lint, hand off, and promote knowledge.
Downstream implementations are expected to provide the data-plane:
raw/source material,wiki/maintained knowledge pages,index.mdandlog.mdas first-class operating files.
This split is deliberate. It keeps the framework reusable while letting each real Wiki keep its own private corpus.
Robust focuses on maintaining stability in operations and governance.
- Keep Karpathy kernel unchanged.
- Use
Schemato define hard boundaries and safe extension space. - Treat runtime safety as architecture, not as an afterthought.
- Improve the maintenance method itself as the corpus grows.
- Keep implementation choices decoupled from this framework repository.
The intended cycle is simple:
- ingest one source into draft knowledge,
- update the linked entity, concept, or synthesis pages it touches,
- refresh
index.mdand append tolog.md, - run lint in layers,
- promote only after evidence and structure checks pass.
The repository does not ship one fixed implementation of that loop. It defines the contract that a downstream implementation should satisfy.
The point of the framework is not "more files." The point is to make long-term Wiki maintenance legible.
The repository already anchors that claim in three ways:
- real operating scale from two anonymized production Wikis,
- evidence and engineering notes in
research/, - explicit runtime and promotion rules in
schema/.
What it publishes is the control logic. What downstream Wikis provide is the live corpus.
.
├── .github/
├── CONTRIBUTING.md
├── CONTRIBUTING.zh-CN.md
├── I18N.md
├── I18N.zh-CN.md
├── README.md
├── README.zh-CN.md
├── research/
└── schema/
Purpose: preserve evidence, dataset scale, ecosystem observations, and problem summaries.
- Main doc (EN): research/RESEARCH.md
- Main doc (ZH): research/RESEARCH.zh-CN.md
- Survey snapshots:
research/snapshots/ - Engineering notes (A/B anonymized): research/2026-04-17-two-wikis-engineering-notes.md
- Ecosystem survey: research/2026-04-17-llm-wiki-ecosystem-survey.md
Purpose: define red lines, operating rules, extension boundaries, and governance requirements.
- Main spec (EN): schema/SPEC.md
- Main spec (ZH): schema/SPEC.zh-CN.md
- Schema start-here guide (EN): schema/README.md
- Schema start-here guide (ZH): schema/README.zh-CN.md
- Claude operating guidance: schema/robust-llm-wiki-CLAUDE.md
- Runtime design note: schema/details/09-agent-runtime-control-plane.md
- Detailed rules: schema/details/
- Open-source policy (EN): schema/OPEN_SOURCE_POLICY.md
- Open-source policy (ZH): schema/OPEN_SOURCE_POLICY.zh-CN.md
- Start with quickstart/README.md.
- Choose the path that matches you: adapt this framework into an existing LLM-Wiki, or generate a fresh
CLAUDE.md/AGENTS.md/GEMINI.mdfrom this repo's operating guide. - Then continue into schema/README.md, schema/SPEC.md, schema/details/09-agent-runtime-control-plane.md, and schema/robust-llm-wiki-CLAUDE.md.
- It must be a Wiki.
- It must have wikilinks/double-links.
- It must keep the
ingest -> query -> lintloop. - It must remain traceable, auditable, and rollback-friendly.
- Code and scripts: Apache-2.0
- Research and standards docs: CC BY 4.0
- Contribution process: DCO sign-off + third-party license declaration
- Current repository license file: Apache-2.0 (
LICENSE)
Detailed policy: schema/OPEN_SOURCE_POLICY.md
- Contribution guide (EN): CONTRIBUTING.md
- Contribution guide (ZH): CONTRIBUTING.zh-CN.md
- Translation policy (EN): I18N.md
- Translation policy (ZH): I18N.zh-CN.md
- Karpathy original gist: https://gist.github.com/karpathy/442a6bf555914893e9891c11519de94f

