spec-coding-skills

Spec-driven skills for agents that plan with acceptance criteria, correct against real feedback, and remember reusable project knowledge.

What This Provides

spec-coding-skills is a small skill set for making agentic coding work more repeatable:

Define the work before implementation.
Validate implementation against observable acceptance criteria.
Use lint, tests, logs, CI, and review feedback to correct behavior.
Save reusable decisions, root causes, setup rules, and fix patterns for future tasks.

The repository is skill-first:

SKILL.md files define workflow and trigger behavior.
Bundled scripts/ handle deterministic or repetitive actions.
Bundled references/ provide templates and reusable formats.

Installation

Install all skills from GitHub:

npx skills add H2Sxxa/spec-coding-skills --all

Install selected skills:

npx skills add H2Sxxa/spec-coding-skills --skill spec-plan --skill spec-crlp --skill spec-index

Install from a local checkout while developing:

npx skills add . --all

Website

A standalone Docusaurus landing site lives in website/.

Run it locally:

cd website
npm install
npm start

Skills

`spec-plan`

Turns ambiguous software requests into BDD-style implementation-ready specs with scope, constraints, assumptions, acceptance criteria, edge cases, validation steps, execution guardrails, and an implementation plan.

Use it when requirements are fuzzy, the user asks for a plan or spec, or implementation would be risky without first making "done" explicit. For complex work, it applies a self-check before implementation handoff so unresolved blocking questions do not turn into execution-time guessing.

`spec-crlp`

Runs a spec-driven correction loop for existing code by comparing the implementation against the spec and using linter output, tests, logs, CI, and review feedback to diagnose gaps.

Use it when code is failing checks, drifting from acceptance criteria, or needs a fix-verify-repeat loop.

`spec-index`

Captures, retrieves, and maintains durable project memory such as decisions, constraints, setup steps, root causes, fixes, patterns, and pitfalls.

Use it when the agent should save reusable knowledge, look up prior solutions, investigate existing notes, clean up weak trigger tags, or feed historical context into planning and correction loops.

Default Workflow

The three skills are designed to reinforce each other:

Read the target repository root SPEC.md when present; otherwise use the built-in defaults from this repository.
Retrieve relevant memory with spec-index when prior decisions, constraints, pitfalls, or fixes could affect the task.
Use spec-plan to define the problem, acceptance criteria, validation plan, and execution steps.
Implement in the normal coding loop.
Use spec-crlp when feedback reveals a mismatch between the implementation and the spec.
Persist reusable findings with spec-index so later tasks can benefit from them.

For small, self-contained tasks, the skills are allowed to stay lightweight and avoid unnecessary persistence or retrieval.

Repository Overrides

The skills work without project-specific configuration. By default, they use the conventions in SPEC.md.

For a target repository that needs custom lint commands, validation order, task spec paths, or knowledge-base paths, copy templates/repository-SPEC.md into the target repository root as SPEC.md and edit it there.

Preference order:

Direct user instructions.
Target repository root SPEC.md.
Built-in defaults from this repository.
Tooling discovered from the target repository when allowed by the defaults.

Partial repository SPEC.md files are fine. Missing fields fall back to the built-in defaults.

Output Language

To change the language used in generated specs, correction summaries, and knowledge-base entries, add a documentation language preference to the target repository root SPEC.md:

## Documentation Language

- Default documentation language: Simplified Chinese.
- Write user-facing specs, plans, correction summaries, and knowledge-base entries in Simplified Chinese.
- Keep code identifiers, commands, file paths, API names, error messages, and quoted logs in their original language.

Direct user instructions still take priority. For example, if a repository defaults to Chinese but the user asks for English in one task, the skills should follow the user for that task.

Knowledge Base

spec-index stores durable project memory as Markdown entries with YAML frontmatter.

Default location:

docs/knowledge-base/

Bundled resources:

template.md defines the canonical memory entry format.
index.py provides deterministic add, best-match search, audit, and rebuild operations.

The helper script uses the Python standard library only and does not require network access.

Demo Benchmark

This repository includes a small 3-prompt demo benchmark for the three core skills. In one local run with Codex, the skill-guided outputs passed a deterministic skill-aligned rubric at 100.0%, compared with 11.1% for baseline generic outputs: +88.9 percentage points.

This is a demonstration benchmark, not a statistically rigorous claim. It uses one run per prompt and checks whether outputs include the structures these skills are designed to produce, such as Context From Memory, Relevant Memory, Memory To Capture, YAML frontmatter, and trigger_tags.

Eval	What was checked	Baseline	With skills	Why it matters in real work
Planning an existing feature	Scope boundaries, acceptance criteria, validation plan, execution plan, and memory context	33.3%	100.0%	Reduces vague requirements before implementation and gives later fixes a stable reference.
Correcting a failing test	Correction summary, relevant memory, root cause, validation, residual risk, and memory to capture	0.0%	100.0%	Makes debugging evidence-driven instead of patch-by-guesswork, and records reusable lessons.
Saving a reusable root cause	YAML frontmatter, `root-cause` classification, trigger tags, applies-when, validation, and related context	0.0%	100.0%	Turns one-off debugging knowledge into searchable project memory for future tasks.
Overall	Mean pass rate across the three demo prompts	11.1%	100.0%	Shows the skills improve process structure, traceability, and memory reuse in this small demo.

These metrics are intentionally process-oriented. They do not claim that the generated code is better by itself; they measure whether the agent produced the workflow artifacts that make real development safer: clear scope, testable acceptance criteria, reproducible correction steps, and retrievable project memory.

The committed eval prompts live in evals/evals.json. See TESTING.md for the reproducible workflow and local review-page generation.

Repository Layout

skills/
  spec-plan/
    SKILL.md
  spec-crlp/
    SKILL.md
  spec-index/
    SKILL.md
    references/
      template.md
    scripts/
      index.py
docs/
  knowledge-base/
    index.md
evals/
  evals.json
templates/
  repository-SPEC.md
website/
  docusaurus.config.ts
  src/
  docs/
TESTING.md
SPEC.md
README.md
README.zh.md
LICENSE

Roadmap

Expand eval coverage beyond the initial 3-prompt demo.
Add example memory entries that demonstrate useful trigger tags.
Run multi-sample benchmarks for more reliable before/after claims.
Collect feedback from real projects.

License

MIT License. See LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

spec-coding-skills

What This Provides

Installation

Website

Skills

`spec-plan`

`spec-crlp`

`spec-index`

Default Workflow

Repository Overrides

Output Language

Knowledge Base

Demo Benchmark

Repository Layout

Roadmap

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.github/workflows		.github/workflows
benchmarks		benchmarks
docs/knowledge-base		docs/knowledge-base
evals		evals
skills		skills
templates		templates
test-workspaces		test-workspaces
website		website
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README.zh.md		README.zh.md
SPEC.md		SPEC.md
TESTING.md		TESTING.md

Folders and files

Latest commit

History

Repository files navigation

spec-coding-skills

What This Provides

Installation

Website

Skills

spec-plan

spec-crlp

spec-index

Default Workflow

Repository Overrides

Output Language

Knowledge Base

Demo Benchmark

Repository Layout

Roadmap

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`spec-plan`

`spec-crlp`

`spec-index`

Packages