Skip to content

AbhijitKumarJ/Universal-Semantic-Interaction-Runtime-USIR-

Repository files navigation

Universal-Semantic-Interaction-Runtime (USIR)

USIR - Universal Semantic Interaction Runtime - Towards a Post-GUI Computing Architecture

USIR Ideation - Ideation FOLDER

Folders are numbered in order of attempts at reining the ideas behind USIR through multiple conversations.

USIR Implementation — USIR_REPO FOLDER

A semantic operating layer that decouples human intent from application implementation. build tests license

Status: Pre-alpha. All core types, runtime, adapters, audio pipeline, federation, VS Code extension, and capability marketplace are implemented. 501 tests pass, 0 lint errors, build is clean across 12 packages. Recent additions: webview audio capture (Node.js host fix), local Whisper fallback (binary→cloud), JSON/SQLite persistence, retry+circuit breaker in executor, TreeWalker DOM extraction (fixes SPA scalability), and 132 adapter tests. See USIR_REPO/IMPLEMENTATION.md for detailed status.

Motivation

Modern computing is built around applications. Humans interact with software through buttons, menus, windows, and screens — abstractions optimized for mice, keyboards, and displays.

LLMs expose a different possibility: humans communicate through intent, context, memory, and relationships — not interface widgets.

USIR proposes a universal semantic runtime that allows humans, agents, and applications to interact through shared semantic representations rather than application-specific user interfaces.

Architecture (6 Pillars + Provenance)

Pillar Package Purpose
1. Universal Intent Ontology @usir/protocol ~50 cognitive verbs across 8 layers (L0–L8)
2. Interaction Memory @usir/runtime Resolves "it", "that", "previous" via 4 reference kinds
3. Semantic Graph @usir/protocol Apps expose entities, not widgets
4. Semantic Snapshot @usir/protocol 3-tier (Hot/Warm/Cold) snapshot every adapter emits
5. Deterministic Execution @usir/runtime LLM plans, runtime executes (DAG, auditable, parallel)
6. Semantic Adapters adapters/* Bridges for VS Code, browser, Playwright, OS, IoT, XR

Plus L0.5 Provenance — tracks why mutations happened, not just what changed, with causal chains across runtimes.

Repository Structure

usir/
├── packages/
│   ├── protocol/              # Universal Intent Ontology, SemanticEntity, SemanticSnapshot
│   ├── runtime/               # Interaction Memory, Intent Router, Topological Executor, A2U
│   ├── audio-pipeline/        # Whisper STT, VAD, FusedIntent
│   ├── federation/            # P2P runtime federation (WebRTC, CRDT sync, L8 handlers)
│   ├── registry/              # Capability marketplace REST API (publish, search, trust, pricing)
│   ├── registry-client/       # Registry client SDK (cache, sync, discovery)
│   ├── adapters-os/           # OS adapters (process, filesystem, window, shell, system)
│   ├── adapters-iot/          # IoT adapters (MQTT, CoAP, Modbus/OPC-UA, sensor fusion)
│   └── adapters-xr/           # XR adapters (Unity bridge, spatial anchors, XR input)
├── adapters/
│   ├── vscode/                # VS Code tiered snapshot engine + 9 tools
│   ├── browser/               # Browser DOM accessibility tree adapter
│   └── playwright/            # Playwright zero-shot adapter (8 tools)
├── apps/
│   └── vscode-extension/      # Deployable VS Code extension (MVP entry point)
├── docs/
│   ├── MASTER-SPEC.md         # Canonical architecture spec
│   ├── FEDERATION.md          # Federation architecture & deployment guide
│   ├── IMPLEMENTATION.md      # Detailed implementation status
│   ├── ROADMAP.md             # 12-month execution plan
│   ├── *.md                   # "Beyond the GUI" 6-part blog series
│   └── semantic-horizon/      # "Semantic Horizon" 5-part blog series
└── ontology/
    └── universal-intent-ontology-v1.md  # 1.0 candidate spec

Packages

Package LOC Tests Description
@usir/protocol ~2,000 41 Shared schemas, ontologies, entity types, Storage interface
@usir/runtime ~2,200 60 Core engine: memory, router, executor (retry + circuit breaker), A2U, provenance, JSON/SQLite persistence (Storage interface), CircuitBreaker
@usir/audio-pipeline ~480 24 Voice capture, VAD, STT, fused intent, local Whisper fallback
@usir/federation ~4,760 73 P2P WebRTC, CRDT graph sync, L8 collaboration, signaling persistence
@usir/registry ~2,440 72 Capability marketplace REST API
@usir/registry-client ~440 8 Registry client SDK
@usir/adapters-os ~990 30 Process, filesystem, window, shell, system adapters
@usir/adapters-iot ~1,080 33 MQTT, CoAP, Modbus/OPC-UA, sensor fusion
@usir/adapters-xr ~710 20 Unity bridge, spatial anchors, XR input
@usir/vscode-adapter ~560 65 VS Code tiered snapshots + 9 tools (debounce, coalescing, caps, engine, tools)
@usir/browser-adapter ~490 68 Browser DOM accessibility tree (7 tools, dom-graph, TreeWalker-based viewport filtering)
@usir/playwright-adapter ~470 7 Playwright DOM extractor (TreeWalker-based, scalable), 8 tools

Key Concepts

  • L0.5 Provenance: Every mutation records intent, actor, rationale, authorization chain, causal parents, and semantic diffs. Auditable, replayable, cross-runtime.
  • A2U Protocol: 3-tier trust gate (auto/confirm/block) keeps humans in control of autonomous agents.
  • 3-Tier Snapshot: Hot (16ms — cursor/focus), Warm (150ms — visible entities), Cold (seconds — full graph).
  • Retry + Circuit Breaker: Steps retry with exponential backoff (+jitter); per-tool circuit breakers (CLOSED/OPEN/HALF_OPEN) fail fast after repeated failures.
  • Dual Persistence: JSON files (zero-dependency default) or SQLite (opt-in via better-sqlite3), both implementing a shared Storage interface.
  • Federated Runtime: P2P WebRTC with Yjs CRDT sync, L8 collaboration handlers (share, discuss, annotate, broadcast).
  • Capability Marketplace: Public registry, trust scoring (weighted factors + exponential decay), pricing & invoicing (free/call/metered/subscription), payout system.

MVP

A VS Code extension that lets developers navigate, edit, and orchestrate their workspace using natural language and voice. The IDE is the perfect "trojan horse" — it has rich semantics (LSP, ASTs, file graphs) and the audience is already AI-fluent.

Targeted first commands:

  • "Focus terminal, start the dev server, tail the logs"
  • "Rename this to user_id everywhere and run the test suite"
  • "Select main.py, run it in python terminal"

Quick Start

# Install
pnpm install

# Build all packages
pnpm build

# Run all tests
pnpm -r test

# Lint all packages
pnpm -r lint

# Run the VS Code extension in dev mode
pnpm --filter @usir/vscode-extension run dev

Documentation

Inspiration

USIR draws from the historical analogy of protocol layers:

Protocol What It Abstracted
TCP/IP Networking
HTML Documents
HTTP Request/response
USIR Interaction

Status

🚧 Pre-alpha — Core runtime, federation, capability marketplace, all adapters, audio pipeline, and VS Code extension are implemented. 501 tests pass across 12 packages with 0 lint errors. Key recent additions: webview audio capture, local Whisper fallback, JSON/SQLite persistence (dual backend), retry+circuit breaker in executor, TreeWalker DOM extraction (fixes SPA scalability), Storage interface in protocol. Next: CI/CD pipeline, publish @usir/protocol to npm, and test the extension in a live VS Code instance.

License

MIT

About

USIR - Universal Semantic Interaction Runtime - Towards a Post-GUI Computing Architecture

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors