LinguaCompanion

Voice-first AI language-learning companion for Russian-speaking IT professionals — with native Russian/English code-switching support, conversational memory, scenario-based practice, and spaced-repetition phrase library.

"Yesterday я работал над automation pipeline"
                    ↓
"Yesterday I worked on an automation pipeline."

Status

Actively developed. Public demo not available yet.

The product is functional and has been live-tested across 9 development sessions in April 2026. 91 backend tests pass · E2E Playwright 10/11 · ElevenLabs TTS confirmed in production. The deployment URL is intentionally not published in this README — the project is in active iteration and the author does not want anonymous traffic burning through production API budgets. A public-access announcement will follow when the workflow stabilises.

If you are evaluating this work for a role / partnership / collaboration: open an issue or reach the author via @CreatmanCEO and a private demo can be arranged.

What it looks like


Free Chat with Grammar / Variants toggle _{Voice or text in. Companion replies bilingually. Per-message reconstruction (✓ Grammar) and 5 alternative phrasings (≡ Variants).}	Scenario practice _{IT-specific role-plays at B1 / B2 levels: stand-up, code review, tech demo, system-design interview, sprint planning, Slack writing.}

Three companions, four voices _{Alex (professional), Sam (casual), Morgan (mentor). US / GB voice variants. Speed 0.8×–1.2×. Topic and CEFR-level filters.}	Phrase Library with spaced repetition _{Saved phrases tagged Professional / Slang. Due-now / Due-for-review queue. Forgot / Hard / Easy review buttons feed an SRS schedule.}

Learning progress _{Streak tracking, session counts, practice time, phrases saved. Recent-activity bar chart for retention loop.}

Why this exists

Existing language apps optimise for vocabulary drills, gamified streaks, or generic conversation practice. None of them are built for how a Russian-speaking IT professional actually wants to speak English — with spontaneous code-switching, IT vocabulary as the lingua franca, and the goal of "sound like a colleague at a stand-up", not "pass an A2 exam."

LinguaCompanion is built around that user. The companion accepts mixed RU/EN speech, reconstructs the intent into natural English, returns a bilingual reply with click-to-listen TTS, and offers grammar correction or 5 alternative phrasings on demand. Scenario mode runs role-plays for daily stand-ups, code reviews, tech demos, system-design interviews, sprint planning, and Slack writing — each tagged at CEFR B1 or B2.

Architecture

Layer	Technology	Notes
Frontend	Next.js 16 (App Router), React 19, Zustand 5, Tailwind, shadcn/ui	port 3001 — messenger-style UI
Backend	Python 3.12, FastAPI, WebSocket, Celery + Redis	port 8001 — `/ws/session` + `/api/v1/*`
STT primary	Deepgram Nova-3 (`language=multi`)	code-switching confirmed: 6/6 spike tests
STT fallback	Groq Whisper large-v3-turbo	sub-second latency, auto-switch on Deepgram failure
LLM main	Groq Llama 3.3 70B via LiteLLM	hot-swap via `LLM_MODEL` env
LLM onboarding	DeepSeek (OpenRouter)	replaced rate-limited Gemma in commit `adbbcbf`
TTS production	ElevenLabs (confirmed: 40 KB audio per response)	three companion voices: Alex, Sam, Morgan
TTS fallbacks	AWS Polly · Edge-TTS · Google Neural2	Polly currently blocked by IAM; Edge-TTS blocked from VPS IP
Database	Supabase (PostgreSQL + pgvector)	conversational memory + phrase library
Cache / queue	Upstash Redis (TLS)	Celery broker + per-session cache
Embeddings	Google Embeddings API	saves ~800 MB RAM vs local sentence-transformers
Pronunciation	Azure Speech SDK	phoneme-level scoring
Monorepo	Turborepo + pnpm workspaces	`apps/web`, `backend/`, `packages/types`, `infra/docker`
Deploy	Docker Compose · Coolify · nginx	sec VPS, TLS, reverse proxy

For diagrams and the WebSocket / agent flow, see docs/ARCHITECTURE.md.

What's built (real surface, not roadmap)

Frontend (`apps/web/src/components/`, 15 components)

CompanionBubble, UserBubble — message rendering with bilingual support
VoiceBar — push-to-talk + recording indicator
ReconstructionBlock — grammar correction with diff highlighting
VariantCards — 5 alternative phrasings on demand
LoginScreen — Google OAuth + email/password
SettingsPanel — 3 companions × 4 voices × speed × topic × CEFR level × theme
PhraseLibrary — saved phrases with spaced-repetition queue (Forgot / Hard / Easy)
StatsScreen — streaks, sessions, messages, practice time, recent-activity chart
SessionSummary, HintOverlay, ThemeToggle, plus ui/ and layout/

Backend (`backend/app/`, 12 agents + 9 routes)

Agents: stt, companion, memory, onboarding, orchestrator, phrase_variants, pronunciation, reconstruction, topic_discovery, tts, analytics, plus prompts/
Routes: auth, opengraph, phrases, push, session, stats, translate, tts, ws (WebSocket)
Migrations: Alembic
Tests: 91 backend pytest passing · E2E Playwright 10/11

Project structure

lingua-companion/
├── apps/
│   └── web/              # Next.js 16 web client
├── backend/              # FastAPI + Celery + 12 agents
│   ├── app/agents/       # stt · companion · memory · reconstruction · …
│   ├── app/api/routes/   # auth · session · phrases · stats · ws · …
│   ├── app/prompts/      # prompt templates
│   ├── migrations/       # Alembic
│   └── tests/            # pytest
├── packages/
│   └── types/            # shared TypeScript types
├── infra/
│   └── docker/           # Docker Compose configs
├── docs/
│   ├── ARCHITECTURE.md       # system architecture + diagrams
│   ├── AI_PIPELINE.md        # agent flow detail
│   ├── API_KEYS.md           # required env vars + setup
│   ├── BACKLOG.md            # current priorities
│   ├── COMPETITIVE_ANALYSIS.md
│   ├── DESIGN_JOURNEY.md
│   ├── VPS_SETUP.md          # deployment runbook
│   ├── architecture.svg      # this README's hero diagram
│   └── screenshots/          # README assets
├── plans/                # per-iteration design docs
├── tests/
│   └── e2e/              # Playwright specs
├── CHANGELOG.md
├── CLAUDE.md
├── Makefile
└── README.md

Working on it

The project is configured for Claude Code as the primary development driver. See CLAUDE.md for the project constitution (stack, commands, CRITICAL RULES, agent inventory). The same author maintains a Claude Code Anti-Regression Setup — the .claude/ config, hooks, and subagents pattern there is what keeps refactors from breaking the existing 91-test suite.

# Bootstrap (assuming pnpm + Python 3.12 + Postgres locally or via .env)
pnpm install
cd backend && pip install -r requirements.txt && cd ..

# Run frontend dev server (port 3001)
pnpm --filter @lingua/web dev

# Run backend (port 8001)
cd backend && uvicorn app.main:app --reload --port 8001

# Tests
cd backend && pytest                          # 91 backend tests
pnpm --filter @lingua/web test               # frontend unit tests (Vitest)
pnpm --filter @lingua/web exec playwright test # E2E

Full session-by-session history of what was built and what broke is in CHANGELOG.md.

Limitations

This is a personal product in active development. Honest constraints:

No public deployment URL is published in this README. The product runs on a personal VPS with metered API budgets. Anonymous traffic would directly burn through the author's Anthropic / Groq / Deepgram / ElevenLabs spend during iteration. Public access will be opened when the workflow stabilises.
Topic Discovery is currently disabled. Earlier iterations had the companion proactively inject "Hey, saw this and thought of you…" snippets from HN / Reddit. It produced repetitive Rust-themed spam during testing and was disabled in commit 5546803. Will return as Rich Link Cards in a future iteration.
TTS provider matrix is partially live. ElevenLabs is confirmed working in production (40 KB audio per response). AWS Polly currently fails with AccessDeniedException (IAM policy fix pending). Edge-TTS is blocked from the VPS IP by Microsoft (HTTP 403). Google Neural2 is the working budget fallback.
Pronunciation analysis is wired but not yet exposed in the UI. Azure Speech SDK integration exists at the agent layer; surfacing per-phoneme scoring in CompanionBubble is pending.
Free Chat sessions can stall on rapid send. tests/UX-Test-Report.md flagged P3 issue: sending multiple messages within ~200 ms drops all but the first. Mitigation: WebSocket message queue with debounced flush, on the backlog.
A2 / B2 mode is fixed at the level toggle. The companion does not auto-detect the user's level and adjust difficulty mid-session — you set it in Settings and it applies to subsequent turns. Adaptive difficulty is on the roadmap.

Claude Code Anti-Regression Setup — sister repo by the same author. The .claude/ config + subagents pattern that keeps the 91-test suite green during refactors.
ai-context-hierarchy — sister repo. The Level 0 / Level 1 hierarchy used by Claude Code on this project to navigate between apps/web, backend, and packages without re-reading the whole tree each session.
claude-statusline — sister repo. Statusline that surfaces context %, model, cost, and the VPS hosting this product during dev sessions.
notebooklm-claude-workflows — sister repo. Used by this project's research workflow when picking design references and competitive analysis.

Author

Nick Podolyak — Python developer and digital architect at CREATMAN

GitHub: @CreatmanCEO
Habr: creatman
dev.to: @creatman

License

MIT · Nick Podolyak

Name		Name	Last commit message	Last commit date
Latest commit History 124 Commits
.antigravity		.antigravity
.claude		.claude
.github/workflows		.github/workflows
apps/web		apps/web
backend		backend
docs		docs
infra/docker		infra/docker
packages/types		packages/types
plans		plans
tests/e2e		tests/e2e
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
README.ru.md		README.ru.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
turbo.json		turbo.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LinguaCompanion

Status

What it looks like

Why this exists

Architecture

What's built (real surface, not roadmap)

Frontend (`apps/web/src/components/`, 15 components)

Backend (`backend/app/`, 12 agents + 9 routes)

Project structure

Working on it

Limitations

Related

Author

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LinguaCompanion

Status

What it looks like

Why this exists

Architecture

What's built (real surface, not roadmap)

Frontend (apps/web/src/components/, 15 components)

Backend (backend/app/, 12 agents + 9 routes)

Project structure

Working on it

Limitations

Related

Author

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Frontend (`apps/web/src/components/`, 15 components)

Backend (`backend/app/`, 12 agents + 9 routes)

Packages