A local-first, speech-to-speech conversational AI with multi-model orchestration, layered memory, and emotional state awareness.
- 🎤 Voice-First Design - Real-time speech-to-speech conversation
- 🧠 Multi-Model Orchestration - Routes to optimal AI model based on complexity
- 💾 Layered Memory - Episodic, semantic, and identity memory systems
- 🌐 Beautiful Web UI - Modern browser interface with emotional orb
- 💻 CLI Mode - Simple text-based chat for quick testing
- 🔒 Local-First - Your data stays on your machine
- ⚡ Ollama Powered - Uses local LLMs (llama3.2, phi4-mini, deepseek-r1)
- Python 3.10+
- Ollama installed and running
git clone https://github.com/Dhevenddra/voice-assitant.git
cd voice-assitant
pip install -e .ollama pull llama3.2Option A: CLI Mode (Type to Chat)
python -m src.cliOption B: Web UI (Visual Interface)
# Terminal 1: Start backend
python -m src.cli
# Terminal 2: Start web server
cd ui && python -m http.server 8080Then open: http://localhost:8080/index.html
| Command | Description |
|---|---|
/help |
Show all commands |
/status |
View current state |
/memory |
See conversation history |
/clear |
Clear conversation |
/quit |
Exit |
- Click anywhere to enable audio
- Type in the text box and press Enter
- Watch the emotional orb respond
- Hear the assistant speak back
voice-assistant/
├── src/ # Main application code
│ ├── cli.py # CLI interface
│ ├── main.py # Full voice mode
│ ├── audio/ # Microphone/speaker
│ ├── stt/ # Speech-to-Text
│ ├── tts/ # Text-to-Speech
│ ├── models/ # AI model routing
│ └── memory/ # Memory systems
├── ui/ # Web interface
│ └── index.html # Browser UI
├── docs/ # Documentation
│ ├── QUICKSTART.md # Getting started guide
│ ├── ARCHITECTURE.md # System design
│ └── API_REFERENCE.md # API documentation
├── config.yaml # Configuration
└── data/ # Memory storage
Copy .env.example to .env and configure:
# Optional: Claude API for synthesis model
ANTHROPIC_API_KEY=your_key_here
# Force offline mode (local models only)
VOICE_ASSISTANT_OFFLINE=1
# Enable debug logging
VOICE_ASSISTANT_DEBUG=1| Guide | Description |
|---|---|
| QUICKSTART.md | Get up and running fast |
| ARCHITECTURE.md | Technical deep-dive |
| API_REFERENCE.md | API documentation |
| HOSTING_GUIDE.md | Deployment guide |
# Install with dev dependencies
pip install -e ".[dev]"
# Run tests
pytest
# Pull additional models
ollama pull phi4-mini:3.8b
ollama pull deepseek-r1:8b- Local-first: All memory and state live locally
- Voice-primary: Designed around real-time spoken dialogue
- Conversation, not commands: Supports interruption, pauses, reflection
- Memory is identity: Selective, structured, meaningful memory
- Privacy is a feature: Full user control over data
MIT License - see LICENSE for details.
Made with ❤️ for natural, human-like AI conversations