Experience the future of voice-to-text with Groq DEV Tier - Ultra-fast transcription that leaves OpenAI's free tier in the dust!
โโโโโโโโโโโโโโโโโโโ
โ ๐ค maVoice โ โ Tiny floating widget (100x22px)
โ โถ โ โช โช โช โช โ Always on top of your screen
โโโโโโโโโโโโโโโโโโโ Double-click to start!
- โก Blazing Fast: Powered by Groq's Whisper Large v3 Turbo model - the fastest inference in the game
- ๐ฏ Native Performance: Built with Rust and Tauri for minimal resource usage
- ๐จ Beautiful UI: Sleek, modern floating widget that stays out of your way
- ๐ Privacy First: Your API key, your data - everything stays local
- ๐ Cross-Platform: Works on Linux (Windows and macOS coming soon!)
- ๐ค Smart Recording: Real-time audio visualization and voice detection
- ๐ Instant Copy: Automatic clipboard integration for seamless workflow
- โ๏ธ Advanced Settings: Comprehensive configuration panel with model selection
- ๐๏ธ Intuitive Controls: Double-click to start, single-click to stop
- ๐ Multi-Language: Support for 100+ languages with custom prompts
maVoice is a floating voice dictation widget that lives on your desktop. Unlike traditional apps with windows and menus, maVoice is a tiny, always-accessible button that floats above your other applications.
Normal State Recording Processing Success
โโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโ
โ ๐ค maVoice โ โ โ ๐ด โถโถโถโถ โ โ โ ๐ โโโโโ โ โ โ โ
Done! โ
โโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโ
(Blue) (Red) (Orange) (Green)
- Size: 100x22 pixels (compact floating button)
- Behavior: Always on top, transparent background, no window borders
- Dragging: Right-click or Ctrl+Left-click to drag to a new position
| Feature | Groq DEV Tier | OpenAI Free |
|---|---|---|
| Speed | ๐ Lightning Fast | ๐ Slow |
| Rate Limits | ๐ช 400 RPM | ๐ Limited |
| Model | ๐ง Whisper v3 Turbo | ๐ค Basic Whisper |
| Quality | ๐ฏ Premium | ๐ Variable |
# Clone and install everything automatically
git clone https://github.com/lliWcWill/maVoice-Linux.git
cd maVoice-Linux
./install.sh
# Add your Groq API key
echo "VITE_GROQ_API_KEY=your_groq_api_key_here" > src-tauri/aquavoice-frontend/.env
# Launch!
npm run dev- Node.js 18+
- Rust 1.70+
- A Groq API key (Get one here)
๐ช WSL2 Setup (Windows Users)
โจ BREAKTHROUGH: WSL2 + WSLg provides PERFECT voice dictation with zero audio issues!
-
Update WSL2 (from Windows PowerShell as Administrator):
wsl --update wsl --version # Ensure version 2 with WSLg
-
Install Debian/Ubuntu if you don't have it:
wsl --install -d Debian
# Install Rust
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
source $HOME/.cargo/env
# Install system dependencies
sudo apt update && sudo apt install -y \
build-essential pkg-config libgtk-3-dev libwebkit2gtk-4.1-dev \
libsoup-3.0-dev libjavascriptcoregtk-4.1-dev libdbus-1-dev \
libappindicator3-dev librsvg2-dev libasound2-dev \
xdotool wl-clipboard wtype
# Clone and run
git clone https://github.com/lliWcWill/maVoice-Linux.git
cd maVoice-Linux
./install.sh๐ง Native Linux Setup
Debian/Ubuntu:
sudo apt update
sudo apt install -y \
build-essential pkg-config libgtk-3-dev libwebkit2gtk-4.1-dev \
libsoup-3.0-dev libjavascriptcoregtk-4.1-dev libdbus-1-dev \
libappindicator3-dev librsvg2-dev libasound2-dev \
xdotool wl-clipboard wtypeFedora/Arch - See detailed instructions
# Build the .deb package
npm run build
# The .deb file will be in:
# src-tauri/target/release/bundle/deb/- Launch maVoice - The app appears as a sleek floating widget
- Double-click to start - The microphone activates with visual feedback
- Speak naturally - Real-time audio visualization shows your voice
- Single-click to stop - Transcription appears instantly
- Copy & paste - Text is automatically copied to clipboard
Web Interface (http://localhost:5173)
- Settings panel - Click the gear icon for full configuration
- API key setup - Secure local storage of your Groq key
- Model selection - Choose from Whisper variants
- Custom prompts - Add technical terms, names, or style instructions
- Temperature control - Adjust creativity vs accuracy
- Multi-language - Support for 100+ languages
Ctrl+,- Open settingsAlt+Space- Toggle recordingDouble Alt- Quick recordSpacebar- Stop recording (while active)
We love contributions! Whether it's:
- ๐ Bug reports
- ๐ก Feature requests
- ๐ง Pull requests
- ๐ Documentation improvements
Check out our Contributing Guide to get started.
maVoice leverages Groq's incredible inference speed:
- Transcription Speed: < 500ms for 30-second audio
- Memory Usage: < 50MB idle, < 100MB active
- CPU Usage: < 5% during transcription
- Network: Minimal bandwidth usage with smart chunking
- Local First: All processing happens on your machine
- No Telemetry: We don't track anything
- Secure API: Your Groq API key is stored locally and never shared
- Open Source: Audit the code yourself!
maVoice is MIT licensed. See LICENSE for details.
- Groq - For providing insanely fast inference
- Whisper - OpenAI's amazing speech recognition model
- Tauri - For making native apps actually enjoyable to build
- You - For choosing open-source!
Built with โค๏ธ by developers who were tired of slow dictation
maVoice - Where speed meets simplicity