Skip to content

๐ŸŽ™๏ธ Lightning-fast voice dictation Desktop Web App powered by Groq's Whisper Turbo - Open-source, privacy-first, with real-time audio visualization and intuitive click controls

Notifications You must be signed in to change notification settings

lliWcWill/maVoice-Linux

Repository files navigation

๐ŸŽ™๏ธ maVoice

Powered by Groq Whisper Turbo Built with Tauri MIT License

๐Ÿš€ Open-Source Voice Dictation Powered by Groq's Lightning-Fast Inference

Experience the future of voice-to-text with Groq DEV Tier - Ultra-fast transcription that leaves OpenAI's free tier in the dust!

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  ๐ŸŽค maVoice     โ”‚  โ† Tiny floating widget (100x22px)
โ”‚ โ–ถ โ–  โ–ช โ–ช โ–ช โ–ช    โ”‚    Always on top of your screen
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜    Double-click to start!

โœจ Features

  • โšก Blazing Fast: Powered by Groq's Whisper Large v3 Turbo model - the fastest inference in the game
  • ๐ŸŽฏ Native Performance: Built with Rust and Tauri for minimal resource usage
  • ๐ŸŽจ Beautiful UI: Sleek, modern floating widget that stays out of your way
  • ๐Ÿ”’ Privacy First: Your API key, your data - everything stays local
  • ๐ŸŒ Cross-Platform: Works on Linux (Windows and macOS coming soon!)
  • ๐ŸŽค Smart Recording: Real-time audio visualization and voice detection
  • ๐Ÿ“‹ Instant Copy: Automatic clipboard integration for seamless workflow
  • โš™๏ธ Advanced Settings: Comprehensive configuration panel with model selection
  • ๐ŸŽ›๏ธ Intuitive Controls: Double-click to start, single-click to stop
  • ๐ŸŒ Multi-Language: Support for 100+ languages with custom prompts

๐ŸŽฏ What is maVoice?

maVoice is a floating voice dictation widget that lives on your desktop. Unlike traditional apps with windows and menus, maVoice is a tiny, always-accessible button that floats above your other applications.

The Floating Widget Design

Normal State           Recording            Processing           Success
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”       โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”     โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”     โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚ ๐ŸŽค maVoice  โ”‚  โ†’    โ”‚ ๐Ÿ”ด โ–ถโ–ถโ–ถโ–ถ     โ”‚  โ†’  โ”‚ ๐ŸŸ  โ—ˆโ—ˆโ—ˆโ—ˆโ—ˆ    โ”‚  โ†’  โ”‚ โœ… Done!    โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜       โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜     โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜     โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
   (Blue)                 (Red)              (Orange)            (Green)
  • Size: 100x22 pixels (compact floating button)
  • Behavior: Always on top, transparent background, no window borders
  • Dragging: Right-click or Ctrl+Left-click to drag to a new position

๐ŸŽ๏ธ Why Groq DEV Tier?

Feature Groq DEV Tier OpenAI Free
Speed ๐Ÿš€ Lightning Fast ๐ŸŒ Slow
Rate Limits ๐Ÿ’ช 400 RPM ๐Ÿ˜” Limited
Model ๐Ÿง  Whisper v3 Turbo ๐Ÿค– Basic Whisper
Quality ๐ŸŽฏ Premium ๐Ÿ“‰ Variable

๐Ÿš€ Quick Start

๐ŸŒŸ ONE-COMMAND Install

# Clone and install everything automatically
git clone https://github.com/lliWcWill/maVoice-Linux.git
cd maVoice-Linux
./install.sh

# Add your Groq API key
echo "VITE_GROQ_API_KEY=your_groq_api_key_here" > src-tauri/aquavoice-frontend/.env

# Launch!
npm run dev

Prerequisites

Platform-Specific Setup

๐ŸชŸ WSL2 Setup (Windows Users)

โœจ BREAKTHROUGH: WSL2 + WSLg provides PERFECT voice dictation with zero audio issues!

Prerequisites

  1. Update WSL2 (from Windows PowerShell as Administrator):

    wsl --update
    wsl --version  # Ensure version 2 with WSLg
  2. Install Debian/Ubuntu if you don't have it:

    wsl --install -d Debian

Installation

# Install Rust
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
source $HOME/.cargo/env

# Install system dependencies
sudo apt update && sudo apt install -y \
    build-essential pkg-config libgtk-3-dev libwebkit2gtk-4.1-dev \
    libsoup-3.0-dev libjavascriptcoregtk-4.1-dev libdbus-1-dev \
    libappindicator3-dev librsvg2-dev libasound2-dev \
    xdotool wl-clipboard wtype

# Clone and run
git clone https://github.com/lliWcWill/maVoice-Linux.git
cd maVoice-Linux
./install.sh
๐Ÿง Native Linux Setup

Debian/Ubuntu:

sudo apt update
sudo apt install -y \
    build-essential pkg-config libgtk-3-dev libwebkit2gtk-4.1-dev \
    libsoup-3.0-dev libjavascriptcoregtk-4.1-dev libdbus-1-dev \
    libappindicator3-dev librsvg2-dev libasound2-dev \
    xdotool wl-clipboard wtype

Fedora/Arch - See detailed instructions

๐Ÿ“ฆ Build Debian Package

# Build the .deb package
npm run build

# The .deb file will be in:
# src-tauri/target/release/bundle/deb/

๐ŸŽฎ Usage

Desktop App

  1. Launch maVoice - The app appears as a sleek floating widget
  2. Double-click to start - The microphone activates with visual feedback
  3. Speak naturally - Real-time audio visualization shows your voice
  4. Single-click to stop - Transcription appears instantly
  5. Copy & paste - Text is automatically copied to clipboard

Web Interface (http://localhost:5173)

  • Settings panel - Click the gear icon for full configuration
  • API key setup - Secure local storage of your Groq key
  • Model selection - Choose from Whisper variants
  • Custom prompts - Add technical terms, names, or style instructions
  • Temperature control - Adjust creativity vs accuracy
  • Multi-language - Support for 100+ languages

Keyboard Shortcuts

  • Ctrl+, - Open settings
  • Alt+Space - Toggle recording
  • Double Alt - Quick record
  • Spacebar - Stop recording (while active)

๐Ÿ› ๏ธ Tech Stack

Rust Tauri React TypeScript Tailwind

๐Ÿค Contributing

We love contributions! Whether it's:

  • ๐Ÿ› Bug reports
  • ๐Ÿ’ก Feature requests
  • ๐Ÿ”ง Pull requests
  • ๐Ÿ“– Documentation improvements

Check out our Contributing Guide to get started.

๐Ÿ“ˆ Performance

maVoice leverages Groq's incredible inference speed:

  • Transcription Speed: < 500ms for 30-second audio
  • Memory Usage: < 50MB idle, < 100MB active
  • CPU Usage: < 5% during transcription
  • Network: Minimal bandwidth usage with smart chunking

๐Ÿ” Privacy & Security

  • Local First: All processing happens on your machine
  • No Telemetry: We don't track anything
  • Secure API: Your Groq API key is stored locally and never shared
  • Open Source: Audit the code yourself!

๐Ÿ“œ License

maVoice is MIT licensed. See LICENSE for details.

๐Ÿ™ Acknowledgments

  • Groq - For providing insanely fast inference
  • Whisper - OpenAI's amazing speech recognition model
  • Tauri - For making native apps actually enjoyable to build
  • You - For choosing open-source!

Built with โค๏ธ by developers who were tired of slow dictation

maVoice - Where speed meets simplicity

About

๐ŸŽ™๏ธ Lightning-fast voice dictation Desktop Web App powered by Groq's Whisper Turbo - Open-source, privacy-first, with real-time audio visualization and intuitive click controls

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •