🎙️ Speech-to-Text (Google Web Speech API)

A simple Speech-to-Text (STT) system in Python using the SpeechRecognition library and Google’s Web Speech API. It listens to your microphone, sends the audio to Google’s cloud recognizer, and prints the transcribed text.

✨ Features

🎤 Capture live audio from your microphone.
☁️ Cloud-based transcription using Google’s Web Speech API.
⚡ Real-time transcription with minimal setup.
🛠️ Minimal Python dependencies for quick prototyping.

📂 Project Structure

├── stt_speechrecognition.py         # Main script (run this)
├── requirements_speechrecognition.txt  # Python dependencies
├── README.md                        # Documentation (this file)

🚀 Getting Started

1. Prerequisites

Python 3.8+
A working microphone
Internet connection (required for Google Web Speech API)
System audio backend:
- macOS:
```
brew install portaudio
```
- Ubuntu/Debian:
```
sudo apt-get install portaudio19-dev python3-pyaudio
```
- Windows: Download PyAudio wheels from here and install with pip install <wheel-file>.

2. Installation

Clone or download this repo, then install dependencies:

pip install -r requirements_speechrecognition.txt

3. Usage

Run the script:

python stt_speechrecognition.py

Steps:

The program adjusts for background noise.
Speak into your microphone.
The recognized text is printed to the terminal.

⚠️ Notes & Limitations

Requires an active internet connection (Google Web Speech API).
API is free but limited (not suitable for very large-scale transcription).
For offline transcription, consider using OpenAI Whisper.

📌 Example Output

🎙️  Adjusting for ambient noise...
✅ Ready. Speak now!
📝 You said: Hello world, this is my Jarvis project!

🛠️ Next Steps

Enhance your project by adding:

Wake word detection (e.g., “Jarvis”).
Text-to-Speech (TTS) for voice responses.
Custom commands (open apps, fetch info, control IoT devices).

📄 License

This project is licensed under the MIT License — free to use, modify, and distribute.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
Asset		Asset
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
setup.bat		setup.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🎙️ Speech-to-Text (Google Web Speech API)

✨ Features

📂 Project Structure

🚀 Getting Started

1. Prerequisites

2. Installation

3. Usage

⚠️ Notes & Limitations

📌 Example Output

🛠️ Next Steps

📄 License

About

Uh oh!

Releases 1

Packages

Languages

License

AnubhavChaturvedi-GitHub/Speech-to-Text-with-Python-Google-Web-Speech-API

Folders and files

Latest commit

History

Repository files navigation

🎙️ Speech-to-Text (Google Web Speech API)

✨ Features

📂 Project Structure

🚀 Getting Started

1. Prerequisites

2. Installation

3. Usage

⚠️ Notes & Limitations

📌 Example Output

🛠️ Next Steps

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages