Sign Language to Speech Conversion

Overview

This project aims to bridge the communication gap between the hearing and speech-impaired community and the general public by converting sign language into speech. The system leverages computer vision techniques and machine learning models to recognize hand gestures in real time and translate them into audible speech output.

Features

Real-time Gesture Recognition: Uses OpenCV to detect and classify hand gestures.
Speech Conversion: Implements Google Text-to-Speech (gTTS) for converting recognized text into speech.
Deep Learning Model: Utilizes a Convolutional Neural Network (CNN) trained on the MINT ASL dataset for accurate gesture classification.
Webcam Integration: Captures hand gestures through a live video feed.
User-Friendly Interface: Provides a seamless experience for users to interact and communicate effectively.

Technologies Used

Python
OpenCV (for image processing)
TensorFlow/Keras (for training the deep learning model)
Google Text-to-Speech (gTTS) (for speech synthesis)
NumPy & Pandas (for data manipulation)
MoviePy (for video processing)

Installation

Prerequisites

Ensure you have Python installed (preferably Python 3.8+). Then, install the necessary dependencies:

pip install opencv-python numpy pandas tensorflow keras gtts moviepy

Running the Project

Clone the repository:

git clone https://github.com/your-username/sign-language-to-speech.git
cd sign-language-to-speech

Run the script:
```
python main.py
```
Allow camera access and perform gestures to see the real-time translation.

Dataset

The project uses the MINT ASL dataset, which contains labeled images of American Sign Language (ASL) gestures. The dataset is preprocessed and augmented for improved model accuracy.

Model Training

The deep learning model is trained using TensorFlow/Keras with a CNN architecture. The training steps include:

Data Preprocessing: Image resizing, grayscale conversion, and augmentation.
Model Training: Using a CNN model with multiple convolutional and pooling layers.
Evaluation: Testing the model accuracy and fine-tuning hyperparameters.

Future Enhancements

Support for More Sign Languages: Expand beyond ASL to include ISL, BSL, etc.
Mobile Application: Develop an Android/iOS app for real-world usability.
Gesture Sentence Prediction: Implement NLP to predict full sentences based on gestures.

Contributors

Kangkan Patowary (Developer & ML Engineer)

License

This project is open-source under the MIT License.

Contact

For inquiries, reach out via:

Email: [email protected]
LinkedIn: kangkan-patowary

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Picture 1.jpg		Picture 1.jpg
Picture 2.jpg		Picture 2.jpg
Picture 3.jpg		Picture 3.jpg
Picture 4.jpg		Picture 4.jpg
README.md		README.md
sign.ipynb		sign.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sign Language to Speech Conversion

Overview

Features

Technologies Used

Installation

Prerequisites

Running the Project

Dataset

Model Training

Future Enhancements

Contributors

License

Contact

About

Uh oh!

Releases

Packages

Languages

kangkan10102001/sign-language

Folders and files

Latest commit

History

Repository files navigation

Sign Language to Speech Conversion

Overview

Features

Technologies Used

Installation

Prerequisites

Running the Project

Dataset

Model Training

Future Enhancements

Contributors

License

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages