Sign Language Detection (Real-Time with Mediapipe + LSTM)

This project implements a real-time sign language detection system using Mediapipe for landmark detection and a deep learning model (LSTM-based) trained to recognize hand gestures.

It supports training a custom dataset, evaluating the model, and running real-time predictions via webcam.

Project Structure

.
├── collect.py      # Collect training data from webcam (saves as keypoints)
├── config.py       # Configuration (actions, paths, sequence settings)
├── evaluate.py     # Evaluate trained model performance
├── model.py        # Model definition and training logic
├── run.py          # Run real-time sign language detection via webcam
├── utils.py        # Utility functions for preprocessing and visualization
├── MP_DATA/        # Auto-generated dataset directory
├── model/          # Saved trained model files
└── Logs/           # TensorBoard logs

Features

Dataset Collection
Collects webcam input, extracts Mediapipe landmarks (pose, face, hands), and saves keypoints into structured .npy files.
Configurable Actions
Default gestures are hello, thanks, and iloveyou, but you can add more in config.py.
Deep Learning Model (LSTM)
A stacked LSTM network is trained on landmark sequences to classify gestures.
Evaluation
Generates confusion matrices and accuracy scores for trained models.
Real-Time Inference
Recognizes signs live from webcam, overlays predictions, and displays probability bars.

Installation

Clone the repo:

git clone https://github.com/yourusername/sign-language-detection.git
cd sign-language-detection

Install dependencies:
```
pip install -r requirements.txt
```

Usage

1. Collect Data

Run the data collection script to generate training samples:

python collect.py

This will create a dataset under MP_DATA/ with your defined actions.

2. Train Model

Train (or load) the model:

python model.py

If a trained model exists at model/sign_language_detection_model.keras, it will load it.
Otherwise, a new model will be trained and saved.

3. Evaluate Model

Check model performance:

python evaluate.py

4. Run Real-Time Detection

Run the live system via webcam:

python run.py

Press q to quit the video feed.

Example

Actions defined: ['hello', 'thanks', 'iloveyou']
Each action → 30 sequences × 30 frames per sequence.
Model learns spatiotemporal patterns of landmarks.
During real-time detection, predictions update dynamically with probabilities displayed.

Customization

Add new actions: Edit actions array in config.py.
Adjust sequence length & dataset size: Change sequence_length and num_sequences in config.py.
Model architecture: Modify build_model() in model.py.

Requirements

Python 3.8+
Webcam access
TensorFlow 2.x
Mediapipe
OpenCV

License

This project is licensed under the MIT License – feel free to use and modify.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sign Language Detection (Real-Time with Mediapipe + LSTM)

Project Structure

Features

Installation

Usage

1. Collect Data

2. Train Model

3. Evaluate Model

4. Run Real-Time Detection

Example

Customization

Requirements

License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
collect.py		collect.py
config.py		config.py
evaluate.py		evaluate.py
model.py		model.py
requirements.txt		requirements.txt
run.py		run.py
utils.py		utils.py

eemass/Sign-Language-Detection

Folders and files

Latest commit

History

Repository files navigation

Sign Language Detection (Real-Time with Mediapipe + LSTM)

Project Structure

Features

Installation

Usage

1. Collect Data

2. Train Model

3. Evaluate Model

4. Run Real-Time Detection

Example

Customization

Requirements

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages