Sign Language Detection using Mediapipe and PyTorch

TLDR; This project uses Google's Mediapipe Hand Landmarks to detect key landmarks and to create a skeleton by joining the keypoints on a white background image. The images is then trained on CNN, where VGG16 is used as the base model, pertaining to the fact that the dataset is created by the user and is too small for the CNN model.

The saved model is then used to predict hand gestures by comparing with the hand landmark "skeleton". The prediction is then used by another prediction function that explicilty compares the landmark coordinates for ASL alphabets to make sure the predictions are correct.

PyQt6 is chosen as the GUI for displaying the result. The user can further press a "Speak" button to pronounce the character, which is based on Pyttsx3 library.

Still few bugs remain to be smoothened out!

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
CustomLayers		CustomLayers
static_images		static_images
CNN_Model.py		CNN_Model.py
README.md		README.md
data_create_webcam.py		data_create_webcam.py
dataset_preparation.py		dataset_preparation.py
prediction_test.py		prediction_test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sign Language Detection using Mediapipe and PyTorch

About

Releases

Packages

Languages

biplavpoudel/Sign-Language-using-Mediapipe

Folders and files

Latest commit

History

Repository files navigation

Sign Language Detection using Mediapipe and PyTorch

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages