A Sign Language detection project using Mediapipe landmark detection and Tensorflow LSTM's

Last update: Feb 06, 2022

Overview

sign-language-detection

A Sign Language detection project using Mediapipe landmark detection and Tensorflow LSTM. The project is built for a vocabulary of 3 words, but more can be added by collecting and adding data for other words.

Vocabulary

Open
to
Work

Output

Disclaimer

Colab doesn't detect webcam and you can't use it for mediapipe detection and dataset collection through webcam so most of that was done locally and then training and inference using Tensorflow was performed on Colab.

You can uncomment the commented part if you wish to do all that locally. In my case, I had some clash between mediapipe and tensorflow on the ARM architecture m1 mac.

The notebook uses the approach to Sign Language Detection by Nicholas Renotte, of course with a whole bunch of tweaks to suit my usecase 🙂

Tweaks:

Input and output in the form of videos to work with colab.
Remove face landmarks as they end up just being noise.
Use tanh activation as it works way better with LSTMs compared to relu.
Colors and Cosmetics.
Disclaimer at bottom.
Different threshold value for inference.

A Sign Language detection project using Mediapipe landmark detection and Tensorflow LSTM's

Related tags

Overview

sign-language-detection

Vocabulary

Output

Disclaimer

Tweaks:

Owner

Hashim

TensorFlow Implementation of Unsupervised Cross-Domain Image Generation

[PNAS2021] The neural architecture of language: Integrative modeling converges on predictive processing

using STGCN to achieve egg classification task

Unofficial implementation of "TTNet: Real-time temporal and spatial video analysis of table tennis" (CVPR 2020)

use machine learning to recognize gesture on raspberrypi

PyTorch Connectomics: segmentation toolbox for EM connectomics

A light-weight image labelling tool for Python designed for creating segmentation data sets.

SCNet: Learning Semantic Correspondence

U-Time: A Fully Convolutional Network for Time Series Segmentation

Baseline of DCASE 2020 task 4

Linescanning - Package for (pre)processing of anatomical and (linescanning) fMRI data

[CVPR 2021] Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision

Tensorflow Implementation of Pixel Transposed Convolutional Networks (PixelTCN and PixelTCL)

An atmospheric growth and evolution model based on the EVo degassing model and FastChem 2.0

LoFTR:Detector-Free Local Feature Matching with Transformers CVPR 2021

Code for the Image similarity challenge.

FLVIS: Feedback Loop Based Visual Initial SLAM

aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

AWS documentation corpus for zero-shot open-book question answering.

CARL provides highly configurable contextual extensions to several well-known RL environments.