Use Python, OpenCV, and MediaPipe to control a keyboard with facial gestures

Last update: Nov 09, 2022

Overview

CheekyKeys

A Face-Computer Interface

CheekyKeys lets you control your keyboard using your face.

View a fuller demo and more background on the project at https://youtu.be/rZ0DBi1avMM

CheekyKeys uses OpenCV and MediaPipe's Face Mesh to perform real-time detection of facial landmarks from video input. From there, relative differences are calculated to determine specific facial gestures and translate those into commands sent via keyboard.

This version 0.1 is hardcoded to my facial features, but thresholds can easily be modified. It's also built for a Mac keyboard, but you can also swap i.e. Windows key for Command simply enough.

The primary input is to "type" letters, digits, and symbols via Morse code by opening and closing your mouth quickly for . and slightly longer for -. Rather than waiting a set time after every letter, you scrunch your mouth upward once to finish a letter, or twice to add a space (end a word). Three mouth scrunches types enter/return.

The cheatsheet includes the full alphabet as well as special characters and hotkeys.

Most of the rest of the keyboard and other helpful actions are included as modifier gestures, such as:

shift: close right eye
command: close left eye
arrow up/down: raise left/right eyebrow
arrow left/right: raise left/right eyebrow + duckface (pursed lips)
backspace: duckface + double blink
zoom in: eyes bulge
zoom out: eyes squint
repeat previous letter/command: double raise of both eyebrows
clear current Morse queue: wink right eye, then wink left eye
escape: wink left eye, then wink right eye

Use Python, OpenCV, and MediaPipe to control a keyboard with facial gestures

Related tags

Overview

CheekyKeys

A Face-Computer Interface

Owner

Hypernetwork-Ensemble Learning of Segmentation Probability for Medical Image Segmentation with Ambiguous Labels

A3C LSTM Atari with Pytorch plus A3G design

Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training

a simple, efficient, and intuitive text editor

Open-source python package for the extraction of Radiomics features from 2D and 3D images and binary masks.

Cooperative Driving Dataset: a dataset for multi-agent driving scenarios

deep learning model that learns to code with drawing in the Processing language

Deep Surface Reconstruction from Point Clouds with Visibility Information

This repository is for our paper Exploiting Scene Graphs for Human-Object Interaction Detection accepted by ICCV 2021.

tensorflow code for inverse face rendering

LocUNet is a deep learning method to localize a UE based solely on the reported signal strengths from a set of BSs.

Learning Off-Policy with Online Planning, CoRL 2021

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

Liecasadi - liecasadi implements Lie groups operation written in CasADi

Official PyTorch code of DeepPanoContext: Panoramic 3D Scene Understanding with Holistic Scene Context Graph and Relation-based Optimization (ICCV 2021 Oral).

Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.

Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling

A tool to estimate time varying instantaneous reproduction number during epidemics

Incorporating Transformer and LSTM to Kalman Filter with EM algorithm