Implementation of QuickDraw - an online game developed by Google, combined with AirGesture - a simple gesture recognition application

Last update: Dec 18, 2022

Overview

QuickDraw - AirGesture

Introduction

Here is my python source code for QuickDraw - an online game developed by google, combined with AirGesture - a simple gesture recognition application. By using my code, you could:

Run an app which you could draw in front of a camera with your hand (If you use laptop, your webcam will be used by default)
Run an app which you could draw on a canvas

Camera app

In order to use this application, you only need to use your hand to draw in front of a camera/webcam. The middle point of your hand will be detected and highlighted by a red dot. When you are ready for drawing, you need to press space button to start drawing. When you want to stop drawing, press space button again. Below is the demo by running the sript camera_app.py:

Camera app demo

Drawing app

The script and demo will be released soon

Categories:

The table below shows 18 categories my model used:


apple	book	bowtie	candle
cloud	cup	door	envelope
eyeglasses	hammer	hat	ice cream
leaf	scissors	star	t-shirt
pants	tree

Trained models

You could find my trained model at data/trained_models/

Docker

For being convenient, I provide Dockerfile which could be used for running training phase as well as launching application

Assume that docker image's name is qd_ag. You already clone this repository and cd into it.

Build:

sudo docker build --network=host -t qd_ag .

Run:

If you want to launch the application, first you need to run xhost + to turn off access control (if you only want to run the training, you could skip this step). Then you run:

sudo docker run --gpus all -it --rm --volume="path/to/your/data:/workspace/code/data -e DISPLAY=$DISPLAY --env="QT_X11_NO_MITSHM=1" -v /tmp/.X11-unix:/tmp/.X11-unix --device=/dev/video0:/dev/video0 qd_ag

Inside docker container, you could run train.py or camera_app.py scripts for training or launching app respectively. By default, the camera_app.py script will automatically generate a video capturing what you have done during the session, at data/output.mp4

Experiments:

For each class, I split the data to training and test sets with ratio of 8:2. The training/test loss/accuracy curves for the experiment are shown below:

Implementation of QuickDraw - an online game developed by Google, combined with AirGesture - a simple gesture recognition application

Related tags

Overview

QuickDraw - AirGesture

Introduction

Camera app

Drawing app

Categories:

Trained models

Docker

Experiments:

Owner

Viet Nguyen

HGCN: Harmonic Gated Compensation Network For Speech Enhancement

Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"

Simple machine learning library / 簡單易用的機器學習套件

给yolov5加个gui界面，使用pyqt5，yolov5是5.0版本

Keyword2Text This repository contains the code of the paper: "A Plug-and-Play Method for Controlled Text Generation"

DeepLabv3+：Encoder-Decoder with Atrous Separable Convolution语义分割模型在tensorflow2当中的实现

Graph Regularized Residual Subspace Clustering Network for hyperspectral image clustering

SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event Data

ObsPy: A Python Toolbox for seismology/seismological observatories.

ICNet and PSPNet-50 in Tensorflow for real-time semantic segmentation

Official PyTorch implementation of "Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning" (ICCV2021 Oral)

[NeurIPS 2021 Spotlight] Aligning Pretraining for Detection via Object-Level Contrastive Learning

Stable Neural ODE with Lyapunov-Stable Equilibrium Points for Defending Against Adversarial Attacks

A state of the art of new lightweight YOLO model implemented by TensorFlow 2.

Towards Representation Learning for Atmospheric Dynamics (AtmoDist)

The Pytorch implementation for "Video-Text Pre-training with Learned Regions"

A super lightweight Lagrangian model for calculating millions of trajectories using ERA5 data

DuBE: Duple-balanced Ensemble Learning from Skewed Data

Deep motion transfer

This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (KDD'21).