In this project we will be using the live feed coming from the webcam to create a virtual mouse with complete functionalities.

Last update: Dec 20, 2022

Overview

Virtual Mouse Using OpenCV

In this project we will be using the live feed coming from the webcam to create a virtual mouse using hand tracking.

Project Description:

In this project, I am using my hand as a virtual mouse than can do everything that a mouse does without even touching your system. I am using the webcam of my system to detect my hands. It will then create a bounding box around my hand and focus on two fingers: The fore finger and the middle finger. The fore finger will act as a cursor and moving it around, we will be moving the cursor around. Now, inorder to successfully click using hand tracking, it is detecting the distance between the fore finger and the middle finger. If they are joined together, then it will perform a click.

Furthermore, a smoothness factor was added as the movement was really shaky.

Requirements:

Following modules need to be installed for it to work properly:

OpenCV
Mediapipe
Autopy

OpenCV:

OpenCV is a huge open-source library for computer vision, machine learning, and image processing. OpenCV supports a wide variety of programming languages like Python, C++, Java, etc. It can process images and videos to identify objects, faces, or even the handwriting of a human.

It can be installed using "pip install opencv-python"

Mediapipe:

MediaPipe is a framework for building multimodal (eg. video, audio, any time series data), cross platform (i.e Android, iOS, web, edge devices) applied ML pipelines.

It can be installed using "pip install mediapipe"

Autopy:

AutoPy is a simple, cross-platform GUI automation library for Python. It includes functions for controlling the keyboard and mouse, finding colors and bitmaps on-screen, and displaying alerts.

It can be installed using "pip install autopy"

Important Note:

I faced alot of dependency issues throughout this project. Some of the issues and their solutions are as follows:

autopy not installing: This is because autopy currently doesn't support Python versions above 3.8
webcam not opening: It was a bug in mediapipe and was fixed in latest python versions

Hence, inorder for the project to run smoothly, you need to degrade the Python version to 3.8

How to Degrade Python Version:

Follow the following steps:

Uninstall Python from add/remove programs
Go to AppData and remove any python folder you see.
Download Python 3.8 from this link : Python 3.8
Install it.
Open command promt and run "pip" inorder to confirm installation.
Your Python version has been degraded :)

Contact Information:

For any further queries, feel free to contact me at:

Email: [email protected]

LinkedIn : Hassan Shahzad

In this project we will be using the live feed coming from the webcam to create a virtual mouse with complete functionalities.

Related tags

Overview

Virtual Mouse Using OpenCV

Project Description:

Requirements:

OpenCV:

Mediapipe:

Autopy:

Important Note:

How to Degrade Python Version:

Contact Information:

Owner

Hassan Shahzad

Balabobapy - Using artificial intelligence algorithms to continue the text

Write-ups for the SwissHackingChallenge2021 CTF.

OCR powered screen-capture tool to capture information instead of images

A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.

A set of workflows for corpus building through OCR, post-correction and normalisation

利用Paddle框架复现CRAFT

Programa que viabiliza a OCR (Optical Character Reading - leitura óptica de caracteres) de um PDF.

kaldi-asr/kaldi is the official location of the Kaldi project.

([email protected]) Boosting Co-teaching with Compression Regularization for Label Noise

An advanced 2D image manipulation with features such as edge detection and image segmentation built using OpenCV

A curated list of promising OCR resources

PSENet - Shape Robust Text Detection with Progressive Scale Expansion Network.

A PyTorch implementation of ECCV2018 Paper: TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes

This is a project to detect gestures to zoom in or out, using the real-time distance between the index finger and the thumb. It's based on OpenCV and Mediapipe.

One Metrics Library to Rule Them All!

A facial recognition device is a device that takes an image or a video of a human face and compares it to another image faces in a database.

STEFANN: Scene Text Editor using Font Adaptive Neural Network

Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.

pulse2percept: A Python-based simulation framework for bionic vision

An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments