pytextractor

python ocr using tesseract/ with EAST opencv text detector

Uses the EAST opencv detector defined here with pytesseract to extract text(default) or numbers from images.

Usage main

usage: text_detection.py [-h] [--east EAST] [-c CONFIDENCE] [-w WIDTH]
                         [-e HEIGHT] [-d] [-n] [-p PERCENTAGE] [-b MIN_BOXES]
                         [-i MAX_ITERATIONS]
                         images [images ...]

Text/Number extractor from image

positional arguments:
  images                path(s) to input image(s)

optional arguments:
  -h, --help            show this help message and exit
  --east EAST           path to input EAST text detector
  -c CONFIDENCE, --confidence CONFIDENCE
                        minimum probability required to inspect a region
  -w WIDTH, --width WIDTH
                        resized image width (should be multiple of 32)
  -e HEIGHT, --height HEIGHT
                        resized image height (should be multiple of 32)
  -d, --display         Display bounding boxes
  -n, --numbers         Detect only numbers
  -p PERCENTAGE, --percentage PERCENTAGE
                        Expand/shrink detected bound box
  -b MIN_BOXES, --min-boxes MIN_BOXES
                        minimum number of detected boxes to return
  -i MAX_ITERATIONS, --max-iterations MAX_ITERATIONS
                        max number of iterations finding min_boxes

Usage lib

from pytextractor import pytextractor

extractor = pytextractor.PyTextractor()

Running tests

python setup.py test

make sure tesseract is installed *

brew | apt-get install tesseract

python ocr using tesseract/ with EAST opencv detector

Related tags

Overview

pytextractor

Usage main

Usage lib

Running tests

Owner

Danny Crasto

Python Computer Vision from Scratch

Thresholding-and-masking-using-OpenCV - Image Thresholding is used for image segmentation

How to detect objects in real time by using Jupyter Notebook and Neural Networks , by using Yolo3

Detect and fix skew in images containing text

Character Segmentation using TensorFlow

Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrieval.

Satoshi is a discord bot template in python using discord.py that allow you to track some live crypto prices with your own discord bot.

The first open-source library that detects the font of a text in a image.

This is a real life mario project using python and mediapipe

Framework for the Complete Gaze Tracking Pipeline

Code for the "Sensing leg movement enhances wearable monitoring of energy expenditure" paper.

Play the Namibian game of Owela against a terrible AI. Built using Django and htmx.

Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder

Scene text recognition

This is a passport scanning web service to help you scan, identify and validate your passport created with a simple and flexible design and ready to be integrated right into your system!

Regions sanitàries (RS), Sectors Sanitàris (SS) i Àrees Bàsiques de Salut (ABS) de Catalunya

An interactive interface for using OpenCV's GrabCut algorithm for image segmentation.

Memory tests solver with using OpenCV

An official PyTorch implementation of the paper "Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences", ICCV 2021.

Go package for OCR (Optical Character Recognition), by using Tesseract C++ library