An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.

Overview

Optical_Character_Recognition

An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.

As an IOT/Computer Visions Intern at the Graduate Rotational Internship program (GRIP) by The Sparks Foundation (TSF), the first task is to implement a character detector which extracts printed or handwritten text from an image/video.

For more learning purposes, I've utilized this feature in cleaning/extracting valuable information from Blood Pressure Reports as images.

download

Dependencies

  • tesseract-ocr package
  • pytesseract 0.3.8
  • Open-cv
  • Pandas

    Using the pytesseract open source library to detect text on image/video.

    Open-cv for Image Processing

    Pandas for data manipulation

  • Owner
    Ramsis Hammadi
    Ramsis Hammadi
    A Vietnamese personal card OCR website built with Django.

    Django VietCardOCR Installation Creation of virtual environments is done by executing the command venv: python -m venv venv That will create a new fol

    Truong Hoang Thuan 4 Sep 04, 2021
    Distort a video using Seam Carving (video) and Vibrato effect (sound)

    Distort videos Applies a Seam Carving algorithm (aka liquid rescale) on every frame of a video, and a vibrato effect on the audio to distort the video

    AlexZeGamer 6 Dec 06, 2022
    Repositório para registro de estudo da biblioteca opencv (Python)

    OpenCV (Python) Objetivo do Repositório: Registrar avanços no estudo da biblioteca opencv. O repositório estará aberto a qualquer pessoa e há tambem u

    1 Jun 14, 2022
    Discord QR Scam Code Generator + Token grab mobile device.

    A Python script that automatically generates a Nitro scam QR code and grabs the Discord token when scanned.

    Visual 9 Nov 22, 2022
    The world's simplest facial recognition api for Python and the command line

    Face Recognition You can also read a translated version of this file in Chinese 简体中文版 or in Korean 한국어 or in Japanese 日本語. Recognize and manipulate fa

    Adam Geitgey 47k Jan 07, 2023
    Library used to deskew a scanned document

    Deskew //Note: Skew is measured in degrees. Deskewing is a process whereby skew is removed by rotating an image by the same amount as its skew but in

    Stéphane Brunner 273 Jan 06, 2023
    Code for the ACL2021 paper "Combining Static Word Embedding and Contextual Representations for Bilingual Lexicon Induction"

    CSCBLI Code for our ACL Findings 2021 paper, "Combining Static Word Embedding and Contextual Representations for Bilingual Lexicon Induction". Require

    Jinpeng Zhang 12 Oct 08, 2022
    Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)

    DewarpNet This repository contains the codes for DewarpNet training. Recent Updates [May, 2020] Added evaluation images and an important note about Ma

    <a href=[email protected]"> 354 Jan 01, 2023
    This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

    pdf-scraper-with-ocr With this tool I am aiming to facilitate the work of those who need to scrape PDFs either by hand or using tools that doesn't imp

    Jacobo José Guijarro Villalba 75 Oct 21, 2022
    Handwritten Character Recognition using CNN

    Handwritten Character Recognition using CNN Problem Definition The main objective of this project is to solve the problem of handwritten character rec

    Mohit Kaushik 4 Mar 02, 2022
    keras复现场景文本检测网络CPTN: 《Detecting Text in Natural Image with Connectionist Text Proposal Network》;欢迎试用,关注,并反馈问题...

    keras-ctpn [TOC] 说明 预测 训练 例子 4.1 ICDAR2015 4.1.1 带侧边细化 4.1.2 不带带侧边细化 4.1.3 做数据增广-水平翻转 4.2 ICDAR2017 4.3 其它数据集 toDoList 总结 说明 本工程是keras实现的CPTN: Detecti

    mick.yi 107 Jan 09, 2023
    OCR-D-compliant page segmentation

    ocrd_segment This repository aims to provide a number of OCR-D-compliant processors for layout analysis and evaluation. Installation In your virtual e

    OCR-D 59 Sep 10, 2022
    list all open dataset about ocr.

    ocr-open-dataset list all open dataset about ocr. printed dataset year Born-Digital Images (Web and Email) 2011-2015 COCO-Text 2017 Text Extraction fr

    hongbomin 95 Nov 24, 2022
    A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約

    Scene Text Localization & Recognition Resources Read this institute-wise: English, 简体中文. Read this year-wise: English, 简体中文. Tags: [STL] (Scene Text L

    Karl Lok (Zhaokai Luo) 901 Dec 11, 2022
    TableBank: A Benchmark Dataset for Table Detection and Recognition

    TableBank TableBank is a new image-based table detection and recognition dataset built with novel weak supervision from Word and Latex documents on th

    844 Jan 04, 2023
    A Python wrapper for the tesseract-ocr API

    tesserocr A simple, Pillow-friendly, wrapper around the tesseract-ocr API for Optical Character Recognition (OCR). tesserocr integrates directly with

    Fayez 1.7k Dec 31, 2022
    A toolbox of scene text detection and recognition

    FudanOCR This toolbox contains the implementations of the following papers: Scene Text Telescope: Text-Focused Scene Image Super-Resolution [Chen et a

    FudanVIC Team 170 Dec 26, 2022
    Python-based tools for document analysis and OCR

    ocropy OCRopus is a collection of document analysis programs, not a turn-key OCR system. In order to apply it to your documents, you may need to do so

    OCRopus 3.2k Dec 31, 2022
    Read Japanese manga inside browser with selectable text.

    mokuro Read Japanese manga with selectable text inside a browser. See demo: https://kha-white.github.io/manga-demo mokuro_demo.mp4 Demo contains excer

    Maciej Budyś 170 Dec 27, 2022
    Pixie - A full-featured 2D graphics library for Python

    Pixie - A full-featured 2D graphics library for Python Pixie is a 2D graphics library similar to Cairo and Skia. pip install pixie-python Features: Ty

    treeform 65 Dec 30, 2022