Machine Learning to Denoise Images for Better OCR Accuracy

This project is an adaptation of this tutorial and used only for learning purposes: https://www.pyimagesearch.com/2021/10/20/using-machine-learning-to-denoise-images-for-better-ocr-accuracy/#download-the-code

Setting Up the project 🚀

First and foremost clone the project with:

$ git clone https://github.com/AntonioBriPerez/Ocr-Denoiser

You don't need to extract the zip files in order to train the model.

Once you have cloned the repository you will need to extract the features from the noisy images. This script will extract 5 x 5 - 25-d feature vectors and the it will extract the target (or cleaned) pixel value from the correspondiente ground truth standard image. And then, this features will be saved in a csv file (~200MB). To extract this features you will have to execute:

$ python3 build_features.py

It will generate the following output:

Once you have done that we will have to load those features in a proper split to train our Random Forest Regressor. That code is implemented in the file train_denoiser.py. To train the model you will have to run the command:

$ python train_denoiser.py

And it will generate:

To check that the model performs good you can execute:

$ python3 denoise_document.py --testing denoising-dirty-documents/test

And some images will be written in disk so you can check the original image and the image obtained by the model we just have trained.

Any doubts or suggestions please open an issue.

Machine Leaning applied to denoise images to improve OCR Accuracy

Related tags

Overview

Machine Learning to Denoise Images for Better OCR Accuracy

Setting Up the project 🚀

Owner

Antonio Bri Pérez

Implement 'Single Shot Text Detector with Regional Attention, ICCV 2017 Spotlight'

This is a GUI program which consist of 4 OpenCV projects

Convert scans of handwritten notes to beautiful, compact PDFs

text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network

Optical character recognition for Japanese text, with the main focus being Japanese manga

This can be use to convert text in a file to handwritten text.

Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized

Primary QPDF source code and documentation

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).

Developed an AI-based system to control the mouse cursor using Python and OpenCV with the real-time camera.

Msos searcher - A half-hearted attempt at finding a magic square of squares

Python package for handwriting and sketching in Jupyter cells

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

chineseocr/table_line 表格线检测模型pytorch版

A PyTorch implementation of ECCV2018 Paper: TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes

A machine learning software for extracting information from scholarly documents

The Open Source Framework for Machine Vision

Recognizing the text contents from a scanned visiting card

かの有名なあの東方二次創作ソング、「bad apple!」のMVをPythonでやってみたって話

How to detect objects in real time by using Jupyter Notebook and Neural Networks , by using Yolo3