A curated list of papers and resources for scene text detection and recognition

Overview

Awesome

Awesome Scene Text

A curated list of papers and resources for scene text detection and recognition

The year when a paper was first published, including ArXiv publications, is used. As a result, there may be cases when a paper was accepted for example to CVPR 2019, but it is listed in year 2018 because it was published in 2018 on ArXiv.

Table of contents
1. Scene Text Detection
2. Weakly Supervised Scene Text Detection
3. Scene Text Recognition
4. Other scene text papers
5. Scene Text Survey papers

Scene Text Detection (including methods for end-to-end detection and recognition)

2010

  • Detecting text in natural scenes with stroke width transform [CVPR 2010] [paper]
  • A Method for Text Localization and Recognition in Real-World Images [ACCV 2010] [paper]

2011

2012

  • Real-time scene text localization and recognition [CVPR 2012] [paper]

2013

2014

  • Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees [ECCV 2014] [paper]

2015

  • Symmetry-based text line detection in natural scenes [CVPR 2015] [paper]
  • Object proposals for text extraction in the wild [ICDAR 2015] [paper]
  • Text-Attentional Convolutional Neural Network for Scene Text Detection [TIP 2016] [paper]
  • Text Flow : A Unified Text Detection System in Natural Scene Images [ICCV 2015] [paper]

2016

2017

2018

2019

  • MSR: Multi-Scale Shape Regression for Scene Text Detection [IJCAI 2019] [paper]
  • Scene Text Detection with Inception Text Proposal Generation Module [ICMLC 2019] [paper]
  • Towards Robust Curve Text Detection with Conditional Spatial Expansion [CVPR 2019] [paper]
  • Curve Text Detection with Local Segmentation Network and Curve Connection [ArXiv] [paper]
  • Pyramid Mask Text Detector [ArXiv] [paper]
  • Tightness-aware Evaluation Protocol for Scene Text Detection [CVPR 2019] [paper]
  • Character Region Awareness for Text Detection [CVPR 2019] [paper]
  • Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes [CVPR 2019] [paper]
  • TextCohesion: Detecting Text for Arbitrary Shapes [ArXiv] [paper]
  • Arbitrary Shape Scene Text Detection With Adaptive Text Region Representation [CVPR 2019] [paper]
  • Learning Shape-Aware Embedding for Scene Text Detection [CVPR 2019] [paper]
  • A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended Multi-Task Learning [ACMMM 2019] [paper]
  • Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network [ICCV 2019] [paper]
  • Towards Unconstrained End-to-End Text Spotting [ICCV 2019] [paper]
  • TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting [paper]
  • Convolutional Character Networks [ICCV 2019] [paper]

Weakly supervised Scene Text Detection & Recognition

2017

  • Attention-Based Extraction of Structured Information from Street View Imagery [ICDAR 2017] [paper]
  • WeText: Scene Text Detection under Weak Supervision [ICCV 2017] [paper]
  • SEE: Towards Semi-Supervised End-to-End Scene Text Recognition [AAAI 2018] [paper]

Scene Text Recognition

2014

2015

2016

2017

2018

  • Char-Net: A Character-Aware Neural Network for Distorted Scene Text Recognition [AAAI 2018] [paper]
  • SqueezedText: A Real-time Scene Text Recognition by Binary Convolutional Encoder-decoder Network [AAAI 2018] [paper]
  • Edit Probability for Scene Text Recognition [CVPR 2018] [paper]
  • ASTER: An Attentional Scene Text Recognizer with Flexible Rectification [TPAMI 2018] [paper]
  • Synthetically Supervised Feature Learning for Scene Text Recognition [ECCV 2018] [paper]
  • Scene Text Recognition from Two-Dimensional Perspective [AAAI 2019] [paper]
  • ESIR: End-to-end Scene Text Recognition via Iterative Image Rectification [CVPR 2019] [paper]

2019

Script Identification

Other scene text related papers

2016

2019

  • Scene Text Synthesis for Efficient and Effective Deep Network Training [ArXiv] [paper]

Scene text survey

2018

  • Scene Text Detection and Recognition: The Deep Learning Era [ArXiv] [paper]

2019

  • Scene text detection and recognition with advances in deep learning: a survey [IJDAR 2019] [paper]
Owner
Jan Zdenek
Jan Zdenek
A simple python program to record security cam footage by detecting a face and body of a person in the frame.

SecurityCam A simple python program to record security cam footage by detecting a face and body of a person in the frame. This code was created by me,

1 Nov 08, 2021
Write-ups for the SwissHackingChallenge2021 CTF.

SwissHackingChallenge 2021 : Write-ups This repository contains a collection of my write-ups for challenges solved during the SwissHackingChallenge (S

Julien Béguin 3 Jun 07, 2021
list all open dataset about ocr.

ocr-open-dataset list all open dataset about ocr. printed dataset year Born-Digital Images (Web and Email) 2011-2015 COCO-Text 2017 Text Extraction fr

hongbomin 95 Nov 24, 2022
This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog:

PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network Introduction This is a tensorflow re-implementation of PSENet: Shape Robu

Michael liu 498 Dec 30, 2022
A list of hyperspectral image super-solution resources collected by Junjun Jiang

A list of hyperspectral image super-resolution resources collected by Junjun Jiang. If you find that important resources are not included, please feel free to contact me.

Junjun Jiang 301 Jan 05, 2023
A set of workflows for corpus building through OCR, post-correction and normalisation

PICCL: Philosophical Integrator of Computational and Corpus Libraries PICCL offers a workflow for corpus building and builds on a variety of tools. Th

Language Machines 41 Dec 27, 2022
An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments

Tips: A more recent scene text detection algorithm: PixelLink, has been implemented here: https://github.com/ZJULearning/pixel_link Contents: Introduc

dengdan 484 Dec 07, 2022
An interactive document scanner built in Python using OpenCV

The scanner takes a poorly scanned image, finds the corners of the document, applies the perspective transformation to get a top-down view of the document, sharpens the image, and applies an adaptive

Kushal Shingote 1 Feb 12, 2022
Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder

================================= OCRFeeder - A Complete OCR Suite ================================= OCRFeeder is a complete Optical Character Recogn

GNOME Github Mirror 81 Dec 23, 2022
Rotational region detection based on Faster-RCNN.

R2CNN_Faster_RCNN_Tensorflow Abstract This is a tensorflow re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detecti

UCAS-Det 581 Nov 22, 2022
⛓ marc is a small, but flexible Markov chain generator

About marc (markov chain) is a small, but flexible Markov chain generator. Usage marc is easy to use. To build a MarkovChain pass the object a sequenc

Max Humber 65 Oct 27, 2022
Face Anonymizer - FaceAnonApp v1.0

Face Anonymizer - FaceAnonApp v1.0 Blur faces from image and video files in /data/files folder. Contents Repo of the source files for the FaceAnonApp.

6 Apr 18, 2022
ERQA - Edge Restoration Quality Assessment

ERQA - a full-reference quality metric designed to analyze how good image and video restoration methods (SR, deblurring, denoising, etc) are restoring real details.

MSU Video Group 27 Dec 17, 2022
Controlling Volume by Hand Gestures

This program allows the user to control the volume of their device with specific hand gestures involving their thumb and index finger!

Riddhi Bajaj 1 Nov 11, 2021
Detect handwritten words in a text-line (classic image processing method).

Word segmentation Implementation of scale space technique for word segmentation as proposed by R. Manmatha and N. Srimal. Even though the paper is fro

Harald Scheidl 190 Jan 03, 2023
PyNeuro is designed to connect NeuroSky's MindWave EEG device to Python and provide Callback functionality to provide data to your application in real time.

PyNeuro PyNeuro is designed to connect NeuroSky's MindWave EEG device to Python and provide Callback functionality to provide data to your application

Zach Wang 45 Dec 30, 2022
Localization of thoracic abnormalities model based on VinBigData (top 1%)

Repository contains the code for 2nd place solution of VinBigData Chest X-ray Abnormalities Detection competition. The goal of competition was to auto

33 May 24, 2022
A real-time dolly zoom camera effect

Dolly-Zoom I've always been amazed by the gradual perspective change of dolly zoom, and I have some experience in python and OpenCV, so I decided to c

Dylan Kai Lau 52 Dec 08, 2022
This is a GUI program which consist of 4 OpenCV projects

Tkinter-OpenCV Project Using Tkinter, Opencv, Mediapipe This is a python GUI program using Tkinter which consist of 4 OpenCV projects 1. Finger Counte

Arya Bagde 3 Feb 22, 2022
Usando o Amazon Textract como OCR para Extração de Dados no DynamoDB

dio-live-textract2 Repositório de código para o live coding do dia 05/10/2021 sobre extração de dados estruturados e gravação em banco de dados a part

hugoportela 0 Jan 19, 2022