A curated list of papers and resources for scene text detection and recognition

Overview

Awesome

Awesome Scene Text

A curated list of papers and resources for scene text detection and recognition

The year when a paper was first published, including ArXiv publications, is used. As a result, there may be cases when a paper was accepted for example to CVPR 2019, but it is listed in year 2018 because it was published in 2018 on ArXiv.

Table of contents
1. Scene Text Detection
2. Weakly Supervised Scene Text Detection
3. Scene Text Recognition
4. Other scene text papers
5. Scene Text Survey papers

Scene Text Detection (including methods for end-to-end detection and recognition)

2010

  • Detecting text in natural scenes with stroke width transform [CVPR 2010] [paper]
  • A Method for Text Localization and Recognition in Real-World Images [ACCV 2010] [paper]

2011

2012

  • Real-time scene text localization and recognition [CVPR 2012] [paper]

2013

2014

  • Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees [ECCV 2014] [paper]

2015

  • Symmetry-based text line detection in natural scenes [CVPR 2015] [paper]
  • Object proposals for text extraction in the wild [ICDAR 2015] [paper]
  • Text-Attentional Convolutional Neural Network for Scene Text Detection [TIP 2016] [paper]
  • Text Flow : A Unified Text Detection System in Natural Scene Images [ICCV 2015] [paper]

2016

2017

2018

2019

  • MSR: Multi-Scale Shape Regression for Scene Text Detection [IJCAI 2019] [paper]
  • Scene Text Detection with Inception Text Proposal Generation Module [ICMLC 2019] [paper]
  • Towards Robust Curve Text Detection with Conditional Spatial Expansion [CVPR 2019] [paper]
  • Curve Text Detection with Local Segmentation Network and Curve Connection [ArXiv] [paper]
  • Pyramid Mask Text Detector [ArXiv] [paper]
  • Tightness-aware Evaluation Protocol for Scene Text Detection [CVPR 2019] [paper]
  • Character Region Awareness for Text Detection [CVPR 2019] [paper]
  • Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes [CVPR 2019] [paper]
  • TextCohesion: Detecting Text for Arbitrary Shapes [ArXiv] [paper]
  • Arbitrary Shape Scene Text Detection With Adaptive Text Region Representation [CVPR 2019] [paper]
  • Learning Shape-Aware Embedding for Scene Text Detection [CVPR 2019] [paper]
  • A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended Multi-Task Learning [ACMMM 2019] [paper]
  • Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network [ICCV 2019] [paper]
  • Towards Unconstrained End-to-End Text Spotting [ICCV 2019] [paper]
  • TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting [paper]
  • Convolutional Character Networks [ICCV 2019] [paper]

Weakly supervised Scene Text Detection & Recognition

2017

  • Attention-Based Extraction of Structured Information from Street View Imagery [ICDAR 2017] [paper]
  • WeText: Scene Text Detection under Weak Supervision [ICCV 2017] [paper]
  • SEE: Towards Semi-Supervised End-to-End Scene Text Recognition [AAAI 2018] [paper]

Scene Text Recognition

2014

2015

2016

2017

2018

  • Char-Net: A Character-Aware Neural Network for Distorted Scene Text Recognition [AAAI 2018] [paper]
  • SqueezedText: A Real-time Scene Text Recognition by Binary Convolutional Encoder-decoder Network [AAAI 2018] [paper]
  • Edit Probability for Scene Text Recognition [CVPR 2018] [paper]
  • ASTER: An Attentional Scene Text Recognizer with Flexible Rectification [TPAMI 2018] [paper]
  • Synthetically Supervised Feature Learning for Scene Text Recognition [ECCV 2018] [paper]
  • Scene Text Recognition from Two-Dimensional Perspective [AAAI 2019] [paper]
  • ESIR: End-to-end Scene Text Recognition via Iterative Image Rectification [CVPR 2019] [paper]

2019

Script Identification

Other scene text related papers

2016

2019

  • Scene Text Synthesis for Efficient and Effective Deep Network Training [ArXiv] [paper]

Scene text survey

2018

  • Scene Text Detection and Recognition: The Deep Learning Era [ArXiv] [paper]

2019

  • Scene text detection and recognition with advances in deep learning: a survey [IJDAR 2019] [paper]
Owner
Jan Zdenek
Jan Zdenek
Pixel art search engine for opengameart

Pixel Art Reverse Image Search for OpenGameArt What does the final search look like? The final search with an example can be found here. It looks like

Eivind Magnus Hvidevold 92 Nov 06, 2022
Python rubik's cube solver

This program makes a 3D representation of a rubiks cube and solves it step by step.

Pablo QB 4 May 29, 2022
Play the Namibian game of Owela against a terrible AI. Built using Django and htmx.

Owela Club A Django project for playing the Namibian game of Owela against a dumb AI. Built following the rules described on the Mancala World wiki pa

Adam Johnson 18 Jun 01, 2022
Text language identification using Wikipedia data

Text language identification using Wikipedia data The aim of this project is to provide high-quality language detection over all the web's languages.

Vsevolod Dyomkin 28 Jul 09, 2022
Code for CVPR 2022 paper "SoftGroup for Instance Segmentation on 3D Point Clouds"

SoftGroup We provide code for reproducing results of the paper SoftGroup for 3D Instance Segmentation on Point Clouds (CVPR 2022) Author: Thang Vu, Ko

Thang Vu 231 Dec 27, 2022
Repository for Scene Text Detection with Supervised Pyramid Context Network with tensorflow.

Scene-Text-Detection-with-SPCNET Unofficial repository for [Scene Text Detection with Supervised Pyramid Context Network][https://arxiv.org/abs/1811.0

121 Oct 15, 2021
A dataset handling library for computer vision datasets in LOST-fromat

A dataset handling library for computer vision datasets in LOST-fromat

8 Dec 15, 2022
Character Segmentation using TensorFlow

Character Segmentation Segment characters and spaces in one text line,from this paper Chinese English mixed Character Segmentation as Semantic Segment

26 Aug 25, 2022
virtual mouse which can copy files, close tabs and many other features !

AI Virtual Mouse Controller Developed an AI-based system to control the mouse cursor using Python and OpenCV with the real-time camera. Fingertip loca

Diwas Pandey 23 Oct 05, 2021
Handwritten Text Recognition (HTR) system implemented with TensorFlow (TF) and trained on the IAM off-line HTR dataset. This Neural Network (NN) model recognizes the text contained in the images of segmented words.

Handwritten-Text-Recognition Handwritten Text Recognition (HTR) system implemented with TensorFlow (TF) and trained on the IAM off-line HTR dataset. T

27 Jan 08, 2023
Polaris is a Face recognition attendance system .

Support Me 🚀 About Polaris 📄 Polaris is a system based on facial recognition with a futuristic GUI design, Can easily find people informations store

XN3UR0N 215 Dec 26, 2022
⛓ marc is a small, but flexible Markov chain generator

About marc (markov chain) is a small, but flexible Markov chain generator. Usage marc is easy to use. To build a MarkovChain pass the object a sequenc

Max Humber 65 Oct 27, 2022
Face_mosaic - Mosaic blur processing is applied to multiple faces appearing in the video

動機 face_recognitionを使用して得られる顔座標は長方形であり、この座標をそのまま用いてぼかし処理を行った場合得られる画像は醜い。 それに対してモ

Yoshitsugu Kesamaru 6 Feb 03, 2022
Memory tests solver with using OpenCV

Human Benchmark project This project is OpenCV based programs which are puzzle solvers for 7 different games for https://humanbenchmark.com/. made as

Bahadır Araz 24 Dec 27, 2022
This is the open source implementation of the ICLR2022 paper "StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis"

StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image

Meta Research 840 Dec 26, 2022
OpenCVを用いたカメラキャリブレーションのサンプルです。2021/06/21時点でPython実装のある3種類(通常カメラ向け、魚眼レンズ向け(fisheyeモジュール)、全方位カメラ向け(omnidirモジュール))について用意しています。

OpenCV-CameraCalibration-Example FishEyeCameraCalibration.mp4 OpenCVを用いたカメラキャリブレーションのサンプルです 2021/06/21時点でPython実装のある以下3種類について用意しています。 通常カメラ向け 魚眼レンズ向け(

KazuhitoTakahashi 34 Nov 17, 2022
Camera Intrinsic Calibration and Hand-Eye Calibration in Pybullet

This repository is mainly for camera intrinsic calibration and hand-eye calibration. Synthetic experiments are conducted in PyBullet simulator. 1. Tes

CAI Junhao 7 Oct 03, 2022
✌️Using this you can control your PC/Laptop volume by Hand Gestures created with Python.

Hand Gesture Volume Controller ✋ Hand recognition 👆 Finger recognition 🔊 you can decrease and increase volume Demo Code Firstly I have created a Mod

Abbas Ataei 19 Nov 17, 2022
原神风花节自动弹琴辅助

GenshinAutoPlayBalladsofBreeze 原神风花节自动弹琴辅助(已适配1920*1080分辨率) 本程序基于opencv图像识别技术,不存在任何封号。 因为正确率取决于你的cpu性能,10900k都不一定全对。 由于图像识别存在误差,根本无法确定出错时间。更不用说被检测到了。

晓轩 20 Oct 27, 2022
This repository lets you train neural networks models for performing end-to-end full-page handwriting recognition using the Apache MXNet deep learning frameworks on the IAM Dataset.

Handwritten Text Recognition (OCR) with MXNet Gluon These notebooks have been created by Jonathan Chung, as part of his internship as Applied Scientis

Amazon Web Services - Labs 422 Jan 03, 2023