TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Last update: Dec 12, 2022

Overview

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection

Introduction

The code and trained models of:

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection, TIP 2019 [Paper]

Citation

Please cite the related works in your publications if it helps your research:


@article{xu2018textfield,
  title={TextField: Learning A Deep Direction Field for Irregular Scene Text Detection},
  author={Xu, Yongchao and Wang, Yukang and Zhou, Wei and Wang, Yongpan and Yang, Zhibo and Bai, Xiang},
  journal={arXiv preprint arXiv:1812.01393},
  year={2018}
}

Prerequisite

Caffe and SynthText pretrained model [Link]
Datasets: [Total-Text], [ICDAR2015]
OpenCV 3.4.3
MATLAB

Usage

1. Install Caffe

cp Makefile.config.example Makefile.config
# adjust Makefile.config (for example, enable python layer)
make all -j16
# make sure to include $CAFFE_ROOT/python to your PYTHONPATH.
make pycaffe

Please refer to Caffe Installation to ensure other dependencies.

2. Data and model preparation

# download datasets and pretrained model then
mkdir data && mv [your_dataset_folder] data/
mkdir models && mv [your_pretrained_model] models/

3. Training scripts

# an example on Total-Text dataset
cd examples/TextField/
python train.py --gpu [your_gpu_id] --dataset total --initmodel ../../models/synth_iter_800000.caffemodel

4. Evaluation scripts

# an example on Total-Text dataset
cd evaluation/total/
./eval.sh

Results and Trained Models

Total-Text

Recall	Precision	F-measure	Link
0.816	0.824	0.820	[Google drive]

*lambda=0.50 for post-processing

ICDAR2015

Recall	Precision	F-measure	Link
0.811	0.846	0.828	[Google drive]

*lambda=0.75 for post-processing

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Related tags

Overview

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection

Introduction

Citation

Prerequisite

Usage

1. Install Caffe

2. Data and model preparation

3. Training scripts

4. Evaluation scripts

Results and Trained Models

Total-Text

ICDAR2015

Owner

Yukang Wang

An application of high resolution GANs to dewarp images of perturbed documents

With the virtual keyboard, you can write on the real time images by combining the thumb and index fingers on the letter you want.

Code for the "Sensing leg movement enhances wearable monitoring of energy expenditure" paper.

Optical character recognition for Japanese text, with the main focus being Japanese manga

Corner-based Region Proposal Network

(CVPR 2021) Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds

A Python wrapper for the tesseract-ocr API

Captcha Recognition

list all open dataset about ocr.

SRA's seminar on Introduction to Computer Vision Fundamentals

textspotter - An End-to-End TextSpotter with Explicit Alignment and Attention

Implementation of EAST scene text detector in Keras

Shape Detection - It's a shape detection project with OpenCV and Python.

Controlling Volume by Hand Gestures

[EMNLP 2021] Improving and Simplifying Pattern Exploiting Training

aardio的opencv库

Rest API Written In Python To Classify NSFW Images.

TensorFlow Implementation of FOTS, Fast Oriented Text Spotting with a Unified Network.

A bot that extract text from images using the Tesseract OCR.

Sign Language Recognition service utilizing a deep learning model with Long Short-Term Memory to perform sign language recognition.