Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Last update: Dec 06, 2022

Related tags

Overview

This is the official implementation of "Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation".

For more details, please refer to our paper.

Citing the paper

Please cite the paper in your publications if it helps your research:

@inproceedings{lyu2018multi,
      title={Multi-oriented scene text detection via corner localization and region segmentation},
      author={Lyu, Pengyuan and Yao, Cong and Wu, Wenhao and Yan, Shuicheng and Bai, Xiang},
      booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
      pages={7553--7563},
      year={2018}
}

Requirements
Installation
Models
Test
Train
License

Requirements

NVIDIA GPU, Ubuntu 14.04, Python2.7, CUDA8/9
PyTorch 0.2.0_3

Installation

git clone https://github.com/lvpengyuan/corner.git
sh ./make.sh   or  cd rpsroi_pooling && python build.py

Models

Download the model and place it in weights/

Our trained model: Google Drive;

Test

You can test a model in a single scale:

python eval_all.py

or in multi-scale:

python eval_multiscale.py

Note that, you should modify the model path and the test dataset before testing.

Train

python train.py

To train a new model, you should modify the training settings before training.

License

This code is only for academic purpose.

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Related tags

Overview

Citing the paper

Contents

Requirements

Installation

Models

Test

Train

License

Owner

Pengyuan Lyu

Handwriting Recognition System based on a deep Convolutional Recurrent Neural Network architecture

Creating a virtual tv using opencv in python3.

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Machine Leaning applied to denoise images to improve OCR Accuracy

Satoshi is a discord bot template in python using discord.py that allow you to track some live crypto prices with your own discord bot.

Detect textlines in document images

A tool to enhance your old/damaged pictures built using python & opencv.

YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

Read Japanese manga inside browser with selectable text.

Repository for playing the computer vision apps: People analytics on Raspberry Pi.

A Python wrapper for the tesseract-ocr API

A small C++ implementation of LSTM networks, focused on OCR.

Face_mosaic - Mosaic blur processing is applied to multiple faces appearing in the video

kaldi-asr/kaldi is the official location of the Kaldi project.

OCR of Chicago 1909 Renumbering Plan

Using computer vision method to recognize and calcutate the features of the architecture.

Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text Recognition"

A simple Digits Recogniser made in Python

Here use convulation with sobel filter from scratch in opencv python .

Controlling the computer volume with your hands // OpenCV