TextBoxes-TensorFlow

TextBoxes re-implementation using tensorflow. This project is greatly inspired by slim project And many functions are modified based on SSD-tensorflow project Later, we will overwrite this project so make it more flexiable and modularized.

Author: Daitao Xing : [email protected] Jin Huang : [email protected]

Progress

2017/ 03/14

data_processing phase finished Test：

1. Download the dataset， put 1/ folder and gt.mat uner ddata/sythtext/ folder（will wirte script）   
2. python datasets/data2record.py    
3. python image_processing.py

output： batch_size * 300 * 300 * 3 image

2017/ 03/17

Finish the design of training(can start training)

python train.py \
--train_dir=${TRAIN_DIR} \
--dataset_dir=${DATASET_DIR} \
--save_summaries_secs=60 \
--save_interval_secs=600 \
--weight_decay=0.0005 \
--optimizer=adam \
--learning_rate=0.001 \
--batch_size=32

Problems to be solved：

1. Need to redesign visualization		
2. image_processing can be improved

Next steps:

traing on other datasets
fine tunes
test
automatic downloading datasets and so on

TextBoxes re-implement using tensorflow

Related tags

Overview

TextBoxes-TensorFlow

Progress

Problems to be solved：

Next steps:

Owner

Gu Xiaodong

Detect textlines in document images

pulse2percept: A Python-based simulation framework for bionic vision

Give a solution to recognize MaoYan font.

Image Smoothing and Blurring Using OpenCV

color detection using python

Document blur detection based on Laplacian operator and text detection.

Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images"

The code of "Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes"

Run tesseract with the tesserocr bindings with @OCR-D's interfaces

Code related to "Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity" paper

Simple SDF mesh generation in Python

Convert Text-to Handwriting Using Python

Programa que viabiliza a OCR (Optical Character Reading - leitura óptica de caracteres) de um PDF.

This is the code for our paper DAAIN: Detection of Anomalous and AdversarialInput using Normalizing Flows

Page to PAGE Layout Analysis Tool

Kornia is a open source differentiable computer vision library for PyTorch.

A post-processing tool for scanned sheets of paper.

Controlling the computer volume with your hands // OpenCV

TableBank: A Benchmark Dataset for Table Detection and Recognition

Pre-Recognize Library - library with algorithms for improving OCR quality.