TextBoxes++-TensorFlow

TextBoxes++ re-implementation using tensorflow. This project is greatly inspired by slim project And many functions are modified based on SSD-tensorflow project

Author: Zhisheng Zou [email protected]

pretrained model

Google drive

environment

python2.7/python3.5

tensorflow-gpu 1.8.0

at least one gpu

how to use

Getting the xml file like this example xml and put the image together because we need the format like this standard xml
1. picture format: *.png or *.PNG
Getting the xml and flags ensure the XML file is under the same directory as the corresponding image.execute the code: convert_xml_format.py
1. python tools/convert_xml_format.py -i in_dir -s split_flag -l save_logs -o output_dir
2. in_dir means the absolute directory which contains the pic and xml
3. split_flag means whether or not to split the datasets
4. save_logs means whether to save train_xml.txt
5. output_dir means where to save xmls
Getting the tfrecords
1. python gene_tfrecords.py --xml_img_txt_path=./logs/train_xml.txt --output_dir=tfrecords
2. xml_img_txt_path like this train xml
3. output_dir means where to save tfrecords
Training
1. python train.py --train_dir =some_path --dataset_dir=some_path --checkpoint_path=some_path
2. train_dir store the checkpoints when training
3. dataset_dir store the tfrecords for training
4. checkpoint_path store the model which needs to be fine tuned
Testing
1. python test.py -m /home/model.ckpt-858 -o test
2. -m which means the model
3. -o which means output_result_dir
4. -i which means the test img dir
5. -c which means use which device to run the test
6. -n which means the nms threshold
7. -s which means the score threshold

Note:

when you are training the model, you can run the eval_result.py to eval your model and save the result

Textboxes_plusplus implementation with Tensorflow (python)

Related tags

Overview

TextBoxes++-TensorFlow

pretrained model

environment

how to use

Note:

Owner

OpenGait is a flexible and extensible gait recognition project

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

Automatic Number Plate Recognition (ANPR) is a highly accurate system capable of reading vehicle number plates without human intervention

Give a solution to recognize MaoYan font.

A webcam-based 3x3x3 rubik's cube solver written in Python 3 and OpenCV.

Visual Attention based OCR

第一届西安交通大学人工智能实践大赛（2018AI实践大赛--图片文字识别）第一名；仅采用densenet识别图中文字

Machine Leaning applied to denoise images to improve OCR Accuracy

Generic framework for historical document processing

Source code of RRPN ---- Arbitrary-Oriented Scene Text Detection via Rotation Proposals

📷 Face Recognition using Haar-Cascade Classifier, OpenCV, and Python

Awesome multilingual OCR toolkits based on PaddlePaddle （practical ultra lightweight OCR system, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices）

PyNeuro is designed to connect NeuroSky's MindWave EEG device to Python and provide Callback functionality to provide data to your application in real time.

A simple QR-Code Reader in Python

OpenCVを用いたカメラキャリブレーションのサンプルです。2021/06/21時点でPython実装のある3種類(通常カメラ向け、魚眼レンズ向け(fisheyeモジュール)、全方位カメラ向け(omnidirモジュール))について用意しています。

text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network

Python package for handwriting and sketching in Jupyter cells

A novel region proposal network for more general object detection ( including scene text detection ).

Responsive Doc. scanner using U^2-Net, Textcleaner and Tesseract

Reference Code for AAAI-20 paper "Multi-Stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labels"