A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Last update: Jan 05, 2023

Related tags

Overview

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Jianqi Ma, Zhetong Liang, Lei Zhang
Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China & OPPO Research

Recovering TextZoom samples

Environment:

Other possible python packages like pyyaml, cv2, Pillow and imgaug

Main idea

The pipeline

TP Interpreter

Configure your training

Download the pretrained recognizer from:

Aster: https://github.com/ayumiymk/aster.pytorch  
MORAN:  https://github.com/Canjie-Luo/MORAN_v2  
CRNN: https://github.com/meijieru/crnn.pytorch

Unzip the codes and walk into the ' $TATT_ROOT$ /', place the pretrained weights from recognizer in ' $TATT_ROOT$ /'.

Download the TextZoom dataset:

https://github.com/JasonBoy1/TextZoom

Train the corresponding model (e.g. TPGSR-TSRN):

chmod a+x train_TATT.sh
./train_TATT.sh

Run the test-prefixed shell to test the corresponding model.

Adding '--go_test' in the shell file

Cite this paper:

@article{ma2021text,
title={A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution},
author={Ma, Jianqi and Zhetong, Liang and Zhang, Lei},
journal={},
year={2022}
}

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Related tags

Overview

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Recovering TextZoom samples

Environment:

Main idea

The pipeline

TP Interpreter

Configure your training

Download the pretrained recognizer from:

Download the TextZoom dataset:

Train the corresponding model (e.g. TPGSR-TSRN):

Run the test-prefixed shell to test the corresponding model.

Cite this paper:

Owner

MA Jianqi, shiki

Code of Classification Saliency-Based Rule for Visible and Infrared Image Fusion

"SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image", Dejia Xu, Yifan Jiang, Peihao Wang, Zhiwen Fan, Humphrey Shi, Zhangyang Wang

Emotion classification of online comments based on RNN

Implement slightly different caffe-segnet in tensorflow

Space Ship Simulator using python

Densely Connected Search Space for More Flexible Neural Architecture Search (CVPR2020)

Measure WWjj polarization fraction

pytorch implementation for Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network arXiv:1609.04802

Surrogate- and Invariance-Boosted Contrastive Learning (SIB-CL)

Code for "Diversity can be Transferred: Output Diversification for White- and Black-box Attacks"

Ladder Variational Autoencoders (LVAE) in PyTorch

Self-supervised learning (SSL) is a method of machine learning

Monocular 3D pose estimation. OpenVINO. CPU inference or iGPU (OpenCL) inference.

object recognition with machine learning on Respberry pi

[CVPR 2020] Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation

A library for using chemistry in your applications

Official implementation of the ICLR 2021 paper

Addon and nodes for working with structural biology and molecular data in Blender.

[CVPR 2021] Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision

pytorch implementation for PointNet