SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

Last update: Aug 24, 2022

Related tags

Overview

SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

This repository implements the SAFL in pytorch.

Installation

conda env create -f environment.yml
conda install pytorch==1.2.0 torchvision==0.4.0 cudatoolkit=10.0 -c pytorch

Train

bash scripts/stn_att_rec.sh

Test

You can test with .lmdb files by

bash scripts/main_test_all.sh

Or test with single image by

bash scripts/main_test_image.sh

Data preparation

We give an example to construct your own datasets. Details please refer to tools/create_svtp_lmdb.py.

Citation

If you find this project helpful for your research, please cite the following papers:

Owner

GitHub Repository

Code, Data and Demo for Paper: Controllable Generation from Pre-trained Language Models via Inverse Prompting

InversePrompting Paper: Controllable Generation from Pre-trained Language Models via Inverse Prompting Code: The code is provided in the "chinese_ip"

101 Dec 16, 2022

A3C LSTM Atari with Pytorch plus A3G design

NEWLY ADDED A3G A NEW GPU/CPU ARCHITECTURE OF A3C FOR SUBSTANTIALLY ACCELERATED TRAINING!! RL A3C Pytorch NEWLY ADDED A3G!! New implementation of A3C

532 Jan 02, 2023

TLDR; Train custom adaptive filter optimizers without hand tuning or extra labels.

AutoDSP TLDR; Train custom adaptive filter optimizers without hand tuning or extra labels. About Adaptive filtering algorithms are commonplace in sign

48 Sep 19, 2022

The official implementation of Equalization Loss v1 & v2 (CVPR 2020, 2021) based on MMDetection.

The Equalization Losses for Long-tailed Object Detection and Instance Segmentation This repo is official implementation CVPR 2021 paper: Equalization

129 Dec 16, 2022

This is the code of using DQN to play Sekiro .

Update for using DQN to play sekiro 2021.2.2（English Version） This is the code of using DQN to play Sekiro . I am very glad to tell that I have writen

144 Dec 25, 2022

Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP

Wav2CLIP 🚧 WIP 🚧 Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP 📄 🔗 Ho-Hsiang Wu, Prem Seetharaman

240 Dec 13, 2022

Python Assignments for the Deep Learning lectures by Andrew NG on coursera with complete submission for grading capability.

1 Feb 03, 2022

Code for CVPR 2021 paper TransNAS-Bench-101: Improving Transferrability and Generalizability of Cross-Task Neural Architecture Search.

TransNAS-Bench-101 This repository contains the publishable code for CVPR 2021 paper TransNAS-Bench-101: Improving Transferrability and Generalizabili

17 Nov 20, 2022

Rasterize with the least efforts for researchers.

utils3d Rasterize and do image-based 3D transforms with the least efforts for researchers. Based on numpy and OpenGL. It could be helpful when you wan

8 Dec 15, 2022

Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite.

TFLite-HITNET-Stereo-depth-estimation Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite. Stereo depth e

22 Oct 20, 2022

Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"

README The code is based on the ILswiss. To run the code, use python run_experiment.py --nosrun -e your YAML file -g gpu id Generally, run_experim

12 Mar 19, 2022

Multilingual Image Captioning

Multilingual Image Captioning Authors: Bhavitvya Malik, Gunjan Chhablani Demo Link: https://huggingface.co/spaces/flax-community/multilingual-image-ca

32 Nov 25, 2022

Weakly Supervised Segmentation with Tensorflow. Implements instance segmentation as described in Simple Does It: Weakly Supervised Instance and Semantic Segmentation, by Khoreva et al. (CVPR 2017).

Weakly Supervised Segmentation with TensorFlow This repo contains a TensorFlow implementation of weakly supervised instance segmentation as described

220 Dec 13, 2022

Code for our paper Aspect Sentiment Quad Prediction as Paraphrase Generation in EMNLP 2021.

Aspect Sentiment Quad Prediction (ASQP) This repo contains the annotated data and code for our paper Aspect Sentiment Quad Prediction as Paraphrase Ge

39 Dec 11, 2022

Realtime micro-expression recognition using OpenCV and PyTorch

Micro-expression Recognition Realtime micro-expression recognition from scratch using OpenCV and PyTorch Try it out with a webcam or video using the e

35 Dec 05, 2022

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long, Evan Shelhamer, and Trevor Darrell. CVPR 2015 and PAMI 2016.

Fully Convolutional Networks for Semantic Segmentation This is the reference implementation of the models and code for the fully convolutional network

3.2k Jan 08, 2023

SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

Related tags

Overview

SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

Installation

Train

Test

Data preparation

Citation

Owner

Code, Data and Demo for Paper: Controllable Generation from Pre-trained Language Models via Inverse Prompting

A3C LSTM Atari with Pytorch plus A3G design

TLDR; Train custom adaptive filter optimizers without hand tuning or extra labels.

The official implementation of Equalization Loss v1 & v2 (CVPR 2020, 2021) based on MMDetection.

This is the code of using DQN to play Sekiro .

Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP

Python Assignments for the Deep Learning lectures by Andrew NG on coursera with complete submission for grading capability.

Code for CVPR 2021 paper TransNAS-Bench-101: Improving Transferrability and Generalizability of Cross-Task Neural Architecture Search.

Rasterize with the least efforts for researchers.

Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite.

Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"

Multilingual Image Captioning

Weakly Supervised Segmentation with Tensorflow. Implements instance segmentation as described in Simple Does It: Weakly Supervised Instance and Semantic Segmentation, by Khoreva et al. (CVPR 2017).

Code for our paper Aspect Sentiment Quad Prediction as Paraphrase Generation in EMNLP 2021.

Realtime micro-expression recognition using OpenCV and PyTorch

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long, Evan Shelhamer, and Trevor Darrell. CVPR 2015 and PAMI 2016.

Python library to receive live stream events like comments and gifts in realtime from TikTok LIVE.

The Ludii general game system, developed as part of the ERC-funded Digital Ludeme Project.

🇰🇷 Text to Image in Korean

Federated learning on graph, especially on graph neural networks (GNNs), knowledge graph, and private GNN.

SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

Related tags

Overview

SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

Installation

Train

Test

Data preparation

Citation

Owner

Code, Data and Demo for Paper: Controllable Generation from Pre-trained Language Models via Inverse Prompting

A3C LSTM Atari with Pytorch plus A3G design

TLDR; Train custom adaptive filter optimizers without hand tuning or extra labels.

The official implementation of Equalization Loss v1 & v2 (CVPR 2020, 2021) based on MMDetection.

This is the code of using DQN to play Sekiro .

Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP

Python Assignments for the Deep Learning lectures by Andrew NG on coursera with complete submission for grading capability.

Code for CVPR 2021 paper TransNAS-Bench-101: Improving Transferrability and Generalizability of Cross-Task Neural Architecture Search.

Rasterize with the least efforts for researchers.

Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite.

Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"

Multilingual Image Captioning

Weakly Supervised Segmentation with Tensorflow. Implements instance segmentation as described in Simple Does It: Weakly Supervised Instance and Semantic Segmentation, by Khoreva et al. (CVPR 2017).

Code for our paper Aspect Sentiment Quad Prediction as Paraphrase Generation in EMNLP 2021.

Realtime micro-expression recognition using OpenCV and PyTorch

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long*, Evan Shelhamer*, and Trevor Darrell. CVPR 2015 and PAMI 2016.

Python library to receive live stream events like comments and gifts in realtime from TikTok LIVE.

The Ludii general game system, developed as part of the ERC-funded Digital Ludeme Project.

🇰🇷 Text to Image in Korean

Federated learning on graph, especially on graph neural networks (GNNs), knowledge graph, and private GNN.

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long, Evan Shelhamer, and Trevor Darrell. CVPR 2015 and PAMI 2016.