Contextual Attention Localization for Offline Handwritten Text Recognition

Last update: Feb 17, 2022

Related tags

Overview

CALText

This repository contains the source code for CALText model introduced in "CALText: Contextual Attention Localization for Offline Handwritten Text" paper. The details of this model are presented in: (Add paper link)

Samples of the datasets that were used to train and test the model can be found at: http://faculty.pucit.edu.pk/nazarkhan/work/urdu_ohtr/pucit_ohul_dataset.html

The code in this model was based on the work of:

https://github.com/JianshuZhang/WAP.

https://github.com/wwjwhen/Watch-Attend-and-Parse-tensorflow-version.

Requirements

Python 3 Tensorflow v1.6

Usage

Upload data files into your Colab account, create pickle files (train, valid, and test images and labels) from the dataset. You can place the pickle dataset files at any folder of your preference but change the path settings in the code where this data is being loaded.

Run "makepickle.ipynb" to create pickle files for train and test data. Further distribute the train pickle file into train and valid pickle files by using last 907 images and labels of train as valid.

For training, set mode="train", and run "CALText.ipynb".

For testing, set mode="test", and run "CALText.ipynb".

For Contextual Attention, set alpha_reg=0, while training and testing.

For Contextual Attention Localization, set alpha_reg=1, while training and testing.

Run on Python Compiler

To run the code on python compiler, copy the code and make file as "makepickle.py" and "CALText.py". Use following commands to run code files.

python makepickle.py

python CALText.py

Run on Google Colab

Open "makepickle.ipynb" and "CALText.ipynb" notebook in Google Colab Notebook, and run.

Run "%tensorflow_version 1.x" command at colab notebook before running of "CALText.ipynb".

Change runtime to GPU or TPU for better performance.

Add these lines in notebook for accessing data from google derive:

from google.colab import drive

drive.mount("/gdrive", force_remount=True)

References

PUCIT Offline Handwritten Urdu Lines (PUCIT-OHUL) Dataset: http://faculty.pucit.edu.pk/nazarkhan/work/urdu_ohtr/pucit_ohul_dataset.html

Previous Work:

http://faculty.pucit.edu.pk/nazarkhan/work/urdu_ohtr/index.html

http://faculty.pucit.edu.pk/nazarkhan/work/urdu_ohtr/ICFHR2020_manuscript.pdf

Contextual Attention Localization for Offline Handwritten Text Recognition

Related tags

Overview

CALText

Requirements

Usage

Run on Python Compiler

Run on Google Colab

References

Owner

Permeability Prediction Via Multi Scale 3D CNN

VisionKG: Vision Knowledge Graph

🎯 A comprehensive gradient-free optimization framework written in Python

Code for "ATISS: Autoregressive Transformers for Indoor Scene Synthesis", NeurIPS 2021

Small utility to demangle Nim symbols in callgrind files

Tutoriais publicados nas nossas redes sociais para obtenção de dados, análises simples e outras tarefas relevantes no mercado financeiro.

BlockUnexpectedPackets - Preventing BungeeCord CPU overload due to Layer 7 DDoS attacks by scanning BungeeCord's logs

FLVIS: Feedback Loop Based Visual Initial SLAM

Semi-supevised Semantic Segmentation with High- and Low-level Consistency

Multi-Modal Fingerprint Presentation Attack Detection: Evaluation On A New Dataset

The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation".

Simple and Effective Few-Shot Named Entity Recognition with Structured Nearest Neighbor Learning

Deep Reinforcement Learning based Trading Agent for Bitcoin

Robocop is your personal mini voice assistant made using Python.

CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation

Code for Efficient Visual Pretraining with Contrastive Detection

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition

Super-Fast-Adversarial-Training - A PyTorch Implementation code for developing super fast adversarial training

Normal Learning in Videos with Attention Prototype Network

An Easy-to-use, Modular and Prolongable package of deep-learning based Named Entity Recognition Models.