Bnagla hand written document digiiztion

Last update: Dec 10, 2021

Related tags

Overview

Bnagla hand written document digiiztion

This repo addresses the problem of digiizing hand written documents in Bangla. Documents have definite fields of specific information. We target this area and crop this region.

We only focus on extracting amount information (in currency) which is important in tax return. Our approach first select characters and separates numbers from non-number characters. The final classification results of each character are merged to get full amount.

Result

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

License

MIT

Owner

Mushfiqur Rahman

Greater world Shorter time ....

GitHub Repository

A fast and easy implementation of Transformer with PyTorch.

FasySeq FasySeq is a shorthand as a Fast and easy sequential modeling toolkit. It aims to provide a seq2seq model to researchers and developers, which

7 Jul 18, 2022

Twewy-discord-chatbot - Build a Discord AI Chatbot that Speaks like Your Favorite Character

Build a Discord AI Chatbot that Speaks like Your Favorite Character! This is a Discord AI Chatbot that uses the Microsoft DialoGPT conversational mode

231 Dec 30, 2022

KR-FinBert And KR-FinBert-SC

KR-FinBert & KR-FinBert-SC Much progress has been made in the NLP (Natural Language Processing) field, with numerous studies showing that domain adapt

5 Jul 29, 2022

Faster, modernized fork of the language identification tool langid.py

py3langid py3langid is a fork of the standalone language identification tool langid.py by Marco Lui. Original license: BSD-2-Clause. Fork license: BSD

12 Nov 05, 2022

Code to reproduce the results of the paper 'Towards Realistic Few-Shot Relation Extraction' (EMNLP 2021)

Realistic Few-Shot Relation Extraction This repository contains code to reproduce the results in the paper "Towards Realistic Few-Shot Relation Extrac

8 Nov 09, 2022

This project is part of Eleuther AI's quest to create a massive repository of high quality text data for training language models.

42 Dec 13, 2022

To classify the News into Real/Fake using Features from the Text Content of the article

Hoax-Detector Authenticity of news has now become a major problem. The Idea is to classify the News into Real/Fake using Features from the Text Conten

1 Feb 09, 2022

Translate - a PyTorch Language Library

NOTE PyTorch Translate is now deprecated, please use fairseq instead. Translate - a PyTorch Language Library Translate is a library for machine transl

775 Dec 24, 2022

MRC approach for Aspect-based Sentiment Analysis (ABSA)

B-MRC MRC approach for Aspect-based Sentiment Analysis (ABSA) Paper: Bidirectional Machine Reading Comprehension for Aspect Sentiment Triplet Extracti

1 Apr 05, 2022

👑 spaCy building blocks and visualizers for Streamlit apps

spacy-streamlit: spaCy building blocks for Streamlit apps This package contains utilities for visualizing spaCy models and building interactive spaCy-

620 Dec 29, 2022

🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

15k Jan 02, 2023

Bnagla hand written document digiiztion

Related tags

Overview

Bnagla hand written document digiiztion

Result

Contributing

License

Owner

Mushfiqur Rahman

A fast and easy implementation of Transformer with PyTorch.

Twewy-discord-chatbot - Build a Discord AI Chatbot that Speaks like Your Favorite Character

KR-FinBert And KR-FinBert-SC

Faster, modernized fork of the language identification tool langid.py

Code to reproduce the results of the paper 'Towards Realistic Few-Shot Relation Extraction' (EMNLP 2021)

This project is part of Eleuther AI's quest to create a massive repository of high quality text data for training language models.

To classify the News into Real/Fake using Features from the Text Content of the article

Translate - a PyTorch Language Library

MRC approach for Aspect-based Sentiment Analysis (ABSA)

👑 spaCy building blocks and visualizers for Streamlit apps

🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

CMeEE 数据集医学实体抽取

Longformer: The Long-Document Transformer

Precision Medicine Knowledge Graph (PrimeKG)

[EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction

Gold standard corpus annotated with verb-preverb connections for Hungarian.

✨Rubrix is a production-ready Python framework for exploring, annotating, and managing data in NLP projects.

Summarization module based on KoBART

This repository is home to the Optimus data transformation plugins for various data processing needs.

Machine translation models released by the Gourmet project