A demo for end-to-end English and Chinese text spotting using ABCNet.

Last update: Oct 04, 2022

Related tags

Overview

ABCNet_Chinese

A demo for end-to-end English and Chinese text spotting using ABCNet. This is an old model that was trained a long ago, which serves as a base setting for others to train their own model on Chinese or other language. Official ABCNet_v2 models will be updated in AdelaiDet.

Installation

Install detectron2 using the provided version (support visualizing Chinese text):

python -m pip install -e d2

Install this repo:

python setup.py build develop

If the above succeed, you can now run the demo using the provided model.

Model

This is our model that can be used for evaluation or pretraining.

wget https://drive.google.com/file/d/1iWX2n_BmyltVwQmfj8_oM9z7cJlq1P0m/view?usp=sharing -O model_chn.pth

Simply put the model in the root directory of the repo.

Demo

bash demo.sh

Example results

If you successfully run the demo, you will get the output below:

Other results (same project but not using the provide model):

Document-like Ancient words, e.g., “彝文”:

Cite

If you find this repo useful, please cite:

@article{liu2021abcnet,
  title={ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text Spotting},
  author={Liu, Yuliang and Shen, Chunhua and Jin, Lianwen and He, Tong and Chen, Peng and Liu, Chongyu and Chen, Hao},
  journal={arXiv preprint arXiv:2105.03620},
  year={2021}
}

Data

We provide the converted json files of ArT, LSVT, and ReCTS that we have used for training ABCNet_Chinese.

ReCTs [images&label](1.7G) [Origin_of_dataset]
LSVT [images&label](8.2G) [Origin_of_dataset]
ArT [images&label](1.5G) [Origin_of_dataset]
SynChinese130k [images&label](25G) [Origin_of_dataset]

License

For academic use, this project is licensed under the 2-clause BSD License - see the LICENSE file for details. For commercial use, please contact Chunhua Shen.

A demo for end-to-end English and Chinese text spotting using ABCNet.

Related tags

Overview

ABCNet_Chinese

Installation

Model

Demo

Example results

Cite

Data

License

Owner

Yuliang Liu

Creating a chess engine using GPT-3

Search with BERT vectors in Solr and Elasticsearch

ADCS - Automatic Defect Classification System (ADCS) for SSMC

The code for the Subformer, from the EMNLP 2021 Findings paper: "Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers", by Machel Reid, Edison Marrese-Taylor, and Yutaka Matsuo

Create a machine learning model which will predict if the mortgage will be approved or not based on 5 variables

Source code for AAAI20 "Generating Persona Consistent Dialogues by Exploiting Natural Language Inference".

Repository for the paper: VoiceMe: Personalized voice generation in TTS

In this repository we have tested 3 VQA models on the ImageCLEF-2019 dataset.

keras implement of transformers for humans

An ultra fast tiny model for lane detection, using onnx_parser, TensorRTAPI, torch2trt to accelerate. our model support for int8, dynamic input and profiling. (Nvidia-Alibaba-TensoRT-hackathon2021)

Code to reprudece NeurIPS paper: Accelerated Sparse Neural Training: A Provable and Efficient Method to Find N:M Transposable Masks

Treemap visualisation of Maya scene files

T‘rex Park is a Youzan sponsored project. Offering Chinese NLP and image models pretrained from E-commerce datasets

A PyTorch implementation of the Transformer model in "Attention is All You Need".

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Py65 65816 - Add support for the 65C816 to py65

Rank-One Model Editing for Locating and Editing Factual Knowledge in GPT

Code for lyric-section-to-comment generation based on huggingface transformers.

To create a deep learning model which can explain the content of an image in the form of speech through caption generation with attention mechanism on Flickr8K dataset.

Code to reproduce the results of the paper 'Towards Realistic Few-Shot Relation Extraction' (EMNLP 2021)