Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images"

Last update: Dec 30, 2022

Related tags

Computer Vision TableNet

Overview

TableNet

Unofficial implementation of ICDAR 2019 paper : TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images.

Paper

Overview

Paper: TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images

TableNet is a modern deep learning architecture that was proposed by a team from TCS Research year in the year 2019. The main motivation was to extract information from scanned tables through mobile phones or cameras.

They proposed a solution that includes accurate detection of the tabular region within an image and subsequently detecting and extracting information from the rows and columns of the detected table.

Architecture: The architecture is based out of Long et al., an encoder-decoder model for semantic segmentation. The same encoder/decoder network is used as the FCN architecture for table extraction. The images are preprocessed and modified using the Tesseract OCR.

Source: Nanonets

How to run

pip install -r requirements.txt

Download the Marmot Dataset from the link given in readme.
Run data_preprocess/generate_mask.py to generate Table and Column Mask of corresponding images.
Follow the TableNet.ipynb notebook to train and test the model.

Challenges

Require a very decent System with a good GPU for accurate result on High pixel images.

Dataset

Download the dataset provided in paper : Marmot Dataset.

Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images"

Related tags

Overview

TableNet

Overview

How to run

Challenges

Dataset

Owner

Jainam Shah

Tesseract Open Source OCR Engine (main repository)

MONAI Label is a server-client system that facilitates interactive medical image annotation by using AI.

This repository lets you train neural networks models for performing end-to-end full-page handwriting recognition using the Apache MXNet deep learning frameworks on the IAM Dataset.

This is a project to detect gestures to zoom in or out, using the real-time distance between the index finger and the thumb. It's based on OpenCV and Mediapipe.

color detection using python

A Python wrapper for the tesseract-ocr API

Pytorch implementation of PSEnet with Pyramid Attention Network as feature extractor

Image Detector and Convertor App created using python's Pillow, OpenCV, cvlib, numpy and streamlit packages.

A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.

This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog:

Fast style transfer

Library used to deskew a scanned document

原神风花节自动弹琴辅助

CNN+LSTM+CTC based OCR implemented using tensorflow.

Hand Detection and Finger Detection on Live Feed

OCR engine for all the languages

Camera Intrinsic Calibration and Hand-Eye Calibration in Pybullet

Deep learning based page layout analysis

3点クリックで円を指定し、極座標変換を行うサンプルプログラム

Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images"