An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

Last update: Dec 12, 2022

Related tags

Overview

InceptText-Tensorflow

An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

Introduction

Tensorflow=1.4.0

Preparation

1.gcc 4.9

2.cuda8.0

3.cd lib && make

可能遇到的错误：

解决办法：把cuda路径添加到系统环境变量，然后改为#include<cuda.h>

解决办法：找到nsync_cv.h的绝对路径然后include

解决办法：找到nsync_mu.h的绝对路径然后include

Download

1.Models trained on ICDAR 2017

2.Resnet V1 50 provided by tensorflow slimResNet-v1

Train

python train_main.py

Test

python test.py

Owner

GeorgeJoe

Focus on NLP and OCR

GitHub Repository

A program that takes in the hand gesture displayed by the user and translates ASL.

Interactive-ASL-Recognition Using the framework mediapipe made by google, OpenCV library and through self teaching, I was able to create a program tha

3 Nov 22, 2021

computer vision, image processing and machine learning on the web browser or node.

Image processing and Machine learning labs computer vision, image processing and machine learning on the web browser or node note Fast Fourier Trans

487 Nov 11, 2022

The official code for the ICCV-2021 paper "Speech Drives Templates: Co-Speech Gesture Synthesis with Learned Templates".

SpeechDrivesTemplates The official repo for the ICCV-2021 paper "Speech Drives Templates: Co-Speech Gesture Synthesis with Learned Templates". [arxiv

53 Dec 23, 2022

Python rubik's cube solver

This program makes a 3D representation of a rubiks cube and solves it step by step.

4 May 29, 2022

Text Detection from images using OpenCV

EAST Detector for Text Detection OpenCV’s EAST(Efficient and Accurate Scene Text Detection ) text detector is a deep learning model, based on a novel

88 Oct 20, 2022

Face Detection with DLIB

Face Detection with DLIB In this project, we have detected our face with dlib and opencv libraries. Setup This Project Install DLIB & OpenCV You can i

2 Jan 16, 2022

Here use convulation with sobel filter from scratch in opencv python .

2 Nov 11, 2021

Implement 'Single Shot Text Detector with Regional Attention, ICCV 2017 Spotlight'

SSTDNet Implement 'Single Shot Text Detector with Regional Attention, ICCV 2017 Spotlight' using pytorch. This code is work for general object detecti

84 Jan 05, 2022

OCR powered screen-capture tool to capture information instead of images

NormCap OCR powered screen-capture tool to capture information instead of images. Links: Repo | PyPi | Releases | Changelog | FAQs Content: Quickstart

575 Dec 31, 2022

Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.

Table of Contents Overview Requirements Demo Modules Overview This python package contains modules to help with finding and extracting tabular data fr

311 Dec 24, 2022

Converts an image into funny, smaller amongus characters

SussyImage Converts an image into funny, smaller amongus characters Demo Mona Lisa | Lona Misa (Made up of AmongUs characters) API I've also added an

14 Aug 18, 2022

keras复现场景文本检测网络CPTN: 《Detecting Text in Natural Image with Connectionist Text Proposal Network》；欢迎试用，关注，并反馈问题...

keras-ctpn [TOC] 说明预测训练例子 4.1 ICDAR2015 4.1.1 带侧边细化 4.1.2 不带带侧边细化 4.1.3 做数据增广-水平翻转 4.2 ICDAR2017 4.3 其它数据集 toDoList 总结说明本工程是keras实现的CPTN: Detecti

107 Jan 09, 2023

Um simples projeto para fazer o reconhecimento do captcha usado pelo jogo bombcrypto

CaptchaSolver - LEIA ISSO 😓 Para iniciar o codigo: pip install -r requirements.txt python captcha_solver.py Se você deseja pegar ver o resultado das

50 Mar 21, 2022

Creating of virtual elements of the graphical interface using opencv and mediapipe.

Virtual GUI Creating of virtual elements of the graphical interface using opencv and mediapipe. Element GUI Output Description Button By default the b

4 Jun 16, 2022

A facial recognition device is a device that takes an image or a video of a human face and compares it to another image faces in a database.

A facial recognition device is a device that takes an image or a video of a human face and compares it to another image faces in a database. The structure, shape and proportions of the faces are comp

4 Mar 19, 2022

An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

Related tags

Overview

InceptText-Tensorflow

Introduction

Tensorflow=1.4.0

Preparation

Download

1.Models trained on ICDAR 2017

2.Resnet V1 50 provided by tensorflow slimResNet-v1

Train

python train_main.py

Test

python test.py

Owner

GeorgeJoe

A program that takes in the hand gesture displayed by the user and translates ASL.

computer vision, image processing and machine learning on the web browser or node.

The official code for the ICCV-2021 paper "Speech Drives Templates: Co-Speech Gesture Synthesis with Learned Templates".

Python rubik's cube solver

Text Detection from images using OpenCV

Face Detection with DLIB

Here use convulation with sobel filter from scratch in opencv python .

Implement 'Single Shot Text Detector with Regional Attention, ICCV 2017 Spotlight'

OCR powered screen-capture tool to capture information instead of images

Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.

Converts an image into funny, smaller amongus characters

keras复现场景文本检测网络CPTN: 《Detecting Text in Natural Image with Connectionist Text Proposal Network》；欢迎试用，关注，并反馈问题...

Um simples projeto para fazer o reconhecimento do captcha usado pelo jogo bombcrypto

Creating of virtual elements of the graphical interface using opencv and mediapipe.

A facial recognition device is a device that takes an image or a video of a human face and compares it to another image faces in a database.

A buffered and threaded wrapper for the OpenCV VideoCapture object. Can speed up video decoding significantly. Supports

QuanTaichi: A Compiler for Quantized Simulations (SIGGRAPH 2021)

Roboflow makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless.

Generate text images for training deep learning ocr model

Pure Javascript OCR for more than 100 Languages 📖🎉🖥