Learning Camera Localization via Dense Scene Matching, CVPR2021

Last update: Dec 01, 2022

Related tags

Overview

This repository contains code of our CVPR 2021 paper - "Learning Camera Localization via Dense Scene Matching" by Shitao Tang, Chengzhou Tang, Rui Huang, Siyu Zhu and Ping Tan.

This paper presents a new method for scene agnostic camera localization using dense scene matching (DSM), where a cost volume is constructed between a query image and a scene. The cost volume and the corresponding coordinates are processed by a CNN to predict dense coordinates. Camera poses can then be solved by PnP algorithms.

If you find this project useful, please cite:

@inproceedings{Tang2021Learning,
  title={Learning Camera Localization via Dense Scene Matching},
  author={Shitao Tang, Chengzhou Tang, Rui Huang, Siyu Zhu and Ping Tan},
  booktitle={Computer Vision and Pattern Recognition (CVPR)},
  year={2021}
}

Usage

Environment

The codes are tested along with
- pytorch=1.4.0
- lmdb (optional)
- yaml
- skimage
- opencv
- numpy=1.17
- tensorboard

Installation

Build PyTorch operations

  cd libs/model/ops
  python setup.py install

Build PnP algorithm

  cd libs/utils/lm_pnp
  mkdir build
  cd build
  cmake ..
  make all

Train and Test

Download

You can download the trained models and label files for 7scenes, Cambridge, Scannet.

For 7scenes, you can use the prepared data in the following.

Chess Fire Heads Office Pumpkin Kitchen Stairs

For Cambridge landmarks, you can download image files here, and depths here.
Test

Please refer to configs/7scenes.yaml for detailed explaination of how to set label file path and image file path.
- 7scenes
```
python tools/video_test.py --config configs/7scenes.yaml
```
- Camrbrige
```
python tools/video_test.py --config configs/cambridge.yaml
```
Train

We use ResNet-FPN pretrained model.
```
  python tools/train_net.py
```

Learning Camera Localization via Dense Scene Matching, CVPR2021

Related tags

Overview

Usage

Environment

Installation

Train and Test

Owner

tangshitao

An unofficial package help developers to implement ZATCA (Fatoora) QR code easily which required for e-invoicing

Python package for handwriting and sketching in Jupyter cells

PyQT5 app that colorize black & white pictures using CNN(use pre-trained model which was made with OpenCV)

CellProfiler is a open-source application for biological image analysis

A novel region proposal network for more general object detection ( including scene text detection ).

Just a script for detecting the lanes in any car game (not just gta 5) with specific resolution and road design ( very basic and limited )

一款基于Qt与OpenCV的仿真数字示波器

天池2021"全球人工智能技术创新大赛"【赛道一】：医学影像报告异常检测 - 第三名解决方案

This is a c++ project deploying a deep scene text reading pipeline with tensorflow. It reads text from natural scene images. It uses frozen tensorflow graphs. The detector detect scene text locations. The recognizer reads word from each detected bounding box.

Deskew is a command line tool for deskewing scanned text documents. It uses Hough transform to detect "text lines" in the image. As an output, you get an image rotated so that the lines are horizontal.

3点クリックで円を指定し、極座標変換を行うサンプルプログラム

Rest API Written In Python To Classify NSFW Images.

Convert PDF/Image to TXT using EasyOcr - the best OCR engine available!

Handwritten_Text_Recognition

1st place solution for SIIM-FISABIO-RSNA COVID-19 Detection Challenge

Optical character recognition for Japanese text, with the main focus being Japanese manga

OCR software for recognition of handwritten text

Volume Control using OpenCV

Image processing using OpenCv

Official code for :rocket: Unsupervised Change Detection of Extreme Events Using ML On-Board :rocket: