Pixel-Perfect Structure-from-Motion with Featuremetric Refinement (ICCV 2021, Oral)

Last update: Dec 29, 2022

Related tags

Overview

Pixel-Perfect Structure-from-Motion (ICCV 2021 Oral)

We introduce a framework that improves the accuracy of Structure-from-Motion by refining keypoints, camera poses, and 3D points using the direct alignment of deep features. It is presented in our paper:

Pixel-Perfect Structure-from-Motion with Featuremetric Refinement
to appear at ICCV 2021
Authors: Philipp Lindenberger*, Paul-Edouard Sarlin*, Viktor Larsson, and Marc Pollefeys
Website: psarlin.com/pixsfm (videos, slides, poster)

This repository will host the code to run and evaluate our refinement. Please subscribe to this issue if you wish to be notified of the code release.

Abstract

Finding local features that are repeatable across multiple views is a cornerstone of sparse 3D reconstruction. The classical image matching paradigm detects keypoints per-image once and for all, which can yield poorly-localized features and propagate large errors to the final geometry. In this paper, we refine two key steps of structure-from-motion by a direct alignment of low-level image information from multiple views: we first adjust the initial keypoint locations prior to any geometric estimation, and subsequently refine points and camera poses as a post-processing. This refinement is robust to large detection noise and appearance changes, as it optimizes a featuremetric error based on dense features predicted by a neural network. This significantly improves the accuracy of camera poses and scene geometry for a wide range of keypoint detectors, challenging viewing conditions, and off-the-shelf deep features. Our system easily scales to large image collections, enabling pixel-perfect crowd-sourced localization at scale. Our code will be publicly available at as an add-on to the popular SfM software COLMAP.

BibTex Citation

Please consider citing our work if you use any code from this repo or ideas presented in the paper:

@inproceedings{lindenberger2021pixsfm,
  author    = {Philipp Lindenberger and
               Paul-Edouard Sarlin and
               Viktor Larsson and
               Marc Pollefeys},
  title     = {{Pixel-Perfect Structure-from-Motion with Featuremetric Refinement}},
  booktitle = {ICCV},
  year      = {2021},
}

Pixel-Perfect Structure-from-Motion with Featuremetric Refinement (ICCV 2021, Oral)

Related tags

Overview

Pixel-Perfect Structure-from-Motion (ICCV 2021 Oral)

Abstract

BibTex Citation

Owner

Computer Vision and Geometry Lab

Official Tensorflow implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation (ICLR 2020)

Official implementation of NeurIPS 2021 paper "Contextual Similarity Aggregation with Self-attention for Visual Re-ranking"

Vision-Language Transformer and Query Generation for Referring Segmentation (ICCV 2021)

A free, multiplatform SDK for real-time facial motion capture using blendshapes, and rigid head pose in 3D space from any RGB camera, photo, or video.

Hcaptcha-challenger - Gracefully face hCaptcha challenge with Yolov5(ONNX) embedded solution

A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX

NFT-Price-Prediction-CNN - Using visual feature extraction, prices of NFTs are predicted via CNN (Alexnet and Resnet) architectures.

A vanilla 3D face modeling on pose-invariant and multi-lightning image data

(JMLR'19) A Python Toolbox for Scalable Outlier Detection (Anomaly Detection)

Sparse R-CNN: End-to-End Object Detection with Learnable Proposals, CVPR2021

CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation

MAU: A Motion-Aware Unit for Video Prediction and Beyond, NeurIPS2021

[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime

Edison AT is software Depression Assistant personal.

Here we present the implementation in TensorFlow of our work about liver lesion segmentation accepted in the Machine Learning 4 Health Workshop

Implementation of Pooling by Sliced-Wasserstein Embedding (NeurIPS 2021)

This program can detect your face and add an Christams hat on the top of your head

Code for "My(o) Armband Leaks Passwords: An EMG and IMU Based Keylogging Side-Channel Attack" paper

A CROSS-MODAL FUSION NETWORK BASED ON SELF-ATTENTION AND RESIDUAL STRUCTURE FOR MULTIMODAL EMOTION RECOGNITION

Mesh TensorFlow: Model Parallelism Made Easier