RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth, in ICCV 2021 (oral)

Last update: Dec 15, 2022

Overview

RINDNet

RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth
Mengyang Pu, Yaping Huang, Qingji Guan and Haibin Ling
ICCV 2021 (oral)

Please refer to supplementary material (code:p86d) (~60M) for more results.

Benchmark --- 🔥 🔥 BSDS-RIND 🔥 🔥

BSDS-RIND is the first public benchmark that dedicated to studying simultaneously the four edge types, namely Reflectance Edge (RE), Illumination Edge (IE), Normal Edge (NE) and Depth Edge (DE). It is created by carefully labeling images from the BSDS500. The datasets can be downloaded from:

Original images: BSDS500
Our annotations: BSDS-RIND (BaiDuNetdisk, code:e7rg ; GoogleDrive)

Abstract

As a fundamental building block in computer vision, edges can be categorised into four types according to the discontinuity in surface-Reflectance, Illumination, surface-Normal or Depth. While great progress has been made in detecting generic or individual types of edges, it remains under-explored to comprehensively study all four edge types together. In this paper, we propose a novel neural network solution, RINDNet, to jointly detect all four types of edges. Taking into consideration the distinct attributes of each type of edges and the relationship between them, RINDNet learns effective representations for each of them and works in three stages. In stage I, RINDNet uses a common backbone to extract features shared by all edges. Then in stage II it branches to prepare discriminative features for each edge type by the corresponding decoder. In stage III, an independent decision head for each type aggregates the features from previous stages to predict the initial results. Additionally, an attention module learns attention maps for all types to capture the underlying relations between them, and these maps are combined with initial results to generate the final edge detection results. For training and evaluation, we construct the first public benchmark, BSDS-RIND, with all four types of edges carefully annotated. In our experiments, RINDNet yields promising results in comparison with state-of-the-art methods.

Code and Main results ----- Coming Soon...

Acknowledgments

The work is partially done while Mengyang was at Stony Brook University.
We thank the anonymous reviewers for valuable and inspiring comments and suggestions.

RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth, in ICCV 2021 (oral)

Related tags

Overview

RINDNet

Benchmark --- 🔥 🔥 BSDS-RIND 🔥 🔥

Abstract

Code and Main results ----- Coming Soon...

Acknowledgments

Owner

Mengyang Pu

This repository allows the user to automatically scale a 3D model/mesh/point cloud on Agisoft Metashape

Concept drift monitoring for HA model servers.

Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)

Unimodal Face Classification with Multimodal Training

Message Passing on Cell Complexes

Contains supplementary materials for reproduce results in HMC divergence time estimation manuscript

Implementation for Learning to Track with Object Permanence

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

HyperCube: Implicit Field Representations of Voxelized 3D Models

ADB-IP-ROTATION - Use your mobile phone to gain a temporary IP address using ADB and data tethering

Attention over nodes in Graph Neural Networks using PyTorch (NeurIPS 2019)

This project is the PyTorch implementation of our CVPR 2022 paper:

Rax is a Learning-to-Rank library written in JAX

This is an official pytorch implementation of Fast Fourier Convolution.

Updated for TTS(CE) = Also Known as TTN V3. The code requires the first server to be 'ttn' protocol.

Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

PyTorch package for the discrete VAE used for DALL·E.

RMNA: A Neighbor Aggregation-Based Knowledge Graph Representation Learning Model Using Rule Mining

Official repository for the ICCV 2021 paper: UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model.

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator