Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Last update: Jun 27, 2022

Related tags

Overview

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Abstract

Analyzing complex scenes with DNN is a challenging task, particularly when images contain multiple objects that partially occlude each other. Existing approaches to image analysis mostly process objects independently and do not take into account the relative occlusion of nearby objects. We propose a deep network for multi-object instance segmentation that is robust to occlusion and can be trained from bounding box supervision only.

We also introduce an Occlusion Challenge dataset generated from real-world segmented objects with accurate annotations and propose a taxonomy of occlusion scenarios that pose a particular challenge for computer vision.

NOTICE

dataset links and model will be released in a few days. Update: 18 June

Requirments

The code uses Python 3.6 and it is tested on PyTorch GPU version 1.2, with CUDA-10.0 and cuDNN-7.5.

Installation

Clone the repository with:

git clone https://github.com/XD7479/Multi-Object-Occlusion.git
cd Multi-Object-Occlusion

Install requirments:

pip install -r requirements.txt

Datasets

Download the KINS dataset here and the Occlusion Challenge dataset here.
Enter the project folder and make links for the datasets:

ln -s  kins
ln -s  occ_challenge

Download the pre-trained model here.
Make links for the pre-trained model:

ln -s  models

Check the configuration file configs.py for the dataset and backbone you're using:

dataset_eval = 'occ_challenge'      # kins, occ_challenge
nn_type = 'resnext'             # vgg, resnext

Run the evaluation code with:

python3 eval_meanIoU.py

Segmentation Demo

Citation

@misc{yuan2021robust,
      title={Robust Instance Segmentation through Reasoning about Multi-Object Occlusion}, 
      author={Xiaoding Yuan and Adam Kortylewski and Yihong Sun and Alan Yuille},
      booktitle = {Proceedings IEEE Conf. on Computer Vision and Pattern Recognition (CVPR)},
      month = jun,
      year = {2021},
      month_numeric = {6}
}

Contact

If you have any questions you can contact Xiaoding Yuan by [email protected].

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Related tags

Overview

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Abstract

NOTICE

Requirments

Installation

Datasets

Segmentation Demo

Citation

Contact

Owner

Irene Yuan

Official code for paper "Optimization for Oriented Object Detection via Representation Invariance Loss".

Code for the ICML 2021 paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Pytorch Lightning Distributed Accelerators using Ray

A denoising diffusion probabilistic model (DDPM) tailored for conditional generation of protein distograms

Official PyTorch Implementation of paper "NeLF: Neural Light-transport Field for Single Portrait View Synthesis and Relighting", EGSR 2021.

Решения, подсказки, тесты и утилиты для тренировки по алгоритмам от Яндекса.

Official Pytorch Implementation of 3DV2021 paper: SAFA: Structure Aware Face Animation.

Tensorflow implementation of DeepLabv2

This is an implementation for the CVPR2020 paper "Learning Invariant Representation for Unsupervised Image Restoration"

Fast Soft Color Segmentation

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis

Unofficial implementation of "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" (https://arxiv.org/abs/2103.14030)

Localizing Visual Sounds the Hard Way

Predict and time series avocado hass

A basic duplicate image detection service using perceptual image hash functions and nearest neighbor search, implemented using faiss, fastapi, and imagehash

TensorFlow implementation of original paper : https://github.com/hszhao/PSPNet

Rate-limit-semaphore - Semaphore implementation with rate limit restriction for async-style (any core)

This project is the PyTorch implementation of our CVPR 2022 paper:

The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation".

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.