Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth

Last update: Dec 07, 2022

Related tags

Overview

Instance segmentation by jointly optimizing spatial embeddings and clustering bandwidth

This codebase implements the loss function described in:

Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth Davy Neven, Bert De Brabandere, Marc Proesmans, and Luc Van Gool Conference on Computer Vision and Pattern Recognition (CVPR), june 2019

Our network architecture is a multi-branched version of ERFNet and uses the Lovasz-hinge loss for maximizing the IoU of each instance.

License

This software is released under a creative commons license which allows for personal and research use only. For a commercial license please contact the authors. You can view a license summary here.

Getting started

This codebase showcases the proposed loss function on car instance segmentation using the Cityscapes dataset.

Prerequisites

Dependencies:

Pytorch 1.1
Python 3.6.8 (or higher)
Cityscapes + scripts (if you want to evaluate the model)

Training

Training consists out of 2 steps. We first train on 512x512 crops around each object, to avoid computation on background patches. Afterwards, we finetune on larger patches (1024x1024) to account for bigger objects and background features which are not present in the smaller crops.

To generate these crops do the following:

$ CITYSCAPES_DIR=/path/to/cityscapes/ python utils/generate_crops.py

Afterwards start training:

$ CITYSCAPES_DIR=/path/to/cityscapes/ python train.py

Different options can be modified in train_config.py, e.g. to visualize set display=True.

Testing

You can download a pretrained model here. Save this file in the src/pretrained_models/ or adapt the test_config.py file.

To test the model on the Cityscapes validation set run:

$ CITYSCAPES_DIR=/path/to/cityscapes/ python test.py

The pretrained model gets 56.4 AP on the car validation set.

Acknowledgement

This work was supported by Toyota, and was carried out at the TRACE Lab at KU Leuven (Toyota Research on Automated Cars in Europe - Leuven)

Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth

Related tags

Overview

Instance segmentation by jointly optimizing spatial embeddings and clustering bandwidth

License

Getting started

Prerequisites

Training

Testing

Acknowledgement

Owner

Full-featured Decision Trees and Random Forests learner.

A TensorFlow implementation of FCN-8s

links and status of cool gradio demos

Simulation of moving particles under microscopic imaging

[CVPR'21] MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation

Repository for "Exploring Sparsity in Image Super-Resolution for Efficient Inference", CVPR 2021

Rate-limit-semaphore - Semaphore implementation with rate limit restriction for async-style (any core)

A complete end-to-end demonstration in which we collect training data in Unity and use that data to train a deep neural network to predict the pose of a cube. This model is then deployed in a simulated robotic pick-and-place task.

This repo provides a demo for the CVPR 2021 paper "A Fourier-based Framework for Domain Generalization" on the PACS dataset.

DeepLearning Anomalies Detection with Bluetooth Sensor Data

MutualGuide is a compact object detector specially designed for embedded devices

JDet is Object Detection Framework based on Jittor.

Code for Quantifying Ignorance in Individual-Level Causal-Effect Estimates under Hidden Confounding

Official repository of Semantic Image Matting

SporeAgent: Reinforced Scene-level Plausibility for Object Pose Refinement

meProp: Sparsified Back Propagation for Accelerated Deep Learning (ICML 2017)

FaceOcc: A Diverse, High-quality Face Occlusion Dataset for Human Face Extraction

An unofficial personal implementation of UM-Adapt, specifically to tackle joint estimation of panoptic segmentation and depth prediction for autonomous driving datasets.

Auto-Lama combines object detection and image inpainting to automate object removals

🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"