Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations, CVPR 2019 (Oral)

Last update: Dec 29, 2022

Overview

Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations

The code of:

Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations, Jiwoon Ahn, Sunghyun Cho, and Suha Kwak, CVPR 2019 [Paper]

This repository contains a framework for learning instance segmentation with image-level class labels as supervision. The key component of our approach is Inter-pixel Relation Network (IRNet) that estimates two types of information: a displacement vector field and a class boundary map, both of which are in turn used to generate pseudo instance masks from CAMs.

Citation

If you find the code useful, please consider citing our paper using the following BibTeX entry.

@InProceedings{Ahn_2019_CVPR,
author = {Ahn, Jiwoon and Cho, Sunghyun and Kwak, Suha},
title = {Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2019}
}

Prerequisite

Python 3.7, PyTorch 1.1.0, and more in requirements.txt
PASCAL VOC 2012 devkit
NVIDIA GPU with more than 1024MB of memory

Usage

Install python dependencies

pip install -r requirements.txt

Download PASCAL VOC 2012 devkit

Follow instructions in http://host.robots.ox.ac.uk/pascal/VOC/voc2012/#devkit

Run run_sample.py or make your own script

python run_sample.py

You can either mannually edit the file, or specify commandline arguments.

Train Mask R-CNN or DeepLab with the generated pseudo labels

For the reports, we used Detectron.
- Run step/make_cocoann.py to create COCO-style annotations.
- Note: Do not employ https://storage.googleapis.com/coco-dataset/external/PASCAL_VOC.zip to measure the performance of the Mask R-CNN! It only contains bounding box annotations.
TorchVision now supports Mask R-CNN and DeepLab. I personally recommend to use this.

TO DO

Training code for MS-COCO
Code refactoring
IRNet v2

Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations, CVPR 2019 (Oral)

Related tags

Overview

Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations

Citation

Prerequisite

Usage

Install python dependencies

Download PASCAL VOC 2012 devkit

Run run_sample.py or make your own script

Train Mask R-CNN or DeepLab with the generated pseudo labels

TO DO

Owner

Jiwoon Ahn

The implementation of 'Image synthesis via semantic composition'.

Source code for the plant extraction workflow introduced in the paper “Agricultural Plant Cataloging and Establishment of a Data Framework from UAV-based Crop Images by Computer Vision”

Contains code for Deep Kernelized Dense Geometric Matching

A full-fledged version of Pix2Seq

FLVIS: Feedback Loop Based Visual Initial SLAM

Arxiv harvester - Poor man's simple harvester for arXiv resources

Full Stack Deep Learning Labs

Codes for ACL-IJCNLP 2021 Paper "Zero-shot Fact Verification by Claim Generation"

Luminaire is a python package that provides ML driven solutions for monitoring time series data.

Stream images from a connected camera over MQTT, view using Streamlit, record to file and sqlite

Anchor Retouching via Model Interaction for Robust Object Detection in Aerial Images

Build tensorflow keras model pipelines in a single line of code. Created by Ram Seshadri. Collaborators welcome. Permission granted upon request.

Using Tensorflow Object Detection API to detect Waymo open dataset

Adaptation through prediction: multisensory active inference torque control

⚓ Eurybia monitor model drift over time and securize model deployment with data validation

Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification

Custom studies about block sparse attention.

Heterogeneous Temporal Graph Neural Network

Convnet transfer - Code for paper How transferable are features in deep neural networks?

Finetune SSL models for MOS prediction