[PAMI 2020] Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation

Last update: Nov 25, 2022

Related tags

Overview

Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation

This repository contains the source code for the paper Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation.

Abstract

We present an approach for jointly matching and segmenting object instances of the same category within a collection of images. In contrast to existing algorithms that tackle the tasks of semantic matching and object co-segmentation in isolation, our method exploits the complementary nature of the two tasks. The key insights of our method are two-fold. First, the estimated dense correspondence fields from semantic matching provide supervision for object co-segmentation by enforcing consistency between the predicted masks from a pair of images. Second, the predicted object masks from object co-segmentation in turn allow us to reduce the adverse effects due to background clutters for improving semantic matching. Our model is end-to-end trainable and does not require supervision from manually annotated correspondences and object masks. We validate the efficacy of our approach on five benchmark datasets: TSS, Internet, PF-PASCAL, PF-WILLOW, and SPair-71k, and show that our algorithm performs favorably against the state-of-the-art methods on both semantic matching and object co-segmentation tasks.

Citation

If you find our code useful, please consider citing our work using the following bibtex:

@article{MaCoSNet,
    title={Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation},
    author={Chen, Yun-Chun and Lin, Yen-Yu and Yang, Ming-Hsuan and Huang, Jia-Bin},
    journal={IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI)},
    year={2020}
}

@inproceedings{WeakMatchNet,
  title={Deep Semantic Matching with Foreground Detection and Cycle-Consistency},
  author={Chen, Yun-Chun and Huang, Po-Hsiang and Yu, Li-Yu and Huang, Jia-Bin and Yang, Ming-Hsuan and Lin, Yen-Yu},
  booktitle={Asian Conference on Computer Vision (ACCV)},
  year={2018}
}

Environment

Install Anaconda Python3.7
This code is tested on NVIDIA V100 GPU with 16GB memory

pip install -r requirements.txt

Dataset

Please download the PF-PASCAL, PF-WILLOW, SPair-71k, TSS, and Internet datasets
Please modify the variable DATASET_DIR in config.py
Please modify the variable CSV_DIR in config.py

Training

You may determine which dataset to be the training set by changing the $DATASET variable in train.sh
You may change the $BATCH_SIZE variable in train.sh to a suitable value based on the GPU memory
The trained model will be saved under the trained_models folder

sh train.sh

Evaluation

You may determine which dataset to be evaluated by changing the $DATASET variable in eval.sh
You may change the $BATCH_SIZE variable in eval.sh to a suitable value based on the GPU memory

sh eval.sh

Acknowledgement

This code is heavily borrowed from Rocco et al.

[PAMI 2020] Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation

Related tags

Overview

Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation

Abstract

Citation

Environment

Dataset

Training

Evaluation

Acknowledgement

Owner

Yun-Chun Chen

DanceTrack: Multiple Object Tracking in Uniform Appearance and Diverse Motion

Audio Visual Emotion Recognition using TDA

Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point Clouds (CVPR 2022, Oral)

Neural network for recognizing the gender of people in photos

One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking

PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models

Source code of the paper Meta-learning with an Adaptive Task Scheduler.

[NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images

Lipschitz-constrained Unsupervised Skill Discovery

Implementations of LSTM: A Search Space Odyssey variants and their training results on the PTB dataset.

[CVPR 2021 Oral] ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis

Yggdrasil - A simplistic bot designed to streamline your server experience

Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)

Planar Prior Assisted PatchMatch Multi-View Stereo

Emblaze - Interactive Embedding Comparison

Codes and models of NeurIPS2021 paper - DominoSearch: Find layer-wise fine-grained N:M sparse schemes from dense neural networks

FAVD: Featherweight Assisted Vulnerability Discovery

Research on Tabular Deep Learning (Python package & papers)

Library to enable Bayesian active learning in your research or labeling work.

This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is accepted to ICCV2021.