SCNet: Learning Semantic Correspondence

Related tags

Deep LearningSCNet
Overview

SCNet Code

Region matching code is contributed by Kai Han ([email protected]).

Dense matching code is contributed by Rafael S. Rezende ([email protected]).

This code is written in MATLAB, and implements the SCNet[1]. For the dataset, see our project page: http://www.di.ens.fr/willow/research/scnet.

Install Dependencies

Codes

SCNet_Matconvnet

Additional Matconvnet modules implemented for SCNet. These code should be copied into matconvnet/matlab/ folder.

SCNet

This is the primary net work training and testing code.

  • SCNet_A_init.m, SCNet_AG_init.m, SCNet_AGplus_init.m: initialize the SCNet_A, SCNet_AG, SCNet_AG+.

  • SCNet_A.m, SCNet_AG.m, SCNet_AGplus.m: train SCNet_A, SCNet_AG, SCNet_AG+.

  • eva_PCR_mIoU_SCNet_A.m, eva_PCR_mIoU_SCNet_AG.m, eva_PCR_mIoU_SCNet_AGplus.m: evaluate the trained nets.

  • eva_PCR_mIoU_ImageNet_SCNet_A.m, eva_PCR_mIoU_ImageNet_SCNet_AG.m, eva_PCR_mIoU_ImageNet_SCNet_AGplus.m: evaluate SCNets with ImageNet pretrained parameters, i.e., SCNets without training.

SCNet_Baselines

Comparison code for our SCNet features and HOG features with NAM, PHM and LOM in Proposal Flow [2, 3].

  • NAM_HOG_eva.m, PHM_HOG_eva.m, LOM_HOG_eva.m: evaluate NAM, PHM, and LOM with HOG features.

  • NAM_SCNet_eva.m, PHM_SCNet_eva.m, LOM_SCNet_eva.m: evaluate NAM, PHM, and LOM with learned SCNet features.

  • HOG_SCNet_AG_eva.m: replace the learned SCNet feature by HOG feature in SCNet_AG model.

Data

We used PF-PASCAL, PF-WILLOW, PASCAL Parts and CUB data sets and follows Proposal Flow[2, 3] to generate our trainging data.

Triaining data preparation code is put in PF-PASCAL-code folder.

Notes

  • The code is provided for academic use only. Use of the code in any commercial or industrial related activities is prohibited.
  • If you use our code or dataset, please cite the paper.
@InProceedings{han2017scnet,
author = {Kai Han and Rafael S. Rezende and Bumsub Ham and Kwan-Yee K. Wong and Minsu Cho and Cordelia Schmid and Jean Ponce},
title = {SCNet: Learning Semantic Correspondence},
booktitle = {International Conference on Computer Vision (ICCV)},
year = {2017}
}

References

[1] Kai Han, Rafael S. Rezende, Bumsub Ham, Kwan-Yee K. Wong, Minsu Cho, Cordelia Schmid, Jean Ponce, "SCNet: Learning Semantic Correspondence", International Conference on Computer Vision (ICCV), 2017.

[2] Bumsub Ham, Minsu Cho, Cordelia Schmid, Jean Ponce, "Proposal Flow: Semantic Correspondences from Object Proposals", IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 2017

[3] Bumsub Ham, Minsu Cho, Cordelia Schmid, Jean Ponce, "Proposal Flow", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016

Owner
Kai Han
Visual Geometry Group (VGG)
Kai Han
Chunkmogrify: Real image inversion via Segments

Chunkmogrify: Real image inversion via Segments Teaser video with live editing sessions can be found here This code demonstrates the ideas discussed i

David Futschik 112 Jan 04, 2023
Neural Dynamic Policies for End-to-End Sensorimotor Learning

This is a PyTorch based implementation for our NeurIPS 2020 paper on Neural Dynamic Policies for end-to-end sensorimotor learning.

Shikhar Bahl 47 Dec 11, 2022
Doing fast searching of nearest neighbors in high dimensional spaces is an increasingly important problem

Benchmarking nearest neighbors Doing fast searching of nearest neighbors in high dimensional spaces is an increasingly important problem, but so far t

Erik Bernhardsson 3.2k Jan 03, 2023
A Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution

DRSAN A Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution Karam Park, Jae Woong Soh, and Nam Ik Cho Environments U

4 May 10, 2022
[ICML 2021] "Graph Contrastive Learning Automated" by Yuning You, Tianlong Chen, Yang Shen, Zhangyang Wang

Graph Contrastive Learning Automated PyTorch implementation for Graph Contrastive Learning Automated [talk] [poster] [appendix] Yuning You, Tianlong C

Shen Lab at Texas A&M University 80 Nov 23, 2022
CM building dataset Timisoara

CM_building_dataset_Timisoara Date created: Febr-2020 The Timi\c{s}oara Building Dataset - TMBuD - is composed of 160 images with the resolution of 76

Orhei Ciprian 5 Sep 07, 2022
Deep Image Matting implementation in PyTorch

Deep Image Matting Deep Image Matting paper implementation in PyTorch. Differences "fc6" is dropped. Indices pooling. "fc6" is clumpy, over 100 millio

Yang Liu 724 Dec 27, 2022
Revisting Open World Object Detection

Revisting Open World Object Detection Installation See INSTALL.md. Dataset Our new data division is based on COCO2017. We divide the training set into

58 Dec 23, 2022
This is a Tensorflow implementation of Learning to See in the Dark in CVPR 2018

Learning-to-See-in-the-Dark This is a Tensorflow implementation of Learning to See in the Dark in CVPR 2018, by Chen Chen, Qifeng Chen, Jia Xu, and Vl

5.3k Jan 01, 2023
RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation

RIFE RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation Ported from https://github.com/hzwer/arXiv2020-RIFE Dependencies NumPy

49 Jan 07, 2023
A project for developing transformer-based models for clinical relation extraction

Clinical Relation Extration with Transformers Aim This package is developed for researchers easily to use state-of-the-art transformers models for ext

uf-hobi-informatics-lab 101 Dec 19, 2022
Wordplay, an artificial Intelligence based crossword puzzle solver.

Wordplay, AI based crossword puzzle solver A crossword is a word puzzle that usually takes the form of a square or a rectangular grid of white- and bl

Vaibhaw 4 Nov 16, 2022
You Only Look Once for Panopitic Driving Perception

You Only ๐Ÿ‘€ Once for Panoptic ๐Ÿš— Perception You Only Look at Once for Panoptic driving Perception by Dong Wu, Manwen Liao, Weitian Zhang, Xinggang Wan

Hust Visual Learning Team 1.4k Jan 04, 2023
SegNet model implemented using keras framework

keras-segnet Implementation of SegNet-like architecture using keras. Current version doesn't support index transferring proposed in SegNet article, so

185 Aug 30, 2022
MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.

MusicYOLO MusicYOLO framework uses the object detection model, YOLOX, to locate notes in the spectrogram. Its performance on the ISMIR2014 dataset, MI

Xianke Wang 2 Aug 02, 2022
Answer a series of contextually-dependent questions like they may occur in natural human-to-human conversations.

SCAI-QReCC-21 [leaderboards] [registration] [forum] [contact] [SCAI] Answer a series of contextually-dependent questions like they may occur in natura

19 Sep 28, 2022
This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-clustering.

EEND-vector clustering The EEND-vector clustering (End-to-End-Neural-Diarization-vector clustering) is a speaker diarization framework that integrates

45 Dec 26, 2022
This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

This is an official pytorch implementation of ActionCLIP: A New Paradigm for Video Action Recognition [arXiv] Overview Content Prerequisites Data Prep

268 Jan 09, 2023
A forwarding MPI implementation that can use any other MPI implementation via an MPI ABI

MPItrampoline MPI wrapper library: MPI trampoline library: MPI integration tests: MPI is the de-facto standard for inter-node communication on HPC sys

Erik Schnetter 31 Dec 22, 2022
A cross-document event and entity coreference resolution system, trained and evaluated on the ECB+ corpus.

A Comprehensive Comparison of Word Embeddings in Event & Entity Coreference Resolution. Introduction This repo contains experimental code derived from

2 May 09, 2022