Towards Interpretable Deep Metric Learning with Structural Matching

Last update: Nov 11, 2022

Overview

DIML

Created by Wenliang Zhao*, Yongming Rao*, Ziyi Wang, Jiwen Lu, Jie Zhou

This repository contains PyTorch implementation for paper Towards Interpretable Deep Metric Learning with Structural Matching (ICCV 2021).

We present a deep interpretable metric learning (DIML) that adopts a structural matching strategy to explicitly aligns the spatial embeddings by computing an optimal matching flow between feature maps of the two images. Our method enables deep models to learn metrics in a more human-friendly way, where the similarity of two images can be decomposed to several part-wise similarities and their contributions to the overall similarity. Our method is model-agnostic, which can be applied to off-the-shelf backbone networks and metric learning methods.

[arXiv]

Usage

Requirement

python3
PyTorch 1.7

Dataset Preparation

Please follow the instruction in RevisitDML to download the datasets and put all the datasets in data folder. The structure should be:

data
├── cars196
│   └── images
├── cub200
│   └── images
└── online_products
    ├── images
    └── Info_Files

Training & Evaluation

To train the baseline models, run the scripts in scripts/baselines. For example:

CUDA_VISIBLE_DEVICES=0 ./script/baselines/cub_runs.sh

The checkpoints are saved in Training_Results folder.

To test the baseline models with our proposed DIML, first edit the checkpoint paths in test_diml.py, then run

CUDA_VISIBLE_DEVICES=0 ./scripts/diml/test_diml.sh cub200

The results will be written to test_results/test_diml_<dataset>.csv in CSV format.

You can also incorporate DIML into the training objectives. We provide two examples which apply DIML to Margin and Multi-Similarity loss. To train DIML models, run

# ./scripts/diml/train_diml.sh <dataset> <batch_size> <loss> <num_epochs>
# where loss could be margin_diml or multisimilarity_diml
# e.g.
CUDA_VISIBLE_DEVICES=0 ./scripts/diml/train_diml.sh cub200 112 margin_diml 150

Acknowledgement

The code is based on RevisitDML.

Citation

If you find our work useful in your research, please consider citing:

@inproceedings{zhao2021towards,
  title={Towards Interpretable Deep Metric Learning with Structural Matching},
  author={Zhao, Wenliang and Rao, Yongming and Wang, Ziyi and Lu, Jiwen and Zhou, Jie},
  booktitle={ICCV},
  year={2021}
}

Towards Interpretable Deep Metric Learning with Structural Matching

Related tags

Overview

DIML

Usage

Requirement

Dataset Preparation

Training & Evaluation

Acknowledgement

Citation

Owner

Wenliang Zhao

Official Pytorch implementation of 6DRepNet: 6D Rotation representation for unconstrained head pose estimation.

Implementation of Online Label Smoothing in PyTorch

On Nonlinear Latent Transformations for GAN-based Image Editing - PyTorch implementation

[CVPR 2016] Unsupervised Feature Learning by Image Inpainting using GANs

A tight inclusion function for continuous collision detection

一套完整的微博舆情分析流程代码，包括微博爬虫、LDA主题分析和情感分析。

A Pytorch loader for MVTecAD dataset.

Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)

Neural Turing Machines (NTM) - PyTorch Implementation

Object Depth via Motion and Detection Dataset

This repository contains code to train and render Mixture of Volumetric Primitives (MVP) models

Sequence to Sequence (seq2seq) Recurrent Neural Network (RNN) for Time Series Forecasting

The official implementation of the research paper "DAG Amendment for Inverse Control of Parametric Shapes"

Official code of the paper "Expanding Low-Density Latent Regions for Open-Set Object Detection" (CVPR 2022)

Code accompanying "Evolving spiking neuron cellular automata and networks to emulate in vitro neuronal activity," accepted to IEEE SSCI ICES 2021

S2s2net - Sentinel-2 Super-Resolution Segmentation Network

PuppetGAN - Cross-Domain Feature Disentanglement and Manipulation just got way better! 🚀

ISBI 2022: Cross-level Contrastive Learning and Consistency Constraint for Semi-supervised Medical Image.

Robustness via Cross-Domain Ensembles

Code base for reproducing results of I.Schubert, D.Driess, O.Oguz, and M.Toussaint: Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics. NeurIPS (2021)