Source code of our work: "Benchmarking Deep Models for Salient Object Detection"

Last update: Dec 30, 2022

Related tags

Overview

SALOD

Source code of our work: "Benchmarking Deep Models for Salient Object Detection".
In this works, we propose a new benchmark for SALient Object Detection (SALOD) methods.

We re-implement 14 methods using same settings, including input size, data loader and evaluation metrics (thanks to Metrics). Hyperparameters of optimizer are different because of various network structures and objective functions. We try our best to tune the optimizer for these models to achieve the best performance one-by-one. Some other networks are debugging now, it is welcome for your contributions on these networks to obtain better performance.

Properties

A unify interface for new models. To develop a new network, you only need to 1) set configs; 2) define network; 3) define loss function. See methods/template.
We build a new dataset by collecting several prevalent datasets in SOD task.
Easy to adopt different backbones (Available backbones: ResNet-50, VGG-16, MobileNet-v2, EfficientNet-B0, GhostNet, Res2Net)
Testing all networks on your own device. By input the name of network, you can test all available methods in our benchmark. Comparisons includes FPS, GFLOPs, model size and multiple effectiveness metrics.
We implement a loss factory that you can change the loss functions using command line parameters.

Available Methods:

Methods	Publish.	Input	Weight	Optim.	LR	Epoch	Paper	Src Code
DHSNet	CVPR2016	320^2	95M	Adam	2e-5	30	openaccess	Pytorch
NLDF	CVPR2017	320^2	161M	Adam	1e-5	30	openaccess	Pytorch/TF
Amulet	ICCV2017	320^2	312M	Adam	1e-5	30	openaccess	Pytorch
SRM	ICCV2017	320^2	240M	Adam	5e-5	30	openaccess	Pytorch
PicaNet	CVPR2018	320^2	464M	SGD	1e-2	30	openaccess	Pytorch
DSS	TPAMI2019	320^2	525M	Adam	2e-5	30	IEEE/ArXiv	Pytorch
BASNet	CVPR2019	320^2	374M	Adam	1e-5	30	openaccess	Pytorch
CPD	CVPR2019	320^2	188M	Adam	1e-5	30	openaccess	Pytorch
PoolNet	CVPR2019	320^2	267M	Adam	5e-5	30	openaccess	Pytorch
EGNet	ICCV2019	320^2	437M	Adam	5e-5	30	openaccess	Pytorch
SCRN	ICCV2019	320^2	100M	SGD	1e-2	30	openaccess	Pytorch
GCPA	AAAI2020	320^2	263M	SGD	1e-2	30	aaai.org	Pytorch
ITSD	CVPR2020	320^2	101M	SGD	5e-3	30	openaccess	Pytorch
MINet	CVPR2020	320^2	635M	SGD	1e-3	30	openaccess	Pytorch
`Tuning`	-----	-----	------	------	-----	-----	-----	-----
*PAGE	CVPR2019	320^2	------	------	-----	-----	openaccess	TF
*PFA	CVPR2019	320^2	------	------	-----	-----	openaccess	Pytorch
*F3Net	AAAI2020	320^2	------	------	-----	-----	aaai.org	Pytorch
*PFPN	AAAI2020	320^2	------	------	-----	-----	aaai.org	Pytorch
*LDF	CVPR2020	320^2	------	------	-----	-----	openaccess	Pytorch

Usage

# model_name: lower-cased method name. E.g. poolnet, egnet, gcpa, dhsnet or minet.
python3 train.py model_name --gpus=0

python3 test.py model_name --gpus=0 --weight=path_to_weight 

python3 test_fps.py model_name --gpus=0

# To evaluate generated maps:
python3 eval.py --pre_path=path_to_maps

Results

We report benchmark results here.
More results please refer to Reproduction, Few-shot and Generalization.

Notice: please contact us if you get better results.

VGG16-based:

Methods	#Param.	GFLOPs	Tr. Time	FPS	max-F	ave-F	Fbw	MAE	SM	EM
DHSNet	15.4	52.5	7.5	69.8	.884	.815	.812	.049	.880	.893
Amulet	33.2	1362	12.5	35.1	.855	.790	.772	.061	.854	.876
NLDF	24.6	136	9.7	46.3	.886	.824	.828	.045	.881	.898
SRM	37.9	73.1	7.9	63.1	.857	.779	.769	.060	.859	.874
PicaNet	26.3	74.2	40.5*	8.8	.889	.819	.823	.046	.884	.899
DSS	62.2	99.4	11.3	30.3	.891	.827	.826	.046	.888	.899
BASNet	80.5	114.3	16.9	32.6	.906	.853	.869	.036	.899	.915
CPD	29.2	85.9	10.5	36.3	.886	.815	.792	.052	.885	.888
PoolNet	52.5	236.2	26.4	23.1	.902	.850	.852	.039	.898	.913
EGNet	101	178.8	19.2	16.3	.909	.853	.859	.037	.904	.914
SCRN	16.3	47.2	9.3	24.8	.896	.820	.822	.046	.891	.894
GCPA	42.8	197.1	17.5	29.3	.903	.836	.845	.041	.898	.907
ITSD	16.9	76.3	15.2*	30.6	.905	.820	.834	.045	.901	.896
MINet	47.8	162	21.8	23.4	.900	.839	.852	.039	.895	.909

ResNet50-based:

Methods	#Param.	GFLOPs	Tr. Time	FPS	max-F	ave-F	Fbw	MAE	SM	EM
DHSNet	24.2	13.8	3.9	49.2	.909	.830	.848	.039	.905	.905
Amulet	79.8	1093.8	6.3	35.1	.895	.822	.835	.042	.894	.900
NLDF	41.1	115.1	9.2	30.5	.903	.837	.855	.038	.898	.910
SRM	61.2	20.2	5.5	34.3	.882	.803	.812	.047	.885	.891
PicaNet	106.1	36.9	18.5*	14.8	.904	.823	.843	.041	.902	.902
DSS	134.3	35.3	6.6	27.3	.894	.821	.826	.045	.893	.898
BASNet	95.5	47.2	12.2	32.8	.917	.861	.884	.032	.909	.921
CPD	47.9	14.7	7.7	22.7	.906	.842	.836	.040	.904	.908
PoolNet	68.3	66.9	10.2	33.9	.912	.843	.861	.036	.907	.912
EGNet	111.7	222.8	25.7	10.2	.917	.851	.867	.036	.912	.914
SCRN	25.2	12.5	5.5	19.3	.910	.838	.845	.040	.906	.905
GCPA	67.1	54.3	6.8	37.8	.916	.841	.866	.035	.912	.912
ITSD	25.7	19.6	5.7	29.4	.913	.825	.842	.042	.907	.899
MINet	162.4	87	11.7	23.5	.913	.851	.871	.034	.906	.917

Create New Model

To create a new model, you can copy the template folder and modify it as you want.

cp -r ./methods/template ./methods/new_name

More details please refer to python files in template floder.

Loss Factory

We supply a Loss Factory for an easier way to tune the loss functions. You can set --loss and --lw parameters to use it.

Here are some examples:

loss_dict = {'b': BCE, 's': SSIM, 'i': IOU, 'd': DICE, 'e': Edge, 'c': CTLoss}

python train.py ... --loss=bd
# loss = 1 * bce_loss + 1 * dice_loss

python train.py ... --loss=bs --lw=0.3,0.7
# loss = 0.3 * bce_loss + 0.7 * ssim_loss

python train.py ... --loss=bsid --lw=0.3,0.1,0.5,0.2
# loss = 0.3 * bce_loss + 0.1 * ssim_loss + 0.5 * iou_loss + 0.2 * dice_loss

Source code of our work: "Benchmarking Deep Models for Salient Object Detection"

Related tags

Overview

SALOD

Properties

Available Methods:

Usage

Results

VGG16-based:

ResNet50-based:

Create New Model

Loss Factory

Owner

Powerful unsupervised domain adaptation method for dense retrieval.

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

PyTorch implementation of VAGAN: Visual Feature Attribution Using Wasserstein GANs

PassAPI is a password generator in hash format and fully developed in Python, with the aim of teaching how to handle and build

A Pytorch Implementation of Domain adaptation of object detector using scissor-like networks

Some toy examples of score matching algorithms written in PyTorch

Language-Agnostic Website Embedding and Classification

The Environment I built to study Reinforcement Learning + Pokemon Showdown

Citation Intent Classification in scientific papers using the Scicite dataset an Pytorch

Simple ONNX operation generator. Simple Operation Generator for ONNX.

Code for technical report "An Improved Baseline for Sentence-level Relation Extraction".

Code for: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

Local-Global Stratified Transformer for Efficient Video Recognition

Implementation for the EMNLP 2021 paper "Interactive Machine Comprehension with Dynamic Knowledge Graphs".

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

Process text, including tokenizing and representing sentences as vectors and Applying some concepts like RNN, LSTM and GRU to create a classifier can detect the language in which a sentence is written from among 17 languages.

This repository provides code for "On Interaction Between Augmentations and Corruptions in Natural Corruption Robustness".

Spectralformer: Rethinking hyperspectral image classification with transformers

ICRA 2021 - Robust Place Recognition using an Imaging Lidar

Wordle Env: A Daily Word Environment for Reinforcement Learning