Unifying Global-Local Representations in Salient Object Detection with Transformer

Last update: Aug 24, 2022

Related tags

Overview

GLSTR (Global-Local Saliency Transformer)

This is the official implementation of paper "Unifying Global-Local Representations in Salient Object Detection with Transformer" by Sucheng Ren, Qiang Wen, Nanxuan Zhao, Guoqiang Han, Shengfeng He

Prerequisites

The whole training process can be done on eight RTX2080Ti or four RTX3090.

Pytorch 1.6

Datasets

Training Set

We use the training set of DUTS (DUTS-TR) to train our model.

/path/to/DUTS-TR/
   img/
      img1.jpg
   label/
      label1.png

Testing Set

We test our model on the testing set of DUTS, ECSSD, HKU-IS, PASCAL-S, DUT-OMRON, and SOD to test our model.

Training

Download the pretrained transformer backbone on ImageNet.

# input the path to training data and pretrained backbone in train.sh
bash train.sh

Testing

Download the pretrained model from Baidu pan(code: uo0a), Google drive, and put it int ./ckpt/

python test.py

Evaluation

The precomputed saliency maps (DUTS-TE, ECSSD, HKU-IS, PASCAL-S, DUT-OMRON, and SOD) can be found at Baidu pan(code: uo0a), Google drive.

After paper submission, we retrain the model, and the performance is improved. Feel free to use the results of our paper or the precomputed saliency maps.

Contact

If you have any questions, feel free to email Sucheng Ren :) ([email protected])

Citation

Please cite our paper if you think the code and paper are helpful.

@article{ren2021unifying,
  title={Unifying Global-Local Representations in Salient Object Detection with Transformer},
  author={Ren, Sucheng and Wen, Qiang and Zhao, Nanxuan and Han, Guoqiang and He, Shengfeng},
  journal={arXiv preprint arXiv:2108.02759},
  year={2021}
}

Unifying Global-Local Representations in Salient Object Detection with Transformer

Related tags

Overview

GLSTR (Global-Local Saliency Transformer)

Prerequisites

Datasets

Training Set

Testing Set

Training

Testing

Evaluation

Contact

Citation

Owner

Learning To Have An Ear For Face Super-Resolution

It's final year project of Diploma Engineering. This project is based on Computer Vision.

Official PyTorch implementation of N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras (ICCV 2021)

Orange Chicken: Data-driven Model Generalizability in Crosslinguistic Low-resource Morphological Segmentation

FedJAX is a library for developing custom Federated Learning (FL) algorithms in JAX.

CT Based COVID 19 Diagnose by Image Processing and Deep Learning

Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer

Pretrained Cost Model for Distributed Constraint Optimization Problems

A Python type explainer!

Predictive Maintenance LSTM

Event sourced bank - A wide-and-shallow example using the Python event sourcing library

Code for “ACE-HGNN: Adaptive Curvature ExplorationHyperbolic Graph Neural Network”

This repository allows the user to automatically scale a 3D model/mesh/point cloud on Agisoft Metashape

Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency

Saliency - Framework-agnostic implementation for state-of-the-art saliency methods (XRAI, BlurIG, SmoothGrad, and more).

Invertible conditional GANs for image editing

Fader Networks: Manipulating Images by Sliding Attributes - NIPS 2017

[ACM MM 2021] Multiview Detection with Shadow Transformer (and View-Coherent Data Augmentation)

Pytorch Implementation of Continual Learning With Filter Atom Swapping (ICLR'22 Spolight) Paper

This project deals with the detection of skin lesions within the ISICs dataset using YOLOv3 Object Detection with Darknet.