PFENet: Prior Guided Feature Enrichment Network for Few-shot Segmentation (TPAMI).

Last update: Dec 31, 2022

Overview

PFENet

This is the implementation of our paper PFENet: Prior Guided Feature Enrichment Network for Few-shot Segmentation that has been accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).

Get Started

Environment

torch==1.4.0 (torch version >= 1.0.1.post2 should be okay to run this repo)
numpy==1.18.4
tensorboardX==1.8
cv2==4.2.0

Datasets and Data Preparation

Please download the following datasets:

PASCAL-5i is based on the PASCAL VOC 2012 and SBD where the val images should be excluded from the list of training samples.
COCO 2014.

This code reads data from .txt files where each line contains the paths for image and the correcponding label respectively. Image and label paths are seperated by a space. Example is as follows:

image_path_1 label_path_1
image_path_2 label_path_2
image_path_3 label_path_3
...
image_path_n label_path_n

Then update the train/val/test list paths in the config files.

[Update] We have uploaded the lists we use in our paper.

The train/val lists for COCO contain 82081 and 40137 images respectively. They are the default train/val splits of COCO.
The train/val lists for PASCAL5i contain 5953 and 1449 images respectively. The train list should be voc_sbd_merge_noduplicate.txt and the val list is the original val list of pascal voc (val.txt).

To get voc_sbd_merge_noduplicate.txt:

We first merge the original VOC (voc_original_train.txt) and SBD (sbd_data.txt) training data.
[Important] sbd_data.txt does not overlap with the PASCALVOC 2012 validation data.
The merged list (voc_sbd_merge.txt) is then processed by the script (duplicate_removal.py) to remove the duplicate images and labels.

Run Demo / Test with Pretrained Models

Please download the pretrained models.
We provide 8 pre-trained models: 4 ResNet-50 based models for PASCAL-5i and 4 VGG-16 based models for COCO.
Update the config file by speficifying the target split and path (weights) for loading the checkpoint.
Execute mkdir initmodel at the root directory.
Download the ImageNet pretrained backbones and put them into the initmodel directory.
Then execute the command:

sh test.sh {*dataset*} {*model_config*}

Example: Test PFENet with ResNet50 on the split 0 of PASCAL-5i:

sh test.sh pascal split0_resnet50

Train

Execute this command at the root directory:

sh train.sh {*dataset*} {*model_config*}

Related Repositories

This project is built upon a very early version of SemSeg: https://github.com/hszhao/semseg.

Other projects in few-shot segmentation:

OSLSM: https://github.com/lzzcd001/OSLSM
CANet: https://github.com/icoz69/CaNet
PANet: https://github.com/kaixin96/PANet
FSS-1000: https://github.com/HKUSTCV/FSS-1000
AMP: https://github.com/MSiam/AdaptiveMaskedProxies
On the Texture Bias for FS Seg: https://github.com/rezazad68/fewshot-segmentation
SG-One: https://github.com/xiaomengyc/SG-One
FS Seg Propogation with Guided Networks: https://github.com/shelhamer/revolver

Many thanks to their greak work!

Citation

If you find this project useful, please consider citing:

@article{tian2020pfenet,
  title={Prior Guided Feature Enrichment Network for Few-Shot Segmentation},
  author={Tian, Zhuotao and Zhao, Hengshuang and Shu, Michelle and Yang, Zhicheng and Li, Ruiyu and Jia, Jiaya},
  journal={TPAMI},
  year={2020}
}

PFENet: Prior Guided Feature Enrichment Network for Few-shot Segmentation (TPAMI).

Related tags

Overview

PFENet

Get Started

Environment

Datasets and Data Preparation

[Update] We have uploaded the lists we use in our paper.

To get voc_sbd_merge_noduplicate.txt:

Run Demo / Test with Pretrained Models

Train

Related Repositories

Citation

Owner

DV Lab

Image morphing without reference points by applying warp maps and optimizing over them.

Computer Vision Paper Reviews with Key Summary of paper, End to End Code Practice and Jupyter Notebook converted papers

Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals.

[CVPR 2022 Oral] Balanced MSE for Imbalanced Visual Regression https://arxiv.org/abs/2203.16427

Self-Supervised Deep Blind Video Super-Resolution

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

交互式标注软件，暂定名 iann

TensorFlow implementation of Adaptive Information Transfer Multi-task (AITM) framework. Code for the paper submitted to KDD21: Modeling the Sequential Dependence among Audience Multi-step Conversions with Multi-task Learning for Customer Acquisition.

Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback

Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning

Concept drift monitoring for HA model servers.

Official PyTorch implementation of the paper "Graph-based Generative Face Anonymisation with Pose Preservation" in ICIAP 2021

RAANet: Range-Aware Attention Network for LiDAR-based 3D Object Detection with Auxiliary Density Level Estimation

Neural-PIL: Neural Pre-Integrated Lighting for Reflectance Decomposition - NeurIPS2021

MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training Conflicts (ICLR 2022)

An unofficial implementation of "Unpaired Image Super-Resolution using Pseudo-Supervision." CVPR2020

Diffusion Probabilistic Models for 3D Point Cloud Generation (CVPR 2021)

Course on computational design, non-linear optimization, and dynamics of soft systems at UIUC.

Human Action Controller - A human action controller running on different platforms.

Example how to deploy deep learning model with aiohttp.