TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Last update: Dec 16, 2022

Related tags

Deep Learning TransFGU

Overview

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Zhaoyun Yin, Pichao Wang, Fan Wang, Xianzhe Xu, Hanling Zhang, Hao Li, Rong Jin

[Preprint]

Getting Started

Create the environment

# create conda env
conda create -n TransFGU python=3.8
# activate conda env
conda activate TransFGU
# install pytorch
conda install pytorch=1.8 torchvision cudatoolkit=10.1
# install other dependencies
pip install mmcv-full -f https://download.openmmlab.com/mmcv/dist/cu101/torch1.8.0/index.html
pip install -r requirements.txt

Dataset Preparation

MS-COCO Dataset: Download the trainset, validset, annotations and the json files, place the extracted files into root/data/MSCOCO.
PascalVOC Dataset: Download training/validation data, place the extracted files into root/data/PascalVOC.
Cityscapes Dataset: Download leftImg8bit_trainvaltest.zip and gtFine_trainvaltest.zip, place the extracted files into root/data/Cityscapes.
LIP Dataset: Download TrainVal_images.zip and TrainVal_parsing_annotations.zip, place the extracted files into root/data/LIP.

the structure of dataset folders should be as follow:

data/
    │── MSCOCO/
    │     ├── images/
    │     │     ├── train2017/
    │     │     └── val2017/
    │     └── annotations/
    │           ├── train2017/
    │           ├── val2017/
    │           ├── instances_train2017.json
    │           └── instances_val2017.json
    │── Cityscapes/
    │     ├── leftImg8bit/
    │     │     ├── train/
    │     │     │       ├── aachen
    │     │     │       └── ...
    │     │     └──── val/
    │     │             ├── frankfurt
    │     │             └── ...
    │     └── gtFine/
    │           ├── train/
    │           │       ├── aachen
    │           │       └── ...
    │           └──── val/
    │                   ├── frankfurt
    │                   └── ...
    │── PascalVOC/
    │     ├── JPEGImages/
    │     ├── SegmentationClass/
    │     └── ImageSets/
    │           └── Segmentation/
    │                   ├── train.txt
    │                   └── val.txt
    └── LIP/
          ├── train_images/
          ├── train_segmentations/
          ├── val_images/
          ├── val_segmentations/
          ├── train_id.txt
          └── val_id.txt

Model download

please download the pretrained dino model (deit small 8x8), then place it into root/weight/dino/
download trained model from Google Drive or Baidu Netdisk (code:1118), then place them into root/weight/trained/

Name	mIoU	Pixel Accuracy	Model
COCOStuff-27	16.19	44.52	Google Drive
COCOStuff-171	11.93	34.32	Google Drive
COCO-80	12.69	64.31	Google Drive
Cityscapes	16.83	77.92	Google Drive
Pascal-VOC	37.15	83.59	Google Drive
LIP-5	25.16	65.76	Google Drive
LIP-16	15.49	60.08	Google Drive
LIP-19	12.24	42.52	Google Drive

Train and Evaluate Our Method

To train and evaluate our method on different datasets under desired granularity level, please follow the instructions here.

Citation

If you find our work useful in your research, please consider citing:

@article{yin2021transfgu,
  title={TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation},
  author={Zhaoyun, Yin and Pichao, Wang and Fan, Wang and Xianzhe, Xu and Hanling, Zhang and Hao, Li and Rong, Jin},
  journal={arXiv preprint arXiv:2112.01515},
  year={2021}
}

LICENSE

The code is released under the MIT license.

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Related tags

Overview

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Getting Started

Dataset Preparation

Model download

Train and Evaluate Our Method

Citation

LICENSE

Copyright

Owner

DamoCV

Pytorch implementation of Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization https://arxiv.org/abs/2008.11646

Progressive Domain Adaptation for Object Detection

A toolset of Python programs for signal modeling and indentification via sparse semilinear autoregressors.

Code for paper: "Spinning Language Models for Propaganda-As-A-Service"

Narya API allows you track soccer player from camera inputs, and evaluate them with an Expected Discounted Goal (EDG) Agent

The code for our paper "AutoSF: Searching Scoring Functions for Knowledge Graph Embedding"

Instantaneous Motion Generation for Robots and Machines.

Parameter Efficient Deep Probabilistic Forecasting

Data, model training, and evaluation code for "PubTables-1M: Towards a universal dataset and metrics for training and evaluating table extraction models".

Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021

This repository contains the reference implementation for our proposed Convolutional CRFs.

Python framework for Stochastic Differential Equations modeling

Ganilla - Official Pytorch implementation of GANILLA

On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

Preprossing-loan-data-with-NumPy - In this project, I have cleaned and pre-processed the loan data that belongs to an affiliate bank based in the United States.

FIRA: Fine-Grained Graph-Based Code Change Representation for Automated Commit Message Generation

A PyTorch implementation of EfficientNet and EfficientNetV2 (coming soon!)

Official Code for ICML 2021 paper "Revisiting Point Cloud Shape Classification with a Simple and Effective Baseline"

[CVPR 21] Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.