Official implementation of "SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers"

Last update: Dec 31, 2022

Overview

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers

Figure 1: Performance of SegFormer-B0 to SegFormer-B5.

Project page | Paper | Demo (Youtube) | Demo (Bilibili)

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers.
Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, Jose M. Alvarez, and Ping Luo.
Technical Report 2021.

This repository contains the PyTorch training/evaluation code and the pretrained models for SegFormer.

SegFormer is a simple, efficient and powerful semantic segmentation method, as shown in Figure 1.

We use MMSegmentation v0.13.0 as the codebase.

Installation

For install and data preparation, please refer to the guidelines in MMSegmentation v0.13.0.

Other requirements: pip install timm==0.3.2

Evaluation

Download trained weights.

Example: evaluate SegFormer-B1 on ADE20K:

# Single-gpu testing
python tools/test.py local_configs/segformer/B1/segformer.b1.512x512.ade.160k.py /path/to/checkpoint_file

# Multi-gpu testing
./tools/dist_test.sh local_configs/segformer/B1/segformer.b1.512x512.ade.160k.py /path/to/checkpoint_file <GPU_NUM>

# Multi-gpu, multi-scale testing
tools/dist_test.sh local_configs/segformer/B1/segformer.b1.512x512.ade.160k.py /path/to/checkpoint_file <GPU_NUM> --aug-test

Training

Download weights pretrained on ImageNet-1K, and put them in a folder pretrained/.

Example: train SegFormer-B1 on ADE20K:

# Single-gpu training
python tools/train.py local_configs/segformer/B1/segformer.b1.512x512.ade.160k.py 

# Multi-gpu training
./tools/dist_train.sh local_configs/segformer/B1/segformer.b1.512x512.ade.160k.py <GPU_NUM>

License

Please check the LICENSE file. SegFormer may be used non-commercially, meaning for research or evaluation purposes only. For business inquiries, please contact [email protected].

Citation

@article{xie2021segformer,
  title={SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers},
  author={Xie, Enze and Wang, Wenhai and Yu, Zhiding and Anandkumar, Anima and Alvarez, Jose M and Luo, Ping},
  journal={arXiv preprint arXiv:2105.15203},
  year={2021}
}

Official implementation of "SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers"

Related tags

Overview

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers

Project page | Paper | Demo (Youtube) | Demo (Bilibili)

Installation

Evaluation

Training

License

Citation

Owner

NVIDIA Research Projects

FastFace: Lightweight Face Detection Framework

Command-line tool for downloading and extending the RedCaps dataset.

Twin-deep neural network for semi-supervised learning of materials properties

ML-Ensemble – high performance ensemble learning

Convert human motion from video to .bvh

Image process framework based on plugin like imagej, it is esay to glue with scipy.ndimage, scikit-image, opencv, simpleitk, mayavi...and any libraries based on numpy

PyTorch framework for Deep Learning research and development.

Relative Human dataset, CVPR 2022

A PyTorch Implementation of "SINE: Scalable Incomplete Network Embedding" (ICDM 2018).

The code for SAG-DTA: Prediction of Drug–Target Affinity Using Self-Attention Graph Network.

Adversarial Graph Augmentation to Improve Graph Contrastive Learning

8-week curriculum for AI Builders

Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper

Face Recognition plus identification simply and fast | Python

This is a Deep Leaning API for classifying emotions from human face and human audios.

ICCV2021 - A New Journey from SDRTV to HDRTV.

PyTorch(Geometric) implementation of G^2GNN in "Imbalanced Graph Classification via Graph-of-Graph Neural Networks"

The implementation of "Shuffle Transformer: Rethinking Spatial Shuffle for Vision Transformer"

Extracting knowledge graphs from language models as a diagnostic benchmark of model performance.

A PyTorch implementation of deep-learning-based registration