Per-Pixel Classification is Not All You Need for Semantic Segmentation

Last update: Jan 08, 2023

Related tags

Deep Learning MaskFormer

Overview

MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

Bowen Cheng, Alexander G. Schwing, Alexander Kirillov

[arXiv] [Project] [BibTeX]

Features

Better results while being more efficient.
Unified view of semantic- and instance-level segmentation tasks.
Support major semantic segmentation datasets: ADE20K, Cityscapes, COCO-Stuff, Mapillary Vistas.
Support ALL Detectron2 models.

Installation

See installation instructions.

Getting Started

See Preparing Datasets for MaskFormer.

See Getting Started with MaskFormer.

Model Zoo and Baselines

We provide a large set of baseline results and trained models available for download in the MaskFormer Model Zoo.

License

Shield:

The majority of MaskFormer is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

However portions of the project are available under separate license terms: Swin-Transformer-Semantic-Segmentation is licensed under the MIT license.

Citing MaskFormer

If you use MaskFormer in your research or wish to refer to the baseline results published in the Model Zoo, please use the following BibTeX entry.

@article{cheng2021maskformer,
  title={Per-Pixel Classification is Not All You Need for Semantic Segmentation},
  author={Bowen Cheng and Alexander G. Schwing and Alexander Kirillov},
  journal={arXiv},
  year={2021}
}

Per-Pixel Classification is Not All You Need for Semantic Segmentation

Related tags

Overview

MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

Features

Installation

Getting Started

Model Zoo and Baselines

License

Citing MaskFormer

Owner

Facebook Research

A lane detection integrated Real-time Instance Segmentation based on YOLACT (You Only Look At CoefficienTs)

FaceAnon - Anonymize people in images and videos using yolov5-crowdhuman

High level network definitions with pre-trained weights in TensorFlow

Source Code for DialogBERT: Discourse-Aware Response Generation via Learning to Recover and Rank Utterances (https://arxiv.org/pdf/2012.01775.pdf)

Official PyTorch implementation of "Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning" (ICCV2021 Oral)

Stable Neural ODE with Lyapunov-Stable Equilibrium Points for Defending Against Adversarial Attacks

NATS-Bench: Benchmarking NAS Algorithms for Architecture Topology and Size

Nvdiffrast - Modular Primitives for High-Performance Differentiable Rendering

TensorFlow 2 AI/ML library wrapper for openFrameworks

Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch

Spatial Intention Maps for Multi-Agent Mobile Manipulation (ICRA 2021)

Picasso: A CUDA-based Library for Deep Learning over 3D Meshes

Implementation of "Efficient Regional Memory Network for Video Object Segmentation" (Xie et al., CVPR 2021).

Imposter-detector-2022 - HackED 2022 Team 3IQ - 2022 Imposter Detector

Code for the SIGIR 2022 paper "Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion"

Code for our paper Aspect Sentiment Quad Prediction as Paraphrase Generation in EMNLP 2021.

ECLARE: Extreme Classification with Label Graph Correlations

Repository sharing code and the model for the paper "Rescoring Sequence-to-Sequence Models for Text Line Recognition with CTC-Prefixes"

验证码识别深度学习 tensorflow 神经网络

Code for the paper Progressive Pose Attention for Person Image Generation in CVPR19 (Oral).

Per-Pixel Classification is Not All You Need for Semantic Segmentation

Related tags

Overview

MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

Features

Installation

Getting Started

Model Zoo and Baselines

License

Citing MaskFormer

Owner

Facebook Research

A lane detection integrated Real-time Instance Segmentation based on YOLACT (You Only Look At CoefficienTs)

FaceAnon - Anonymize people in images and videos using yolov5-crowdhuman

High level network definitions with pre-trained weights in TensorFlow

Source Code for DialogBERT: Discourse-Aware Response Generation via Learning to Recover and Rank Utterances (https://arxiv.org/pdf/2012.01775.pdf)

Official PyTorch implementation of "Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning" (ICCV2021 Oral)

Stable Neural ODE with Lyapunov-Stable Equilibrium Points for Defending Against Adversarial Attacks

NATS-Bench: Benchmarking NAS Algorithms for Architecture Topology and Size

Nvdiffrast - Modular Primitives for High-Performance Differentiable Rendering

TensorFlow 2 AI/ML library wrapper for openFrameworks

Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch

Spatial Intention Maps for Multi-Agent Mobile Manipulation (ICRA 2021)

Picasso: A CUDA-based Library for Deep Learning over 3D Meshes

Implementation of "Efficient Regional Memory Network for Video Object Segmentation" (Xie et al., CVPR 2021).

Imposter-detector-2022 - HackED 2022 Team 3IQ - 2022 Imposter Detector

Code for the SIGIR 2022 paper "Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion"

Code for our paper Aspect Sentiment Quad Prediction as Paraphrase Generation in EMNLP 2021.

ECLARE: Extreme Classification with Label Graph Correlations

Repository sharing code and the model for the paper "Rescoring Sequence-to-Sequence Models for Text Line Recognition with CTC-Prefixes"

验证码识别 深度学习 tensorflow 神经网络

Code for the paper Progressive Pose Attention for Person Image Generation in CVPR19 (Oral).

验证码识别深度学习 tensorflow 神经网络