Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Last update: Jan 02, 2023

Related tags

Deep Learning Mask2Former

Overview

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation

Bowen Cheng, Ishan Misra, Alexander G. Schwing, Alexander Kirillov, Rohit Girdhar

[arXiv] [Project] [BibTeX]

Features

A single architecture for panoptic, instance and semantic segmentation.
Support major segmentation datasets: ADE20K, Cityscapes, COCO, Mapillary Vistas.

Installation

See installation instructions.

Getting Started

See Preparing Datasets for Mask2Former.

See Getting Started with Mask2Former.

Advanced usage

See Advanced Usage of Mask2Former.

Model Zoo and Baselines

We provide a large set of baseline results and trained models available for download in the Mask2Former Model Zoo.

License

Shield:

The majority of Mask2Former is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

However portions of the project are available under separate license terms: Swin-Transformer-Semantic-Segmentation is licensed under the MIT license, Deformable-DETR is licensed under the Apache-2.0 License.

Citing Mask2Former

If you use Mask2Former in your research or wish to refer to the baseline results published in the Model Zoo, please use the following BibTeX entry.

@article{cheng2021mask2former,
  title={Masked-attention Mask Transformer for Universal Image Segmentation},
  author={Bowen Cheng and Ishan Misra and Alexander G. Schwing and Alexander Kirillov and Rohit Girdhar},
  journal={arXiv},
  year={2021}
}

If you find the code useful, please also consider the following BibTeX entry.

@inproceedings{cheng2021maskformer,
  title={Per-Pixel Classification is Not All You Need for Semantic Segmentation},
  author={Bowen Cheng and Alexander G. Schwing and Alexander Kirillov},
  journal={NeurIPS},
  year={2021}
}

Acknowledgement

Code is largely based on MaskFormer (https://github.com/facebookresearch/MaskFormer).

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Related tags

Overview

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation

Features

Installation

Getting Started

Advanced usage

Model Zoo and Baselines

License

Citing Mask2Former

Acknowledgement

Owner

Meta Research

DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency

Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning

Code for our CVPR 2021 paper "MetaCam+DSCE"

This is an open-source toolkit for Heterogeneous Graph Neural Network(OpenHGNN) based on DGL [Deep Graph Library] and PyTorch.

Template repository to build PyTorch projects from source on any version of PyTorch/CUDA/cuDNN.

Least Square Calibration for Peer Reviews

[CVPR'21] FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space

Object-aware Contrastive Learning for Debiased Scene Representation

A simple Neural Network that predicts the label for a series of handwritten digits

Code for KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

A Vision Transformer approach that uses concatenated query and reference images to learn the relationship between query and reference images directly.

Syntax-Aware Action Targeting for Video Captioning

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

This repo contains the implementation of YOLOv2 in Keras with Tensorflow backend.

2021搜狐校园文本匹配算法大赛分比我们低的都是帅哥队

[NeurIPS 2021] Large Scale Learning on Non-Homophilous Graphs: New Benchmarks and Strong Simple Methods

PerfFuzz: Automatically Generate Pathological Inputs for C/C++ programs

Get 2D point positions (e.g., facial landmarks) projected on 3D mesh

Official code release for ICCV 2021 paper SNARF: Differentiable Forward Skinning for Animating Non-rigid Neural Implicit Shapes.

An implementation of RetinaNet in PyTorch.

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Related tags

Overview

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation

Features

Installation

Getting Started

Advanced usage

Model Zoo and Baselines

License

Citing Mask2Former

Acknowledgement

Owner

Meta Research

DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency

Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning

Code for our CVPR 2021 paper "MetaCam+DSCE"

This is an open-source toolkit for Heterogeneous Graph Neural Network(OpenHGNN) based on DGL [Deep Graph Library] and PyTorch.

Template repository to build PyTorch projects from source on any version of PyTorch/CUDA/cuDNN.

Least Square Calibration for Peer Reviews

[CVPR'21] FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space

Object-aware Contrastive Learning for Debiased Scene Representation

A simple Neural Network that predicts the label for a series of handwritten digits

Code for KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

A Vision Transformer approach that uses concatenated query and reference images to learn the relationship between query and reference images directly.

Syntax-Aware Action Targeting for Video Captioning

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

This repo contains the implementation of YOLOv2 in Keras with Tensorflow backend.

2021搜狐校园文本匹配算法大赛 分比我们低的都是帅哥队

[NeurIPS 2021] Large Scale Learning on Non-Homophilous Graphs: New Benchmarks and Strong Simple Methods

PerfFuzz: Automatically Generate Pathological Inputs for C/C++ programs

Get 2D point positions (e.g., facial landmarks) projected on 3D mesh

Official code release for ICCV 2021 paper SNARF: Differentiable Forward Skinning for Animating Non-rigid Neural Implicit Shapes.

An implementation of RetinaNet in PyTorch.

2021搜狐校园文本匹配算法大赛分比我们低的都是帅哥队