Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Last update: Dec 16, 2021

Related tags

Deep Learning Mask2Former

Overview

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Bowen Cheng, Ishan Misra, Alexander G. Schwing, Alexander Kirillov, Rohit Girdhar [arXiv]

Features

A single architecture for three tasks: panoptic, instance and semantic segmentation. This straightforward mini project was built as part of the main project, IST: A TensorFlow 2 compatible instance segmentation toolbox, with the purpose of adapting recent research into segmentation approaches into TensorFlow.
Support common benchmark datasets: ADE20K, Cityscapes, COCO, Mapillary Vistas.

Getting started

Project is currently being built, with SwinTransformerV1 and SwinTransformerV2 and a few bits and pieces ready.

License

Shield:

The majority of MaskFormer is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

However portions of the project are available under separate license terms: Swin-Transformer-Semantic-Segmentation is licensed under the MIT license.

Citation

@article{cheng2021mask2former,
  title={Masked-attention Mask Transformer for Universal Image Segmentation},
  author={Bowen Cheng and Ishan Misra and Alexander G. Schwing and Alexander Kirillov and Rohit Girdhar},
  journal={arXiv},
  year={2021}
}

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Related tags

Overview

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Features

Getting started

License

Citation

Owner

Phan Nguyen

An implementation of MobileFormer

Good Classification Measures and How to Find Them

Contrastive Learning for Many-to-many Multilingual Neural Machine Translation(mCOLT/mRASP2), ACL2021

Callable PyTrees and filtered JIT/grad transformations => neural networks in JAX.

This is a official repository of SimViT.

A stable algorithm for GAN training

Revisiting Weakly Supervised Pre-Training of Visual Perception Models

Python code for the paper How to scale hyperparameters for quickshift image segmentation

Hardware accelerated, batchable and differentiable optimizers in JAX.

Video Frame Interpolation with Transformer (CVPR2022)

这是一个yolox-keras的源码，可以用于训练自己的模型。

A embed able annotation tool for end to end cross document co-reference

The comma.ai Calibration Challenge!

The codes and related files to reproduce the results for Image Similarity Challenge Track 1.

Dataset and codebase for NeurIPS 2021 paper: Exploring Forensic Dental Identification with Deep Learning

机器学习、深度学习、自然语言处理等人工智能基础知识总结。

Open source implementation of AceNAS: Learning to Rank Ace Neural Architectures with Weak Supervision of Weight Sharing

Code implementation for the paper 'Conditional Gaussian PAC-Bayes'.

This is the source code for generating the ASL-Skeleton3D and ASL-Phono datasets. Check out the README.md for more details.

Repositório para arquivos sobre o Módulo 1 do curso Top Coders da Let's Code + Safra