Applying PVT to Semantic Segmentation

Last update: Nov 30, 2022

Related tags

Deep Learning PVTv2-Seg

Overview

Applying PVT to Semantic Segmentation

Here, we take MMSegmentation v0.13.0 as an example, applying PVTv2 to SemanticFPN.

For details see Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions.

If you use this code for a paper please cite:

@misc{wang2021pyramid,
      title={Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions}, 
      author={Wenhai Wang and Enze Xie and Xiang Li and Deng-Ping Fan and Kaitao Song and Ding Liang and Tong Lu and Ping Luo and Ling Shao},
      year={2021},
      eprint={2102.12122},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Usage

Install MMSegmentation.

Data preparation

First, prepare ADE20K according to the guidelines in MMSegmentation.

Then, download the weights pretrained on ImageNet at here, and put them in a folder pretrained/

Results and models

Backbone	Iters	mIoU	Config
PVTv2-B0 + Semantic FPN	40K	37.2	config
PVTv2-B1 + Semantic FPN	40K	42.5	config
PVTv2-B2 + Semantic FPN	40K	45.2	config
PVTv2-B3 + Semantic FPN	40K	47.3	config
PVTv2-B4 + Semantic FPN	40K	47.9	config
PVTv2-B5 + Semantic FPN	40K	48.7	config

Evaluation

To evaluate PVTv2-B2 + SemFPN on a single node with 8 gpus run:

dist_test.sh configs/sem_fpn/PVT/fpn_pvtv2_b2_ade20k_40k.py /path/to/checkpoint_file 8 --out results.pkl --eval mIoU

Training

To train PVTv2-B2 + SemFPN on a single node with 8 gpus run:

dist_train.sh configs/sem_fpn/PVT/fpn_pvtv2_b2_ade20k_40k.py 8

License

This repository is released under the Apache 2.0 license as found in the LICENSE file.

Applying PVT to Semantic Segmentation

Related tags

Overview

Applying PVT to Semantic Segmentation

Usage

Data preparation

Results and models

Evaluation

Training

License

Owner

Stroke-predictions-ml-model - Machine learning model to predict individuals chances of having a stroke

Official implementation of NeurIPS'21: Implicit SVD for Graph Representation Learning

Replication Code for "Self-Supervised Bug Detection and Repair" NeurIPS 2021

Mix3D: Out-of-Context Data Augmentation for 3D Scenes (3DV 2021)

Universal Probability Distributions with Optimal Transport and Convex Optimization

Wafer Fault Detection using MlOps Integration

Split your patch similarly to `git add -p` but supporting multiple buckets

Pytorch Implementation of paper "Noisy Natural Gradient as Variational Inference"

This repository contains codes of ICCV2021 paper: SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation

Customer-Transaction-Analysis - This analysis is based on a synthesised transaction dataset containing 3 months worth of transactions for 100 hypothetical customers.

Pytorch implementation of Decoupled Spatial-Temporal Transformer for Video Inpainting

QSYM: A Practical Concolic Execution Engine Tailored for Hybrid Fuzzing

Hippocampal segmentation using the UNet network for each axis

Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

This implementation contains the application of GPlearn's symbolic transformer on a commodity futures sector of the financial market.

PyTorch version implementation of DORN

Instant-Teaching: An End-to-End Semi-Supervised Object Detection Framework

Open CV - Convert a picture to look like a cartoon sketch in python

Contrastive Learning for Metagenomic Binning

MCMC samplers for Bayesian estimation in Python, including Metropolis-Hastings, NUTS, and Slice