TensorFlow implementation of original paper : https://github.com/hszhao/PSPNet

Last update: Dec 29, 2022

Overview

Keras implementation of PSPNet(caffe)

Implemented Architecture of Pyramid Scene Parsing Network in Keras.

For the best compability please use Python3.5

Setup

Install dependencies:
- Tensorflow (-gpu)
- Keras
- numpy
- scipy
- pycaffe(PSPNet)(optional for converting the weights)
```
pip install -r requirements.txt --upgrade
```
Converted trained weights are needed to run the network. Weights(in .h5 .json format) have to be downloaded and placed into directory weights/keras

Already converted weights can be downloaded here:

Convert weights by yourself(optional)

(Note: this is not required if you use .h5/.json weights)

Running this needs the compiled original PSPNet caffe code and pycaffe.

python weight_converter.py <path to .prototxt> <path to .caffemodel>

Usage:

python pspnet.py -m <model> -i <input_image>  -o <output_path>
python pspnet.py -m pspnet101_cityscapes -i example_images/cityscapes.png -o example_results/cityscapes.jpg
python pspnet.py -m pspnet101_voc2012 -i example_images/pascal_voc.jpg -o example_results/pascal_voc.jpg

List of arguments:

 -m --model        - which model to use: 'pspnet50_ade20k', 'pspnet101_cityscapes', 'pspnet101_voc2012'
    --id           - (int) GPU Device id. Default 0
 -s --sliding      - Use sliding window
 -f --flip         - Additional prediction of flipped image
 -ms --multi_scale - Predict on multiscale images

Keras results:

Implementation details

The interpolation layer is implemented as custom layer "Interp"
Forward step takes about ~1 sec on single image

Memory usage can be optimized with:

config = tf.ConfigProto()
config.gpu_options.per_process_gpu_memory_fraction = 0.3 
sess = tf.Session(config=config)

ndimage.zoom can take a long time

TensorFlow implementation of original paper : https://github.com/hszhao/PSPNet

Related tags

Overview

Keras implementation of PSPNet(caffe)

Setup

Convert weights by yourself(optional)

Usage:

Keras results:

Implementation details

Owner

VladKry

Cereal box identification in store shelves using computer vision and a single train image per model.

Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers"

Code for "Typilus: Neural Type Hints" PLDI 2020

Bottom-up Human Pose Estimation

Regularized Frank-Wolfe for Dense CRFs: Generalizing Mean Field and Beyond

Code for models used in Bashiri et al., "A Flow-based latent state generative model of neural population responses to natural images".

Semi-Supervised Signed Clustering Graph Neural Network (and Implementation of Some Spectral Methods)

VOGUE: Try-On by StyleGAN Interpolation Optimization

Unified MultiWOZ evaluation scripts for the context-to-response task.

Solving Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge

DCGAN LSGAN WGAN-GP DRAGAN PyTorch

A full-fledged version of Pix2Seq

Deep Learning for Human Part Discovery in Images - Chainer implementation

Inflated i3d network with inception backbone, weights transfered from tensorflow

A High-Performance Distributed Library for Large-Scale Bundle Adjustment

OntoProtein: Protein Pretraining With Ontology Embedding

Tensorflow implementation for "Improved Transformer for High-Resolution GANs" (NeurIPS 2021).

Official implementation of "CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding" (CVPR, 2022)

GarmentNets: Category-Level Pose Estimation for Garments via Canonical Space Shape Completion

Minimalist Error collection Service compatible with Rollbar clients. Sentry or Rollbar alternative.