Implementation of SegNet: A Deep Convolutional Encoder-Decoder Architecture for Semantic Pixel-Wise Labelling

Last update: Jan 02, 2023

Related tags

Overview

Caffe SegNet

This is a modified version of Caffe which supports the SegNet architecture

As described in SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation Vijay Badrinarayanan, Alex Kendall and Roberto Cipolla, PAMI 2017 [http://arxiv.org/abs/1511.00561]

Updated Version:

This version supports cudnn v2 acceleration. @TimoSaemann has a branch supporting a more recent version of Caffe (Dec 2016) with cudnn v5.1: https://github.com/TimoSaemann/caffe-segnet-cudnn5

Getting Started with Example Model and Webcam Demo

If you would just like to try out a pretrained example model, then you can find the model used in the SegNet webdemo and a script to run a live webcam demo here: https://github.com/alexgkendall/SegNet-Tutorial

For a more detailed introduction to this software please see the tutorial here: http://mi.eng.cam.ac.uk/projects/segnet/tutorial.html

Dataset

Prepare a text file of space-separated paths to images (jpegs or pngs) and corresponding label images alternatively e.g. /path/to/im1.png /another/path/to/lab1.png /path/to/im2.png /path/lab2.png ...

Label images must be single channel, with each value from 0 being a separate class. The example net uses an image size of 360 by 480.

Net specification

Example net specification and solver prototext files are given in examples/segnet. To train a model, alter the data path in the data layers in net.prototxt to be your dataset.txt file (as described above).

In the last convolution layer, change num_output to be the number of classes in your dataset.

Training

In solver.prototxt set a path for snapshot_prefix. Then in a terminal run ./build/tools/caffe train -solver ./examples/segnet/solver.prototxt

Publications

If you use this software in your research, please cite our publications:

http://arxiv.org/abs/1511.02680 Alex Kendall, Vijay Badrinarayanan and Roberto Cipolla "Bayesian SegNet: Model Uncertainty in Deep Convolutional Encoder-Decoder Architectures for Scene Understanding." arXiv preprint arXiv:1511.02680, 2015.

http://arxiv.org/abs/1511.00561 Vijay Badrinarayanan, Alex Kendall and Roberto Cipolla "SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation." PAMI, 2017.

License

This extension to the Caffe library is released under a creative commons license which allows for personal and research use only. For a commercial license please contact the authors. You can view a license summary here: http://creativecommons.org/licenses/by-nc/4.0/

Implementation of SegNet: A Deep Convolutional Encoder-Decoder Architecture for Semantic Pixel-Wise Labelling

Related tags

Overview

Caffe SegNet

Updated Version:

Getting Started with Example Model and Webcam Demo

Dataset

Net specification

Training

Publications

License

Owner

Alex Kendall

CVPR2021 Content-Aware GAN Compression

This is the repo of the manuscript "Dual-branch Attention-In-Attention Transformer for speech enhancement"

DetCo: Unsupervised Contrastive Learning for Object Detection

Dark Finix: All in one hacking framework with almost 100 tools

Coarse implement of the paper "A Simultaneous Denoising and Dereverberation Framework with Target Decoupling", On DNS-2020 dataset, the DNSMOS of first stage is 3.42 and second stage is 3.47.

Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers

Kohei's 5th place solution for xview3 challenge

CAMoE + Dual SoftMax Loss (DSL): Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss

SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021)

Where-Got-Time - An NUS timetable generator which uses a genetic algorithm to optimise timetables to suit the needs of NUS students

This is an early in-development version of training CLIP models with hivemind.

Robust & Reliable Route Recommendation on Road Networks

Understanding the Effects of Datasets Characteristics on Offline Reinforcement Learning

[NeurIPS-2021] Mosaicking to Distill: Knowledge Distillation from Out-of-Domain Data

"Domain Adaptive Semantic Segmentation without Source Data" (ACM MM 2021)

Rule-based Customer Segmentation

A curated list of programmatic weak supervision papers and resources

Benchmark for evaluating open-ended generation

A Pytorch Implementation of a continuously rate adjustable learned image compression framework.

TC-GNN with Pytorch integration