Using fully convolutional networks for semantic segmentation with caffe for the cityscapes dataset

Last update: Jun 06, 2022

Overview

Using fully convolutional networks for semantic segmentation (Shelhamer et al.) with caffe for the cityscapes dataset

How to get started

Download the cityscapes dataset and the vgg-16-layer net
Modify the images in the dataset with cut_images.py or downscale_images.py for less resource demanding training and evaluation
Create the 32 pixel stride net with net_32.py
Modify the paths in train.txt and val.txt (first line: path to training/validation images, second line: path to annotations)
Start training with solve_start.py
Run evaluate_models.py to evaluate your model or create_eval_images.py to create images with pixel label ids

Sources

Fully Convolutional Models for Semantic Segmentation:

Shelhamer, Evan, Jonathon Long, and Trevor Darrell. "Fully Convolutional Networks for Semantic Segmentation." PAMI, 2016, URL http://fcn.berkeleyvision.org

Cityscapes Dataset (Semantic Understanding of Urban Street Scenes):

Cordts, Marius, et al. "The cityscapes dataset." CVPR Workshop on The Future of Datasets in Vision. 2015, URL https://www.cityscapes-dataset.com

Caffe Deep Learning Framework:

Jia, Yangqing, et al. "Caffe: Convolutional architecture for fast feature embedding." Proceedings of the 22nd ACM international conference on Multimedia. ACM, 2014, URL http://caffe.berkeleyvision.org

Using fully convolutional networks for semantic segmentation with caffe for the cityscapes dataset

Related tags

Overview

How to get started

Sources

Fully Convolutional Models for Semantic Segmentation:

Cityscapes Dataset (Semantic Understanding of Urban Street Scenes):

Caffe Deep Learning Framework:

Owner

Simon Guist

Introduction to Statistics and Basics of Mathematics for Data Science - The Hacker's Way

code for Multi-scale Matching Networks for Semantic Correspondence, ICCV

PyTorch framework for Deep Learning research and development.

ShapeGlot: Learning Language for Shape Differentiation

GLIP: Grounded Language-Image Pre-training

Official PyTorch implementation of "ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows"

Code for project: "Learning to Minimize Remainder in Supervised Learning".

Leaf: Multiple-Choice Question Generation

Blind visual quality assessment on 360° Video based on progressive learning

FedTorch is an open-source Python package for distributed and federated training of machine learning models using PyTorch distributed API

Official Pytorch implementation of MixMo framework

Fast, general, and tested differentiable structured prediction in PyTorch

Part-aware Measurement for Robust Multi-View Multi-Human 3D Pose Estimation and Tracking

This Deep Learning Model Predicts that from which disease you are suffering.

Automatic voice-synthetised summaries of latest research papers on arXiv

Code accompanying the paper on "An Empirical Investigation of Domain Generalization with Empirical Risk Minimizers" published at NeurIPS, 2021

We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances to transformations applied to both the audio and video streams.

A PyTorch re-implementation of Neural Radiance Fields

Pytorch modules for paralel models with same architecture. Ideal for multi agent-based systems

simple demo codes for Learning to Teach with Dynamic Loss Functions