Last update: Dec 23, 2022

Overview

Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes

Introduction

This is the unofficial code of Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes. which achieve state-of-the-art trade-off between accuracy and speed on cityscapes and camvid, without using inference acceleration and extra data!on single 2080Ti GPU, DDRNet-23-slim yields 77.4% mIoU at 109 FPS on Cityscapes test set and 74.4% mIoU at 230 FPS on CamVid test set.

The code mainly borrows from HRNet-Semantic-Segmentation OCR and the official repository, thanks for their work.

requirements

Here I list the software and hardware used in my experiment

pytorch==1.7.0
3080*2
cuda==11.1

Quick start

0. Data preparation

You need to download the Cityscapesdatasets. and rename the folder cityscapes, then put the data under data folder.

└── data
  ├── cityscapes
  └── list

1. Pretrained model

download the pretrained model on imagenet or the segmentation model from the official，and put the files in ${PROJECT}/pretrained_models folder

VAL

use the official pretrained model and our eval.py code. with ydhongHIT's advice now can reach the same accuracy in the paper. Thanks.

cd ${PROJECT}
python tools/eval.py --cfg experiments/cityscapes/ddrnet23_slim.yaml

model	Train Set	Test Set	OHEM	Multi-scale	Flip	mIoU	Link
DDRNet23_slim	unknown	eval	Yes	No	No	77.83	official
DDRNet23_slim	unknown	eval	Yes	No	Yes	78.42	official
DDRNet23	unknown	eval	Yes	No	No	79.51	official
DDRNet23	unknown	eval	Yes	No	Yes	79.98	official

Note

with the ALIGN_CORNERS: false in ***.yaml will reach higher accuracy.

TRAIN

download the imagenet pretrained model, and then train the model with 2 nvidia-3080

cd ${PROJECT}
python -m torch.distributed.launch --nproc_per_node=2 tools/train.py --cfg experiments/cityscapes/ddrnet23_slim.yaml

the own trained model coming soon

OWN model

model	Train Set	Test Set	OHEM	Multi-scale	Flip	mIoU	Link
DDRNet23_slim	train	eval	Yes	No	Yes	77.77	Baidu/password:it2s
DDRNet23_slim	train	eval	Yes	Yes	Yes	79.57	Baidu/password:it2s
DDRNet23	train	eval	Yes	No	Yes	~	None
DDRNet39	train	eval	Yes	No	Yes	~	None

Note

set the ALIGN_CORNERS: true in ***.yaml, because i use the default setting in HRNet-Semantic-Segmentation OCR.
Multi-scale with scales: 0.5,0.75,1.0,1.25,1.5,1.75. it runs too slow.
from ydhongHIT, can change the align_corners=True with better performance, the default option is False

Reference

[1] HRNet-Semantic-Segmentation OCR branch

[2] the official repository

This is the unofficial code of Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes. which achieve state-of-the-art trade-off between accuracy and speed on cityscapes and camvid, without using inference acceleration and extra data

Related tags

Overview

Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes

Introduction

requirements

Quick start

0. Data preparation

1. Pretrained model

VAL

TRAIN

OWN model

Reference

Owner

chenjun

[TIP 2020] Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Implementation of CVAE. Trained CVAE on faces from UTKFace Dataset to produce synthetic faces with a given degree of happiness/smileyness.

diablo2 resurrected loot filter

Article Reranking by Memory-enhanced Key Sentence Matching for Detecting Previously Fact-checked Claims.

Instance-level Image Retrieval using Reranking Transformers

A general python framework for visual object tracking and video object segmentation, based on PyTorch

Supplementary materials for ISMIR 2021 LBD paper "Evaluation of Latent Space Disentanglement in the Presence of Interdependent Attributes"

The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"

PSML: A Multi-scale Time-series Dataset for Machine Learning in Decarbonized Energy Grids

HeatNet is a python package that provides tools to build, train and evaluate neural networks designed to predict extreme heat wave events globally on daily to subseasonal timescales.

Sentinel-1 vessel detection model used in the xView3 challenge

🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.

Cognate Detection Repository

Deep Learning Theory

[ACM MM 2021] Yes, "Attention is All You Need", for Exemplar based Colorization

PaddleRobotics is an open-source algorithm library for robots based on Paddle, including open-source parts such as human-robot interaction, complex motion control, environment perception, SLAM positioning, and navigation.

(under submission) Bayesian Integration of a Generative Prior for Image Restoration

pybaum provides tools to work with pytrees which is a concept burrowed from JAX.

Tensor-Based Quantum Machine Learning

The repo of the preprinting paper "Labels Are Not Perfect: Inferring Spatial Uncertainty in Object Detection"