Semantic segmentation task for ADE20k & cityscapse dataset, based on several models.

Last update: Oct 13, 2022

Overview

semantic-segmentation-tensorflow

This is a Tensorflow implementation of semantic segmentation models on MIT ADE20K scene parsing dataset and Cityscapes dataset We re-produce the inference phase of several models, including PSPNet, FCN, and ICNet by transforming the released pre-trained weights into tensorflow format, and apply on handcraft models. Also, we refer to ENet from freg856 github. Still working on task integrated.

Models

PSPNet
FCN
ENet
ICNet

...to be continue

Install

Get corresponding transformed pre-trained weights, and put into model directory:

FCN	PSPNet	ICNet
Google drive	Google drive	Google drive

Inference

Run following command:

python inference.py --img-path /Path/To/Image --dataset Model_Type

Arg list

--model - choose from "icnet"/"pspnet"/"fcn"/"enet"

Import module in your code:

from model import FCN8s, PSPNet50, ICNet, ENet

model = PSPNet50() # or another model

model.read_input(img_path)  # read image data from path

sess = tf.Session(config=config)
init = tf.global_variables_initializer()
sess.run(init)

model.load(model_path, sess)  # load pretrained model
preds = model.forward(sess) # Get prediction

Results

ade20k

Input Image	PSPNet	FCN

cityscapes

Input Image	ICNet	ENet

Citation

@inproceedings{zhao2017pspnet,
  author = {Hengshuang Zhao and
            Jianping Shi and
            Xiaojuan Qi and
            Xiaogang Wang and
            Jiaya Jia},
  title = {Pyramid Scene Parsing Network},
  booktitle = {Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2017}
}

Scene Parsing through ADE20K Dataset. B. Zhou, H. Zhao, X. Puig, S. Fidler, A. Barriuso and A. Torralba. Computer Vision and Pattern Recognition (CVPR), 2017. (http://people.csail.mit.edu/bzhou/publication/scene-parse-camera-ready.pdf)

@inproceedings{zhou2017scene,
    title={Scene Parsing through ADE20K Dataset},
    author={Zhou, Bolei and Zhao, Hang and Puig, Xavier and Fidler, Sanja and Barriuso, Adela and Torralba, Antonio},
    booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
    year={2017}
}

Semantic Understanding of Scenes through ADE20K Dataset. B. Zhou, H. Zhao, X. Puig, S. Fidler, A. Barriuso and A. Torralba. arXiv:1608.05442. (https://arxiv.org/pdf/1608.05442.pdf)

@article{zhou2016semantic,
  title={Semantic understanding of scenes through the ade20k dataset},
  author={Zhou, Bolei and Zhao, Hang and Puig, Xavier and Fidler, Sanja and Barriuso, Adela and Torralba, Antonio},
  journal={arXiv preprint arXiv:1608.05442},
  year={2016}
}

Semantic segmentation task for ADE20k & cityscapse dataset, based on several models.

Related tags

Overview

semantic-segmentation-tensorflow

Models

...to be continue

Install

Inference

Arg list

Import module in your code:

Results

ade20k

cityscapes

Citation

Owner

HsuanKung Yang

Intelligent Video Analytics toolkit based on different inference backends.

We provided a matlab implementation for an evolutionary multitasking AUC optimization framework (EMTAUC).

A python library for highly configurable transformers - easing model architecture search and experimentation.

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning, NeurIPS 2021 (Spotlight)

This program can detect your face and add an Christams hat on the top of your head

An improvement of FasterGICP: Acceptance-rejection Sampling based 3D Lidar Odometry

Tweesent-back - Tweesent backend uses fastAPI as the web framework

This repository contains the code for the ICCV 2019 paper "Occupancy Flow - 4D Reconstruction by Learning Particle Dynamics"

hySLAM is a hybrid SLAM/SfM system designed for mapping

Model Zoo for AI Model Efficiency Toolkit

EvDistill: Asynchronous Events to End-task Learning via Bidirectional Reconstruction-guided Cross-modal Knowledge Distillation (CVPR'21)

Combining Diverse Feature Priors

This repository contains the implementation of Deep Detail Enhancment for Any Garment proposed in Eurographics 2021

Towards Improving Embedding Based Models of Social Network Alignment via Pseudo Anchors

Vehicles Counting using YOLOv4 + DeepSORT + Flask + Ngrok

A general and strong 3D object detection codebase that supports more methods, datasets and tools (debugging, recording and analysis).

Official Code Release for Container : Context Aggregation Network

1st place solution in CCF BDCI 2021 ULSEG challenge

The Habitat-Matterport 3D Research Dataset - the largest-ever dataset of 3D indoor spaces.

Keeping it safe - AI Based COVID-19 Tracker using Deep Learning and facial recognition