Real-Time Semantic Segmentation in Mobile device

Last update: Jan 01, 2023

Overview

Real-Time Semantic Segmentation in Mobile device

This project is an example project of semantic segmentation for mobile real-time app.

The architecture is inspired by MobileNetV2 and U-Net.

LFW, Labeled Faces in the Wild, is used as a Dataset.

The goal of this project is to detect hair segments with reasonable accuracy and speed in mobile device. Currently, it achieves 0.89 IoU.

About speed vs accuracy, more details are available at my post.

Example application

iOS
Android (TODO)

Requirements

Python 3.8
pip install -r requirements.txt -f https://download.pytorch.org/whl/torch_stable.html
CoreML for iOS app.

About Model

At this time, there is only one model in this repository, MobileNetV2_unet. As a typical U-Net architecture, it has encoder and decoder parts, which consist of depthwise conv blocks proposed by MobileNets.

Input image is encoded to 1/32 size, and then decoded to 1/2. Finally, it scores the results and make it to original size.

Steps to training

Data Preparation

Data is available at LFW. To get mask images, refer issue #11 for more. After you got images and masks, put the images of faces and masks as shown below.

data/
  lfw/
    raw/
      images/
        0001.jpg
        0002.jpg
      masks/
        0001.ppm
        0002.ppm

Training

If you use 224 x 224 as input size, pre-trained weight of MobileNetV2 is available. It will be automatically downloaded when you train model with the following command.

cd src
python run_train.py params/002.yaml

Dice coefficient is used as a loss function.

Pretrained model

Input size	IoU	Download
224	0.89	Google Drive

Converting

As the purpose of this project is to make model run in mobile device, this repository contains some scripts to convert models for iOS and Android.

run_convert_coreml.py
- It converts trained PyTorch model into CoreML model for iOS app.

TBD

Report speed vs accuracy in mobile device.
Convert pytorch to Android using TesorFlow Light

Real-Time Semantic Segmentation in Mobile device

Related tags

Overview

Real-Time Semantic Segmentation in Mobile device

Example application

Requirements

About Model

Steps to training

Data Preparation

Training

Pretrained model

Converting

TBD

Owner

Fast and robust certifiable relative pose estimation

[ECCV'20] Convolutional Occupancy Networks

This library is a location of the LegacyLogger for PyTorch Lightning.

Learning Intents behind Interactions with Knowledge Graph for Recommendation, WWW2021

YOLOv5 in PyTorch > ONNX > CoreML > TFLite

Mail classification with tensorflow and MS Exchange Server (ham or spam).

A python bot to move your mouse every few seconds to appear active on Skype, Teams or Zoom as you go AFK. 🐭 🤖

Repository for Traffic Accident Benchmark for Causality Recognition (ECCV 2020)

Used to record WKU's utility bills on a regular basis.

A Transformer-Based Siamese Network for Change Detection

Use your Philips Hue lights as Racing Flags. Works with Assetto Corsa, Assetto Corsa Competizione and iRacing.

The undersampled DWI image using Slice-Interleaved Diffusion Encoding (SIDE) method can be reconstructed by the UNet network.

68 keypoint annotations for COFW test data

A general-purpose, flexible, and easy-to-use simulator alongside an OpenAI Gym trading environment for MetaTrader 5 trading platform (Approved by OpenAI Gym)

Stereo Radiance Fields (SRF): Learning View Synthesis for Sparse Views of Novel Scenes

Official repository for "On Improving Adversarial Transferability of Vision Transformers" (2021)

AI that generate music

Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"

Use of Attention Gates in a Convolutional Neural Network / Medical Image Classification and Segmentation

PyTorch version of the paper 'Enhanced Deep Residual Networks for Single Image Super-Resolution' (CVPRW 2017)