Modification of convolutional neural net "UNET" for image segmentation in Keras framework

Last update: Nov 02, 2022

Overview

ZF_UNET_224 Pretrained Model

Modification of convolutional neural net "UNET" for image segmentation in Keras framework

Requirements

Python 3.*, Keras 2.1, Tensorflow 1.4

Usage

from zf_unet_224_model import ZF_UNET_224, dice_coef_loss, dice_coef
from keras.optimizers import Adam

model = ZF_UNET_224(weights='generator')
optim = Adam()
model.compile(optimizer=optim, loss=dice_coef_loss, metrics=[dice_coef])

model.fit(...)

Notes

"ZF_UNET_224" Model based on UNET code from following paper: https://arxiv.org/abs/1505.04597
This model used to get 2nd place in DSTL competition: https://www.kaggle.com/c/dstl-satellite-imagery-feature-detection
For training used DICE coefficient: https://en.wikipedia.org/wiki/S%C3%B8rensen%E2%80%93Dice_coefficient
Input shape for model is 224x224 (the same as for other popular CNNs like VGG or ResNet)
It has 3 input channels (to process standard RGB (BGR) images). You can change it with variable "INPUT_CHANNELS"
In most cases model ZF_UNET_224 is ok to be used without pretrained weights.
This code should work fine on both Theano and Tensorflow backends. Code prepared for Keras 2.1, if you need code for Keras 1.2 then use this link:

Pretrained weights

Download: Weights for Tensorflow backend ~123 MB (Keras 2.1, Dice coef: 0.998)

Weights were obtained with random image generator (generator code available here: train_infinite_generator.py). See example of images from generator below.

Dice coefficient for pretrained weights: ~0.998. See history of learning below:

Comments

Extended example
Hi, I have created extended example based on your repository: https://github.com/mrgloom/keras-semantic-segmentation-example

It also use random colors for foreground and background (not like lighter and darker like here https://github.com/ZFTurbo/ZF_UNET_224_Pretrained_Model/blob/master/train_infinite_generator.py#L24 ), one idea behind it is that in that case network can learn 'shape of object' not just 'thresholding and separating background and foreground', also looks like using random colors make problem harder and network converges slower.

Also I have experienced some problems:

Netwoks not always converges on second run with fixed params even for this toy problem, looks like it depens on random seed.

Dice loss and jaccard loss are harder to train than binary crossentropy, any ideas why? Network architecture is the same just loss differs, I even tried to load trained weights from binary crossentropy loss network and use them in dice loss network which show high dice coef.
opened by mrgloom 8
Deeper network

I know this is not an issue, but I wanted to contact you to know how did you make the network deeper in keras for the DSTL competition using this model?

opened by nassarofficial 6

Tensorflow problem

When I use tensorflow-1.3.0 as backend, I get this kind of error:

builtins.ValueError: Dimension 2 in both shapes must be equal, but are 3 and 32 for 'Assign' (op: 'Assign') with input shapes: [3,3,3,32], [3,3,32,3].

opened by lawlite19 5

preprocess_batch for real data
Here is preprocessing for the batch (looks like 256 should be 255 ;) ) https://github.com/ZFTurbo/ZF_UNET_224_Pretrained_Model/blob/master/zf_unet_224_model.py#L27

Is it ok for real images to use code like this or it should be calculated for entire dataset?

batch=batch-np.mean(batch) batch=batch/np.std(batch)

Also how crucial is impact of data normalization for U-net? In my tests even on this simple synthetic data network doesn't converges if input is not normalized.
opened by mrgloom 2
Applying pretrained weights to 128*128 size image

You have generated pretrained weights for 224224 input size, but I have 128128. How can we use such weights in this situation, but without padding/upsampling 128*128 images. Sorry for silly question - is it worth trying in kaggle salt competition?

opened by Diyago 1
Attribute Error

Traceback (most recent call last): File "train.py", line 11, in import segmentation_models as sm File "/home/melih/anaconda3/envs/ai/lib/python3.6/site-packages/segmentation_models/init.py", line 98, in set_framework(_framework) File "/home/melih/anaconda3/envs/ai/lib/python3.6/site-packages/segmentation_models/init.py", line 68, in set_framework import efficientnet.keras # init custom objects File "/home/melih/anaconda3/envs/ai/lib/python3.6/site-packages/efficientnet/keras.py", line 17, in init_keras_custom_objects() File "/home/melih/anaconda3/envs/ai/lib/python3.6/site-packages/efficientnet/init.py", line 71, in init_keras_custom_objects keras.utils.generic_utils.get_custom_objects().update(custom_objects) AttributeError: module 'keras.utils' has no attribute 'generic_utils'

when I run the code, I got the result below but don't know why there is no generic_utils attribute in the library since there is in the keras.

opened by melih1996 0
How to run the model for 6 input channels?

Is it possible to run the model for 6 input channels? Three inputs in that are RGB values and the other three are metrics I want to pass on into the architecture for my use case.

opened by ShreyaPandita01 2
dice and jaccard metrics

Thanks for the repo. I am wondering why do you use a smoothing factor of 1.0 in both dice and jaccard coefficients? Where does this value comes from? And what about using another smaller value close to zero, e.g. K.epsilon()

opened by tinalegre 3
model.fit step

Hi! I would like to know how I should perform the model.fit instruction. model.fit(trainSet, mask_trainSet, batch_size=20, nb_epoch=1, verbose=1,validation_split=0.2, shuffle=True, callbacks=[model_checkpoint])¿? What I write in callback??

And how should I use the weights if I wan't to use pretained weights??

Thank you very much and sorry for the inconvenience!

opened by AmericaBG 7
How to generate img and mask correctly

I run your code and then find that the img batch has a shape(16,224,224,3),but mask batch has a shape(16,1,224,224). I don't understand it.Can you explain it to me?I use my dataset to train unet and then the dice coef is high，but the real effect is bad.

opened by wong-way 6

Releases(v1.0)

v1.0(Mar 19, 2018)

Weights for Tensorflow backend ~123 MB (Keras 2.1, Dice coef: 0.998)
Source code(tar.gz)
Source code(zip)
zf_unet_224.h5(120.21 MB)

Owner

GitHub Repository

IGCN : Image-to-graph convolutional network

IGCN : Image-to-graph convolutional network IGCN is a learning framework for 2D/3D deformable model registration and alignment, and shape reconstructi

7 Oct 27, 2022

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

This is a playground for pytorch beginners, which contains predefined models on popular dataset. Currently we support mnist, svhn cifar10, cifar100 st

2.4k Dec 28, 2022

PlenOctrees: NeRF-SH Training & Conversion

PlenOctrees Official Repo: NeRF-SH training and conversion This repository contains code to train NeRF-SH and to extract the PlenOctree, constituting

323 Dec 29, 2022

Evaluation toolkit of the informative tracking benchmark comprising 9 scenarios, 180 diverse videos, and new challenges.

Informative-tracking-benchmark Informative tracking benchmark (ITB) higher diversity. It contains 9 representative scenarios and 180 diverse videos. m

15 Nov 26, 2022

Soft actor-critic is a deep reinforcement learning framework for training maximum entropy policies in continuous domains.

This repository is no longer maintained. Please use our new Softlearning package instead. Soft Actor-Critic Soft actor-critic is a deep reinforcement

752 Jan 07, 2023

Code for "Single-view robot pose and joint angle estimation via render & compare", CVPR 2021 (Oral).

Single-view robot pose and joint angle estimation via render & compare Yann Labbé, Justin Carpentier, Mathieu Aubry, Josef Sivic CVPR: Conference on C

51 Oct 14, 2022

Rendering Point Clouds with Compute Shaders

Compute Shader Based Point Cloud Rendering This repository contains the source code to our techreport: Rendering Point Clouds with Compute Shaders and

460 Jan 05, 2023

Language Used: Python . Made in Jupyter(Anaconda) notebook.

FACE-DETECTION-ATTENDENCE-SYSTEM Made in Jupyter(Anaconda) notebook. Language Used: Python Steps to perform before running the program : Install Anaco

1 Jan 12, 2022

Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning

Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning This is the official repository for Conservative and Adaptive Penalty fo

7 Nov 22, 2022

Official source code to CVPR'20 paper, "When2com: Multi-Agent Perception via Communication Graph Grouping"

When2com: Multi-Agent Perception via Communication Graph Grouping This is the PyTorch implementation of our paper: When2com: Multi-Agent Perception vi

34 Nov 09, 2022

StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.

3k Jan 08, 2023

The code is the training example of AAAI2022 Security AI Challenger Program Phase 8: Data Centric Robot Learning on ML models.

Example code of [Tianchi AAAI2022 Security AI Challenger Program Phase 8]

22 Oct 14, 2022

Code and data of the EMNLP 2021 paper "Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer"

StyleAttack Code and data of the EMNLP 2021 paper "Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer" Prepare Pois

19 Nov 20, 2022

Modification of convolutional neural net "UNET" for image segmentation in Keras framework

Related tags

Overview

ZF_UNET_224 Pretrained Model

Requirements

Usage

Notes

Pretrained weights

Comments

Releases(v1.0)

v1.0(Mar 19, 2018)

Owner

IGCN : Image-to-graph convolutional network

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

PlenOctrees: NeRF-SH Training & Conversion

Evaluation toolkit of the informative tracking benchmark comprising 9 scenarios, 180 diverse videos, and new challenges.

Soft actor-critic is a deep reinforcement learning framework for training maximum entropy policies in continuous domains.

Code for "Single-view robot pose and joint angle estimation via render & compare", CVPR 2021 (Oral).

Rendering Point Clouds with Compute Shaders

Language Used: Python . Made in Jupyter(Anaconda) notebook.

Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning

Official source code to CVPR'20 paper, "When2com: Multi-Agent Perception via Communication Graph Grouping"

StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.

The code is the training example of AAAI2022 Security AI Challenger Program Phase 8: Data Centric Robot Learning on ML models.

Code and data of the EMNLP 2021 paper "Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer"

Download and preprocess popular sequential recommendation datasets

LSTM model trained on a small dataset of 3000 names written in PyTorch

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

A particular navigation route using satellite feed and can help in toll operations & traffic managemen

This repository contains the code used in the paper "Prompt-Based Multi-Modal Image Segmentation".

OOD Dataset Curator and Benchmark for AI-aided Drug Discovery

Code and real data for the paper "Counterfactual Temporal Point Processes", available at arXiv.