A playable implementation of Fully Convolutional Networks with Keras.

Last update: Sep 07, 2022

Overview

keras-fcn

A re-implementation of Fully Convolutional Networks with Keras

Installation

Dependencies

Install with `pip`

$ pip install git+https://github.com/JihongJu/keras-fcn.git

Build from source

$ git clone https://github.com/JihongJu/keras-fcn.git
$ cd keras-fcn
$ pip install --editable .

Usage

FCN with VGG16

from keras_fcn import FCN
fcn_vgg16 = FCN(input_shape=(500, 500, 3), classes=21,  
                weights='imagenet', trainable_encoder=True)
fcn_vgg16.compile(optimizer='rmsprop',
                  loss='categorical_crossentropy',
                  metrics=['accuracy'])
fcn_vgg16.fit(X_train, y_train, batch_size=1)

FCN with VGG19

from keras_fcn import FCN
fcn_vgg19 = FCN_VGG19(input_shape=(500, 500, 3), classes=21,  
                      weights='imagenet', trainable_encoder=True)
fcn_vgg19.compile(optimizer='rmsprop',
                  loss='categorical_crossentropy',
                  metrics=['accuracy'])
fcn_vgg19.fit(X_train, y_train, batch_size=1)

Custom FCN (VGG16 as an example)

from keras.layers import Input
from keras.models import Model
from keras_fcn.encoders import Encoder
from keras_fcn.decoders import VGGUpsampler
from keras_fcn.blocks import (vgg_conv, vgg_fc)
inputs = Input(shape=(224, 224, 3))
blocks = [vgg_conv(64, 2, 'block1'),
          vgg_conv(128, 2, 'block2'),
          vgg_conv(256, 3, 'block3'),
          vgg_conv(512, 3, 'block4'),
          vgg_conv(512, 3, 'block5'),
          vgg_fc(4096)]
encoder = Encoder(inputs, blocks, weights='imagenet',
                  trainable=True)
feat_pyramid = encoder.outputs   # A feature pyramid with 5 scales
feat_pyramid = feat_pyramid[:3]  # Select only the top three scale of the pyramid
feat_pyramid.append(inputs)      # Add image to the bottom of the pyramid


outputs = VGGUpsampler(feat_pyramid, scales=[1, 1e-2, 1e-4], classes=21)
outputs = Activation('softmax')(outputs)

fcn_custom = Model(inputs=inputs, outputs=outputs)

And implement a custom Fully Convolutional Network becomes simply define a series of convolutional blocks that one stacks on top of another.

Custom decoders

from keras_fcn.blocks import vgg_upsampling
from keras_fcn.decoders import Decoder
decode_blocks = [
vgg_upsampling(classes=21, target_shape=(None, 14, 14, None), scale=1),            
vgg_upsampling(classes=21, target_shape=(None, 28, 28, None),  scale=0.01),
vgg_upsampling(classes=21, target_shape=(None, 224, 224, None),  scale=0.0001)
]
outputs = Decoder(feat_pyramid[-1], decode_blocks)

The decode_blocks can be customized as well.

from keras_fcn.layers import BilinearUpSampling2D

def vgg_upsampling(classes, target_shape=None, scale=1, block_name='featx'):
    """A VGG convolutional block with bilinear upsampling for decoding.

    :param classes: Integer, number of classes
    :param scale: Float, scale factor to the input feature, varing from 0 to 1
    :param target_shape: 4D Tuples with targe_height, target_width as
    the 2nd, 3rd elements if `channels_last` or as the 3rd, 4th elements if
    `channels_first`.

    >>> from keras_fcn.blocks import vgg_upsampling
    >>> feat1, feat2, feat3 = feat_pyramid[:3]
    >>> y = vgg_upsampling(classes=21, target_shape=(None, 14, 14, None),
    >>>                    scale=1, block_name='feat1')(feat1, None)
    >>> y = vgg_upsampling(classes=21, target_shape=(None, 28, 28, None),
    >>>                    scale=1e-2, block_name='feat2')(feat2, y)
    >>> y = vgg_upsampling(classes=21, target_shape=(None, 224, 224, None),
    >>>                    scale=1e-4, block_name='feat3')(feat3, y)

    """
    def f(x, y):
        score = Conv2D(filters=classes, kernel_size=(1, 1),
                       activation='linear',
                       padding='valid',
                       kernel_initializer='he_normal',
                       name='score_{}'.format(block_name))(x)
        if y is not None:
            def scaling(xx, ss=1):
                return xx * ss
            scaled = Lambda(scaling, arguments={'ss': scale},
                            name='scale_{}'.format(block_name))(score)
            score = add([y, scaled])
        upscore = BilinearUpSampling2D(
            target_shape=target_shape,
            name='upscore_{}'.format(block_name))(score)
        return upscore
    return f

Try Examples

Download VOC2011 dataset

$ wget "http://host.robots.ox.ac.uk/pascal/VOC/voc2011/VOCtrainval_25-May-2011.tar"
$ tar -xvzf VOCtrainval_25-May-2011.tar
$ mkdir ~/Datasets
$ mv TrainVal/VOCdevkit/VOC2011 ~/Datasets

Mount dataset from host to container and start bash in container image

From repository keras-fcn

$ nvidia-docker run -it --rm -v `pwd`:/root/workspace -v ${Home}/Datasets/:/root/workspace/data jihong/keras-gpu bash

or equivalently,

$ make bash

Within the container, run the following codes.

$ cd ~/workspace
$ pip setup.py -e .
$ cd voc2011
$ python train.py

More details see source code of the example in Training Pascal VOC2011 Segmention

Model Architecture

FCN8s with VGG16 as base net:

TODO

Add ResNet

A playable implementation of Fully Convolutional Networks with Keras.

Related tags

Overview

keras-fcn

Installation

Dependencies

Install with `pip`

Build from source

Usage

FCN with VGG16

FCN with VGG19

Custom FCN (VGG16 as an example)

Custom decoders

Try Examples

Model Architecture

TODO

Owner

JihongJu

Source code of our BMVC 2021 paper: AniFormer: Data-driven 3D Animation with Transformer

Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"

NuPIC Studio is an all-in-one tool that allows users create a HTM neural network from scratch

Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices

QTool: A Low-bit Quantization Toolbox for Deep Neural Networks in Computer Vision

1st place solution in CCF BDCI 2021 ULSEG challenge

Deep Networks with Recurrent Layer Aggregation

Repo for the Video Person Clustering dataset, and code for the associated paper

Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)

Bringing Computer Vision and Flutter together , to build an awesome app !!

Scikit-learn compatible estimation of general graphical models

StyleGAN-Human: A Data-Centric Odyssey of Human Generation

Code for paper: Towards Tokenized Human Dynamics Representation

Code for the paper: Fighting Fake News: Image Splice Detection via Learned Self-Consistency

Cosine Annealing With Warmup

Efficient electromagnetic solver based on rigorous coupled-wave analysis for 3D and 2D multi-layered structures with in-plane periodicity

这是一个deeplabv3-plus-pytorch的源码，可以用于训练自己的模型。

LERP : Label-dependent and event-guided interpretable disease risk prediction using EHRs

Just playing with getting CLIP Guided Diffusion running locally, rather than having to use colab.

[ICCV 2021] HRegNet: A Hierarchical Network for Large-scale Outdoor LiDAR Point Cloud Registration

A playable implementation of Fully Convolutional Networks with Keras.

Related tags

Overview

keras-fcn

Installation

Dependencies

Install with pip

Build from source

Usage

FCN with VGG16

FCN with VGG19

Custom FCN (VGG16 as an example)

Custom decoders

Try Examples

Model Architecture

TODO

Owner

JihongJu

Source code of our BMVC 2021 paper: AniFormer: Data-driven 3D Animation with Transformer

Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"

NuPIC Studio is an all­-in-­one tool that allows users create a HTM neural network from scratch

Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices

QTool: A Low-bit Quantization Toolbox for Deep Neural Networks in Computer Vision

1st place solution in CCF BDCI 2021 ULSEG challenge

Deep Networks with Recurrent Layer Aggregation

Repo for the Video Person Clustering dataset, and code for the associated paper

Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)

Bringing Computer Vision and Flutter together , to build an awesome app !!

Scikit-learn compatible estimation of general graphical models

StyleGAN-Human: A Data-Centric Odyssey of Human Generation

Code for paper: Towards Tokenized Human Dynamics Representation

Code for the paper: Fighting Fake News: Image Splice Detection via Learned Self-Consistency

Cosine Annealing With Warmup

Efficient electromagnetic solver based on rigorous coupled-wave analysis for 3D and 2D multi-layered structures with in-plane periodicity

这是一个deeplabv3-plus-pytorch的源码，可以用于训练自己的模型。

LERP : Label-dependent and event-guided interpretable disease risk prediction using EHRs

Just playing with getting CLIP Guided Diffusion running locally, rather than having to use colab.

[ICCV 2021] HRegNet: A Hierarchical Network for Large-scale Outdoor LiDAR Point Cloud Registration

Install with `pip`

NuPIC Studio is an all-in-one tool that allows users create a HTM neural network from scratch