Semi-Supervised Learning with Ladder Networks in Keras. Get 98% test accuracy on MNIST with just 100 labeled examples !

Last update: Sep 07, 2022

Related tags

Overview

Semi-Supervised Learning with Ladder Networks in Keras

This is an implementation of Ladder Network in Keras. Ladder network is a model for semi-supervised learning. Refer to the paper titled Semi-Supervised Learning with Ladder Networks by A Rasmus, H Valpola, M Honkala,M Berglund, and T Raiko

This implementation was used in the official code of our paper Unsupervised Clustering using Pseudo-semi-supervised Learning . The code can be found here and the blog post can be found here

The model achives 98% test accuracy on MNIST with just 100 labeled examples.

The code only works with Tensorflow backend.

Requirements

Python 2.7+/3.6+
Tensorflow (1.4.0)
numpy
keras (2.1.4)

Note that other versions of tensorflow/keras should also work.

How to use

Load the dataset

from keras.datasets import mnist
import keras
import random

# get the dataset
(x_train, y_train), (x_test, y_test) = mnist.load_data()

x_train = x_train.reshape(60000, 28*28).astype('float32')/255.0
x_test = x_test.reshape(10000, 28*28).astype('float32')/255.0

y_train = keras.utils.to_categorical( y_train )
y_test = keras.utils.to_categorical( y_test )

# only select 100 training samples 
idxs_annot = range( x_train.shape[0])
random.seed(0)
random.shuffle( idxs_annot )
idxs_annot = idxs_annot[ :100 ]

x_train_unlabeled = x_train
x_train_labeled = x_train[ idxs_annot ]
y_train_labeled = y_train[ idxs_annot  ]

Repeat the labeled dataset to match the shapes

n_rep = x_train_unlabeled.shape[0] / x_train_labeled.shape[0]
x_train_labeled_rep = np.concatenate([x_train_labeled]*n_rep)
y_train_labeled_rep = np.concatenate([y_train_labeled]*n_rep)

Initialize the model

from ladder_net import get_ladder_network_fc
inp_size = 28*28 # size of mnist dataset 
n_classes = 10
model = get_ladder_network_fc( layer_sizes = [ inp_size , 1000, 500, 250, 250, 250, n_classes ]  )

Train the model

model.fit([ x_train_labeled_rep , x_train_unlabeled   ] , y_train_labeled_rep , epochs=100)

Get the test accuracy

from sklearn.metrics import accuracy_score
y_test_pr = model.test_model.predict(x_test , batch_size=100 )

print "test accuracy" , accuracy_score(y_test.argmax(-1) , y_test_pr.argmax(-1)  )

Semi-Supervised Learning with Ladder Networks in Keras. Get 98% test accuracy on MNIST with just 100 labeled examples !

Related tags

Overview

Semi-Supervised Learning with Ladder Networks in Keras

Requirements

How to use

Owner

Divam Gupta

Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT

Python code to generate art with Generative Adversarial Network

A multi-scale unsupervised learning for deformable image registration

Code for paper "Vocabulary Learning via Optimal Transport for Neural Machine Translation"

CoaT: Co-Scale Conv-Attentional Image Transformers

Unofficial TensorFlow implementation of the Keyword Spotting Transformer model

SBINN: Systems-biology informed neural network

Pose estimation with MoveNet Lightning

PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).

Attentive Implicit Representation Networks (AIR-Nets)

Its a Plant Leaf Disease Detection System based on Machine Learning.

Hippocampal segmentation using the UNet network for each axis

This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.

catch-22: CAnonical Time-series CHaracteristics

Mememoji - A facial expression classification system that recognizes 6 basic emotions: happy, sad, surprise, fear, anger and neutral.

ReSSL: Relational Self-Supervised Learning with Weak Augmentation

Code repo for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

A symbolic-model-guided fuzzer for TLS

An implementation of the AdaOPS (Adaptive Online Packing-based Search), which is an online POMDP Solver used to solve problems defined with the POMDPs.jl generative interface.