Code implementation of "Sparsity Probe: Analysis tool for Deep Learning Models"

Last update: Jun 09, 2021

Related tags

Overview

Sparsity Probe: Analysis tool for Deep Learning Models

This repository is a limited implementation of Sparsity Probe: Analysis tool for Deep Learning Models by I. Ben-Shaul and S. Dekel (2021).

Downloading the Repo

git clone https://github.com/idobenshaul10/SparsityProbe.git
pip install -r requirements.txt

Requirements

torch==1.7.0
umap_learn==0.4.6
matplotlib==3.3.2
tqdm==4.49.0
seaborn==0.11.0
torchvision==0.8.1
numpy==1.19.2
scikit_learn==0.24.2
umap==0.1.1

Usage

The first step of using this Repo should be to look at this example: CIFAR10 Example. In this example, we demonstrate running the Sparsity-Probe on a trained Resnet18 on the CIFAR10 dataset, at selected layers.

Creating a new enviorment:

Create a new environment in the environments directory, inheriting from BaseEnviorment. This enviorment should include the train and test datasets(including the matching transforms), the model layers we want to test the alpha-scores on(see cifar10_env example), and the trained model.

Training a model:

It is possible to train a basic model with the train.py script, which uses an environment to load the model and the datasets. Example Usage: python train/train_mnist.py --output_path "results" --batch_size 32 --epochs 100

Running the Sparsity Probe

Done using the DL_smoothness.py script. Arguments:
trees - Number of trees in the forest.
depth - Maximum depth of each tree.
batch_size - batch used in the forward pass(when computing the layer outputs)
env_name - enviorment which is loaded to measure alpha-scores on
epsilon_1 - the epsilon_low used for the numerical approximation. By default, epsilon_high is inited as 4*epsilon_low
only_umap - only create umaps of the intermediate layers(without computing alpha-scores)
use_clustering - run KMeans on intermediate layers
calc_test - calculate test accuracy(More metrics coming soon)
output_folder - location where all outputs are saved
feature_dimension - to reduce computation costs, we compute the alpha-scores on the features after a dimensionality reduction technique has been applied. As of now, if the dim(layer_outputs)>feature_dimension, the TruncatedSVD is used to reduce dim(layer_outputs) to feature_dimension. Default feature_dimension is 2500.

Plotting Results

Result plots can be created using this script.

Acknowledgements

Our pretrained CIFAR10 Resnet18 network used in the example is taken from This Repo.

License

This repository is MIT licensed, as found in the LICENSE file.

Code implementation of "Sparsity Probe: Analysis tool for Deep Learning Models"

Related tags

Overview

Sparsity Probe: Analysis tool for Deep Learning Models

Downloading the Repo

Requirements

Usage

Creating a new enviorment:

Training a model:

Running the Sparsity Probe

Plotting Results

Acknowledgements

License

Owner

This is RFA-Toolbox, a simple and easy-to-use library that allows you to optimize your neural network architectures using receptive field analysis (RFA) and create graph visualizations of your architecture.

A Novel Plug-in Module for Fine-grained Visual Classification

An expansion for RDKit to read all types of files in one line

Python Library for Signal/Image Data Analysis with Transport Methods

Code and models for ICCV2021 paper "Robust Object Detection via Instance-Level Temporal Cycle Confusion".

Minecraft agent to farm resources using reinforcement learning

Source code for "Progressive Transformers for End-to-End Sign Language Production" (ECCV 2020)

PyTorch Lightning + Hydra. A feature-rich template for rapid, scalable and reproducible ML experimentation with best practices. ⚡🔥⚡

HiddenMarkovModel implements hidden Markov models with Gaussian mixtures as distributions on top of TensorFlow

Semi-supervised Implicit Scene Completion from Sparse LiDAR

Lorien: A Unified Infrastructure for Efficient Deep Learning Workloads Delivery

A framework for joint super-resolution and image synthesis, without requiring real training data

[NeurIPS 2020] Blind Video Temporal Consistency via Deep Video Prior

EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings

Providing the solutions for high-frequency trading (HFT) strategies using data science approaches (Machine Learning) on Full Orderbook Tick Data.

Just Randoms Cats with python

【ACMMM 2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning

High performance distributed framework for training deep learning recommendation models based on PyTorch.

Implementation of Online Label Smoothing in PyTorch

Code for You Only Cut Once: Boosting Data Augmentation with a Single Cut