A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).

Last update: Sep 26, 2022

Related tags

Overview

LegoNet

This code is the implementation of ICML2019 paper LegoNet: Efficient Convolutional Neural Networks with Lego Filters

Run

python train.py

You could achieve an VGG16 with 93.88% accuracy on CIFAR10 dataset, the lego filters occupy ~3.8M parameters.

LegoConv2d

self.lego = nn.Parameter(nn.init.kaiming_normal_(torch.rand(self.n_lego, self.basic_channels, self.kernel_size, self.kernel_size)))
self.aux_coefficients = nn.Parameter(init.kaiming_normal_(torch.rand(self.n_split, self.out_channels, self.n_lego, 1, 1)))
self.aux_combination = nn.Parameter(init.kaiming_normal_(torch.rand(self.n_split, self.out_channels, self.n_lego, 1, 1)))

lego: Lego Filters

aux_coefficients: combination coefficients used during combination

aux_combination: combination index

Note

The aux_coefficients and aux_combination should be saved as sparse matrix for saving memory. This code does not include this part.

Citation

@inproceedings{legonet,
	title={LegoNet: Efficient Convolutional Neural Networks with Lego Filters},
	author={Yang, Zhaohui and Wang, Yunhe and Liu, Chuanjian and Chen, Hanting and Xu, Chunjing and Shi, Boxin and Xu, Chao and Xu, Chang},
	booktitle={International Conference on Machine Learning},
	pages={7005--7014},
	year={2019}
}

A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).

Related tags

Overview

LegoNet

Run

LegoConv2d

Note

Citation

Owner

YangZhaohui

Libraries, tools and tasks created and used at DeepMind Robotics.

AdamW optimizer for bfloat16 models in pytorch.

This is a work in progress reimplementation of Instant Neural Graphics Primitives

Code for our CVPR 2021 Paper "Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes".

Pun Detection and Location

Sample Code for "Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL"

Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".

optimization routines for hyperparameter tuning

Code repo for "FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation" (ICCV 2021)

Atomistic Line Graph Neural Network

Implementation of Invariant Point Attention, used for coordinate refinement in the structure module of Alphafold2, as a standalone Pytorch module

Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.

Checking fibonacci - Generating the Fibonacci sequence is a classic recursive problem

Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"

MMdet2-based reposity about lightweight detection model: Nanodet, PicoDet.

Pytorch implementation of face attention network

🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.

MemStream: Memory-Based Anomaly Detection in Multi-Aspect Streams with Concept Drift

Implementation for our AAAI2021 paper (Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction).

Deep generative modeling for time-stamped heterogeneous data, enabling high-fidelity models for a large variety of spatio-temporal domains.