Official Code for "Non-deep Networks"

Last update: Dec 12, 2022

Related tags

Overview

Non-deep Networks
arXiv:2110.07641
Ankit Goyal, Alexey Bochkovskiy, Jia Deng, Vladlen Koltun

Overview: Depth is the hallmark of DNNs. But more depth means more sequential computation and higher latency. This begs the question -- is it possible to build high-performing ``non-deep" neural networks? We show that it is. We show, for the first time, that a network with a depth of just 12 can achieve top-1 accuracy over 80% on ImageNet, 96% on CIFAR10, and 81% on CIFAR100. We also show that a network with a low-depth (12) backbone can achieve an AP of 48% on MS-COCO.

If you find our work useful, please consider citing it:

@article{goyal2021nondeep,
  title={Non-deep Networks},
  author={Goyal, Ankit and Bochkovskiy, Alexey and Deng, Jia and Koltun, Vladlen},
  journal={arXiv:2110.07641},
  year={2021}
}

Code Coming Soon!

Comments

when will the code of the model be released?

I am very interested in your research, when will the code of the model be released? I saw on October 23rd that you said it would be released in 4 weeks

opened by Dr-Goopher 6
When will the code be released?

I am very interested in your work and would like to further study. I hope you can release the code as soon as possible in your busy schedule. Thank you！

opened by SenShu96 5
what is the meaning of 'Shuffle' of fusion block in Fig. A1?

Hello. Thank you for your great study. I wonder the meaning of 'Shuffle' of fusion block in Fig. A1. Is it pixel shuffle layer? Please let me know the meaning of that.

Thank you.

opened by jhcha08 3
Question about SSE module

Hi. Figure 2b shows that there's one 1x1conv in a branch of SSE, how to match the channel of output by 1x1conv with the channel of input after shortcut? If I set the output channel of 1x1conv the same as input, the channels of the outputs by RepVGG block and SSE will not match.

opened by Tsianmy 2
Really faster than ResNet? I am very confused

Hello, my friend, appreciate for your great work! I have tested the code on https://github.com/Pritam-N/ParNet by Pritam-N and change the ResNet code in my model by using your ParNet , but the actual time is quite slow than the paper said. My block size is [64, 128, 256, 512, 2048], and the time of "forward()" is more than 5s average while the Resnet is 0.02s in my device. I have use the time function for every line in the forward(), find that the encode stuff is the main reason. I continue write time.perf_counter() in the encode stuff, find that the "self.stream2_fusion" and "self.stream3_fusion" is the most time user. Do you know why ?

opened by StonepageVan 1
fusion module, accuracy about cifar100
what is your shuffle code in your fusion module?

what is your model architecture in cifar-100? I just changed front two downsample modules based on the ParNet for Imagenet in the paper. But the accuracy is lower. And How do you set the LR, MILESTONES and NUM_EPOCH to meet high accuracy?
opened by qq769852576 2

Releases(v.0.1.0)

v.0.1.0(Dec 24, 2021)

Preliminary version containing code for the imagenet dataset.
Source code(tar.gz)
Source code(zip)
ft2_init_lr_0.001_cosine_epoch_16_is_320_we_0.0_zero_init_head_2_scale_0.5_1.0_mixup_0.1_reprob_0.6.pth.tar(1067.21 MB)
planes_128_256_512_2048_num_blocks_5_6_6_1_sebv_13.pth.tar(446.48 MB)
planes_160_320_640_2560_num_blocks_5_6_6_1_sebv_13_dropout_lin.pth.tar(689.49 MB)
planes_92_192_384_1280_num_blocks_5_6_6_1_sebv_13.pth.tar(240.13 MB)
reg_se13_cosine_planes_200_400_800_3200_num_blocks_5_6_6_1_sebv_13_dropout_lin.pth.tar(1067.21 MB)
reg_se13_planes_200_400_800_3200_num_blocks_5_6_6_1_sebv_13_dropout_lin.pth.tar(1067.21 MB)
resnet101.pth.tar(511.15 MB)
resnet34.pth.tar(249.76 MB)
resnet50.pth.tar(293.15 MB)

Owner

Ankit Goyal

Phd Candidate @Princeton | Works in CV and AI

GitHub Repository

Transformer in Computer Vision

Transformer-in-Vision A paper list of some recent Transformer-based CV works. If you find some ignored papers, please open issues or pull requests. **

506 Dec 26, 2022

Implementation of Pix2Seq in PyTorch

pix2seq-pytorch Implementation of Pix2Seq paper Different from the paper image input size 1280 bin size 1280 LambdaLR scheduler used instead of Linear

9 Dec 15, 2022

Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference

Self-Supervised Document Similarity Ranking (SDR) via Contextualized Language Models and Hierarchical Inference This repo is the implementation for SD

36 Nov 28, 2022

This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.

TSForecasting This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the tim

80 Dec 30, 2022

A project to build an AI voice assistant using Python . The Voice assistant interacts with the humans to perform basic tasks.

AI_Personal_Voice_Assistant_Using_Python A project to build an AI voice assistant using Python . The Voice assistant interacts with the humans to perf

1 Oct 30, 2021

Autonomous Driving on Curvy Roads without Reliance on Frenet Frame: A Cartesian-based Trajectory Planning Method

C++/ROS Source Codes for "Autonomous Driving on Curvy Roads without Reliance on Frenet Frame: A Cartesian-based Trajectory Planning Method" published in IEEE Trans. Intelligent Transportation Systems

88 Dec 23, 2022

A computational optimization project towards the goal of gerrymandering the results of a hypothetical election in the UK.

1 Jan 18, 2022

Learning to Disambiguate Strongly Interacting Hands via Probabilistic Per-Pixel Part Segmentation [3DV 2021 Oral]

Learning to Disambiguate Strongly Interacting Hands via Probabilistic Per-Pixel Part Segmentation [3DV 2021 Oral] Learning to Disambiguate Strongly In

40 Dec 22, 2022

Neural Turing Machines (NTM) - PyTorch Implementation

PyTorch Neural Turing Machine (NTM) PyTorch implementation of Neural Turing Machines (NTM). An NTM is a memory augumented neural network (attached to

519 Dec 21, 2022

Spatial Single-Cell Analysis Toolkit

Single-Cell Image Analysis Package Scimap is a scalable toolkit for analyzing spatial molecular data. The underlying framework is generalizable to spa

30 Nov 08, 2022

Scheme for training and applying a label propagation framework

Factorisation-based Image Labelling Overview This is a scheme for training and applying the factorisation-based image labelling (FIL) framework. Some

2 Dec 17, 2021

A PyTorch-based library for semi-supervised learning

News If you want to join TorchSSL team, please e-mail Yidong Wang ([email protected]<

1k Jan 06, 2023

Tensorflow 2 implementation of the paper: Learning and Evaluating Representations for Deep One-class Classification published at ICLR 2021

Deep Representation One-class Classification (DROC). This is not an officially supported Google product. Tensorflow 2 implementation of the paper: Lea

137 Dec 23, 2022

Official Code for "Non-deep Networks"

Related tags

Overview

Code Coming Soon!

Comments

when will the code of the model be released?

When will the code be released?

what is the meaning of 'Shuffle' of fusion block in Fig. A1?

Question about SSE module

Really faster than ResNet? I am very confused

fusion module, accuracy about cifar100

Releases(v.0.1.0)

v.0.1.0(Dec 24, 2021)

Owner

Ankit Goyal

Transformer in Computer Vision

Implementation of Pix2Seq in PyTorch

Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference

This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.

A project to build an AI voice assistant using Python . The Voice assistant interacts with the humans to perform basic tasks.

Autonomous Driving on Curvy Roads without Reliance on Frenet Frame: A Cartesian-based Trajectory Planning Method

A computational optimization project towards the goal of gerrymandering the results of a hypothetical election in the UK.

Learning to Disambiguate Strongly Interacting Hands via Probabilistic Per-Pixel Part Segmentation [3DV 2021 Oral]

Neural Turing Machines (NTM) - PyTorch Implementation

Spatial Single-Cell Analysis Toolkit

Scheme for training and applying a label propagation framework

A PyTorch-based library for semi-supervised learning

Tensorflow 2 implementation of the paper: Learning and Evaluating Representations for Deep One-class Classification published at ICLR 2021

Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).

RIM: Reliable Influence-based Active Learning on Graphs.

yolov5 deepsort 行人车辆跟踪检测计数

Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research

Age Progression/Regression by Conditional Adversarial Autoencoder

BEGAN in PyTorch

Inferred Model-based Fuzzer

Official Code for "Non-deep Networks"

Related tags

Overview

Code Coming Soon!

Comments

when will the code of the model be released?

When will the code be released?

what is the meaning of 'Shuffle' of fusion block in Fig. A1?

Question about SSE module

Really faster than ResNet? I am very confused

fusion module, accuracy about cifar100

Releases(v.0.1.0)

v.0.1.0(Dec 24, 2021)

Owner

Ankit Goyal

Transformer in Computer Vision

Implementation of Pix2Seq in PyTorch

Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference

This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.

A project to build an AI voice assistant using Python . The Voice assistant interacts with the humans to perform basic tasks.

Autonomous Driving on Curvy Roads without Reliance on Frenet Frame: A Cartesian-based Trajectory Planning Method

A computational optimization project towards the goal of gerrymandering the results of a hypothetical election in the UK.

Learning to Disambiguate Strongly Interacting Hands via Probabilistic Per-Pixel Part Segmentation [3DV 2021 Oral]

Neural Turing Machines (NTM) - PyTorch Implementation

Spatial Single-Cell Analysis Toolkit

Scheme for training and applying a label propagation framework

A PyTorch-based library for semi-supervised learning

Tensorflow 2 implementation of the paper: Learning and Evaluating Representations for Deep One-class Classification published at ICLR 2021

Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).

RIM: Reliable Influence-based Active Learning on Graphs.

yolov5 deepsort 行人 车辆 跟踪 检测 计数

Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research

Age Progression/Regression by Conditional Adversarial Autoencoder

BEGAN in PyTorch

Inferred Model-based Fuzzer

yolov5 deepsort 行人车辆跟踪检测计数