[CVPR 2021] 'Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator'

Last update: Dec 01, 2022

Related tags

Overview

[CVPR2021] Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator

Overview

This is the entire codebase for the paper Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator

In one-shot NAS, sub-networks need to be searched from the supernet to meet different hardware constraints. However, the search cost is high and N times of searches are needed for N different constraints. In this work, we propose a novel search strategy called architecture generator to search sub-networks by generating them, so that the search process can be much more efficient and flexible. With the trained architecture generator, given target hardware constraints as the input, N good architectures can be generated for N constraints by just one forward pass without researching and supernet retraining. Moreover, we propose a novel single-path supernet, called unified supernet, to further improve search efficiency and reduce GPU memory consumption of the architecture generator. With the architecture generator and the unified supernet, we pro- pose a flexible and efficient one-shot NAS framework, called Searching by Generating NAS (SGNAS). The search time of SGNAS for N different hardware constraints is only 5 GPU hours, which is 4N times faster than previous SOTA single-path methods. The top1-accuracy of SGNAS on ImageNet is 77.1%, which is comparable with the SOTAs.

Model Zoo

Model	FLOPs (M)	Param (M)	Top-1 (%)	Weights
SGNAS-A	373	6.0	77.1	Google drive
SGNAS-B	326	5.5	76.8	Google drive
SGNAS-C	281	4.7	76.2	Google drive

Requirements

pip3 install -r requirements.txt

[Optional] Transfer Imagenet dataset into LMDB format by utils/folder2lmdb.py
- With LMDB format, you can speed up entire training process(30 mins per epoch with 4 GeForce GTX 1080 Ti)

Getting Started

Search

Training Unified Supernet

For Imagenet training, set the config file ./config_file/imagenet_config.yml. For cifar100 training, set the config file ./config_file/config.yml.
Set the hyperparameter warmup_epochs in the config file to specific the epochs for training the unified supernet.

python3 search.py --cfg [CONFIG_FILE] --title [EXPERIMENT_TITLE]

Training Architecture Generator

For Imagenet training, set the config file ./config_file/imagenet_config.yml. For cifar100 training, set the config file ./config_file/config.yml.
Set the hyperparameter warmup_epochs in the config file to skip the supernet training, and set the hyperparameter search_epochs to specific the epochs for training the architecture generator.

python3 search.py --cfg [CONFIG_FILE] --title [EXPERIMENT_TITLE]

Train From Scratch

CIFAR10 or CIFAR100

Set train_portion in ./config_file/config.yml to 1

python3 train_cifar.py --cfg [CONFIG_FILE] -- flops [TARGET_FLOPS] --title [EXPERIMENT_TITLE]

ImageNet

Set the target flops and correspond config file path in run_example.sh

bash ./run_example.sh

Validate

ImageNet

SGNAS-A

python3 validate.py [VAL_PATH] --checkpoint [CHECKPOINT_PATH] --config_path [CONFIG_FILE] --target_flops 365 --se True --activation hswish

SGNAS-B

python3 validate.py [VAL_PATH] --checkpoint [CHECKPOINT_PATH] --config_path [CONFIG_FILE] --target_flops 320 --se True --activation hswish

SGNAS-C

python3 validate.py [VAL_PATH] --checkpoint [CHECKPOINT_PATH] --config_path [CONFIG_FILE] --target_flops 275 --se True --activation hswish

Reference

Citation

@InProceedings{sgnas,
author = {Sian-Yao Huang and Wei-Ta Chu},
title = {Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator},
booktitle = {Proceedings of IEEE Conference on Computer Vision and Pattern Recognition},
year = {2021}
}

[CVPR 2021] 'Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator'

Related tags

Overview

[CVPR2021] Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator

Overview

Model Zoo

Requirements

Getting Started

Search

Training Unified Supernet

Training Architecture Generator

Train From Scratch

CIFAR10 or CIFAR100

ImageNet

Validate

ImageNet

Reference

Citation

Owner

Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition

My implementation of transformers related papers for computer vision in pytorch

Predict halo masses from simulations via graph neural networks

Repo 4 basic seminar §How to make human machine readable"

Pytorch Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic

DLFlow is a deep learning framework.

[TIP 2021] SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction

Lunar is a neural network aimbot that uses real-time object detection accelerated with CUDA on Nvidia GPUs.

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features

Official implementation for TTT++: When Does Self-supervised Test-time Training Fail or Thrive

A little software to generate and save Julia or Mandelbrot's Fractals.

Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021

Pytorch-diffusion - A basic PyTorch implementation of 'Denoising Diffusion Probabilistic Models'

FSL-Mate: A collection of resources for few-shot learning (FSL).

Automatic detection and classification of Covid severity degree in LUS (lung ultrasound) scans

RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.

Python tools for 3D face: 3DMM, Mesh processing(transform, camera, light, render), 3D face representations.

ManiSkill-Learn is a framework for training agents on SAPIEN Open-Source Manipulation Skill Challenge (ManiSkill Challenge), a large-scale learning-from-demonstrations benchmark for object manipulation.

IRON Kaggle project done while doing IRONHACK Bootcamp where we had to analyze and use a Machine Learning Project to predict future sales