MobileFormer

An implementation of MobileFormer proposed by Yinpeng Chen, Xiyang Dai et al.

Including

[1] Mobile-Former proposed in: 
                        Yinpeng Chen, Xiyang Dai et al., Mobile-Former: Bridging MobileNet and Transformer. 
                        arxiv.org/abs/2108.05895
[2] Dynamtic ReLU proposed in: 
                        Yinpeng Chen, Xiyang Dai et al., Dynamtic ReLU. 
                        arxiv.org/abs/2003.10027v2
[3] Lite-BottleNeck proposed in: 
                        Yunsheng Li, Yinpeng Chen et al., MicroNet: Improving Image Recognition with Extremely Low FLOPs. 
                        arxiv.org/abs/2108.05894v1
[4] Adam-W proposed in:
                        Ilya Loshchilov & Frank Hutter, Decoupled Weight Decay Regularization.
                        arxiv.org/abs/1711.05101v3
[5] Mixup proposed in:
                        Hongyi Zhang, Moustapha Cisse et al., Mixup: Beyond Empircal Risk Minimization.
                        arxiv.org/abs/1710.09412
[6] Multi-FocalLoss (not used), focal loss is proposed in:
                        Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, Piotr Dollár, Focal Loss for Dense Object Detection.
                        arxiv.org/abs/1708.02002

Note

(1) Due to the expanded DW conv used in strided Mobile-Former blocks, 
    the out_channel should be divisible by expand_size of the next block.
(2) Adam-W and Mixup is embedded in train.py.
(3) Use run() in train.py to train('run') or search('search'). There is an example in the train.py.

'###### The '#'s #######'

'##### are aligned #####'

No pre-train parameters for now.

An implementation of MobileFormer

Related tags

Overview

MobileFormer

Including

Note

'###### The '#'s #######'

'##### are aligned #####'

Owner

slwang9353

[SIGGRAPH 2022 Journal Track] AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars

[CVPR 2021] Region-aware Adaptive Instance Normalization for Image Harmonization

Prefix-Tuning: Optimizing Continuous Prompts for Generation

[NeurIPS'21] Projected GANs Converge Faster

Fake videos detection by tracing the source using video hashing retrieval.

Learning to Estimate Hidden Motions with Global Motion Aggregation

Contains supplementary materials for reproduce results in HMC divergence time estimation manuscript

Multiview Dataset Toolkit

Official PyTorch code for "BAM: Bottleneck Attention Module (BMVC2018)" and "CBAM: Convolutional Block Attention Module (ECCV2018)"

Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization

[CVPR2021] De-rendering the World's Revolutionary Artefacts

TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.

An implementation of the BADGE batch active learning algorithm.

🔅 Shapash makes Machine Learning models transparent and understandable by everyone

A web porting for NVlabs' StyleGAN2, to facilitate exploring all kinds characteristic of StyleGAN networks

A light and fast one class detection framework for edge devices. We provide face detector, head detector, pedestrian detector, vehicle detector......

Waymo motion prediction challenge 2021: 3rd place solution

My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control

Generate saved_model, tfjs, tf-trt, EdgeTPU, CoreML, quantized tflite and .pb from .tflite.

A Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution