Pytorch implementation of Cut-Thumbnail in the paper Cut-Thumbnail:A Novel Data Augmentation for Convolutional Neural Network.

Last update: Apr 12, 2022

Related tags

Deep Learning Cut-Thumbnail

Overview

Cut-Thumbnail (Accepted at ACM MULTIMEDIA 2021)

Tianshu Xie, Xuan Cheng, Xiaomin Wang, Minghui Liu, Jiali Deng, Tao Zhou, Ming Liu

This is the official Pytorch implementation of Cut-Thumbnail in the paper Cut-Thumbnail:A Novel Data Augmentation for Convolutional Neural Network.

This implementation is based on these repositories:

Main Requirements

torch == 1.0.1
torchvision == 0.2.0
Python 3

Training Examples

Mixed Single Thumbnail

python train.py -d [datasetlocation] --depth 50 --mode mst --size 112 --lam 0.25 --participation_rate 0.8

Self Thumbnail

python train.py -d [datasetlocation] --depth 50 --mode st --size 112 --lam 0.25 --participation_rate 0.8

Results

ImageNet Results

Model	Accuracy (%)
ResNet50 + CutMix	78.60*
ResNet50 + Cut-Thumbnail (ST)	77.74
ResNet50 + Cut-Thumbnail (MST)	79.21

* denotes results reported in the original papers.

CIFAR-100 Results

Model	Accuracy (%)
WideResNet-28-10 + Cut-Thumbnail (ST)	81.41
WideResNet-28-10 + Cut-Thumbnail (MST)	83.35

CUB-200-2011 Results

Model	Accuracy (%)
ResNet50 + Cut-Thumbnail (ST)	85.72
ResNet50 + Cut-Thumbnail (MST)	86.56
ResNet50 + Cut-Thumbnail (MDT)	86.72

Citation

If you find our paper and this repo useful, please cite as

@inproceedings{xie20cut-thumbnail,
    author = {Xie, Tianshu and Cheng, Xuan and Wang, Xiaomin and Liu, Minghui and Deng, Jiali and Zhou, Tao and Liu, Ming},
    title = {Cut-Thumbnail: A Novel Data Augmentation for Convolutional Neural Network},
    year = {2021},
    isbn = {9781450386517},
    publisher = {Association for Computing Machinery},
    address = {New York, NY, USA},
    url = {https://doi.org/10.1145/3474085.3475302},
    doi = {10.1145/3474085.3475302},
    booktitle = {Proceedings of the 29th ACM International Conference on Multimedia},
    pages = {1627–1635},
    numpages = {9},
    location = {Virtual Event, China},
    series = {MM '21}
}

Pytorch implementation of Cut-Thumbnail in the paper Cut-Thumbnail:A Novel Data Augmentation for Convolutional Neural Network.

Related tags

Overview

Cut-Thumbnail (Accepted at ACM MULTIMEDIA 2021)

Tianshu Xie, Xuan Cheng, Xiaomin Wang, Minghui Liu, Jiali Deng, Tao Zhou, Ming Liu

Main Requirements

Training Examples

Results

Citation

Owner

Linear algebra python - Number of operations and problems in Linear Algebra and Numerical Linear Algebra

[ICLR 2021] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yining Ding, Vikas Chandra, Yingyan Lin

A coin flip game in which you can put the amount of money below or equal to 1000 and then choose heads or tail

The Official PyTorch Implementation of "VAEBM: A Symbiosis between Variational Autoencoders and Energy-based Models" (ICLR 2021 spotlight paper)

E2VID_ROS - E2VID_ROS: E2VID to a real-time system

CARMS: Categorical-Antithetic-REINFORCE Multi-Sample Gradient Estimator

OpenL3: Open-source deep audio and image embeddings

MetaDrive: Composing Diverse Scenarios for Generalizable Reinforcement Learning

Hand gesture recognition model that can be used as a remote control for a smart tv.

SOTR: Segmenting Objects with Transformers [ICCV 2021]

ALBERT-pytorch-implementation - ALBERT pytorch implementation

Self-Supervised Image Denoising via Iterative Data Refinement

Automatic 2D-to-3D Video Conversion with CNNs

Unified MultiWOZ evaluation scripts for the context-to-response task.

Vehicles Counting using YOLOv4 + DeepSORT + Flask + Ngrok

Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implicit Bayesian Inference"

Pytorch implementation of Decoupled Spatial-Temporal Transformer for Video Inpainting

Inhomogeneous Social Recommendation with Hypergraph Convolutional Networks

Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.

This is a project based on ConvNets used to identify whether a road is clean or dirty. We have used MobileNet as our base architecture and the weights are based on imagenet.