General Multi-label Image Classification with Transformers

Last update: Dec 21, 2022

Overview

General Multi-label Image Classification with Transformers
Jack Lanchantin, Tianlu Wang, Vicente Ordóñez Román, Yanjun Qi
Conference on Computer Vision and Pattern Recognition (CVPR) 2021
[paper] [poster] [slides]

Training and Running C-Tran

Python version 3.7 is required and all major packages used and their versions are listed in requirements.txt.

C-Tran on COCO80 Dataset

Download COCO data (19G)

wget http://cs.virginia.edu/~jjl5sw/data/vision/coco.tar.gz
mkdir -p data/
tar -xvf coco.tar.gz -C data/

Train New Model

python main.py  --batch_size 16  --lr 0.00001 --optim 'adam' --layers 3  --dataset 'coco' --use_lmt --dataroot data/

C-Tran on VOC20 Dataset

Download VOC2007 data (1.7G)

wget http://cs.virginia.edu/~jjl5sw/data/vision/voc.tar.gz
mkdir -p data/
tar -xvf voc.tar.gz -C data/

Train New Model

python main.py  --batch_size 16  --lr 0.00001 --optim 'adam' --layers 3  --dataset 'voc' --use_lmt --grad_ac_step 2 --dataroot data/

Citing

@article{lanchantin2020general,
  title={General Multi-label Image Classification with Transformers},
  author={Lanchantin, Jack and Wang, Tianlu and Ordonez, Vicente and Qi, Yanjun},
  journal={arXiv preprint arXiv:2011.14027},
  year={2020}
}

General Multi-label Image Classification with Transformers

Related tags

Overview

Training and Running C-Tran

C-Tran on COCO80 Dataset

C-Tran on VOC20 Dataset

Citing

Owner

QData

DL course co-developed by YSDA, HSE and Skoltech

Code release for paper: The Boombox: Visual Reconstruction from Acoustic Vibrations

Implementation of gMLP, an all-MLP replacement for Transformers, in Pytorch

GAN example for Keras. Cuz MNIST is too small and there should be something more realistic.

N-Omniglot is a large neuromorphic few-shot learning dataset

A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX

The Fundamental Clustering Problems Suite (FCPS) summaries 54 state-of-the-art clustering algorithms, common cluster challenges and estimations of the number of clusters as well as the testing for cluster tendency.

To propose and implement a multi-class classification approach to disaster assessment from the given data set of post-earthquake satellite imagery.

A benchmark for the task of translation suggestion

[ICCV 2021] Code release for "Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks"

Prototype for Baby Action Detection and Classification

Video-Captioning - A machine Learning project to generate captions for video frames indicating the relationship between the objects in the video

Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻‍🦸🏽

Pytorch and Keras Implementations of Hyperspectral Image Classification -- Traditional to Deep Models: A Survey for Future Prospects.

NLP made easy

The codes and related files to reproduce the results for Image Similarity Challenge Track 2.

Codebase for the Summary Loop paper at ACL2020

最新版本yolov5+deepsort目标检测和追踪，支持5.0版本可训练自己数据集

李云龙二次元风格化!打滚卖萌，使用了animeGANv2进行了视频的风格迁移

Semantically Contrastive Learning for Low-light Image Enhancement