vit for few-shot classification

Last update: Nov 30, 2022

Related tags

Deep Learning few-shot-vit

Overview

Few-Shot ViT

Requirements

PyTorch (>= 1.9)
TorchVision
timm (latest)
einops
tqdm
numpy
scikit-learn
scipy
argparse
tensorboardx

Pretrained Checkpoints

Currently we provide SUN-M (Visformer) trained on miniImageNet (5-way 1-shot and 5-way 5-shot), see Google Drive for details.

More pretrained checkpoints coming soon.

Evaluate the Pretrained Checkpoints

Prepare data

For example, miniImageNet:

cd test_phase

Download miniImageNet dataset from miniImageNet (courtesy of Spyros Gidaris)

unzip the package to materials/mini-imagenet, then obtain materials/mini-imagenet with pickle files.

Prapare pretrained checkpoints

Download corresponding checkpoints from Google Drive and store the checkpoints in test_phase/ directory.

Evaluation

cd test_phase
python test_few_shot.py --config configs/test_1_shot.yaml --shot 1 --gpu 1 # for 1-shot
python test_few_shot.py --config configs/test_5_shot.yaml --shot 5 --gpu 1 # for 5-shot

For 1-shot, you can obtain: test epoch 1: acc=67.80 +- 0.45 (%)

For 5-shot, you can obtain: test epoch 1: acc=83.25 +- 0.28 (%)

Test accuracy may slightly vary with different pytorch/cuda versions or different hardwares

TODO

more checkpoints
training code

You might also like...

So-ViT: Mind Visual Tokens for Vision Transformer

So-ViT: Mind Visual Tokens for Vision Transformer Introduction This repository contains the source code under PyTorch framework and models trai

44 Nov 24, 2022

A PyTorch Implementation of ViT (Vision Transformer)

ViT - Vision Transformer This is an implementation of ViT - Vision Transformer by Google Research Team through the paper "An Image is Worth 16x16 Word

7 May 11, 2022

PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT.

MoCo v3 for Self-supervised ResNet and ViT Introduction This is a PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT. The original M

887 Jan 8, 2023

Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer This repository contains the PyTorch code for Evo-ViT. This work proposes a slow-fas

53 Dec 5, 2022

This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.

75 Dec 2, 2022

A simple approach to emable dense segmentation with ViT.

Comments

timm version

hello, I met a question when run your code as follow? Traceback (most recent call last): File "train_classifier.py", line 296, in <module> main(config) File "train_classifier.py", line 133, in main lr_scheduler = CosineLRScheduler(optimizer, warmup_lr_init=float(config['optimizer_args']['warmup_lr']), t_initial=config['max_epoch'], cycle_decay=0.1, warmup_t=int(config['optimizer_args']['warmup'])) TypeError: __init__() got an unexpected keyword argument 'cycle_decay' I think it's the version of timm package is not right, and the requirement in your code just say that is the latest version. can your provide the version of timm package??

opened by JIAOJIAYUASD 2
The variant of visformer

Hi Bowen

Thanks for opensource the inference code. I am just curious which variant of the visformer achieves the best results in Table 5 on mini-ImageNet? Is it visformer_80_small?

opened by RongKaiWeskerMA 1

vit for few-shot classification

Related tags

Overview

Few-Shot ViT

Requirements

Pretrained Checkpoints

Evaluate the Pretrained Checkpoints

Prepare data

Prapare pretrained checkpoints

Evaluation

TODO

You might also like...

So-ViT: Mind Visual Tokens for Vision Transformer

A PyTorch Implementation of ViT (Vision Transformer)

PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT.

Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.

A simple approach to emable dense segmentation with ViT.

PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners for self-supervised ViT.

A simple program for training and testing vit

Implementing Vision Transformer (ViT) in PyTorch

Comments

timm version

The variant of visformer

Releases(SUN)

SUN(Jun 5, 2022)

Owner

Martin Dong

Multi-Glimpse Network With Python

Sync2Gen Code for ICCV 2021 paper: Scene Synthesis via Uncertainty-Driven Attribute Synchronization

A framework for joint super-resolution and image synthesis, without requiring real training data

Offical implementation for "Trash or Treasure? An Interactive Dual-Stream Strategy for Single Image Reflection Separation".

CSKG is a commonsense knowledge graph that combines seven popular sources into a consolidated representation

Parasite: a tool allowing you to compress and decompress files, to reduce their size

Pytorch implementation of "Forward Thinking: Building and Training Neural Networks One Layer at a Time"

CS550 Machine Learning course project on CNN Detection.

🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.

A simple program for training and testing vit

Feature extraction made simple with torchextractor

Code of TIP2021 Paper《SFace: Sigmoid-Constrained Hypersphere Loss for Robust Face Recognition》. We provide both MxNet and Pytorch versions.

Phonetic PosteriorGram (PPG)-Based Voice Conversion (VC)

Pytorch implementation of the paper "Topic Modeling Revisited: A Document Graph-based Neural Network Perspective"

ICLR 2021: Pre-Training for Context Representation in Conversational Semantic Parsing

Second-Order Neural ODE Optimizer, NeurIPS 2021 spotlight

iris - Open Source Photos Platform Powered by PyTorch

Quick program made to generate alpha and delta tables for Hidden Markov Models

Official source code to CVPR'20 paper, "When2com: Multi-Agent Perception via Communication Graph Grouping"

Self Governing Neural Networks (SGNN): the Projection Layer