PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021

Last update: Mar 16, 2022

Overview

PatchGame: Learning to Signal Mid-level Patches in Referential Games

This repository is the official implementation of the paper - "PatchGame: Learning to SignalMid-level Patches in Referential Games"

Requirements

We recommend using anaconda or miniconda for python. Our code has been tested with python=3.8 on linux.

To create a new environment with conda

conda create -n patchgame python=3.8
conda activate patchgame

We recommend installing the latest pytorch and torchvision packages You can install them using

conda install pytorch torchvision -c pytorch

Make sure the following requirements are met

torch>=1.8.1
torchvision>=0.9.1

Installing `torchsort`

Note we only tried installing torchsort with following cuda==10.2.89 and gcc==6.3.0.

export TORCH_CUDA_ARCH_LIST="Pascal;Volta;Turing"
unzip torchsort.zip && cd torchsort
python setup.py install --user
cd .. && rm -rf torchsort

Dataset

We use ImageNet-1k (ILSVRC2012) data in all our experiments. Please download and save the data from the official website.

Training

To train the model(s) in the paper on 1-8 GPUs, run this command (where nproc_per_node is the number of gpus):

python -m torch.distributed.launch --nproc_per_node=1 train.py \
    --data_path /patch/to/imagenet/dir/train \
    --output_dir /path/to/checkpoint/dir \
    --patch_size 32 --epochs 100

Pre-trained Models

You can download pretrained models here trained on ImageNet using parameters using above command (and default hyperparameters).

Evaluation

PatchRank with ViT

python eval_patchrank.py --patch-model mymodel.pth --data-path <path to dataset> --topk <no. of patches to use>

This achieves the following accuracy on ImageNet.

Model name	Top 1 Accuracy	Top 5 Accuracy
PatchGame(S=32, topk=75, size=384x384)	58.4%	80.9%

k-NN classification ImageNet with listener's vision module

python -m torch.distributed.launch --nproc_per_node=1 eval_knn.py \
    --pretrained_weights /path/to/checkpoint/dir/checkpoint.pth \
    --arch resnet18 --nb_knn 20 \
    --batch_size_per_gpu 1024 --use_cuda 0 \
    --data_path /patch/to/imagenet/dir

This achieves the following accuracy on ImageNet

Model name	Top 1 Accuracy	Top 5 Accuracy
PatchGame(S=32)	30.3%	49.9%

Acknowledgements

We would like to thank several public repos from where we borrowed various utilities

License

This repository is released under the Apache 2.0 license as found in the LICENSE file.

PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021

Related tags

Overview

PatchGame: Learning to Signal Mid-level Patches in Referential Games

Requirements

Installing `torchsort`

Dataset

Training

Pre-trained Models

Evaluation

PatchRank with ViT

k-NN classification ImageNet with listener's vision module

Acknowledgements

License

Owner

Kamal Gupta

PaRT: Parallel Learning for Robust and Transparent AI

Latent Network Models to Account for Noisy, Multiply-Reported Social Network Data

Implementing Graph Convolutional Networks and Information Retrieval Mechanisms using pure Python and NumPy

Reinforcement learning library in JAX.

Computer Vision Script to recognize first person motion, developed as final project for the course "Machine Learning and Deep Learning"

Build fully-functioning computer vision models with PyTorch

Brax is a differentiable physics engine that simulates environments made up of rigid bodies, joints, and actuators

Official code for On Path Integration of Grid Cells: Group Representation and Isotropic Scaling (NeurIPS 2021)

Official Pytorch and JAX implementation of "Efficient-VDVAE: Less is more"

Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking

Open source Python module for computer vision

DC3: A Learning Method for Optimization with Hard Constraints

MetaBalance: Improving Multi-Task Recommendations via Adapting Gradient Magnitudes of Auxiliary Tasks

Code for: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

Consensus Learning from Heterogeneous Objectives for One-Class Collaborative Filtering

atmaCup #11 の Public 4th / Pricvate 5th Solution のリポジトリです。

Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using coresets and data selection.

High dimensional black-box optimizer using Latent Action Monte Carlo Tree Search algorithm

An open source library for face detection in images. The face detection speed can reach 1000FPS.

IRON Kaggle project done while doing IRONHACK Bootcamp where we had to analyze and use a Machine Learning Project to predict future sales

PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021

Related tags

Overview

PatchGame: Learning to Signal Mid-level Patches in Referential Games

Requirements

Installing torchsort

Dataset

Training

Pre-trained Models

Evaluation

PatchRank with ViT

k-NN classification ImageNet with listener's vision module

Acknowledgements

License

Owner

Kamal Gupta

PaRT: Parallel Learning for Robust and Transparent AI

Latent Network Models to Account for Noisy, Multiply-Reported Social Network Data

Implementing Graph Convolutional Networks and Information Retrieval Mechanisms using pure Python and NumPy

Reinforcement learning library in JAX.

Computer Vision Script to recognize first person motion, developed as final project for the course "Machine Learning and Deep Learning"

Build fully-functioning computer vision models with PyTorch

Brax is a differentiable physics engine that simulates environments made up of rigid bodies, joints, and actuators

Official code for On Path Integration of Grid Cells: Group Representation and Isotropic Scaling (NeurIPS 2021)

Official Pytorch and JAX implementation of "Efficient-VDVAE: Less is more"

Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking

Open source Python module for computer vision

DC3: A Learning Method for Optimization with Hard Constraints

MetaBalance: Improving Multi-Task Recommendations via Adapting Gradient Magnitudes of Auxiliary Tasks

Code for: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

Consensus Learning from Heterogeneous Objectives for One-Class Collaborative Filtering

atmaCup #11 の Public 4th / Pricvate 5th Solution のリポジトリです。

Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using coresets and data selection.

High dimensional black-box optimizer using Latent Action Monte Carlo Tree Search algorithm

An open source library for face detection in images. The face detection speed can reach 1000FPS.

IRON Kaggle project done while doing IRONHACK Bootcamp where we had to analyze and use a Machine Learning Project to predict future sales

Installing `torchsort`