Official Implementation of Few-shot Visual Relationship Co-localization

Last update: Oct 13, 2022

Related tags

Deep Learning VRC

Overview

VRC

Official implementation of the Few-shot Visual Relationship Co-localization (ICCV 2021) paper

project page | paper

Requirements

Use python >= 3.8.5. Conda recommended : https://docs.anaconda.com/anaconda/install/linux/
Use pytorch 1.7.0 CUDA 10.2
Other requirements from 'requirements.txt'

To setup environment

# create new env vrc
$ conda create -n vrc python=3.8.5

# activate vrc
$ conda activate vrc

# install pytorch, torchvision
$ conda install pytorch==1.7.0 torchvision==0.8.0 cudatoolkit=10.2 -c pytorch

# install other dependencies
$ pip install -r requirements.txt

Training

Preparing dataset

Download VG images from https://visualgenome.org/
Extract faster_rcnn features of VG images using data_preparation/vrc_extract_frcnn_feats.py. Please follow instructions here.
Download VrR-VG dataset from http://vrr-vg.com/ or Google Drive Link

Training VR Encoder (VTransE)

Training parameters

To check and update training, model and dataset parameters see VR_Encoder/configs

To train VR Encoder:

$ python train_vr_encoder.py

Training VR Similarity Network (Relation Network)

Training parameters

To check and update training, testing, model and dataset parameters see VR_SimilarityNetwork/configs

To train VR Similarity Network:

$ python SimilarityNetworkTrain.py

To train VR Similarity Network (w/ concat as VR Encoding):

$ python ConcatplusSimilarityNetworkTrain.py

To evaluate (set eval setting in test_config.yaml)

$ python FullModelTest.py

Cite

If you find this code/paper useful for your research, please consider citing.

@InProceedings{teotiaMMM2021,
  author    = "Teotia, Revant and Mishra, Vaibhav and Maheshwari, Mayank and Mishra, Anand",
  title     = "Few-shot Visual Relationship Co-Localization",
  booktitle = "ICCV",
  year      = "2021",
}

Acknowledgements

This repo uses https://gitlab.com/meetshah1995/vqa-maskrcnn-benchmark and scripts from https://github.com/facebookresearch/mmf for Faster R-CNN feature extraction.

Code provided by https://github.com/zawlin/cvpr17_vtranse and https://github.com/yangxuntu/vrd helped in implementing VR encoder.

Contact

For any clarification, comment, or suggestion please create an issue or contact Revant, Vaibhav or Mayank.

Official Implementation of Few-shot Visual Relationship Co-localization

Related tags

Overview

VRC

Requirements

Training

Preparing dataset

Training VR Encoder (VTransE)

Training parameters

To train VR Encoder:

Training VR Similarity Network (Relation Network)

Training parameters

To train VR Similarity Network:

To train VR Similarity Network (w/ concat as VR Encoding):

To evaluate (set eval setting in test_config.yaml)

Cite

Acknowledgements

Contact

Owner

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)

Riemann Noise Injection With PyTorch

Deep Learning with PyTorch made easy 🚀 !

MARE - Multi-Attribute Relation Extraction

Cervix ROI Segmentation Using U-NET

🗺 General purpose U-Network implemented in Keras for image segmentation

Source code for paper "Deep Diffusion Models for Robust Channel Estimation", TBA.

Notebooks em Python para Métodos Eletromagnéticos

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Automatic caption evaluation metric based on typicality analysis.

Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)

The Implicit Bias of Gradient Descent on Generalized Gated Linear Networks

[ICCV'21] Official implementation for the paper Social NCE: Contrastive Learning of Socially-aware Motion Representations

SEC'21: Sparse Bitmap Compression for Memory-Efficient Training onthe Edge

BankNote-Net: Open dataset and encoder model for assistive currency recognition

Semi-supervised Implicit Scene Completion from Sparse LiDAR

Mesh TensorFlow: Model Parallelism Made Easier

AI that generate music

Personalized Federated Learning using Pytorch (pFedMe)