[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

Last update: Nov 21, 2022

Overview

Discriminative Region-based Multi-Label Zero-Shot Learning (ICCV 2021)

[arXiv][Project page >> coming soon]

Sanath Narayan^, Akshita Gupta^, Salman Khan, Fahad Shahbaz Khan, Ling Shao, Mubarak Shah

( 🌟 denotes equal contribution)

Installation

The codebase is built on PyTorch 1.1.0 and tested on Ubuntu 16.04 environment (Python3.6, CUDA9.0, cuDNN7.5).

For installing, follow these intructions

conda create -n mlzsl python=3.6
conda activate mlzsl
conda install pytorch=1.1 torchvision=0.3 cudatoolkit=9.0 -c pytorch
pip install matplotlib scikit-image scikit-learn opencv-python yacs joblib natsort h5py tqdm pandas

Install warmup scheduler

cd pytorch-gradual-warmup-lr; python setup.py install; cd ..

Attention Visualization

Results


Our approach on NUS-WIDE Dataset.	Our approach on OpenImages Dataset.

Training and Evaluation

NUS-WIDE

Step 1: Data preparation

Download pre-computed features from here and store them at features folder inside BiAM/datasets/NUS-WIDE directory.
[Optional] You can extract the features on your own by using the original NUS-WIDE dataset from here and run the below script:

python feature_extraction/extract_nus_wide.py

Step 2: Training from scratch

To train and evaluate multi-label zero-shot learning model on full NUS-WIDE dataset, please run:

sh scripts/train_nus.sh

Step 3: Evaluation using pretrained weights

To evaluate the multi-label zero-shot model on NUS-WIDE. You can download the pretrained weights from here and store them at NUS-WIDE folder inside pretrained_weights directory.

sh scripts/evaluate_nus.sh

OPEN-IMAGES

Step 1: Data preparation

Please download the annotations for training, validation, and testing into this folder.
Store the annotations inside BiAM/datasets/OpenImages.
To extract the features for OpenImages-v4 dataset run the below scripts for crawling the images and extracting features of them:

## Crawl the images from web
python ./datasets/OpenImages/download_imgs.py  #`data_set` == `train`: download images into `./image_data/train/`
python ./datasets/OpenImages/download_imgs.py  #`data_set` == `validation`: download images into `./image_data/validation/`
python ./datasets/OpenImages/download_imgs.py  #`data_set` == `test`: download images into `./image_data/test/`

## Run feature extraction codes for all the 3 splits
python feature_extraction/extract_openimages_train.py
python feature_extraction/extract_openimages_test.py
python feature_extraction/extract_openimages_val.py

Step 2: Training from scratch

To train and evaluate multi-label zero-shot learning model on full OpenImages-v4 dataset, please run:

sh scripts/train_openimages.sh
sh scripts/evaluate_openimages.sh

Step 3: Evaluation using pretrained weights

To evaluate the multi-label zero-shot model on OpenImages. You can download the pretrained weights from here and store them at OPENIMAGES folder inside pretrained_weights directory.

sh scripts/evaluate_openimages.sh

License

This repository is released under the Apache 2.0 license as found in the LICENSE file.

Citation

If you find this repository useful, please consider giving a star ⭐ and citation 🎊 :

@article{narayan2021discriminative,
title={Discriminative Region-based Multi-Label Zero-Shot Learning},
author={Narayan, Sanath and Gupta, Akshita and Khan, Salman and  Khan, Fahad Shahbaz and Shao, Ling and Shah, Mubarak},
journal={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
publisher = {IEEE},
year={2021}
}

Contact

Should you have any question, please contact 📧 [email protected]

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

Related tags

Overview

Discriminative Region-based Multi-Label Zero-Shot Learning (ICCV 2021)

Sanath Narayan*, Akshita Gupta*, Salman Khan, Fahad Shahbaz Khan, Ling Shao, Mubarak Shah

Installation

Attention Visualization

Results

Training and Evaluation

NUS-WIDE

Step 1: Data preparation

Step 2: Training from scratch

Step 3: Evaluation using pretrained weights

OPEN-IMAGES

Step 1: Data preparation

Step 2: Training from scratch

Step 3: Evaluation using pretrained weights

License

Citation

Contact

Owner

Akshita Gupta

This program uses trial auth token of Azure Cognitive Services to do speech synthesis for you.

Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (CVPR 2022)

Defense-GAN: Protecting Classifiers Against Adversarial Attacks Using Generative Models (published in ICLR2018)

Code for KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

Angular & Electron desktop UI framework. Angular components for native looking and behaving macOS desktop UI (Electron/Web)

RIM: Reliable Influence-based Active Learning on Graphs.

[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers

CTRL-C: Camera calibration TRansformer with Line-Classification

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Code and data (Incidents Dataset) for ECCV 2020 Paper "Detecting natural disasters, damage, and incidents in the wild".

The repo of the preprinting paper "Labels Are Not Perfect: Inferring Spatial Uncertainty in Object Detection"

Code for Ditto: Building Digital Twins of Articulated Objects from Interaction

code for CVPR paper Zero-shot Instance Segmentation

A curated list of awesome resources combining Transformers with Neural Architecture Search

AI-generated-characters for Learning and Wellbeing

a baseline to practice

An implementation of Deep Graph Infomax (DGI) in PyTorch

Official implementation of GraphMask as presented in our paper Interpreting Graph Neural Networks for NLP With Differentiable Edge Masking.

Sanath Narayan^, Akshita Gupta^, Salman Khan, Fahad Shahbaz Khan, Ling Shao, Mubarak Shah