Codes for "Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier"

Last update: May 26, 2022

Related tags

Overview

Deep-RTC [project page]

This repository contains the source code accompanying our ECCV 2020 paper.

Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier
Tz-Ying Wu, Pedro Morgado, Pei Wang, Chih-Hui Ho, Nuno Vasconcelos

@inproceedings{Wu20DeepRTC,
	title={Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier},
	author={Tz-Ying Wu and Pedro Morgado and Pei Wang and Chih-Hui Ho and Nuno Vasconcelos},
	booktitle={European Conference on Computer Vision (ECCV)},
	year={2020}
}

Dependencies

Python (3.5.6)
PyTorch (1.2.0)
torchvision (0.4.0)
NumPy (1.15.2)
Pillow (5.2.0)
PyYaml (5.1.2)
tensorboardX (1.8)

Data preparation

CIFAR100 [Raw images] [Long-tail version]
AWA2 [Raw images]
ImageNet [Raw images] [Long-tail version]
iNaturalist [Raw images]

These datasets can be downloaded from the above links. Please organize the images in the hierarchical folders that represent the dataset hierarchy, and put the root folder under prepro/raw. For example,

prepro/raw/imagenet
--abstraction
----bubble
------ILSVRC2012_val_00014026.JPEG
------ILSVRC2012_val_00000697.JPEG
...
--physical_entity
----object
...

While CIFAR100 and iNaturalist have released taxonomies, we built the tree-type taxonomy of AWA2 and ImageNet with WordNet. All the taxonomies are provided in prepro/data/{dataset}/tree.npy, and the data splits are provided in prepro/splits/{dataset}/{split}.json. Please refer to prepro/README.md for more details. After the raw images are managed hierarchically, run

$ ./prepare_data.sh {dataset}

where {dataset}=awa2/cifar100/imagenet/inaturalist. This will automatically generate the data lists for all splits, and build the codeword matrices needed for training Deep-RTC. Note that our codes can be applied to other datasets once they are organized hierarchically.

Training and evaluation

To train and evaluate Deep-RTC, run

$ export PYTHONPATH=${PWD}/prepro:${PYTHONPATH}
$ ./run.sh {dataset}

where {dataset}=awa2/cifar100/imagenet/inaturalist. Our pretrained models can be downloaded here.

Codes for "Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier"

Related tags

Overview

Deep-RTC [project page]

Dependencies

Data preparation

Training and evaluation

Owner

Gina Wu

Planner_backend - Academic planner application designed for students and counselors.

Graph Representation Learning via Graphical Mutual Information Maximization

TensorFlow implementation of "Attention is all you need (Transformer)"

Generic ecosystem for feature extraction from aerial and satellite imagery

YoloAll is a collection of yolo all versions. you you use YoloAll to test yolov3/yolov5/yolox/yolo_fastest

Built a deep neural network (DNN) that functions as an end-to-end machine translation pipeline

Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices

《A-CNN: Annularly Convolutional Neural Networks on Point Clouds》(2019)

Official DGL implementation of "Rethinking High-order Graph Convolutional Networks"

🚀 An end-to-end ML applications using PyTorch, W&B, FastAPI, Docker, Streamlit and Heroku

Neural style in TensorFlow! 🎨

Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'

Cross View SLAM

This is the implementation of the paper "Self-supervised Outdoor Scene Relighting"

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

Code and Datasets from the paper "Self-supervised contrastive learning for volcanic unrest detection from InSAR data"

Code for BMVC2021 "MOS: A Low Latency and Lightweight Framework for Face Detection, Landmark Localization, and Head Pose Estimation"

Some pre-commit hooks for OpenMMLab projects

DeconvNet : Learning Deconvolution Network for Semantic Segmentation

Code for the Convolutional Vision Transformer (ConViT)