The official implementation of Equalization Loss for Long-Tailed Object Recognition (CVPR 2020) based on Detectron2

Last update: Dec 25, 2022

Related tags

Overview

Equalization Loss for Long-Tailed Object Recognition

Jingru Tan, Changbao Wang, Buyu Li, Quanquan Li, Wanli Ouyang, Changqing Yin, Junjie Yan

⚠️ We recommend to use the EQLv2 repository (code) which is based on mmdetection. It also includes EQL and other algorithms, such as cRT (classifier-retraining), BAGS (BalanceGroup Softmax).

[arXiv] [BibTeX]

In this repository, we release code for Equalization Loss (EQL) in Detectron2. EQL protects the learning for rare categories from being at a disadvantage during the network parameter updating under the long-tailed situation.

Installation

Install Detectron 2 following INSTALL.md. You are ready to go!

LVIS Dataset

Following the instruction of README.md to set up the lvis dataset.

Training

To train a model with 8 GPUs run:

cd /path/to/detectron2/projects/EQL
python train_net.py --config-file configs/eql_mask_rcnn_R_50_FPN_1x.yaml --num-gpus 8

Evaluation

Model evaluation can be done similarly:

cd /path/to/detectron2/projects/EQL
python train_net.py --config-file configs/eql_mask_rcnn_R_50_FPN_1x.yaml --eval-only MODEL.WEIGHTS /path/to/model_checkpoint

Pretrained Models

Instance Segmentation on LVIS

Backbone	Method	AP	AP.r	AP.c	AP.f	AP.bbox	download
R50-FPN	MaskRCNN	21.2	3.2	21.1	28.7	20.8	model \| metrics
R50-FPN	MaskRCNN-EQL	24.0	9.4	25.2	28.4	23.6	model \| metrics
R50-FPN	MaskRCNN-EQL-Resampling	26.1	17.2	27.3	28.2	25.4	model \| metrics
R101-FPN	MaskRCNN	22.8	4.3	22.7	30.2	22.3	model \| metrics
R101-FPN	MaskRCNN-EQL	25.9	10.0	27.9	29.8	25.9	model \| metrics
R101-FPN	MaskRCNN-EQL-Resampling	27.4	17.3	29.0	29.4	27.1	model \| metrics

The AP in this repository is higher than that of the origin paper. Because all those models use:

Scale jitter
Class-specific mask head
Better ImageNet pretrain models (of caffe rather than pytorch)

Note that the final results of these configs have large variance across different runs.

Citing EQL

If you use EQL, please use the following BibTeX entry.

@InProceedings{tan2020eql,
  title={Equalization Loss for Long-Tailed Object Recognition},
  author={Jingru Tan, Changbao Wang, Buyu Li, Quanquan Li, 
  Wanli Ouyang, Changqing Yin, Junjie Yan},
  journal={ArXiv:2003.05176},
  year={2020}
}

The official implementation of Equalization Loss for Long-Tailed Object Recognition (CVPR 2020) based on Detectron2

Related tags

Overview

Equalization Loss for Long-Tailed Object Recognition

Installation

LVIS Dataset

Training

Evaluation

Pretrained Models

Instance Segmentation on LVIS

Citing EQL

Owner

Jingru Tan

Real-CUGAN - Real Cascade U-Nets for Anime Image Super Resolution

An open software package to develop BCI based brain and cognitive computing technology for recognizing user's intention using deep learning

Source code for CVPR2022 paper "Abandoning the Bayer-Filter to See in the Dark"

EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers

Simple object detection app with streamlit

A machine learning package for streaming data in Python. The other ancestor of River.

System Combination for Grammatical Error Correction Based on Integer Programming

Human pose estimation from video plays a critical role in various applications such as quantifying physical exercises, sign language recognition, and full-body gesture control.

A python library for face detection and features extraction based on mediapipe library

The code for our paper CrossFormer: A Versatile Vision Transformer Based on Cross-scale Attention.

Implementation supporting the ICCV 2017 paper "GANs for Biological Image Synthesis"

Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using coresets and data selection.

DFM: A Performance Baseline for Deep Feature Matching

A setup script to generate ITK Python Wheels

Can we visualize a large scientific data set with a surrogate model? We're building a GAN for the Earth's Mantle Convection data set to see if we can!

Shared Attention for Multi-label Zero-shot Learning

Learning Domain Invariant Representations in Goal-conditioned Block MDPs

Using image super resolution models with vapoursynth and speeding them up with TensorRT

Official implementation of NeurIPS 2021 paper "One Loss for All: Deep Hashing with a Single Cosine Similarity based Learning Objective"

10th place solution for Google Smartphone Decimeter Challenge at kaggle.