Recurrent Scale Approximation (RSA) for Object Detection

Last update: Dec 28, 2022

Related tags

Overview

Recurrent Scale Approximation (RSA) for Object Detection

Codebase for Recurrent Scale Approximation for Object Detection in CNN published at ICCV 2017, [arXiv]. Here we offer the training and test code for two modules in the paper, scale-forecast network and recurrent scale approximation (RSA). Models for face detection trained on some open datasets are also provided.

Note: This project is still underway. Please stay tuned for more features soon!

Codebase at a Glance

train/: Training code for modules scale-forecast network and RSA

predict/: Test code for the whole detection pipeline

afw_gtmiss.mat: Revised face data annotation mentioned in Section 4.1 in the paper.

Grab and Go (Demo)

Caffe models for face detection trained on popular datasets.

Base RPN model: predict/output/ResNet_3b_s16/tot_wometa_1epoch, trained on Widerface (fg/bg), COCO (bg only) and ImageNet Det (bg only)
RSA model: predict/output/ResNet_3b_s16_fm2fm_pool2_deep/65w, trained on Widerface, COCO, and ImageNet Det

Steps to run the test code:

Compile CaffeMex_v2 with matlab interface
Add CaffeMex_v2/matlab/ to matlab search path
See tips in predict/script_start.m and run it!
After processing for a few minutes, the detection and alignment results will be shown in an image window. Please click the image window to view all results. If you set line 8 in script_start.m to false as default, you should observe some results as above.

Train Your Own Model

Still in progress, this part will be released later.

FAQ

We will list the common issues of this project as time goes. Stay tuned! :)

Citation

Please kindly cite our work if it helps your research:

@inproceedings{liu_2017_rsa,
  Author = {Yu Liu and Hongyang Li and Junjie Yan and Fangyin Wei and Xiaogang Wang and Xiaoou Tang},
  Title = {Recurrent Scale Approximation for Object Detection in CNN},
  Journal = {IEEE International Conference on Computer Vision},
  Year = {2017}
}

Acknowledgment

We appreciate the contribution of the following researchers:

Dong Chen @Microsoft Research, some basic ideas are inspired by him when Yu Liu worked as an intern at MSR.

Jiongchao Jin @Beihang University, some baseline results are provided by him.

Recurrent Scale Approximation (RSA) for Object Detection

Related tags

Overview

Recurrent Scale Approximation (RSA) for Object Detection

Codebase at a Glance

Grab and Go (Demo)

Train Your Own Model

FAQ

Citation

Acknowledgment

Owner

Yu Liu (Louis)

Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch

The official pytorch implemention of the CVPR paper "Temporal Modulation Network for Controllable Space-Time Video Super-Resolution".

E-Ink Magic Calendar that automatically syncs to Google Calendar and runs off a battery powered Raspberry Pi Zero

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Official implementation of NeurIPS 2021 paper "Contextual Similarity Aggregation with Self-attention for Visual Re-ranking"

Repository for the paper "From global to local MDI variable importances for random forests and when they are Shapley values"

A nutritional label for food for thought.

This repo is to be freely used by ML devs to check the GAN performances without coding from scratch.

Keras implementation of the GNM model in paper ’Graph-Based Semi-Supervised Learning with Nonignorable Nonresponses‘

SysWhispers Shellcode Loader

Official PyTorch implementation of our AAAI22 paper: TransMEF: A Transformer-Based Multi-Exposure Image Fusion Framework via Self-Supervised Multi-Task Learning. Code will be available soon.

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime

Satellite labelling tool for manual labelling of storm top features such as overshooting tops, above-anvil plumes, cold U/Vs, rings etc.

Trading Gym is an open source project for the development of reinforcement learning algorithms in the context of trading.

Official Implementation of SimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-Training for Spatial-Aware Visual Representations

An implementation of the 1. Parallel, 2. Streaming, 3. Randomized SVD using MPI4Py

CVAT is free, online, interactive video and image annotation tool for computer vision

Automatically erase objects in the video, such as logo, text, etc.

A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env