Learning Calibrated-Guidance for Object Detection in Aerial Images

Last update: Sep 22, 2022

Related tags

Overview

Learning Calibrated-Guidance for Object Detection in Aerial Images arxiv

We propose a simple yet effective Calibrated-Guidance (CG) scheme to enhance channel communications in a feature transformer fashion, which can adaptively determine the calibration weights for each channel based on the global feature affinity-pairs. Specifically, given a set of feature maps, CG first computes the feature similarity between each channel and the remaining channels as the intermediary calibration guidance. Then, re-representing each channel by aggregating all the channels weighted together via the guidance. Our CG can be plugged into any deep neural network, which is named as CG-Net. To demonstrate its effectiveness and efficiency, extensive experiments are carried out on both oriented and horizontal object detection tasks of aerial images. Results on two challenging benchmarks (i.e., DOTA and HRSC2016) demonstrate that our CG-Net can achieve state-of-the-art performance in accuracy with a fair computational overhead.

Introduction

This codebase is created to build benchmarks for object detection in aerial images. It is modified from mmdetection. The master branch works with PyTorch 1.1 or higher. If you would like to use PyTorch 0.4.1, please checkout to the pytorch-0.4.1 branch.

Results

Visualization results for oriented object detection on the test set of DOTA.

Comparison to the baseline on DOTA for oriented object detection with ResNet-101. The figures with blue boxes are the results of the baseline and pink boxes are the results of our proposed CG-Net.

Experiment

ImageNet Pretrained Model from Pytorch

The effectiveness of our proposed methods with different backbone network on the test of DOTA.

Backbone	+CG	Weight	mAP(%)
ResNet-50		download	73.26
ResNet-50	+	download	74.21
ResNet-101		download	73.06
ResNet-101	+	download	74.30
ResNet-152		download	72.78
ResNet-152	+	download	73.53

CG-Net Results in DOTA.

Backbone	Aug Rotate	Task	Weight	mAP(%)
ResNet-101	+	Oriented	download	77.89
ResNet-101	+	Horizontal	download	78.26

Installation

Please refer to INSTALL.md for installation.

Get Started

Please see GETTING_STARTED.md for the basic usage of mmdetection.

Contributing

We appreciate all contributions to improve benchmarks for object detection in aerial images.

Citing

If you use our work, please consider citing:

@InProceedings{liang2021learning,
      title={Learning Calibrated-Guidance for Object Detection in Aerial Images}, 
      author={Dong, Liang and Zongqi, Wei and Dong, Zhang and Qixiang, Geng and Liyan, Zhang and Han, Sun and Huiyu, Zhou and Mingqiang, Wei and Pan, Gao},
      booktitle ={arXiv:2103.11399},
      year={2021}
}

Thanks to the Third Party Libs

Pytorch

mmdetection

AerialDetection

Learning Calibrated-Guidance for Object Detection in Aerial Images

Related tags

Overview

Learning Calibrated-Guidance for Object Detection in Aerial Images arxiv

Introduction

Results

Experiment

Installation

Get Started

Contributing

Citing

Thanks to the Third Party Libs

Owner

Author's PyTorch implementation of TD3 for OpenAI gym tasks

EM-POSE 3D Human Pose Estimation from Sparse Electromagnetic Trackers.

Efficient and Scalable Physics-Informed Deep Learning and Scientific Machine Learning on top of Tensorflow for multi-worker distributed computing

Creating Multi Task Models With Keras

Generalized Data Weighting via Class-level Gradient Manipulation

This repository contains code to train and render Mixture of Volumetric Primitives (MVP) models

Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.

Code for Neurips2021 Paper "Topology-Imbalance Learning for Semi-Supervised Node Classification".

Position detection system of mobile robot in the warehouse enviroment

Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding (CVPR2022)

a pytorch implementation of auto-punctuation learned character by character

git《Pseudo-ISP: Learning Pseudo In-camera Signal Processing Pipeline from A Color Image Denoiser》(2021) GitHub: [fig5]

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

Keras implementations of Generative Adversarial Networks.

PyTorch implementation of Memory-based semantic segmentation for off-road unstructured natural environments.

SASM - simple crossplatform IDE for NASM, MASM, GAS and FASM assembly languages

A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB, or simply to separate onnx files to any size you want.

Hamiltonian Dynamics with Non-Newtonian Momentum for Rapid Sampling

BasicNeuralNetwork - This project looks over the basic structure of a neural network and how machine learning training algorithms work

Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning using 🤗 transformers