The sixth place winning solution (6/220) in 2021 Gaofen Challenge.

Last update: Dec 02, 2022

Related tags

Overview

SwinTransformer + OBBDet

The sixth place winning solution (6/220) in the track of Fine-grained Object Recognition in High-Resolution Optical Images, 2021 Gaofen Challenge on Automated High-Resolution Earth Observation Image Interpretation.

Members

Qi Ming, Junjie Song, Yunpeng Dong.

Solution

Off-line date augmentation
We use random combination of affine transformation, flip, scaling, optical distortion for data augmentation.
Multi-scale training and testing
The training images are resized into sizes of 600, 800, and 1024 for training and testing.
Strong backbone
Swin transformer is adopt in ORCNN and RoI Transformer for better performance.
Model ensemble
We have merged the results from RoI Transformer, ORCNN, S2ANet, and ReDet.
Lower confidence
Set the output threshold into 0.005.

Tried but didn't work

Soft-NMS.
Adjust NMS threshold.
Class-agnostic NMS.
Mosaic, and mix up for data augmentation.
Oversample the categories with fewer instances.
Train the detectors for specific classes with low AP.
Multi-scale training and testing on SwinTransformer-based detectors (even dropped by about 1% mAP).

The sixth place winning solution (6/220) in 2021 Gaofen Challenge.

Related tags

Overview

SwinTransformer + OBBDet

Members

Solution

Tried but didn't work

Detections

Owner

ming71

Official repository for Fourier model that can generate periodic signals

TVNet: Temporal Voting Network for Action Localization

Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis

[NAACL & ACL 2021] SapBERT: Self-alignment pretraining for BERT.

PyTorch code for the paper "FIERY: Future Instance Segmentation in Bird's-Eye view from Surround Monocular Cameras"

[CVPRW 2021] Code for Region-Adaptive Deformable Network for Image Quality Assessment

This program generates a random 12 digit/character password (upper and lowercase) and stores it in a file along with your username and app/website.

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.

Employs neural networks to classify images into four categories: ship, automobile, dog or frog

Face recognition. Redefined.

This project aims at building a real-time wide band channel sounder using USRPs

FAMIE is a comprehensive and efficient active learning (AL) toolkit for multilingual information extraction (IE)

Pacman-AI - AI project designed by UC Berkeley. Designed reflex and minimax agents for the game Pacman.

Pytorch implementation of "Neural Wireframe Renderer: Learning Wireframe to Image Translations"

Decorators for maximizing memory utilization with PyTorch & CUDA

python debugger and anti-vm that checks if you're in a virtual machine or if someones trying to debug your file

A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)

A Web API for automatic background removal using Deep Learning. App is made using Flask and deployed on Heroku.

High level network definitions with pre-trained weights in TensorFlow

The sixth place winning solution (6/220) in 2021 Gaofen Challenge.

Related tags

Overview

SwinTransformer + OBBDet

Members

Solution

Tried but didn't work

Detections

Owner

ming71

Official repository for Fourier model that can generate periodic signals

TVNet: Temporal Voting Network for Action Localization

Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis

[NAACL & ACL 2021] SapBERT: Self-alignment pretraining for BERT.

PyTorch code for the paper "FIERY: Future Instance Segmentation in Bird's-Eye view from Surround Monocular Cameras"

[CVPRW 2021] Code for Region-Adaptive Deformable Network for Image Quality Assessment

This program generates a random 12 digit/character password (upper and lowercase) and stores it in a file along with your username and app/website.

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.

Employs neural networks to classify images into four categories: ship, automobile, dog or frog

Face recognition. Redefined.

This project aims at building a real-time wide band channel sounder using USRPs

FAMIE is a comprehensive and efficient active learning (AL) toolkit for multilingual information extraction (IE)

Pacman-AI - AI project designed by UC Berkeley. Designed reflex and minimax agents for the game Pacman.

Pytorch implementation of "Neural Wireframe Renderer: Learning Wireframe to Image Translations"

Decorators for maximizing memory utilization with PyTorch & CUDA

python debugger and anti-vm that checks if you're in a virtual machine or if someones trying to debug your file

A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)

A Web API for automatic background removal using Deep Learning. App is made using Flask and deployed on Heroku.

High level network definitions with pre-trained weights in TensorFlow

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.