Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Last update: Apr 04, 2022

Related tags

Deep Learning FSAC

Overview

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Main requirements

torch >= 1.0

torchvision >= 0.2.0

Python 3

Environmental settings

This repository is developed using python 3.6.12 on Ubuntu 16.04.5 LTS. The CUDA and pytorch version is 11.2 and 1.7.1. We use one NVIDIA 3090 GPU card for training and testing.

Dataset

PASCAL VOC, Watercolor, Cityscapes, Foggycityscapes -> Please follow the instructions in [Link] to prepare the datasets.

Daytime-Sunny, Dusk-Rainy, and Night-Rainy -> Dataset preparation instruction link [Link].

Code

Faster R-CNN -> Thanks for jwyang [Link]; Fourier Domain Adaptation -> Thanks for Yanchao Yang [Link].

Our Augmentation (Mix+Replace+Extend+Disorder).

Train

To train a faster R-CNN model with vgg16 on pascal_voc:

CUDA_VISIBLE_DEVICES=$GPU_ID python trainval_net.py --dataset pascal_voc --net vgg16 --bs 1 --cuda

And you need to add augmentated data in the loadpath by creating a new dataset_name variable.

Test

To test:

python test_net.py --dataset pascal_voc --net vgg16 --modelpath your modelpath --cuda

Augmentation

Daytime-Sunny -> Dusk-Rainy

Daytime-Sunny -> Night-Rainy

Result

Results on adaptation from Cityscapes to FoggyCityscapes. ‘prsn’, ‘mcycl’, and ‘bcycl’ separately denote ‘person’, ‘motorcycle’, and ‘bicycle’ category.

Results on adaptation from Daytime-sunny to Duskrainy. Here, we directly run the released codes of the compared methods to obtain the results.

Results on Daytime-sunny → Night-rainy.

Results on the compound target domain.

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Related tags

Overview

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Main requirements

Environmental settings

Dataset

Code

Train

Test

Augmentation

Result

Owner

Poisson Surface Reconstruction for LiDAR Odometry and Mapping

UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Attention-based CNN-LSTM and XGBoost hybrid model for stock prediction

The codes reproduce the figures and statistics in the paper, "Controlling for multiple covariates," by Mark Tygert.

A repository for benchmarking neural vocoders by their quality and speed.

A small library of 3D related utilities used in my research.

A new version of the CIDACS-RL linkage tool suitable to a cluster computing environment.

Project page for the paper Semi-Supervised Raw-to-Raw Mapping 2021.

CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching（CVPR2021）

Face Synthetics dataset is a collection of diverse synthetic face images with ground truth labels.

Provided is code that demonstrates the training and evaluation of the work presented in the paper: "On the Detection of Digital Face Manipulation" published in CVPR 2020.

DziriBERT: a Pre-trained Language Model for the Algerian Dialect

Buffon’s needle: one of the oldest problems in geometric probability

Classify the disease status of a plant given an image of a passion fruit

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking

PoolFormer: MetaFormer is Actually What You Need for Vision

SE3 Pose Interp - Interpolate camera pose or trajectory in SE3, pose interpolation, trajectory interpolation

Exploit ILP to learn symmetry breaking constraints of ASP programs.

Large scale and asynchronous Hyperparameter Optimization at your fingertip.

eXPeditious Data Transfer