You Only Look One-level Feature (YOLOF), CVPR2021, Detectron2

Last update: Jan 03, 2023

Related tags

Deep Learning YOLOF

Overview

You Only Look One-level Feature (YOLOF), CVPR2021

A simple, fast, and efficient object detector without FPN.

This repo provides a neat implementation for YOLOF based on Detectron2. A cvpods version can be found in https://github.com/megvii-model/YOLOF.

You Only Look One-level Feature,
Qiang Chen, Yingming Wang, Tong Yang, Xiangyu Zhang, Jian Cheng, Jian Sun

Getting Started

Our project is developed on detectron2. Please follow the official detectron2 installation.

Install mish-cuda to speed up the training and inference when using CSPDarkNet-53 as the backbone (optional)

git clone https://github.com/thomasbrandon/mish-cuda
cd mish-cuda
python setup.py build install
cd ..

Install YOLOF by:
```
python setup.py develop
```
Then link your dataset path to datasets
```
cd datasets/
ln -s /path/to/coco coco
```
Download the pretrained model in OneDrive or in the Baidu Cloud with code qr6o to train with the CSPDarkNet-53 backbone (optional)
```
mkdir pretrained_models
# download the `cspdarknet53.pth` to the `pretrained_models` directory
```

Train with yolof

python ./tools/train_net.py --num-gpus 8 --config-file ./configs/yolof_R_50_C5_1x.yaml

Test with yolof

python ./tools/train_net.py --num-gpus 8 --config-file ./configs/yolof_R_50_C5_1x.yaml --eval-only MODEL.WEIGHTS /path/to/checkpoint_file

Note that there might be API changes in future detectron2 releases that make the code incompatible.

Main results

The models listed below can be found in this onedrive link or in the BaiduCloud link with code qr6o. The FPS is tested on a 2080Ti GPU. More models will be available in the near future.

Model	COCO val mAP	FPS
YOLOF_R_50_C5_1x	37.7	36
YOLOF_R_50_DC5_1x	39.2	23
YOLOF_R_101_C5_1x	39.8	23
YOLOF_R_101_DC5_1x	40.5	17
YOLOF_CSP_D_53_DC5_3x	41.2	41

Note that, the speed reported in this repo is 2~3 FPS faster than the one reported in the cvpods version.

Citation

If you find this project useful for your research, please use the following BibTeX entry.

@inproceedings{chen2021you,
  title={You Only Look One-level Feature},
  author={Chen, Qiang and Wang, Yingming and Yang, Tong and Zhang, Xiangyu and Cheng, Jian and Sun, Jian},
  booktitle={IEEE Conference on Computer Vision and Pattern Recognition},
  year={2021}
}

You Only Look One-level Feature (YOLOF), CVPR2021, Detectron2

Related tags

Overview

You Only Look One-level Feature (YOLOF), CVPR2021

Getting Started

Main results

Citation

Owner

qiang chen

Calculates carbon footprint based on fuel mix and discharge profile at the utility selected. Can create graphs and tabular output for fuel mix based on input file of series of power drawn over a period of time.

LSTM-VAE Implementation and Relevant Evaluations

Source code for our Paper "Learning in High-Dimensional Feature Spaces Using ANOVA-Based Matrix-Vector Multiplication"

The implementation of 'Image synthesis via semantic composition'.

TCube generates rich and fluent narratives that describes the characteristics, trends, and anomalies of any time-series data (domain-agnostic) using the transfer learning capabilities of PLMs.

The audio-video synchronization of MKV Container Format is exploited to achieve data hiding

'Aligned mixture of latent dynamical systems' (amLDS) for stimulus decoding probabilistic manifold alignment across animals. P. Herrero-Vidal et al. NeurIPS 2021 code.

TensorFlow port of PyTorch Image Models (timm) - image models with pretrained weights.

This repo contains the code required to train the multivariate time-series Transformer.

S-attack library. Official implementation of two papers "Are socially-aware trajectory prediction models really socially-aware?" and "Vehicle trajectory prediction works, but not everywhere".

Auto-Encoding Score Distribution Regression for Action Quality Assessment

Facial recognition project

SpanNER: Named EntityRe-/Recognition as Span Prediction

optimization routines for hyperparameter tuning

Noether Networks: meta-learning useful conserved quantities

网络协议2天集训

SW components and demos for visual kinship recognition. An emphasis is put on the FIW dataset-- data loaders, benchmarks, results in summary.

CIFAR-10 Photo Classification

Codes for ACL-IJCNLP 2021 Paper "Zero-shot Fact Verification by Claim Generation"