Multispectral Object Detection with Yolov5

Last update: Jan 01, 2023

Overview

Multispectral-Object-Detection

Intro

Official Code for Cross-Modality Fusion Transformer for Multispectral Object Detection.

Multispectral Object Detection with Transformer and Yolov5

Citation

If you use this repo for your research, please cite our paper:

@article{fang2021cross,
  title={Cross-Modality Fusion Transformer for Multispectral Object Detection},
  author={Fang Qingyun and Han Dapeng and Wang Zhaokui},
  journal={arXiv preprint arXiv:2111.00273},
  year={2021}
}

Installation

Python>=3.6.0 is required with all requirements.txt installed including PyTorch>=1.7 (The same as yolov5 https://github.com/ultralytics/yolov5 ).

Clone the repo

git clone https://github.com/DocF/multispectral-object-detection

Install requirements

$ cd  multispectral-object-detection
$ pip install -r requirements.txt

Dataset

-[FLIR] download A new aligned version.

-[LLVIP] download

-[VEDAI] download

Run

Download the pretrained weights

yolov5 weights:

CFT weights:

Add the some file

create runs/train, runs/test and runs/detect three files for save the results.

Change the data cfg

some example in data/multispectral/

Train Test and Detect

train: python train.py

test: python test.py

detect: python detect_twostream.py

Results

Dataset	CFT	mAP50	mAP75	mAP
FLIR		73.0	32.0	37.4
FLIR	✔️	77.7 (Δ4.7)	34.8 (Δ2.8)	40.0 (Δ2.6)
LLVIP		95.8	71.4	62.3
LLVIP	✔️	97.5 (Δ1.7)	72.9 (Δ1.5)	63.6 (Δ1.3)
VEDAI		79.7	47.7	46.8
VEDAI	✔️	85.3 (Δ5.6)	65.9(Δ18.2)	56.0 (Δ9.2)

Multispectral Object Detection with Yolov5

Related tags

Overview

Multispectral-Object-Detection

Intro

Citation

Installation

Clone the repo

Install requirements

Dataset

Run

Download the pretrained weights

Add the some file

Change the data cfg

Train Test and Detect

Results

Owner

Richard Fang

Smart edu-autobooking - Johnson @ DMI-UNICT study room self-booking system

Official Keras Implementation for UNet++ in IEEE Transactions on Medical Imaging and DLMIA 2018

Code to reproduce the results for Statistically Robust Neural Network Classification, published in UAI 2021

Code to reproduce results from the paper "AmbientGAN: Generative models from lossy measurements"

Learning to Reconstruct 3D Non-Cuboid Room Layout from a Single RGB Image

A Survey on Deep Learning Technique for Video Segmentation

Use unsupervised and supervised learning to predict stocks

Tensorflow Implementation of Pixel Transposed Convolutional Networks (PixelTCN and PixelTCL)

Inhomogeneous Social Recommendation with Hypergraph Convolutional Networks

Predictive Maintenance LSTM

Image-Stitching - Panorama composition using SIFT Features and a custom implementaion of RANSAC algorithm

Using pretrained language models for biomedical knowledge graph completion.

Dynamic Capacity Networks using Tensorflow

Use deep learning, genetic programming and other methods to predict stock and market movements

The project is an official implementation of our paper "3D Human Pose Estimation with Spatial and Temporal Transformers".

An easy-to-use app to visualise attentions of various VQA models.

This is a collection of our NAS and Vision Transformer work.

[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang

Multi-task Self-supervised Object Detection via Recycling of Bounding Box Annotations (CVPR, 2019)

Experiments for Fake News explainability project