Visualizing Yolov5's layers using GradCam

Last update: Jan 01, 2023

Overview

YOLO-V5 GRADCAM

I constantly desired to know to which part of an object the object-detection models pay more attention. So I searched for it, but I didn't find any for Yolov5. Here is my implementation of Grad-cam for YOLO-v5. To load the model I used the yolov5's main codes, and for computing GradCam I used the codes from the gradcam_plus_plus-pytorch repository. Please follow my GitHub account and star ⭐ the project if this functionality benefits your research or projects.

Installation

pip install -r requirements.txt

Infer

python main.py --model-path yolov5s.pt --img-path images/cat-dog.jpg --output-dir outputs

NOTE: If you don't have any weights and just want to test, don't change the model-path argument. The yolov5s model will be automatically downloaded thanks to the download function from yolov5.

NOTE: For more input arguments, check out the main.py or run the following command:

python main.py -h

Examples

Note

I checked the code, but I couldn't find an explanation for why the truck's heatmap does not show anything. Please inform me or create a pull request if you find the reason.

TO Do

Add GradCam++
Add ScoreCam
Add the functionality to the deep_utils library

References

Citation

Please cite yolov5-gradcam if it helps your research. You can use the following BibTeX entry:

@misc{deep_utils,
	title = {yolov5-gradcam},
	author = {Mohammadi Kazaj, Pooya},
	howpublished = {\url{github.com/pooya-mohammadi/yolov5-gradcam}},
	year = {2021}
}

Visualizing Yolov5's layers using GradCam

Related tags

Overview

YOLO-V5 GRADCAM

Installation

Infer

Examples

Note

TO Do

References

Citation

Owner

Pooya Mohammadi Kazaj

Unsupervised Feature Loss (UFLoss) for High Fidelity Deep learning (DL)-based reconstruction

Set of methods to ensemble boxes from different object detection models, including implementation of "Weighted boxes fusion (WBF)" method.

Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT

Implementation of the "PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences" paper.

Video Matting Refinement For Python

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

Repo público onde postarei meus estudos de Python, buscando aprender por meio do compartilhamento do aprendizado!

Vit-ImageClassification - Pytorch ViT for Image classification on the CIFAR10 dataset

Py4fi2nd - Jupyter Notebooks and code for Python for Finance (2nd ed., O'Reilly) by Yves Hilpisch.

some classic model used to segment the medical images like CT、X-ray and so on

Modular Gaussian Processes

🤗 Paper Style Guide

More than a hundred strange attractors

Official code for Score-Based Generative Modeling through Stochastic Differential Equations

QAHOI: Query-Based Anchors for Human-Object Interaction Detection (paper)

Official PyTorch implementation of "Rapid Neural Architecture Search by Learning to Generate Graphs from Datasets" (ICLR 2021)

Pytorch library for fast transformer implementations

Towards End-to-end Video-based Eye Tracking

ByteTrack超详细教程！训练自己的数据集&&摄像头实时检测跟踪

The official implementation for "FQ-ViT: Fully Quantized Vision Transformer without Retraining".