Yolov5+SlowFast: Realtime Action Detection Based on PytorchVideo

Last update: Dec 30, 2022

Related tags

Deep Learning yolo_slowfast

Overview

Yolov5+SlowFast: Realtime Action Detection

A realtime action detection frame work based on PytorchVideo.

Here are some details about our modification:

we choose yolov5 as an object detector instead of detectron2, it is faster and more convenient
we use a tracker(deepsort) to allocate action labels to all objects(with same ids) in different frames
our processing speed reached 24.2 FPS at 30 inference barch size (on a single RTX 2080Ti GPU)

Relevant infomation: FAIR/PytorchVideo; Ultralytics/Yolov5

Demo comparison betwween original(<-left) and ours(->right).

Installation

create a new python environment:
```
conda create -n env_name python=3.7.11
```
install requiments:
```
pip install -r requirements.txt
```
download weights file(ckpt.t7) from [deepsort] to this folder:
```
./deep_sort/deep_sort/deep/checkpoint/
```
test on your video:
```
python yolo_slowfast.py --input {path to your video}
```
The first time to execute this command may take some times to download the yolov5 code and it's weights file from torch.hub, keep your network connected.

References

Thanks for these great works:

[1] Ultralytics/Yolov5

[2] ZQPei/deepsort

[3] FAIR/PytorchVideo

[2] AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions. paper

[3] SlowFast Networks for Video Recognition. paper

Citation

If you find our work useful, please cite as follow:

{   yolo_slowfast,
    author = {Wu Fan},
    title = { A realtime action detection frame work based on PytorchVideo},
    year = {2021},
    url = {\url{https://github.com/wufan-tb/gmm_dae}}
}

Yolov5+SlowFast: Realtime Action Detection Based on PytorchVideo

Related tags

Overview

Yolov5+SlowFast: Realtime Action Detection

A realtime action detection frame work based on PytorchVideo.

Here are some details about our modification:

Demo comparison betwween original(<-left) and ours(->right).

Installation

References

Citation

Owner

WuFan

Code for the paper "Learning-Augmented Algorithms for Online Steiner Tree"

HiFT: Hierarchical Feature Transformer for Aerial Tracking (ICCV2021)

Distributed Asynchronous Hyperparameter Optimization better than HyperOpt.

Self-supervised Augmentation Consistency for Adapting Semantic Segmentation (CVPR 2021)

Shallow Convolutional Neural Networks for Human Activity Recognition using Wearable Sensors

Neural Cellular Automata + CLIP

Semantically Contrastive Learning for Low-light Image Enhancement

Fuzzing JavaScript Engines with Aspect-preserving Mutation

Graph Transformer Architecture. Source code for

A-ESRGAN aims to provide better super-resolution images by using multi-scale attention U-net discriminators.

Code for "The Box Size Confidence Bias Harms Your Object Detector"

Official Pytorch implementation for Deep Contextual Video Compression, NeurIPS 2021

You Only 👀 One Sequence

An Inverse Kinematics library aiming performance and modularity

An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities.

Source codes for "Structure-Aware Abstractive Conversation Summarization via Discourse and Action Graphs"

Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.

FlexConv: Continuous Kernel Convolutions with Differentiable Kernel Sizes

MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks

A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning