Libtorch yolov3 deepsort

Last update: Dec 13, 2022

Overview

It is for my undergrad thesis in Tsinghua University.

There are four modules in the project:

Detection: YOLOv3
Tracking: SORT and DeepSORT
Processing: Run detection and tracking, then display and save the results (a compressed video, a few snapshots for each target)
GUI: Display the results

YOLOv3

A Libtorch implementation of the YOLO v3 object detection algorithm, written with modern C++.

The code is based on the walktree.

The config file in .\models can be found at Darknet.

SORT

I also merged SORT to do tracking.

A similar software in Python is here, which also rewrite form the most starred version and SORT

DeepSORT

Recently I reimplement DeepSORT which employs another CNN for re-id. It seems it gives better result but also slows the program a bit. Also, a PyTorch version is available at ZQPei, thanks!

Performance

Currently on a GTX 1060 6G it consumes about 1G RAM and have 37 FPS.

The video I test is TownCentreXVID.avi.

GUI

With wxWidgets, I developed the GUI module for visualization of results.

Previously I used Dear ImGui. However, I do not think it suits my purpose.

Pre-trained network

This project uses pre-trained network weights from others

How to build

This project requires LibTorch, OpenCV, wxWidgets and CMake to build.

LibTorch can be easily integrated with CMake, but there are a lot of strange things...

On Ubuntu 16.04, I use apt install to install the others. Everything is fine. On Windows 10 + Visual Studio 2017, I use the latest stable version of the others from their official websites.

Snapshots

Here are some intermediate output from detection and tracking module:

Here is the snapshot of processing module:

Here is the snapshot of GUI module:

Libtorch yolov3 deepsort

Related tags

Overview

Overview

YOLOv3

SORT

DeepSORT

Performance

GUI

Pre-trained network

How to build

Snapshots

Owner

Xu Wei

Automatic caption evaluation metric based on typicality analysis.

This repo is developed for Strong Baseline For Vehicle Re-Identification in Track 2 Ai-City-2021 Challenges

Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)

Robotics environments

Computational Methods Course at UdeA. Forked and size reduced from:

Minimal implementation and experiments of "No-Transaction Band Network: A Neural Network Architecture for Efficient Deep Hedging".

Code for TIP 2017 paper --- Illumination Decomposition for Photograph with Multiple Light Sources.

Code for "Human Pose Regression with Residual Log-likelihood Estimation", ICCV 2021 Oral

Neural machine translation between the writings of Shakespeare and modern English using TensorFlow

PyTorch implementation of Densely Connected Time Delay Neural Network

VoxHRNet - Whole Brain Segmentation with Full Volume Neural Network

This repository contains an implementation of the Permutohedral Attention Module in Pytorch

Tensorflow implementation for Self-supervised Graph Learning for Recommendation

Self-attentive task GAN for space domain awareness data augmentation.

Loopy belief propagation for factor graphs on discrete variables, in JAX!

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

piSTAR Lab is a modular platform built to make AI experimentation accessible and fun. (pistar.ai)

Graph Attention Networks

Official implementation of the ICLR 2021 paper

A public available dataset for road boundary detection in aerial images