A state of the art of new lightweight YOLO model implemented by TensorFlow 2.

Last update: Dec 21, 2022

Overview

CSL-YOLO: A New Lightweight Object Detection System for Edge Computing

This project provides a SOTA level lightweight YOLO called "Cross-Stage Lightweight YOLO"(CSL-YOLO),

it is achieving better detection performance with only 43% FLOPs and 52% parameters than Tiny-YOLOv4.

Paper Link: https://arxiv.org/abs/2107.04829

Requirements

How to Get Started?

#Predict
python3 main.py -p cfg/predict_coco.cfg

#Train
python3 main.py -t cfg/train_coco.cfg

#Eval
python3 main.py -ce cfg/eval_coco.cfg

WebCam DEMO(on CPU)

This DEMO runs on a pure CPU environment, the CPU is I7-6600U(2.6Ghz~3.4Ghz), the model scale is 224x224, and the FPS is about 10.

Please execute the following script to get this DEMO, the "camera_idx" in the cfg file represents the camera number you specified.

#Camera DEMO
python3 main.py -d cfg/demo_coco.cfg

More Info

Change Model Scale

The model's default scale is 224x224, if you want to change the scale to 320~512,

please go to cfg/XXXX.cfg and change the following two parts:

# input_shape=[512,512,3]
# out_hw_list=[[64,64],[48,48],[32,32],[24,24],[16,16]]
# input_shape=[416,416,3]
# out_hw_list=[[52,52],[39,39],[26,26],[20,20],[13,13]]
# input_shape=[320,320,3]
# out_hw_list=[[40,40],[30,30],[20,20],[15,15],[10,10]]
input_shape=[224,224,3]
out_hw_list=[[28,28],[21,21],[14,14],[10,10],[7,7]]

weight_path=weights/224_nolog.hdf5

                         |
                         | 224 to 320
                         V
                         
# input_shape=[512,512,3]
# out_hw_list=[[64,64],[48,48],[32,32],[24,24],[16,16]]
# input_shape=[416,416,3]
# out_hw_list=[[52,52],[39,39],[26,26],[20,20],[13,13]]
input_shape=[320,320,3]
out_hw_list=[[40,40],[30,30],[20,20],[15,15],[10,10]]
# input_shape=[224,224,3]
# out_hw_list=[[28,28],[21,21],[14,14],[10,10],[7,7]]

weight_path=weights/320_nolog.hdf5

Fully Dataset

The entire MS-COCO data set is too large, here only a few pictures are stored for DEMO,

if you need complete data, please download on this page.

Our Data Format

We did not use the official format of MS-COCO, we expressed a bounding box as following:

[ left_top_x<float>, left_top_y<float>, w<float>, h<float>, confidence<float>, class<str> ]

The bounding boxes contained in a picture are represented by single json file.

For detailed format, please refer to the json file in "data/coco/train/json".

AP Performance on MS-COCO

For detailed COCO report, please refer to "mscoco_result".

TODOs

Improve the calculator script of FLOPs.
Using Focal Loss will cause overfitting, we need to explore the reasons.

A state of the art of new lightweight YOLO model implemented by TensorFlow 2.

Related tags

Overview

CSL-YOLO: A New Lightweight Object Detection System for Edge Computing

Requirements

How to Get Started?

WebCam DEMO(on CPU)

More Info

Change Model Scale

Fully Dataset

Our Data Format

AP Performance on MS-COCO

TODOs

Owner

Miles Zhang

A Framework for Encrypted Machine Learning in TensorFlow

A pure PyTorch batched computation implementation of "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition"

This tool uses Deep Learning to help you draw and write with your hand and webcam.

GestureSSD CBAM - A gesture recognition web system based on SSD and CBAM, using pytorch, flask and node.js

Implementation of the GBST block from the Charformer paper, in Pytorch

Implementation of paper "Self-supervised Learning on Graphs:Deep Insights and New Directions"

Bu repo SAHI uygulamasını mantığını öğreniyoruz.

Parameterising Simulated Annealing for the Travelling Salesman Problem

simple_pytorch_example project is a toy example of a python script that instantiates and trains a PyTorch neural network on the FashionMNIST dataset

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Personal thermal comfort models using digital twins: Preference prediction with BIM-extracted spatial-temporal proximity data from Build2Vec

🔎 Monitor deep learning model training and hardware usage from your mobile phone 📱

[ICML 2021] “ Self-Damaging Contrastive Learning”, Ziyu Jiang, Tianlong Chen, Bobak Mortazavi, Zhangyang Wang

Training a Resilient Q-Network against Observational Interference, Causal Inference Q-Networks

Code for paper "Context-self contrastive pretraining for crop type semantic segmentation"

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Sharpness-Aware Minimization for Efficiently Improving Generalization

Jetson Nano-based smart camera system that measures crowd face mask usage in real-time.

A modification of Daniel Russell's notebook merged with Katherine Crowson's hq-skip-net changes

Official Pytorch Implementation of Unsupervised Image Denoising with Frequency Domain Knowledge