A PyTorch Implementation of Single Shot Scale-invariant Face Detector.

Last update: Jan 07, 2023

Related tags

Overview

S³FD: Single Shot Scale-invariant Face Detector

A PyTorch Implementation of Single Shot Scale-invariant Face Detector.

Eval

python wider_eval_pytorch.py

cd eval/eval_tools_old-version
octave wider_eval_pytorch.m

Model

s3fd_convert.7z

Test

python test.py --model data/s3fd_convert.pth --path data/test01.jpg

References

SFD

Comments

RGB <-> BGR

From this line, I assume you use RGB: img = img - np.array([104,117,123])

However opencv uses BGR, so this line returns BGR: if args.path=='CAMERA': ret, img = cap.read()

Then BGR is fed to the network bboxlist = detect(net,img)

I fed RGB to the network and got worse results. Is it possible that you meant RGB in all places but the network is actually trained for BGR? (If then it should be img = img - np.array([123,117,104]))

opened by elbaro 3
How Convert Weights

Dear @clcarwin, Thank you for your nice work. Would you please tell me how you can convert Caffe weights and model of S3FD into PyTorch? Can you convert the model & pre-trained weights of RefineDet into PyTorch?

opened by ahkarami 2
evaluation accuracy is not good as the original paper

hi @clcarwin,

I test you evaluation results on wider face as (easy 92.8, medium 91.5, hard 84.2). But with the original model provided by sfzhang15/SFD, I can get (easy 93.8, medium 92.4, hard 85.1).

Did I test correctly? If so, why there is accuracy loss?

Great work! Best,

opened by marvis 2
'float' object cannot be interpreted as an integer??

Sir,I'm sorry to disturb you about this object. I run this object on windows 10,python 3.5.2 ,pytorch 0.3. After : python test.py --model data/s3fd_convert.pth --path data/test01.jpg, the screen display: D:\Python\Pytorch_cw_sfd\SFD_pytorch>python test.py --model data/s3fd_convert.pth --path data/test01.jpg Traceback (most recent call last): File "test.py", line 71, in bboxlist = detect(net,img) File "test.py", line 27, in detect for i in range(len(olist)/2): olist[i2] = F.softmax(olist[i2]) TypeError: 'float' object cannot be interpreted as an integer

Why ???

opened by door5719 1
padding size of fc6

Hi @clcarwin,

Why do you set the padding size of fc6 to 3? This is inconsistent with the original paper. See https://github.com/clcarwin/SFD_pytorch/blob/master/net_s3fd.py#L42

Best,

opened by marvis 1
Optimization

Good: It is accurate.

Bad: The inference time is more than 80 ms for realtime usage. To make it work for realtime image has to be resized to less than 200x200 which reduces accuracy.

So in order to make it usable the only way is to make it faster. Have you tried using TensorRT or TVM or Pytorch serving in C++ ?

opened by jamessmith90 0
Several speed & code updates

Seems nobody's looking at PR's here, but letting others know I've made a number of improvements.

It runs smoothly on modern pytorch (1.3) and refactored the code to eliminate redundant code. I also added some convenient methods that make it easier to do common things, like detect_faces. Also, added integration tests.

I independently found the same speed-up as @kir-dan in https://github.com/clcarwin/SFD_pytorch/pull/4 and moved all that code into pytorch instead of numpy, so it can be fully run on GPU.

opened by leopd 0
Very high GPU memory usage

Hi, I have been running the model using test.py and modified it run multiple files. The GPU memory keeps on increasing,from 3gigs to 9 gigs. Is this due to poor garbage collection?

opened by vaishnavm217 2
Change Anchor Boxes Aspect Ratio

Dear @clcarwin, If one wants to change the aspect ratio of anchor boxes, must just changed the detect method in test.py? For example, line https://github.com/clcarwin/SFD_pytorch/blob/96fdfbe22eef176a04802d915834b82a131a854d/test.py#L39 or other methods moreover must changed?

opened by ahkarami 0
About data augmentation

When I use the Tensorflow to build the project, I have some trouble in data augmentation which describe in the paper. Can you tell the details of the data augmentation or show your data augmentation code to me. Thank you

opened by ckqsars 0

Releases(v0.1)

v0.1(Nov 21, 2017)

Source code(tar.gz)
Source code(zip)
s3fd_convert.7z(8.14 MB)

Owner

carwin

GitHub Repository

DFM: A Performance Baseline for Deep Feature Matching

DFM: A Performance Baseline for Deep Feature Matching Python (Pytorch) and Matlab (MatConvNet) implementations of our paper DFM: A Performance Baselin

143 Jan 02, 2023

A dataset for online Arabic calligraphy

Calliar Calliar is a dataset for Arabic calligraphy. The dataset consists of 2500 json files that contain strokes manually annotated for Arabic callig

114 Dec 28, 2022

Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track (SIGIR 2021 Full Paper).

Optimizing Dense Retrieval Model Training with Hard Negatives Jingtao Zhan, Jiaxin Mao, Yiqun Liu, Jiafeng Guo, Min Zhang, Shaoping Ma This repo provi

99 Dec 27, 2022

Defense-GAN: Protecting Classifiers Against Adversarial Attacks Using Generative Models (published in ICLR2018)

Defense-GAN: Protecting Classifiers Against Adversarial Attacks Using Generative Models Pouya Samangouei*, Maya Kabkab*, Rama Chellappa [*: authors co

212 Dec 07, 2022

PiRapGenerator - Make anyone rap the digits of pi

PiRapGenerator Make anyone rap the digits of pi (sample files are of Ted Nivison

7 Oct 02, 2022

Official PyTorch implementation of the paper "Deep Constrained Least Squares for Blind Image Super-Resolution", CVPR 2022.

Deep Constrained Least Squares for Blind Image Super-Resolution [Paper] This is the official implementation of 'Deep Constrained Least Squares for Bli

141 Dec 30, 2022

Code, environments, and scripts for the paper: "How Private Is Your RL Policy? An Inverse RL Based Analysis Framework"

Privacy-Aware Inverse RL (PRIL) Analysis Framework Code, environments, and scripts for the paper: "How Private Is Your RL Policy? An Inverse RL Based

1 Dec 06, 2021

Allows including an action inside another action (by preprocessing the Yaml file). This is how composite actions should have worked.

actions-includes Allows including an action inside another action (by preprocessing the Yaml file). Instead of using uses or run in your action step,

70 Nov 04, 2022

A PyTorch Implementation of Single Shot Scale-invariant Face Detector.

Related tags

Overview

S³FD: Single Shot Scale-invariant Face Detector

Eval

Model

Test

References

Comments

Releases(v0.1)

v0.1(Nov 21, 2017)

Owner

carwin

DFM: A Performance Baseline for Deep Feature Matching

A dataset for online Arabic calligraphy

Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track (SIGIR 2021 Full Paper).

Defense-GAN: Protecting Classifiers Against Adversarial Attacks Using Generative Models (published in ICLR2018)

PiRapGenerator - Make anyone rap the digits of pi

Official PyTorch implementation of the paper "Deep Constrained Least Squares for Blind Image Super-Resolution", CVPR 2022.

Code, environments, and scripts for the paper: "How Private Is Your RL Policy? An Inverse RL Based Analysis Framework"

Allows including an action inside another action (by preprocessing the Yaml file). This is how composite actions should have worked.

A 2D Visual Localization Framework based on Essential Matrices [ICRA2020]

Code Release for Learning to Adapt to Evolving Domains

HW3 ― GAN, ACGAN and UDA

Artifacts for paper "MMO: Meta Multi-Objectivization for Software Configuration Tuning"

Match SafeGraph POIs with Data collected through a cultural resource survey in Washington DC.

Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)

Generating retro pixel game characters with Generative Adversarial Networks. Dataset "TinyHero" included.

Listing arxiv - Personalized list of today's articles from ArXiv

thundernet ncnn

[AAAI-2021] Visual Boundary Knowledge Translation for Foreground Segmentation

[ICCV'21] Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages