End-to-end face detection, cropping, norm estimation, and landmark detection in a single onnx model

Last update: Dec 30, 2022

Related tags

Deep Learning onnx-facial-lmk-detector

Overview

onnx-facial-lmk-detector

End-to-end face detection, cropping, norm estimation, and landmark detection in a single onnx model, model.onnx.

Demo

You can try this model at the following link. Thanks for hysts.

https://huggingface.co/spaces/hysts/atksh-onnx-facial-lmk-detector

Code

See src.

Example

import onnxruntime as ort
import cv2

sess = ort.InferenceSession("model.onnx")
img = cv2.imread("input.jpg")

scores, bboxes, keypoints, aligned_imgs, landmarks, affine_matrices = sess.run(None, {"input": img})
# float32 int64 int64 uint8 int64 float32
# (N,) (N, 4) (N, 5, 2) (N, 224, 224, 3) (N, 106, 2) (N, 2, 3)

This model requires onnxruntime>=1.11.

How does it work?

This is simply a merged model of the following underlying models with some pre- and post-processing.

Underlying models

	model	reference
face detection	SCRFD_10G_KPS	https://github.com/deepinsight/insightface/tree/master/detection/scrfd#pretrained-models
landmark detection	2d106det	https://github.com/deepinsight/insightface/blob/master/alignment/coordinate_reg/README.md#pretrained-models

Pre- and Post-Processing

Implemented the following processing by PyTorch and exported to ONNX.

Input transform:
- Resize and pad to (1920, 1920)
- BGR to RGB conversion
- Transpose (H, W, C) to (C, H, W)
(Face Detection)
Post-processing of face detection
- Predicted bounding boxes and Confidence Score Processing
- NMS (ONNX Operator)
Norm estimation and face cropping
- Estimate the norm and apply an affine transformation to each face.
- Crop the faces and resize them to (192, 192).
(Landmark Detection)
Perform post-processing for landmark detection.
- Process the predicted landmarks and apply the inverse affine transform to each face.

Note

Please check with the model provider regarding the license for your use.

This model includes the work that is distributed in the Apache License 2.0.

End-to-end face detection, cropping, norm estimation, and landmark detection in a single onnx model

Related tags

Overview

onnx-facial-lmk-detector

Demo

Code

Example

How does it work?

Underlying models

Pre- and Post-Processing

Note

Owner

atksh

Human4D Dataset tools for processing and visualization

tf2-keras implement yolov5

Benchmarks for the Optimal Power Flow Problem

GULAG: GUessing LAnGuages with neural networks

This is a pytorch implementation of the NeurIPS paper GAN Memory with No Forgetting.

A modified version of DeepMind's Alphafold2 to divide CPU part (MSA and template searching) and GPU part (prediction model)

Face and Pose detector that emits MQTT events when a face or human body is detected and not detected.

The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.

Python wrapper of LSODA (solving ODEs) which can be called from within numba functions.

Code for "Layered Neural Rendering for Retiming People in Video."

Detail-Preserving Transformer for Light Field Image Super-Resolution

Pipeline code for Sequential-GAM(Genome Architecture Mapping).

Digan - Official PyTorch implementation of Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks

bespoke tooling for offensive security's Windows Usermode Exploit Dev course (OSED)

This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.

Extending JAX with custom C++ and CUDA code

PyTorch implementation of SQN based on CloserLook3D's encoder

SNE-RoadSeg in PyTorch, ECCV 2020

I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)

Official codes: Self-Supervised Learning by Estimating Twin Class Distribution