Expand human face editing via Global Direction of StyleCLIP, especially to maintain similarity during editing.

Last update: Nov 17, 2022

Related tags

Overview

Oh-My-Face

This project is based on StyleCLIP, RIFE, and encoder4editing, which aims to expand human face editing via Global Direction of StyleCLIP, especially to maintain similarity during editing.

StyleCLIP is an excellent algorithm that acts on the latent code of StyleGAN2 to edit images guided by texts. Global Direction uses models such as e4e to convert images into latent codes and then further editing. However, this conversion causes information loss of the original image and dissimilarities.

Thus, we use the optical flow model to detect the change in different regions between the StyleCLIP generated image and the original image, sample more from the original in slightly-edited areas, then use frame interpolation to perform weighted fusion, which is simple yet efficient.

We will further release weights for cat face editing, containing cat facial landmark recognition from pycatfd and e4e-cat model. e4e-cat is trained via afhq-cat dataset and StyleGAN2-cat weights. StyleGAN2-pytorch/convert_weights.py is used to convert the tensorflow weights.

Usage

Prerequisites

NVIDIA GPU + CUDA11.0 CuDNN
Python 3.6

Installation

Clone this repository

git clone https://github.com/P2Oileen/oh-my-face

Dependencies

To install all the dependencies, please run the following commands.

wget https://developer.nvidia.com/compute/cuda/10.0/Prod/local_installers/cuda-repo-ubuntu1604-10-0-local-10.0.130-410.48_1.0-1_amd64 -O cuda-repo-ubuntu1604-10-0-local-10.0.130-410.48_1.0-1_amd64.deb
dpkg -i cuda-repo-ubuntu1604-10-0-local-10.0.130-410.48_1.0-1_amd64.deb
apt-key add /var/cuda-repo-10-0-local/7fa2af80.pub
apt-get update
apt-get -y install gcc-7 g++-7
apt-get -y install cuda 

export PATH=/usr/local/cuda/bin${PATH:+:${PATH}}
export LD_LIBRARY_PATH=/usr/local/cuda/lib64\${LD_LIBRARY_PATH:+:${LD_LIBRARY_PATH}}
export CUDA_HOME=/usr/local/cuda

pip install tensorflow-gpu==1.15.2
pip install ftfy regex tqdm gdown
pip install git+https://github.com/openai/CLIP.git
pip install torch==1.7.1+cu110 torchvision==0.8.2+cu110 torchaudio==0.7.2 -f https://download.pytorch.org/whl/torch_stable.html

wget https://github.com/ninja-build/ninja/releases/download/v1.8.2/ninja-linux.zip
sudo unzip ninja-linux.zip -d /usr/local/bin/
sudo update-alternatives --install /usr/bin/ninja ninja /usr/local/bin/ninja 1 --force

Download Weights Currently, We only provide weights for human face editing, PLEASE wait for further weights.

cd oh-my-face
wget https://drive.google.com/file/d/1efFoGShtZhcd6SCxOPu3AMbKZus478au/view?usp=sharing
tar -zxvf ffhq.tar.gz
mv ffhq src/
wget https://drive.google.com/file/d/1bXhWOnwCTTXTz7T7zJ1iXA717tyj-n3U/view?usp=sharing
tar -zxvf oh-my-face/weights-face.tar.gz
mv weights oh-my-face/src/

Edit image via oh-my-face

python3 run.py \
--input_dir='input.jpg' \ # Path to your input image
--output_dir='output.jpg' \ # Path to output directory
--option_beta=0.15 \ # Range from 0.08 to 0.3, corresponds to the disentanglement threshold
--option_alpha=4.1 \ # Range from -10.0 to 10.0, corresponds to the manipulation strength
--option_gamma=3 \ # Range from 1 to 10, corresponds to RIFE's sample strength
--neutral='face' \ # Origin description
--target='face with smile' \ # Target description

Expand human face editing via Global Direction of StyleCLIP, especially to maintain similarity during editing.

Related tags

Overview

Oh-My-Face

Usage

Prerequisites

Installation

Edit image via oh-my-face

Owner

AiLin Huang

Authors implementation of LieTransformer: Equivariant Self-Attention for Lie Groups

Oriented Response Networks, in CVPR 2017

La source de mon module 'pyfade' disponible sur Pypi.

Empower Sequence Labeling with Task-Aware Language Model

The code for the CVPR 2021 paper Neural Deformation Graphs, a novel approach for globally-consistent deformation tracking and 3D reconstruction of non-rigid objects.

Official PyTorch implementation of "Evolving Search Space for Neural Architecture Search"

Face detection using deep learning.

Contrastive Multi-View Representation Learning on Graphs

Official implementation of deep Gaussian process (DGP)-based multi-speaker speech synthesis with PyTorch.

scAR (single-cell Ambient Remover) is a package for data denoising in single-cell omics.

Alignment Attention Fusion framework for Few-Shot Object Detection

Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation Networks

Code for HodgeNet: Learning Spectral Geometry on Triangle Meshes, in SIGGRAPH 2021.

YoloV5 implemented by TensorFlow2 , with support for training, evaluation and inference.

Table-Extractor 表格抽取

An Implementation of SiameseRPN with Feature Pyramid Networks

Boundary-preserving Mask R-CNN (ECCV 2020)

An open source library for face detection in images. The face detection speed can reach 1000FPS.

CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

Multi-label Co-regularization for Semi-supervised Facial Action Unit Recognition (NeurIPS 2019)