Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Last update: Dec 20, 2022

Overview

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Artifact Detection/Correction - Offcial PyTorch Implementation

This repo provides the official PyTorch implementation of the following paper:

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?
Hwanil Choi, Wonjoon Chang, Jaesik Choi*
Korea Advanced Institute of Science and Technology, KAIST

Abstract
Even though image generation with Generative Adversarial Networks (GANs) has been showing remarkable ability to generate high-quality images, GANs do not always guarantee photorealistic images will be generated. Sometimes they generate images that have defective or unnatural objects, which are referred to as 'artifacts'. Research to determine why the artifacts emerge and how they can be detected and removed has not been sufficiently carried out. To analyze this, we first hypothesize that rarely activated neurons and frequently activated neurons have different purposes and responsibilities for the progress of generating images. By analyzing the statistics and the roles for those neurons, we empirically show that rarely activated neurons are related to failed results of making diverse objects and lead to artifacts. In addition, we suggest a correction method, called 'sequential ablation', to repair the defective part of the generated images without complex computational cost and manual efforts.
https://arxiv.org/abs/1812.04948

Dependencies

PyTorch 1.4.0
python 3.6
cuda 10.0.x
cudnn 7.6.3

Pre-Trained Models (Official) - GenForce

Dataset \ Model	PGGAN	StyleGAN2
CelebA-HQ (Official)	1024 x 1024	X
FFHQ (Official)	X	1024 X 1024
LSUN-Church (Official)	256 x 256	256 x 256
LSUN-CAT (Official)	256 x 256	256 x 256

For following implementation, download StyleGAN2 FFHQ weights in current directory. Otherwise, you should change the '--weight_path' options to your directory.

More pre-trained weights are available in genforce-model-zoo

optional : StyleGAN3

Implementation

Options

optional arguments:
  -h, --help                show this help message and exit
  --gpu GPU                 gpu index numper
  --batch_size BATCH_SIZE
                            batch size for pre processing and generating process
  --sample_size SAMPLE_SIZE
                            sample size for statistics
  --freq_path FREQ_PATH
                            loading saved frequencies of neurons
  --model MODEL             pggan, styelgan2
  --dataset DATASET         ffhq, cat, church, etc
  --resolution RESOLUTION
                            dataset resolution
  --weight_path WEIGHT_PATH
                            pre-trained weight path
  --detection DETECTION
                            implement normal/artifact detection
  --correction CORRECTION
                            implement correction task

Usage

python main.py --gpu 0 --batch_size 30 --sample_size 30000 --freq_pth ./stats \
               --model stylegan2 --dataset ffhq --resolution 1024 --weight_path ./ \
               --detection True --correction True

If you are on remote server, then to show the results, you should do the following. (X11 forwarding).

X11 forwarding

You can also implement our codes in 'Jupyter Notebook' that has more degree of freedom. Use the 'notebook.ipynb' file.

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Related tags

Overview

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Artifact Detection/Correction - Offcial PyTorch Implementation

Dependencies

Pre-Trained Models (Official) - GenForce

Implementation

Detection results for 50K samples

Bottom 60 images

Top 60 images

Correction results

Owner

CHOI HWAN IL

Text modding tools for FF7R (Final Fantasy VII Remake)

Official code for ROCA: Robust CAD Model Retrieval and Alignment from a Single Image (CVPR 2022)

Deskew is a command line tool for deskewing scanned text documents. It uses Hough transform to detect "text lines" in the image. As an output, you get an image rotated so that the lines are horizontal.

⛓ marc is a small, but flexible Markov chain generator

The Open Source Framework for Machine Vision

OpenCV-Erlang/Elixir bindings

color detection using python

PyQT5 app that colorize black & white pictures using CNN(use pre-trained model which was made with OpenCV)

Source code of RRPN ---- Arbitrary-Oriented Scene Text Detection via Rotation Proposals

A simple OCR API server, seriously easy to be deployed by Docker, on Heroku as well

A post-processing tool for scanned sheets of paper.

YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

Controlling Volume by Hand Gestures

Face Anonymizer - FaceAnonApp v1.0

Reference Code for AAAI-20 paper "Multi-Stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labels"

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

Code for the paper "Controllable Video Captioning with an Exemplar Sentence"

This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog: