Image reconstruction done with untrained neural networks.

Last update: Nov 30, 2022

Overview

PyTorch Deep Image Prior

An implementation of image reconstruction methods from Deep Image Prior (Ulyanov et al., 2017) in PyTorch.

The point of the paper is to execute some common image manipulation tasks using neural networks untrained on data prior to use.

Architectures differ from those used in the actual paper. The authors use some specific networks for specific tasks. This repo uses a couple of alternative architectures to produce similar results. One where upsampling is done on the basis of pixel shuffling, and the other using transposed convolutions. Pixel shuffling results in some hotspots that do not disappear with further training.

Requirements

Python3 with PyTorch, torchvision and NumPy. CUDA and cuDNN are optional (settable within the script in a self-explanatory way) but strongly recommended.

To use

It's relatively easy to play around with the settings from within the scripts. To reproduce the results in the repo, do the following.

Make a directory to hold the network output:

mkdir output

Generate output images with:

python3 deep_image_prior.py

Consolidate output images into a training gif and sample some actual data with:

python3 parse_ec2_results.py

Results

Note that the images here have been downsampled for formatting sensibly in the README. Full sized samples are in the repo if you would like to have a closer look.

Training was done over 25k iterations on an Amazon GPU instance. Takes roughly an hour on 512x512 images.

Upsampling with transposed convolutions:

Note the grid like speckles during training. These are caused by convolutional kernels overlapping with one another during upsampling.

Ground truth	Input	Output	Training

Upsampling with pixel shuffling:

No speckles, however there is a hotspot (in the out of focus region towards the bunny's behind) that becomes a black spot. The appearance of these hotspots seems commonplace through both architectures, but the extra smoothness given by the convolution transpose layers repairs these more effectively.

Ground truth	Input	Output	Training

Image reconstruction done with untrained neural networks.

Related tags

Overview

PyTorch Deep Image Prior

Requirements

To use

Results

Upsampling with transposed convolutions:

Upsampling with pixel shuffling:

Owner

Atiyo Ghosh

Finetune alexnet with tensorflow - Code for finetuning AlexNet in TensorFlow >= 1.2rc0

[SIGMETRICS 2022] One Proxy Device Is Enough for Hardware-Aware Neural Architecture Search

Code and datasets for the paper "KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction"

Performance Analysis of Multi-user NOMA Wireless-Powered mMTC Networks: A Stochastic Geometry Approach

source code for 'Finding Valid Adjustments under Non-ignorability with Minimal DAG Knowledge' by A. Shah, K. Shanmugam, K. Ahuja

This repository allows the user to automatically scale a 3D model/mesh/point cloud on Agisoft Metashape

SGoLAM - Simultaneous Goal Localization and Mapping

JAX bindings to the Flatiron Institute Non-uniform Fast Fourier Transform (FINUFFT) library

A multi-functional library for full-stack Deep Learning. Simplifies Model Building, API development, and Model Deployment.

piSTAR Lab is a modular platform built to make AI experimentation accessible and fun. (pistar.ai)

DAFNe: A One-Stage Anchor-Free Deep Model for Oriented Object Detection

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

Pytorch implementation for Semantic Segmentation/Scene Parsing on MIT ADE20K dataset

Start-to-finish tutorial for interactive music co-creation in PyTorch and Tensorflow.js

This is a custom made virus code in python, using tkinter module.

Evaluating different engineering tricks that make RL work

Sample Prior Guided Robust Model Learning to Suppress Noisy Labels

Official PyTorch implementation of "Adversarial Reciprocal Points Learning for Open Set Recognition"

PyTorch implementation of "Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning"

classify fashion-mnist dataset with pytorch