Official PaddlePaddle implementation of Paint Transformer

Last update: Dec 31, 2022

Related tags

Deep Learning PaintTransformer

Overview

Paint Transformer: Feed Forward Neural Painting with Stroke Prediction

[Paper] [Paddle Implementation]

Update

We have optimized the serial inference procedure to achieve better rendering quality and faster speed.

Overview

This repository contains the official PaddlePaddle implementation of paper:

Paint Transformer: Feed Forward Neural Painting with Stroke Prediction,

Songhua Liu*, Tianwei Lin*, Dongliang He, Fu Li, Ruifeng Deng, Xin Li, Errui Ding, Hao Wang (* indicates equal contribution)

ICCV 2021 (Oral)

Prerequisites

Linux or macOS
Python 3.6+
PaddlePaddle 2.0+ and other dependencies (numpy, cv2, and other common python libs)
```
python -m pip install paddlepaddle-gpu
```

Getting Started

Clone this repository:

git clone https://github.com/wzmsltw/PaintTransformer
cd PaintTransformer

Download pretrained model from Google Drive and move it to inference directory:
```
mv [Download Directory]/paint_best.pdparams inference/
cd inference
```
Inference:
```
python inference.py
```
- Input image path, output path, and etc can be set in the main function.
- Notably, there is a flag serial as one parameter of the main function:
  - If serial is True, strokes would be rendered serially. The consumption of video memory will be low but it requires more time. Serial inference can achieve better rendering quality.
  - If serial is False, strokes would be rendered in parallel. The consumption of video memory will be high but it would be faster.
  - If animated results are required, serial must be True.
Train:
- You can send email to us for the training codes.

More Results

Input	Animated Output

App

Do not want to run the code? Try an App 一刻相册 downloaded from here!

Citation

If you find ideas or codes useful for your research, please cite:

@inproceedings{liu2021paint,
  title={Paint Transformer: Feed Forward Neural Painting with Stroke Prediction},
  author={Liu, Songhua and Lin, Tianwei and He, Dongliang and Li, Fu and Deng, Ruifeng and Li, Xin and Ding, Errui and Wang, Hao},
  booktitle={Proceedings of the IEEE International Conference on Computer Vision},
  year={2021}
}

Contact

For any question, please file an issue or contact

Songhua Liu: s[email protected]
Tianwei Lin: [email protected]

Official PaddlePaddle implementation of Paint Transformer

Related tags

Overview

Paint Transformer: Feed Forward Neural Painting with Stroke Prediction

Update

Overview

Prerequisites

Getting Started

More Results

App

Citation

Contact

Owner

TianweiLin

Generative Flow Networks

Tensorflow implementation for "Improved Transformer for High-Resolution GANs" (NeurIPS 2021).

Code for EmBERT, a transformer model for embodied, language-guided visual task completion.

Implementation of Axial attention - attending to multi-dimensional data efficiently

Which Style Makes Me Attractive? Interpretable Control Discovery and Counterfactual Explanation on StyleGAN

Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning

CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation

PyTorch implementation of "Optimization Planning for 3D ConvNets"

Hierarchical Metadata-Aware Document Categorization under Weak Supervision (WSDM'21)

Code for "Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation" ICCV'21

Official repository for Jia, Raghunathan, Göksel, and Liang, "Certified Robustness to Adversarial Word Substitutions" (EMNLP 2019)

Pretrained models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet.

Simple image captioning model - CLIP prefix captioning.

A web application that provides real time temperature and humidity readings of a house.

EZ graph is an easy to use AI solution that allows you to make and train your neural networks without a single line of code.

A simple rest api serving a deep learning model that classifies human gender based on their faces. (vgg16 transfare learning)

Bare bones use-case for deploying a containerized web app (built in streamlit) on AWS.

Code for "The Box Size Confidence Bias Harms Your Object Detector"

Implementation of ToeplitzLDA for spatiotemporal stationary time series data.

UMich 500-Level Mobile Robotics Course