Official tensorflow implementation for CVPR2020 paper “Learning to Cartoonize Using White-box Cartoon Representations”

Last update: Dec 31, 2022

Related tags

Deep Learning White-box-Cartoonization

Overview

[CVPR2020]Learning to Cartoonize Using White-box Cartoon Representations

Tensorflow implementation for CVPR2020 paper “Learning to Cartoonize Using White-box Cartoon Representations”.
Improved method for facial images are now available:
https://github.com/SystemErrorWang/FacialCartoonization

Use cases

Scenery

Food

Indoor Scenes

People

More Images Are Shown In The Supplementary Materials

Online demo

Some kind people made online demo for this project
Demo link: https://cartoonize-lkqov62dia-de.a.run.app/cartoonize
Code: https://github.com/experience-ml/cartoonize
Sample Demo: https://www.youtube.com/watch?v=GqduSLcmhto&feature=emb_title

Prerequisites

Training code: Linux or Windows
NVIDIA GPU + CUDA CuDNN for performance
Inference code: Linux, Windows and MacOS

How To Use

Installation

Assume you already have NVIDIA GPU and CUDA CuDNN installed
Install tensorflow-gpu, we tested 1.12.0 and 1.13.0rc0
Install scikit-image==0.14.5, other versions may cause problems

Inference with Pre-trained Model

Store test images in /test_code/test_images
Run /test_code/cartoonize.py
Results will be saved in /test_code/cartoonized_images

Train

Place your training data in corresponding folders in /dataset
Run pretrain.py, results will be saved in /pretrain folder
Run train.py, results will be saved in /train_cartoon folder
Codes are cleaned from production environment and untested
There may be minor problems but should be easy to resolve
Pretrained VGG_19 model can be found at following url: https://drive.google.com/file/d/1j0jDENjdwxCDb36meP6-u5xDBzmKBOjJ/view?usp=sharing

Datasets

Due to copyright issues, we cannot provide cartoon images used for training
However, these training datasets are easy to prepare
Scenery images are collected from Shinkai Makoto, Miyazaki Hayao and Hosoda Mamoru films
Clip films into frames and random crop and resize to 256x256
Portrait images are from Kyoto animations and PA Works
We use this repo(https://github.com/nagadomi/lbpcascade_animeface) to detect facial areas
Manual data cleaning will greatly increace both datasets quality

Acknowledgement

We are grateful for the help from Lvmin Zhang and Style2Paints Research

License

license (https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode).
Commercial application is prohibited, please remain this license if you clone this repo

Citation

If you use this code for your research, please cite our paper:

@InProceedings{Wang_2020_CVPR, author = {Wang, Xinrui and Yu, Jinze}, title = {Learning to Cartoonize Using White-Box Cartoon Representations}, booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2020} }

中文社区

我们有一个除了技术什么东西都聊的以技术交流为主的宇宙超一流二次元相关技术交流吹水群“纸片协会”。如果你一次加群失败，可以多次尝试。

纸片协会总舵：184467946

Official tensorflow implementation for CVPR2020 paper “Learning to Cartoonize Using White-box Cartoon Representations”

Related tags

Overview

[CVPR2020]Learning to Cartoonize Using White-box Cartoon Representations

Use cases

Scenery

Food

Indoor Scenes

People

More Images Are Shown In The Supplementary Materials

Online demo

Prerequisites

How To Use

Installation

Inference with Pre-trained Model

Train

Datasets

Acknowledgement

License

Citation

中文社区

Owner

The datasets and code of ACL 2021 paper "Aspect-Category-Opinion-Sentiment Quadruple Extraction with Implicit Aspects and Opinions".

An Extendible (General) Continual Learning Framework based on Pytorch - official codebase of Dark Experience for General Continual Learning

Code of TVT: Transferable Vision Transformer for Unsupervised Domain Adaptation

Implementation for Learning to Track with Object Permanence

Auto White-Balance Correction for Mixed-Illuminant Scenes

Physical Anomalous Trajectory or Motion (PHANTOM) Dataset

Free like Freedom

The Deep Learning with Julia book, using Flux.jl.

Single/multi view image(s) to voxel reconstruction using a recurrent neural network

Mail classification with tensorflow and MS Exchange Server (ham or spam).

Official Keras Implementation for UNet++ in IEEE Transactions on Medical Imaging and DLMIA 2018

Qlib is an AI-oriented quantitative investment platform

Convert Table data to approximate values with GUI

Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

PyTorchVideo is a deeplearning library with a focus on video understanding work

PyTorch DepthNet Training on Still Box dataset

WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU

Implementation of "With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition, BMVC, 2021" in PyTorch

A minimalist tool to display a network graph.