GitHub repository for "Improving Video Generation for Multi-functional Applications"

Last update: Dec 07, 2022

Related tags

Overview

Improving Video Generation for Multi-functional Applications

GitHub repository for "Improving Video Generation for Multi-functional Applications"

Paper Link

For more information please refer to our homepage.

Requirements

Tensorflow 1.2.1
Python 2.7
ffmpeg

Data Format

Videos are stored as JPEGs of vertically stacked frames. Every frame needs to be at least 64x64 pixels; videos contain between 16 and 32 frames. For an example datasets see: http://carlvondrick.com/tinyvideo/#data

Training

python main_train.py

Important Parameters:

mode: one of 'generate', 'predict', 'bw2rgb', 'inpaint' depending on weather you want to generate videos, predict future frames, colorize videos or do inpainting.
batch_size: Recommended 64, for colorization use 32 for memory issues.
root_dir: root directory of dataset
index_file: must be in root_dir, containing a list of all training data clips; path relative to root_dir.
experiment_name: name of experiment
output_every: output loss to stdout and write to tensorboard summary every xx steps.
sample_every: generate a visual sample every xx steps.
save_model_very: save the model every xx steps.
recover_model: if true recover model and continue training

GitHub repository for "Improving Video Generation for Multi-functional Applications"

Related tags

Overview

Improving Video Generation for Multi-functional Applications

Requirements

Data Format

Training

Owner

Bernhard Kratzwald

Pytorch tutorials for Neural Style transfert

Self-supervised learning on Graph Representation Learning (node-level task)

Adjust Decision Boundary for Class Imbalanced Learning

Code for the Paper: Conditional Variational Capsule Network for Open Set Recognition

Least Square Calibration for Peer Reviews

The author's officially unofficial PyTorch BigGAN implementation.

Semi-Supervised Semantic Segmentation with Cross-Consistency Training (CCT)

Official Pytorch implementation for video neural representation (NeRV)

Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging

Use AI to generate a optimized stock portfolio

I-BERT: Integer-only BERT Quantization

This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".

A repository for generating stylized talking 3D and 3D face

Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)

Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning

My usage of Real-ESRGAN to upscale anime, some test and results in the test_img folder

EXplainable Artificial Intelligence (XAI)

SSL_SLAM2: Lightweight 3-D Localization and Mapping for Solid-State LiDAR (mapping and localization separated) ICRA 2021

Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning

Mind the Trade-off: Debiasing NLU Models without Degrading the In-distribution Performance