A Pytorch Implementation of a continuously rate adjustable learned image compression framework.

Last update: Dec 24, 2022

Related tags

Overview

GainedVAE

A Pytorch Implementation of a continuously rate adjustable learned image compression framework, Gained Variational Autoencoder(GainedVAE).

Note that This Is Not An Official Implementation Code.

More details can be found in the following paper:

Asymmetric Gained Deep Image Compression With Continuous Rate Adaptation.
Huawei Technologies, CVPR 2021
Ze Cui, Jing Wang, Shangyin Gao, Tiansheng Guo, Yihui Feng, Bo Bai

Todo: Reproduce Implementation of the following paper:

INTERPOLATION VARIABLE RATE IMAGE COMPRESSION
Alibaba Group, arxiv 2021.9.20
Zhenhong Sun, Zhiyu Tan, Xiuyu Sun, Fangyi Zhang, Yichen Qian, Dongyang Li, Hao Li

Environment

Python == 3.7.10
Pytorch == 1.7.1
CompressAI

Dataset

Training set

I use a part of the OpenImages Dataset to train the models (train06, train07, train08, about 54w images). You can download from here. Download OpenImages Maybe train08 (14w images) is enough.

Test set

Download Kodak dataset

Train Your Own Model

python3 trainGain.py -d /path/to/your/image/dataset/ --epochs 200 -lr 1e-4 --batch-size 16 --model-save /path/to/your/model/save/dir --cuda

Result

I try to train the Gained Mean-Scale Hyperprior model and here is the result.

Acknowledgement

The framework is based on CompressAI, I add the model in compressai.models.gain, compressai.models.gain_utils.
And trainGain/trainGain.py is modified with reference to compressai_examples/train.py.

More Variable Rate Image Compression Repositories

"Variable-Rate Deep Image Compression through Spatially-Adaptive Feature Transform" (ICCV 2021).
code

"Variable Bitrate Image Compression with Quality Scaling Factors" (ICASSP 2020).
code

"Variable Rate Deep Image Compression with Modulated Autoencoders" (IEEE SPL 2020)
code

"Slimmable Compressive Autoencoders for Practical Neural Image Compression" (CVPR 2021)
code

Contact

Feel free to contact me if there is any question about the code or to discuss any problems with image and video compression. ([email protected])

A Pytorch Implementation of a continuously rate adjustable learned image compression framework.

Related tags

Overview

GainedVAE

Environment

Dataset

Training set

Test set

Train Your Own Model

Result

Acknowledgement

More Variable Rate Image Compression Repositories

Contact

Owner

Artificial intelligence technology inferring issues and logically supporting facts from raw text

FastReID is a research platform that implements state-of-the-art re-identification algorithms.

Code to reproduce the experiments from our NeurIPS 2021 paper " The Limitations of Large Width in Neural Networks: A Deep Gaussian Process Perspective"

SWA Object Detection

[ICCV 2021] Our work presents a novel neural rendering approach that can efficiently reconstruct geometric and neural radiance fields for view synthesis.

Tweesent-back - Tweesent backend uses fastAPI as the web framework

👐OpenHands : Making Sign Language Recognition Accessible (WiP 🚧👷‍♂️🏗)

A PyTorch implementation of the architecture of Mask RCNN

GRF: Learning a General Radiance Field for 3D Representation and Rendering

PyTorch implementation of "A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing"

Course materials for Fall 2021 "CIS6930 Topics in Computing for Data Science" at New College of Florida

Synthetic LiDAR sequential point cloud dataset with point-wise annotations

E-RAFT: Dense Optical Flow from Event Cameras

Official Implementation of DDOD (Disentangle your Dense Object Detector), ACM MM2021

Language Models for the legal domain in Spanish done @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

The FIRST GANs-based omics-to-omics translation framework

PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners for self-supervised ViT.

A bare-bones TensorFlow framework for Bayesian deep learning and Gaussian process approximation

Implementation of "Selection via Proxy: Efficient Data Selection for Deep Learning" from ICLR 2020.

Official code for "Maximum Likelihood Training of Score-Based Diffusion Models", NeurIPS 2021 (spotlight)