A PyTorch library and evaluation platform for end-to-end compression research

Last update: Jan 06, 2023

Overview

CompressAI

CompressAI (compress-ay) is a PyTorch library and evaluation platform for end-to-end compression research.

CompressAI currently provides:

custom operations, layers and models for deep learning based data compression
a partial port of the official TensorFlow compression library
pre-trained end-to-end compression models for learned image compression
evaluation scripts to compare learned models against classical image/video compression codecs

Note: Multi-GPU support is now experimental.

Installation

CompressAI supports python 3.6+ and PyTorch 1.7+.

pip:

pip install compressai

Note: wheels are available for Linux and MacOS.

From source:

A C++17 compiler, a recent version of pip (19.0+), and common python packages are also required (see setup.py for the full list).

To get started locally and install the development version of CompressAI, run the following commands in a virtual environment:

git clone https://github.com/InterDigitalInc/CompressAI compressai
cd compressai
pip install -U pip && pip install -e .

For a custom installation, you can also run one of the following commands:

pip install -e '.[dev]': install the packages required for development (testing, linting, docs)
pip install -e '.[tutorials]': install the packages required for the tutorials (notebooks)
pip install -e '.[all]': install all the optional packages

Note: Docker images will be released in the future. Conda environments are not officially supported.

Documentation

Usage

Examples

Script and notebook examples can be found in the examples/ directory.

To encode/decode images with the provided pre-trained models, run the codec.py example:

python3 examples/codec.py --help

An examplary training script with a rate-distortion loss is provided in examples/train.py. You can replace the model used in the training script with your own model implemented within CompressAI, and then run the script for a simple training pipeline:

python3 examples/train.py -d /path/to/my/image/dataset/ --epochs 300 -lr 1e-4 --batch-size 16 --cuda --save

Note: the training example uses a custom ImageFolder structure.

A jupyter notebook illustrating the usage of a pre-trained model for learned image compression is also provided in the examples directory:

pip install -U ipython jupyter ipywidgets matplotlib
jupyter notebook examples/

Evaluation

To evaluate a trained model on your own dataset, CompressAI provides an evaluation script:

python3 -m compressai.utils.eval_model checkpoint /path/to/images/folder/ -a $ARCH -p $MODEL_CHECKPOINT...

To evaluate traditional image/video codecs:

python3 -m compressai.utils.bench --help
python3 -m compressai.utils.bench bpg --help
python3 -m compressai.utils.bench vtm --help

Tests

Run tests with pytest:

pytest -sx --cov=compressai --cov-append --cov-report term-missing tests

Slow tests can be skipped with the -m "not slow" option.

License

CompressAI is licensed under the Apache License, Version 2.0

Contributing

We welcome feedback and contributions. Please open a GitHub issue to report bugs, request enhancements or if you have any questions.

Before contributing, please read the CONTRIBUTING.md file.

Authors

Jean Bégaint, Fabien Racapé, Simon Feltman and Akshay Pushparaja, InterDigital AI Lab.

Citation

If you use this project, please cite the relevant original publications for the models and datasets, and cite this project as:

@article{begaint2020compressai,
	title={CompressAI: a PyTorch library and evaluation platform for end-to-end compression research},
	author={B{\'e}gaint, Jean and Racap{\'e}, Fabien and Feltman, Simon and Pushparaja, Akshay},
	year={2020},
	journal={arXiv preprint arXiv:2011.03029},
}

A PyTorch library and evaluation platform for end-to-end compression research

Related tags

Overview

CompressAI

Installation

Documentation

Usage

Examples

Evaluation

Tests

License

Contributing

Authors

Citation

Related links

Owner

InterDigital

PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision

Code for Multimodal Neural SLAM for Interactive Instruction Following

Collection of generative models in Tensorflow

Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"

Face and Body Tracking for VRM 3D models on the web.

KaziText is a tool for modelling common human errors.

Release of SPLASH: Dataset for semantic parse correction with natural language feedback in the context of text-to-SQL parsing

Manim is an engine for precise programmatic animations, designed for creating explanatory math videos

State of the Art Neural Networks for Generative Deep Learning

Source code for CVPR 2020 paper "Learning to Forget for Meta-Learning"

A3C LSTM Atari with Pytorch plus A3G design

Class-Attentive Diffusion Network for Semi-Supervised Classification [AAAI'21] (official implementation)

Dados coletados e programas desenvolvidos no processo de iniciação científica

OMLT: Optimization and Machine Learning Toolkit

Unofficial pytorch implementation of 'Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization'

DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

So-ViT: Mind Visual Tokens for Vision Transformer

LiDAR Distillation: Bridging the Beam-Induced Domain Gap for 3D Object Detection

PyTorch GPU implementation of the ES-RNN model for time series forecasting

A naive ROS interface for visualDet3D.