Torch-based tool for quantizing high-dimensional vectors using additive codebooks

Last update: Jan 07, 2023

Related tags

Overview

Trainable multi-codebook quantization

This repository implements a utility for use with PyTorch, and ideally GPUs, for training an efficient quantizer based on multiple single-byte codebooks. The prototypical scenario is that you have some distribution over vectors in some space, say, of dimension 512, that might come from a neural net embedding, and you want a means of encoding a vector into a short sequence of bytes (say, 4 or 8 bytes) that can be used to reconstruct the vector with minimal expected loss, measured as squared distance, i.e. squared l2 loss.

This repository provides Quantizer object that lets you do this quantization, and an associated QuantizerTrainer object that you can use to train the Quantizer. For example, you might invoke the QuantizerTrainer with 20,000 minibatches of vectors.

Usage

Installation

python3 setup.py install

Example

import torch
import quantization

trainer = quantization.QuantizerTrainer(dim=256, bytes_per_frame=4,
                                        device=torch.device('cuda'))
while not trainer.done():
   # let x be some tensor of shape (*, dim), that you will train on
   # (should not be the same on each minibatch)
   trainer.step(x)
quantizer = trainer.get_quantizer()

# let x be some tensor of shape (*, dim)..
encoded = quantizer.encode(x)  # (*, 4), dtype=uint8
x_approx = quantizer.decode(quantizer.encode(x))

To avoid versioning issues and so on, it may be easier to just include quantization.py in your repository directly (and add its requirements to your requirements.txt).

Torch-based tool for quantizing high-dimensional vectors using additive codebooks

Related tags

Overview

Trainable multi-codebook quantization

Usage

Installation

Example

Owner

Daniel Povey

Hierarchical Motion Encoder-Decoder Network for Trajectory Forecasting (HMNet)

FAMIE is a comprehensive and efficient active learning (AL) toolkit for multilingual information extraction (IE)

PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection?

code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification

A Game-Theoretic Perspective on Risk-Sensitive Reinforcement Learning

Official PyTorch implementation for "Low Precision Decentralized Distributed Training with Heterogenous Data"

GNNAdvisor: An Efficient Runtime System for GNN Acceleration on GPUs

a project for 3D multi-object tracking

FastCover: A Self-Supervised Learning Framework for Multi-Hop Influence Maximization in Social Networks by Anonymous.

Fast, flexible and fun neural networks.

[IJCAI-2021] A benchmark of data-free knowledge distillation from paper "Contrastive Model Inversion for Data-Free Knowledge Distillation"

Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"

The official PyTorch code for 'DER: Dynamically Expandable Representation for Class Incremental Learning' accepted by CVPR2021

DrWhy is the collection of tools for eXplainable AI (XAI). It's based on shared principles and simple grammar for exploration, explanation and visualisation of predictive models.

Keyword-BERT: Keyword-Attentive Deep Semantic Matching

PyTorch implementation of Interpretable Explanations of Black Boxes by Meaningful Perturbation

Fast, general, and tested differentiable structured prediction in PyTorch

ICRA 2021 "Towards Precise and Efficient Image Guided Depth Completion"

This is the official code of L2G, Unrolling and Recurrent Unrolling in Learning to Learn Graph Topologies.

Codes for our paper "SentiLARE: Sentiment-Aware Language Representation Learning with Linguistic Knowledge" (EMNLP 2020)