UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Last update: Dec 03, 2022

Related tags

Deep Learning UMEC

Overview

UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Code for this paper UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Jiayi Shen, Haotao Wang*, Shupeng Gui*, Jianchao Tan, Zhangyang Wang, and Ji Liu

Overview

We propose a unified model and embedding compression (UMEC) framework to hammer an efficient neural network-based recommendation system. Our framework jointly learns input feature selection and neural network compression together, and solve them as an end-to-end resource-constrained optimization problem using ADMM.

Main Results

Implementation

We perform the compression process on DLRM, which is a public recommendation model. Our proposed algorithm is mainly implemented inrc_optimizer.py and rc_utils.py. Other files are inherited from the original DLRM code repo, with several lines of modifications, such as joint_train.py, input_selection.py, and finetune.py, in order to plug in our algorithm. To run the code in this repo, you have to first follow the instructions in the original repo to download the dataset, and run the corresponding training part, to finish the data preprocessing process.

Unified Framework

To implement to joint training and compressing under the resource constraint, please see the script in script/joint_train.sh.

Input feature selection

To implement to joint training and compressing under the resource constraint, please see the script in script/input_selection.sh.

Acknowledgement

We thank the author of DLRM for providing a recommendation model benchmark.

Citation

@inproceedings{
shen2021umec,
title={{\{}UMEC{\}}: Unified model and embedding compression for efficient recommendation systems},
author={Jiayi Shen and Haotao Wang and Shupeng Gui and Jianchao Tan and Zhangyang Wang and Ji Liu},
booktitle={International Conference on Learning Representations},
year={2021},
url={https://openreview.net/forum?id=BM---bH_RSh}
}

UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Related tags

Overview

UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Overview

Main Results

Implementation

Unified Framework

Input feature selection

Acknowledgement

Citation

Owner

VITA

Navigating StyleGAN2 w latent space using CLIP

《Truly shift-invariant convolutional neural networks》(2021)

Prometheus exporter for Cisco Unified Computing System (UCS) Manager

[ICCV 2021 Oral] PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers

WHENet - ONNX, OpenVINO, TFLite, TensorRT, EdgeTPU, CoreML, TFJS, YOLOv4/YOLOv4-tiny-3L

General purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends)

City-Scale Multi-Camera Vehicle Tracking Guided by Crossroad Zones Code

[NeurIPS 2021] Large Scale Learning on Non-Homophilous Graphs: New Benchmarks and Strong Simple Methods

RoboDesk A Multi-Task Reinforcement Learning Benchmark

This is the official repository of the paper Stocastic bandits with groups of similar arms (NeurIPS 2021). It contains the code that was used to compute the figures and experiments of the paper.

⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.

Implementation of gMLP, an all-MLP replacement for Transformers, in Pytorch

EfficientNetV2-with-TPU - Cifar-10 case study

Code for "Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo"

A DCGAN to generate anime faces using custom mined dataset

Resilient projection-based consensus actor-critic (RPBCAC) algorithm

Manifold Alignment for Semantically Aligned Style Transfer

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI

Code for “ACE-HGNN: Adaptive Curvature ExplorationHyperbolic Graph Neural Network”

UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Related tags

Overview

UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

Overview

Main Results

Implementation

Unified Framework

Input feature selection

Acknowledgement

Citation

Owner

VITA

Navigating StyleGAN2 w latent space using CLIP

《Truly shift-invariant convolutional neural networks》(2021)

Prometheus exporter for Cisco Unified Computing System (UCS) Manager

[ICCV 2021 Oral] PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers

WHENet - ONNX, OpenVINO, TFLite, TensorRT, EdgeTPU, CoreML, TFJS, YOLOv4/YOLOv4-tiny-3L

General purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends)

City-Scale Multi-Camera Vehicle Tracking Guided by Crossroad Zones Code

[NeurIPS 2021] Large Scale Learning on Non-Homophilous Graphs: New Benchmarks and Strong Simple Methods

RoboDesk A Multi-Task Reinforcement Learning Benchmark

This is the official repository of the paper Stocastic bandits with groups of similar arms (NeurIPS 2021). It contains the code that was used to compute the figures and experiments of the paper.

⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.

Implementation of gMLP, an all-MLP replacement for Transformers, in Pytorch

EfficientNetV2-with-TPU - Cifar-10 case study

Code for "Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo"

A DCGAN to generate anime faces using custom mined dataset

Resilient projection-based consensus actor-critic (RPBCAC) algorithm

Manifold Alignment for Semantically Aligned Style Transfer

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

This is the pytorch implementation for the paper: *Learning Accurate Performance Predictors for Ultrafast Automated Model Compression*, which is in submission to TPAMI

Code for “ACE-HGNN: Adaptive Curvature ExplorationHyperbolic Graph Neural Network”

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI