OptNet: Differentiable Optimization as a Layer in Neural Networks

Last update: Dec 24, 2022

Overview

OptNet: Differentiable Optimization as a Layer in Neural Networks

This repository is by Brandon Amos and J. Zico Kolter and contains the PyTorch source code to reproduce the experiments in our ICML 2017 paper OptNet: Differentiable Optimization as a Layer in Neural Networks.

If you find this repository helpful in your publications, please consider citing our paper.

@InProceedings{amos2017optnet,
  title = {{O}pt{N}et: Differentiable Optimization as a Layer in Neural Networks},
  author = {Brandon Amos and J. Zico Kolter},
  booktitle = {Proceedings of the 34th International Conference on Machine Learning},
  pages = {136--145},
  year = {2017},
  volume = {70},
  series = {Proceedings of Machine Learning Research},
  publisher ={PMLR},
}

Informal Introduction

Mathematical optimization is a well-studied language of expressing solutions to many real-life problems that come up in machine learning and many other fields such as mechanics, economics, EE, operations research, control engineering, geophysics, and molecular modeling. As we build our machine learning systems to interact with real data from these fields, we often cannot (but sometimes can) simply ``learn away'' the optimization sub-problems by adding more layers in our network. Well-defined optimization problems may be added if you have a thorough understanding of your feature space, but oftentimes we don't have this understanding and resort to automatic feature learning for our tasks.

Until this repository, no modern deep learning library has provided a way of adding a learnable optimization layer (other than simply unrolling an optimization procedure, which is inefficient and inexact) into our model formulation that we can quickly try to see if it's a nice way of expressing our data.

See our paper OptNet: Differentiable Optimization as a Layer in Neural Networks and code at locuslab/optnet if you are interested in learning more about our initial exploration in this space of automatically learning quadratic program layers for signal denoising and sudoku.

Setup and Dependencies

Python/numpy/PyTorch
qpth: Our fast QP solver for PyTorch released in conjunction with this paper.
bamos/block: Our intelligent block matrix library for numpy, PyTorch, and beyond.
Optional: bamos/setGPU: A small library to set CUDA_VISIBLE_DEVICES on multi-GPU systems.

Denoising Experiments

denoising
├── create.py - Script to create the denoising dataset.
├── plot.py - Plot the results from any experiment.
├── main.py - Run the FC baseline and OptNet denoising experiments. (See arguments.)
├── main.tv.py - Run the TV baseline denoising experiment.
└── run-exps.sh - Run all experiments. (May need to uncomment some lines.)

Sudoku Experiments

The dataset we used in our experiments is available in sudoku/data.

sudoku
├── create.py - Script to create the dataset.
├── plot.py - Plot the results from any experiment.
├── main.py - Run the FC baseline and OptNet Sudoku experiments. (See arguments.)
└── models.py - Models used for Sudoku.

Classification Experiments

cls
├── train.py - Run the FC baseline and OptNet classification experiments. (See arguments.)
├── plot.py - Plot the results from any experiment.
└── models.py - Models used for classification.

Acknowledgments

The rapid development of this work would not have been possible without the immense amount of help from the PyTorch team, particularly Soumith Chintala and Adam Paszke.

Licensing

Unless otherwise stated, the source code is copyright Carnegie Mellon University and licensed under the Apache 2.0 License.

OptNet: Differentiable Optimization as a Layer in Neural Networks

Related tags

Overview

OptNet: Differentiable Optimization as a Layer in Neural Networks

Informal Introduction

Setup and Dependencies

Denoising Experiments

Sudoku Experiments

Classification Experiments

Acknowledgments

Licensing

Owner

CMU Locus Lab

PyTorch framework A simple and complete framework for PyTorch, providing a variety of data loading and simple task solutions that are easy to extend and migrate

A few Windows specific scripts for PyTorch

Tez is a super-simple and lightweight Trainer for PyTorch. It also comes with many utils that you can use to tackle over 90% of deep learning projects in PyTorch.

higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual training steps.

3D-RETR: End-to-End Single and Multi-View3D Reconstruction with Transformers

Pretrained EfficientNet, EfficientNet-Lite, MixNet, MobileNetV3 / V2, MNASNet A1 and B1, FBNet, Single-Path NAS

A PyTorch implementation of L-BFGS.

The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

Training PyTorch models with differential privacy

A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.

This is an differentiable pytorch implementation of SIFT patch descriptor.

torch-optimizer -- collection of optimizers for Pytorch

TorchShard is a lightweight engine for slicing a PyTorch tensor into parallel shards

PyTorch implementation of Glow, Generative Flow with Invertible 1x1 Convolutions

The goal of this library is to generate more helpful exception messages for numpy/pytorch matrix algebra expressions.

Model summary in PyTorch similar to `model.summary()` in Keras

A pure Python implementation of Compact Bilinear Pooling and Count Sketch for PyTorch.

ONNX Runtime for PyTorch accelerates PyTorch model training using ONNX Runtime.

A Pytorch Implementation for Compact Bilinear Pooling.

Fast, general, and tested differentiable structured prediction in PyTorch