Open-L2O

This repository establishes the first comprehensive benchmark efforts of existing learning to optimize (L2O) approaches on a number of problems and settings. We release our software implementation and data as the Open-L2O package, for reproducible research and fair benchmarking in the L2O field. [Paper]

Overview

What is learning to optimize (L2O)?

L2O (Learning to optimize) aims to replace manually designed analytic optimization algorithms (SGD, RMSProp, Adam, etc.) with learned update rules.

How does L2O work?

L2O serves as functions that can be fit from data. L2O gains experience from training optimization tasks in a principled and automatic way.

What can L2O do for you?

L2O is particularly suitable for solving a certain type of optimization over a specific distribution of data repeatedly. In comparison to classic methods, L2O is shown to find higher-quality solutions and/or with much faster convergence speed for many problems.

Open questions for research?

There are significant theoretical and practicality gaps between manually designed optimizers and existing L2O models.

Main Results

Learning to optimize sparse recovery

Learning to optimize Lasso functions

Learning to optimize non-convex Rastrigin functions

Learning to optimize neural networks

Supported Model-base Learnable Optimizers

All codes are available at here.

LISTA (feed-forward form) from Learning fast approximations of sparse coding [Paper]
LISTA-CP from Theoretical Linear Convergence of Unfolded ISTA and its Practical Weights and Thresholds [Paper]
LISTA-CPSS from Theoretical Linear Convergence of Unfolded ISTA and its Practical Weights and Thresholds [Paper]
LFISTA from Understanding Trainable Sparse Coding via Matrix Factorization [Paper]
LAMP from AMP-Inspired Deep Networks for Sparse Linear Inverse Problems [Paper]
ALISTA from ALISTA: Analytic Weights Are As Good As Learned Weights in LISTA [Paper]
GLISTA from Sparse Coding with Gated Learned ISTA [Paper]

Supported Model-free Learnable Optimizers

L2O-DM from Learning to learn by gradient descent by gradient descent [Paper] [Code]
L2O-RNNProp Learning Gradient Descent: Better Generalization and Longer Horizons from [Paper] [Code]
L2O-Scale from Learned Optimizers that Scale and Generalize [Paper] [Code]
L2O-enhanced from Training Stronger Baselines for Learning to Optimize [Paper] [Code]
L2O-Swarm from Learning to Optimize in Swarms [Paper] [Code]
L2O-Jacobian from HALO: Hardware-Aware Learning to Optimize [Paper] [Code]
L2O-Minmax from Learning A Minimax Optimizer: A Pilot Study [Paper] [Code]

Supported Optimizees

Convex Functions:

Quadratic
Lasso

Non-convex Functions:

Rastrigin

Minmax Functions:

Saddle
Rotated Saddle
Seesaw
Matrix Game

Neural Networks:

MLPs on MNIST
ConvNets on MNIST and CIFAR-10
LeNet
NAS searched archtectures

Other Resources

This is a Pytorch implementation of L2O-DM. [Code]
This is the original L2O-Swarm repository. [Code]
This is the original L2O-Jacobian repository. [Code]

Future Works

TF2.0 Implementated toolbox v2 with a unified framework and lib dependency.

Cite

@misc{chen2021learning,
      title={Learning to Optimize: A Primer and A Benchmark}, 
      author={Tianlong Chen and Xiaohan Chen and Wuyang Chen and Howard Heaton and Jialin Liu and Zhangyang Wang and Wotao Yin},
      year={2021},
      eprint={2103.12828},
      archivePrefix={arXiv},
      primaryClass={math.OC}
}

Open-L2O: A Comprehensive and Reproducible Benchmark for Learning to Optimize Algorithms

Related tags

Overview

Open-L2O

Overview

What is learning to optimize (L2O)?

How does L2O work?

What can L2O do for you?

Open questions for research?

Main Results

Learning to optimize sparse recovery

Learning to optimize Lasso functions

Learning to optimize non-convex Rastrigin functions

Learning to optimize neural networks

Supported Model-base Learnable Optimizers

Supported Model-free Learnable Optimizers

Supported Optimizees

Other Resources

Future Works

Cite

Owner

VITA

NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.

Official implementation of Deep Convolutional Dictionary Learning for Image Denoising.

Official PyTorch implementation of "Improving Face Recognition with Large AgeGaps by Learning to Distinguish Children" (BMVC 2021)

Patch Rotation: A Self-Supervised Auxiliary Task for Robustness and Accuracy of Supervised Models

[Preprint] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang, Zhangyang Wang

Earth Vision Foundation

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

This is a Keras-based Python implementation of DeepMask- a complex deep neural network for learning object segmentation masks

Scripts for training an AI to play the endless runner Subway Surfers using a supervised machine learning approach by imitation and a convolutional neural network (CNN) for image classification

The PyTorch implementation of DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision.

This repository includes code of my study about Asynchronous in Frequency domain of GAN images.

[NeurIPS2021] Code Release of K-Net: Towards Unified Image Segmentation

Open source annotation tool for machine learning practitioners.

Object Detection and Multi-Object Tracking

TensorFlow (Python) implementation of DeepTCN model for multivariate time series forecasting.

Implementation of neural class expression synthesizers

Fully-automated scripts for collecting AI-related papers

Official code for "Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer. ICCV2021".

Learning Confidence for Out-of-Distribution Detection in Neural Networks

This project contains an implemented version of Face Detection using OpenCV and Mediapipe. This is a code snippet and can be used in projects.