Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Last update: Dec 19, 2021

Related tags

Overview

Official code for Continual Learning In Environments With Polynomial Mixing Times

Continual Learning in Environments with Polynomial Mixing Times

This repository provides official code base for the paper "Continual Learning in Environments with Polynomial Mixing Times"

Basic Setup

Clone this repository and then follow this command

cd polynomial-mixing-times

Create either use a python virtualenv or a conda environment and activate it.

pip install virtualenv
virtualenv -p /usr/bin/python3.7 mixing-times
source mixing-times/bin/activate

To install all the relevant packages use the following command:

pip install -e .

Running the experiments

We provide a running script with all relevant hyperparameters used for both baselines and our proposed model. One can run run_bottleneck.sh to run all the models.

To run the experiments of the proposed models on the Example 2 Bottleneck MDP class with 4 rooms, "random" task evolution and a random seed of 1, use the following command

bash run_bottleneck.sh 1 4 "random"

Available Models

Online Q learning
Q learning with Replay
Q learning w/ Dyna
Model based n-step TD
Vanilla Policy Gradient
Onpolicy rho learning
Off-policy rho learning
rho Policy Gradient

List of Environments

ScaleClass-v0
NBottleneckClass-v0
NCycleClass-v0

System requirements

We used python 3.7 version to run all our experiments.

Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Related tags

Overview

Continual Learning in Environments with Polynomial Mixing Times

Basic Setup

Running the experiments

Available Models

List of Environments

System requirements

Owner

Sharath Raparthy

ElasticFace: Elastic Margin Loss for Deep Face Recognition

Learning Continuous Image Representation with Local Implicit Image Function

Multi-Anchor Active Domain Adaptation for Semantic Segmentation (ICCV 2021 Oral)

DROPO: Sim-to-Real Transfer with Offline Domain Randomization

PyTorch implementation of: Michieli U. and Zanuttigh P., "Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations", CVPR 2021.

PyTorch implementation of SQN based on CloserLook3D's encoder

Semantic Segmentation Suite in TensorFlow

Repository features UNet inspired architecture used for segmenting lungs on chest X-Ray images

Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper

A unified framework for machine learning with time series

Source code for deep symbolic optimization.

IDA file loader for UF2, created for the DEFCON 29 hardware badge

Serve TensorFlow ML models with TF-Serving and then create a Streamlit UI to use them

[ICCV' 21] "Unsupervised Point Cloud Pre-training via Occlusion Completion"

This repository provides an unified frameworks to train and test the state-of-the-art few-shot font generation (FFG) models.

Neural Module Network for VQA in Pytorch

Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)

Dense Unsupervised Learning for Video Segmentation (NeurIPS*2021)

Code base for "On-the-Fly Test-time Adaptation for Medical Image Segmentation"

UniFormer - official implementation of UniFormer